Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp6420152rwl; Mon, 9 Jan 2023 08:08:02 -0800 (PST) X-Google-Smtp-Source: AMrXdXvNNGhDUbEro6VJ1uZ6xWZg3id+oc6IWgp4rOJJqSp/Stw2qIltIr3jrrjDLluX8GWt+a3u X-Received: by 2002:a17:90b:4b0a:b0:223:415c:2b82 with SMTP id lx10-20020a17090b4b0a00b00223415c2b82mr76007773pjb.27.1673280482276; Mon, 09 Jan 2023 08:08:02 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1673280482; cv=none; d=google.com; s=arc-20160816; b=IEnLakQH9bVse1WG0QrgQeK+9YdWI9nycMjMO7ttcOfzytzq2J/z3NluRIv62sELCt wtCJChkPqiyqWRuhy0dUUXjsI58YJrdKJWRgA7WiLyH9AcGSbr3sG5k845hlIJmgiZaV tc+gRobmhoDoyTDqdc/F6L6bkazvdTeOIOWtIAhfvKgEIlE6cXEhcYE+peT/XBT0hkAQ ztbylRRK+yqcbR3LgqkD59Avh1qvU12YOtqL83FSl4oJ8mwC6TTICWIwvpUwep+8QSH3 rEs7oNGYkZPdfv9W2iZWRRvNsYIud4Man6XhcxaPXE29GbHCR4vVcg8a9/r6Q2m91wh+ 1N0A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=V/cJ1YC6Qm4oi/+ePT1u46lkHDDlv6J9yjE28fdRIVI=; b=MIJf1FmXM6H3hpQ3dBv7IAZHWjU41WJXmDA3+NOzzJ+v/f6+srBbVM5mF2a0sSBRwm TJ8v9W734TUmsjD9lWO1174n9s1663eONmBNmqPR47CsfjSLxuN+kUl1BAqpPDjuHRxI 9w1MxiE+eiuiqxlbpSQKlqSCYyC2ti0JaWC9poM0UA2c+GQ1beHVcruU+GS3E4ghhcQX KvFL+6LUkmhchnNNpTkDzsteIBrNvcfgmG+wcIrKz+OKSrApCqr+31teg5Gydvao4EJy 0oB3nGXnt7VOpEEh/wYPstR+4zlKBk3WTWusmmrnbKM8imEuVOH6a1qBLGYtYTAMEZhH Jq/A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id lb18-20020a17090b4a5200b00218792fa2b3si14992705pjb.160.2023.01.09.08.07.55; Mon, 09 Jan 2023 08:08:02 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229643AbjAIP0t (ORCPT + 53 others); Mon, 9 Jan 2023 10:26:49 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41972 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231126AbjAIP0l (ORCPT ); Mon, 9 Jan 2023 10:26:41 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 6821F24F; Mon, 9 Jan 2023 07:26:40 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 240251042; Mon, 9 Jan 2023 07:27:22 -0800 (PST) Received: from FVFF77S0Q05N (unknown [10.57.37.103]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id EC3B03F587; Mon, 9 Jan 2023 07:26:37 -0800 (PST) Date: Mon, 9 Jan 2023 15:26:27 +0000 From: Mark Rutland To: Atish Patra Cc: linux-perf-users@vger.kernel.org, "linux-kernel@vger.kernel.org List" , Peter Zijlstra , Arnaldo Carvalho de Melo , Alexander Shishkin , Will Deacon , Stephane Eranian , Andi Kleen , Palmer Dabbelt , Beeman Strong , Atish Patra , Kan Liang , Anup Patel Subject: Re: Expected rdpmc behavior during context swtich and a RISC-V conundrum Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jan 09, 2023 at 01:06:45AM -0800, Atish Patra wrote: > On Fri, Jan 6, 2023 at 4:02 AM Mark Rutland wrote: > > > > On Thu, Jan 05, 2023 at 11:59:24AM -0800, Atish Patra wrote: > > > Hi All, > > > There was a recent uabi update[1] for RISC-V that allows the users to > > > read cycle and instruction count without any checks. > > > We tried to restrict that behavior to address security concerns > > > earlier but it resulted in breakage for some user space > > > applications[2]. > > > Thus, previous behavior was restored where a user on RISC-V platforms > > > can directly read cycle or instruction count[3]. > > > > > > Comparison with other ISAs w.r.t user space access of counters: > > > ARM64 > > > -- Enabled/Disabled via (/proc/sys/kernel/perf_user_access) > > > -- Only for task bound events configured via perf. > > > > > > X86 > > > --- rdpmc instruction > > > --- Enable/Disable via “/sys/devices/cpu/rdpmc” > > > -- Before v4.0 > > > -- any process (even without active perf event) rdpmc > > > After v4.0 > > > -- Default behavior changed to support only active events in a > > > process’s context. > > > -- Configured through perf similar to ARM64 > > > -- Continue to maintain backward compatibility for unrestricted access > > > by writing 2 to “/sys/devices/cpu/rdpmc” > > > > > > IMO, RISC-V should only enable user space access through perf similar > > > to ARM64 and x86 (post v4.0). > > > However, we do have to support the legacy behavior to avoid > > > application breakage. > > > As per my understanding a direct user space access can lead to the > > > following problems: > > > > > > 1) There is no context switch support, so counts from other contexts are exposed > > > 2) If a perf user is allocated one of these counters, the counter > > > value will be written > > > > > > Looking at the x86 code as it continues to allow the above behavior, > > > rdpmc_always_available_key is enabled in the above case. However, > > > during the context switch (cr4_update_pce_mm) > > > only dirty counters are cleared. It only prevents leakage from perf > > > task to rdpmc task. > > > > > > How does the context switch of counters work for users who enable > > > unrestricted access by writing 2 to “/sys/devices/cpu/rdpmc” ? > > > Otherwise, rdpmc users likely get noise from other applications. Is > > > that expected ? > > > > Regardless of leakage, they're also going to get random jumps in the visible > > values of the cycle count and instruction count as the task is context-switched > > (and/or if those values get reset across idle, as can happen on arm64), so > > those aren't going to be useful unless a number of other constraints apply. > > > > Agreed. > > > AFAICT the affected package was actually a library of intrinsics; does this > > affect a real application, or was this just in tests? If it's the latter there > > might still be scope to properly lock this down... > > > > Unfortunately, there are real applications In RISC-V started using > cycle counters due to legacy reasons. > > Here is the short list from debian repo pointed out in [1] > https://codesearch.debian.net/search?q=%22rdcycle+%250%22 The first of those is GRUB, when running bare metal. The second is this library again, which is not a whole application. I see that it's used in some benchmarks in real applications, e.g. firefox, chrome. However, as above we know that's *broken* today. Is that code actually run? > Looking at aarch64 code in one of the application, it seems they rely > on reading "pmccntr_el0" to read time > https://sources.debian.org/src/chromium/108.0.5359.124-1/third_party/ffmpeg/libavutil/aarch64/timer.h/ That part is under ifdefs for mac os: | #if defined(__APPLE__) ... and it's nonsensical anyway, pmccntr_el0 is *not* a timer, and I don't know if Mac OS would bother to context-switch the value, so it's very likely broken anyhow. > AFAIK, any counter access from EL0 is disabled by default in > reset_pmuserenr_el0 and should be enabled via the > proc/sys/perf_user_access > in armv8pmu_enable_user_access. Is that correct ? Yes, we *only* enable access for tasks doing self-monitoring via perf. No other useage makes sense, since the value is arbitrarily reset, and it's not consistent across CPUs. It *cannot* be used as a timer. > I couldn't find any application actually enabling the access using > perf_user_access. Maybe I am missing something? > Otherwise, the above application would trap on access to pmccntr_el0. As above, that's on Mac OS, not Linux. Thanks, Mark. > > [1] https://groups.google.com/a/groups.riscv.org/g/sw-dev/c/REWcwYnzsKE?pli=1 > > Thanks, > > Mark. > > > > > This can be a security concern also where a rogue rdpmc user > > > application can monitor other critical applications to initiate side > > > channel attack. > > > > > > Am I missing something? Please correct my understanding of the x86 > > > implementation if it is wrong. > > > > > > [1] https://lore.kernel.org/lkml/20221201135110.3855965-1-conor.dooley@microchip.com/ > > > [2] https://groups.google.com/a/groups.riscv.org/g/sw-dev/c/REWcwYnzsKE?pli=1 > > > [3] https://lore.kernel.org/all/YxIzgYP3MujXdqwj@aurel32.net/T/ > > > > > > -- > > > Regards, > > > Atish > > > > -- > Regards, > Atish