Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp652749pxj; Thu, 27 May 2021 08:40:26 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy0lr8pEUcKy5GNRVNb76O+Bxo//egp0rdzSOA5TrKyiTW9/pyhkDNcunt/rFZWq7hPEnfZ X-Received: by 2002:a92:d38d:: with SMTP id o13mr3317150ilo.45.1622130026347; Thu, 27 May 2021 08:40:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1622130026; cv=none; d=google.com; s=arc-20160816; b=nDqbxwfLx7zZ0ATHYk/X8fGS6TEW45wQQ9ozuoSGpg9GTiUCM5ffOE/OaxiIo+Dhjn qyDOS6V50XouNKcyHLJdnCkhFdCvEw0TM+HC/vQJC/rrR4HreOt0R6rp2mSjZGyvMtVp eKJUzSqdxUatuFLD5dpZx5J1Efkdhno6PD3PuxNGntYJGSDNjwNtHJYQvlMDMXTdqNGj unm6RAwu2He/uK48l6vEz/RIVf7/Ydhh6ipLwL57rqzuHKMdjRFHavApAEDmZdsh2nMw 0KFnaeaqjIuSyKAO/qfZ2vblhh+3sG5NMEjX51q64ASbVKzETZb4s62m09Mwh+2j8Ylr HC6Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:organization :references:cc:to:from:subject:ironport-sdr:ironport-sdr; bh=Xc4FAm/H6k2iXM1e8wF1O6YvOnK8G1vEKFG6NRKT4IQ=; b=tx9k3JclPcpQHJ6hd7L4TiuwX1hp9mSt+SaHx3g+KKmThmHN4k4ddZCRIBsv29SGJi n/xdm22Y379MbTcGEwA5pqe9/yAqP0crmv9t2vBbrghd9tDKRr095/I1DcaekyhuF1wB KXyAD15EDsNrInGJtVaFj//tTMKCJlZrkstZtBt+UcVgvayLqVm2cD2Acutu6tUa4tQ8 TfuUhP6bc1Ljxx/OJ+sbhiPybgZHgaXbcgAIA59GFd4tt4Q7sAmGCWMtdCqwGFDdCHqr MV2M/Cm2Pouq7HMaIdayNxlYSpZpyu4XWcJ/ATxx8XC9u0nTYEZFNgwJeP1nPqWOjUXg W6kA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id h17si2729201jas.28.2021.05.27.08.40.13; Thu, 27 May 2021 08:40:26 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235792AbhE0JZ2 (ORCPT + 99 others); Thu, 27 May 2021 05:25:28 -0400 Received: from mga05.intel.com ([192.55.52.43]:47318 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235699AbhE0JZ1 (ORCPT ); Thu, 27 May 2021 05:25:27 -0400 IronPort-SDR: c5Exn7zCIbDzv9gxTLSAv726rzRPmMWjJ4QsTLDe66OjaoR5mHv6pEjaLDycsEaUsbODweFNsX 7Xmogk+kW/jw== X-IronPort-AV: E=McAfee;i="6200,9189,9996"; a="288277859" X-IronPort-AV: E=Sophos;i="5.82,334,1613462400"; d="scan'208";a="288277859" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by fmsmga105.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 May 2021 02:23:55 -0700 IronPort-SDR: +zPwMvPqWjJaGITy6u17k9Dl02fHfHPahnIW4RE2PM1CJnKV0l4AorqSvgVeLC8NGTh+6NHOSp iGnIXKuq3Wkg== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.82,334,1613462400"; d="scan'208";a="480498295" Received: from ahunter-desktop.fi.intel.com (HELO [10.237.72.174]) ([10.237.72.174]) by fmsmga002.fm.intel.com with ESMTP; 27 May 2021 02:23:52 -0700 Subject: Re: [PATCH v1 1/2] perf auxtrace: Change to use SMP memory barriers From: Adrian Hunter To: Peter Zijlstra Cc: Leo Yan , Arnaldo Carvalho de Melo , Ingo Molnar , Mark Rutland , Alexander Shishkin , Namhyung Kim , Andi Kleen , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org References: <20210519140319.1673043-1-leo.yan@linaro.org> <3c7dcd5d-fddd-5d3b-81ac-cb7b615b0338@intel.com> Organization: Intel Finland Oy, Registered Address: PL 281, 00181 Helsinki, Business Identity Code: 0357606 - 4, Domiciled in Helsinki Message-ID: <7cdc3578-a50e-89ef-477a-3dc1f84f96bb@intel.com> Date: Thu, 27 May 2021 12:24:15 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.10.1 MIME-Version: 1.0 In-Reply-To: <3c7dcd5d-fddd-5d3b-81ac-cb7b615b0338@intel.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 27/05/21 11:25 am, Adrian Hunter wrote: > On 27/05/21 11:11 am, Peter Zijlstra wrote: >> On Thu, May 27, 2021 at 10:54:56AM +0300, Adrian Hunter wrote: >>> On 19/05/21 5:03 pm, Leo Yan wrote: >>>> The AUX ring buffer's head and tail can be accessed from multiple CPUs >>>> on SMP system, so changes to use SMP memory barriers to replace the >>>> uniprocessor barriers. >>> >>> I don't think user space should attempt to be SMP-aware. >> >> Uhh, what? It pretty much has to. Since userspace cannot assume UP, it >> must assume SMP. > > Yeah that is what I meant, but consequently we generally shouldn't be > using functions called smp_ > >> >>> For perf tools, on __x86_64__ it looks like smp_rmb() is only a compiler barrier, whereas >>> rmb() is a "lfence" memory barrier instruction, so this patch does not >>> seem to do what the commit message says at least for x86. >> >> The commit message is somewhat confused; *mb() are not UP barriers >> (although they are available and useful on UP). They're device/dma >> barriers. >> >>> With regard to the AUX area, we don't know in general how data gets there, >>> so using memory barriers seems sensible. >> >> IIRC (but I ddn't check) the rule was that the kernel needs to ensure >> the AUX area is complete before it updates the head pointer. So if >> userspace can observe the head pointer, it must then also be able to >> observe the data. This is not something userspace can fix up anyway. >> >> The ordering here is between the head pointer and the data, and from a >> userspace perspective that's a regular smp ordering. Similar for the >> tail update, that's between our reading the data and writing the tail, >> regular cache coherent smp ordering. >> >> So ACK on the patch, it's sane and an optimization for both x86 and ARM. >> Just the Changelog needs work. > > If all we want is a compiler barrier, then shouldn't that be what we use? > i.e. barrier() I guess you are saying we still need to stop potential re-ordering across CPUs, so please ignore my comments. > >> >>>> Signed-off-by: Leo Yan >>>> --- >>>> tools/perf/util/auxtrace.h | 6 +++--- >>>> 1 file changed, 3 insertions(+), 3 deletions(-) >>>> >>>> diff --git a/tools/perf/util/auxtrace.h b/tools/perf/util/auxtrace.h >>>> index 472c0973b1f1..8bed284ccc82 100644 >>>> --- a/tools/perf/util/auxtrace.h >>>> +++ b/tools/perf/util/auxtrace.h >>>> @@ -452,7 +452,7 @@ static inline u64 auxtrace_mmap__read_snapshot_head(struct auxtrace_mmap *mm) >>>> u64 head = READ_ONCE(pc->aux_head); >>>> >>>> /* Ensure all reads are done after we read the head */ >>>> - rmb(); >>>> + smp_rmb(); >>>> return head; >>>> } >>>> >>>> @@ -466,7 +466,7 @@ static inline u64 auxtrace_mmap__read_head(struct auxtrace_mmap *mm) >>>> #endif >>>> >>>> /* Ensure all reads are done after we read the head */ >>>> - rmb(); >>>> + smp_rmb(); >>>> return head; >>>> } >>>> >>>> @@ -478,7 +478,7 @@ static inline void auxtrace_mmap__write_tail(struct auxtrace_mmap *mm, u64 tail) >>>> #endif >>>> >>>> /* Ensure all reads are done before we write the tail out */ >>>> - mb(); >>>> + smp_mb(); >>>> #if BITS_PER_LONG == 64 || !defined(HAVE_SYNC_COMPARE_AND_SWAP_SUPPORT) >>>> pc->aux_tail = tail; >>>> #else >>>> >>> >