Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id B6D30C433F5 for ; Tue, 14 Dec 2021 10:46:30 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233217AbhLNKq3 (ORCPT ); Tue, 14 Dec 2021 05:46:29 -0500 Received: from ams.source.kernel.org ([145.40.68.75]:36456 "EHLO ams.source.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231358AbhLNKq2 (ORCPT ); Tue, 14 Dec 2021 05:46:28 -0500 Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 715B9B81890 for ; Tue, 14 Dec 2021 10:46:27 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D95CCC34600; Tue, 14 Dec 2021 10:46:24 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1639478786; bh=y2TO8eUsZslebPtLMoXPsaU8U9BaXv1lNSADsmV2naU=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=A7Fytq1M96tndoohlGLXKOdnBbCxPDGsDDEOl5Klcv1ZEegZo8IJjpveXIWzu5Xjj nIdeLyCGoV11UnmpOc+4QeBX77HWwcnZeRWd01oKfTFu5/2ORF5O6asgBeMzFbn3q1 pYadx0piww5YK8FS4cy1dHiHB12azWVpSNpBNSOA1k9jP8D46wLv9ZadZ9sJqCT9vD AgpldzwBorQxKSE+PYtONd5qidgE+6vKv2hgQI8K5Dzhc745kBpzzTCy154tDnop1L wxt1QUndURnBWWw5i42mlnG+7KcD9hCYYsshuWAP1zw4C3QuAU1sUcLAGppmWF6jWx h7wrOHewiCNNQ== Date: Tue, 14 Dec 2021 10:46:20 +0000 From: Will Deacon To: Catalin Marinas Cc: Xiongfeng Wang , mark.rutland@arm.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, moyufeng@huawei.com Subject: Re: [RFC PATCH v2] arm64: barrier: add macro dgh() to control memory accesses merging Message-ID: <20211214104620.GA13682@willie-the-truck> References: <20211015090511.92421-1-wangxiongfeng2@huawei.com> <3303413f-a8de-bd41-4095-80ffa98cf75b@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Dec 03, 2021 at 12:47:47PM +0000, Catalin Marinas wrote: > On Fri, Oct 22, 2021 at 09:51:40AM +0800, Xiongfeng Wang wrote: > > On 2021/10/16 1:59, Catalin Marinas wrote: > > > On Fri, Oct 15, 2021 at 05:05:11PM +0800, Xiongfeng Wang wrote: > > >> diff --git a/arch/arm64/include/asm/barrier.h b/arch/arm64/include/asm/barrier.h > > >> index 451e11e5fd23..d71a7457d619 100644 > > >> --- a/arch/arm64/include/asm/barrier.h > > >> +++ b/arch/arm64/include/asm/barrier.h > > >> @@ -18,6 +18,14 @@ > > >> #define wfe() asm volatile("wfe" : : : "memory") > > >> #define wfi() asm volatile("wfi" : : : "memory") > > >> > > >> +/* > > >> + * Data Gathering Hint: > > >> + * This instruction prohibits merging memory accesses with Normal-NC or > > >> + * Device-GRE attributes before the hint instruction with any memory accesses > > >> + * appearing after the hint instruction. > > >> + */ > > >> +#define dgh() asm volatile("hint #6" : : : "memory") > > > > > > On its own, this patch doesn't do anything. It's more interesting to see > > > how it will be used and maybe come up with a common name that other > > > architectures would share (or just implement as no-opp). I'm not sure > > > there was any conclusion last time we discussed this. > > > > In the last mail, I was suggested to investigate the code in other architecture > > to find if there exists similar interface. I searched 'merg' in the code and > > didn't find similar interface. > > Maybe no other architecture has such hint. They have write buffer > draining but that's more expensive. > > > The only thing similar I found is in Intel Software Developer's Manual. It says > > "Write Combining (WC) ... Writes may be delayed and combined in the write > > combining buffer (WC buffer) to reduce memory accesses. If the WC buffer is > > partially filled, the writes may be delayed until the next occurrence of a > > serializing event; such as an SFENCE or MFENCE instruction, CPUID or other > > serializing instruction, a read or write to uncached memory, an interrupt > > occurrence, or an execution of a LOCK instruction (including one with an > > XACQUIRE or XRELEASE prefix)." > > Maybe this is more like the write combine buffer flushing, not prevent merging. > > Sorry I still didn't understand the difference clearly. > > IIUC those drivers on x86 just rely on the microarchitecture aspects of > the draining (e.g. fill 64 bytes). On Arm we don't have such guarantee > as there's a wide variation in implementations, hence the DGH > instruction. > > > How about a common name called 'merge_prohibit_hint()'? Could you give me some > > suggestions ? > > I think "prohibit" looks more like not allowing any write-buffer merge. > Maybe stop_merge_hint(), stop_write_buffer_merge(), > stop_write_combining() (any other ideas?). It would be a NOP on all > other architectures. Given that DGH only affects prior memory accesses to Normal-NC or Device-GRE memory, we probably want to avoid making this too broad. We don't use GRE in Linux and I think Normal-NC is only exposed as an explicit part of the I/O accessors in ioremap_wc(). So I'd be inclined to add something like: io_stop_wc() with corresponding documentation in Documentation/memory-barriers.txt. Will