Received: by 2002:a05:7412:b995:b0:f9:9502:5bb8 with SMTP id it21csp4900879rdb; Fri, 29 Dec 2023 19:08:19 -0800 (PST) X-Google-Smtp-Source: AGHT+IEjJfCx8QFM5hHE/eLaHUtwCRKzlyCbbnBQci/Vh3HUhOGC+cjGd8uFVTJ6ywUJKn+oZsxa X-Received: by 2002:a05:622a:308:b0:425:4043:8d2f with SMTP id q8-20020a05622a030800b0042540438d2fmr14041346qtw.74.1703905699689; Fri, 29 Dec 2023 19:08:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1703905699; cv=none; d=google.com; s=arc-20160816; b=rj+ughTbunF1a/xjgyTHtRn/Kv9hGrnGEty0b4FJ5NYQYdoPpivMvqf3qTF8VssGUC MhlW1ccOjYd3AxlOacDL61h5TccX9wF78aBMblTHrllgJ16CRrQTw2V+1qL5n0Ra7/jl kkZUqLPP2yR8nwhRCtcMJEPsR/G0bVcesl3jR2SizU9CroWbO1EKGzwi7r2cmmbmVUn0 hcaBfIYJWDLKFDY+v4CEIAAqeO/qg1BcTm86haSWHC9ES48ta7dHpspN+eDczms4aelM qsk/8lnu8GjFNBs/I7MVWara7JW6Pz4nPvDNpyQPPt90u7gbAjC62Mio5lsmnj8lfzst yPsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:date:message-id:dkim-signature; bh=AmyPFQZTx3DBNTrvibOLHdlFPMZhkmpr5I+XfX8iSuk=; fh=pvHai+PZt5LffIZ++I0WChek9Xa9cRTyDX+3LL+snXY=; b=QgNpjca4Qz3UJmNeF8CcviyH0GNdlXZCo4DWNbvQqANumEgJUexq3zPaA4xlPYH4Of yu2M0DuoTiLKtQomiVD1oSPrZJ+jD0MDqHLvF5E2JqjWZWQ+ccxy/BKjL4dQ+9E3LoBh tF3ZO9b2ALwIxXq7BerMK4nNSV5kpjE7o4vbW7z1A+1DVYXbU9EgvjfXkmIJ2N8wxb96 sqPXOO4UzYsAFMHVfOG1HKa2GL+bRG54uH2L3gG6ZyjKv33+E3IfXPoS0I7Zw8u8NkSg 79xEg7Sa+9aBiCO6FP5xBPf+7uYjO1k7jQWxeYuzuBXpX/aeXt21tzeTJKw5VEQ84iSk xTew== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=K+RgksLq; spf=pass (google.com: domain of linux-kernel+bounces-13319-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-13319-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id u10-20020a05622a14ca00b0042573335c80si20484755qtx.263.2023.12.29.19.08.19 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 29 Dec 2023 19:08:19 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-13319-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=K+RgksLq; spf=pass (google.com: domain of linux-kernel+bounces-13319-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-13319-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 5075D1C20A73 for ; Sat, 30 Dec 2023 03:08:19 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id F0520138E; Sat, 30 Dec 2023 03:08:13 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="K+RgksLq" X-Original-To: linux-kernel@vger.kernel.org Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 959E310E9 for ; Sat, 30 Dec 2023 03:08:11 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1703905690; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=AmyPFQZTx3DBNTrvibOLHdlFPMZhkmpr5I+XfX8iSuk=; b=K+RgksLqf6HMR24zw0g4OTxXGRpQ6WceGs3VEQMc22hRptZJLy465N80edpGm668M515BC 0GHLmLZUYaXMjCprc5FvEuv3L5v+RPQJl54Vc55Fvzgr3LUql5ReadZQqkobP2N/MDAGGD 0grttZyxHIpA9qv+HuIPJ0ij3qg4HbU= Received: from mimecast-mx02.redhat.com (mx-ext.redhat.com [66.187.233.73]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-225-xf0T1TxuMLivzC5vapgPuA-1; Fri, 29 Dec 2023 22:08:04 -0500 X-MC-Unique: xf0T1TxuMLivzC5vapgPuA-1 Received: from smtp.corp.redhat.com (int-mx09.intmail.prod.int.rdu2.redhat.com [10.11.54.9]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id E58ED3C1CD0A; Sat, 30 Dec 2023 03:08:03 +0000 (UTC) Received: from [10.22.16.36] (unknown [10.22.16.36]) by smtp.corp.redhat.com (Postfix) with ESMTP id 4DE8B492BC6; Sat, 30 Dec 2023 03:08:03 +0000 (UTC) Message-ID: <346c40b5-686f-461e-a1e3-9f255418efb2@redhat.com> Date: Fri, 29 Dec 2023 22:08:03 -0500 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH next 4/5] locking/osq_lock: Optimise per-cpu data accesses. Content-Language: en-US To: David Laight , "'linux-kernel@vger.kernel.org'" , "'peterz@infradead.org'" Cc: "'mingo@redhat.com'" , "'will@kernel.org'" , "'boqun.feng@gmail.com'" , 'Linus Torvalds' , "'xinhui.pan@linux.vnet.ibm.com'" , "'virtualization@lists.linux-foundation.org'" , 'Zeng Heng' References: <73a4b31c9c874081baabad9e5f2e5204@AcuMS.aculab.com> From: Waiman Long In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 3.4.1 on 10.11.54.9 On 12/29/23 15:57, David Laight wrote: > this_cpu_ptr() is rather more expensive than raw_cpu_read() since > the latter can use an 'offset from register' (%gs for x86-84). > > Add a 'self' field to 'struct optimistic_spin_node' that can be > read with raw_cpu_read(), initialise on first call. > > Signed-off-by: David Laight > --- > kernel/locking/osq_lock.c | 14 +++++++++----- > 1 file changed, 9 insertions(+), 5 deletions(-) > > diff --git a/kernel/locking/osq_lock.c b/kernel/locking/osq_lock.c > index 9bb3a077ba92..b60b0add0161 100644 > --- a/kernel/locking/osq_lock.c > +++ b/kernel/locking/osq_lock.c > @@ -13,7 +13,7 @@ > */ > > struct optimistic_spin_node { > - struct optimistic_spin_node *next, *prev; > + struct optimistic_spin_node *self, *next, *prev; > int locked; /* 1 if lock acquired */ > int cpu; /* encoded CPU # + 1 value */ > }; > @@ -93,12 +93,16 @@ osq_wait_next(struct optimistic_spin_queue *lock, > > bool osq_lock(struct optimistic_spin_queue *lock) > { > - struct optimistic_spin_node *node = this_cpu_ptr(&osq_node); > + struct optimistic_spin_node *node = raw_cpu_read(osq_node.self); My gcc 11 compiler produces the following x86-64 code: 92        struct optimistic_spin_node *node = this_cpu_ptr(&osq_node);    0x0000000000000029 <+25>:    mov    %rcx,%rdx    0x000000000000002c <+28>:    add %gs:0x0(%rip),%rdx        # 0x34 Which looks pretty optimized for me. Maybe older compiler may generate more complex code. However, I do have some doubt as to the benefit of this patch at the expense of making the code a bit more complex. Cheers, Longman