Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp4829717rdb; Fri, 15 Sep 2023 13:43:41 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEel+tbFW+aHqG3EYXY+CQSoS7bhJL/HYhTs5pns2rw+SQMF8u1KGzR97TdHhXMX3Q3gTy6 X-Received: by 2002:a17:903:244a:b0:1c3:dafa:b1e9 with SMTP id l10-20020a170903244a00b001c3dafab1e9mr3233396pls.10.1694810621032; Fri, 15 Sep 2023 13:43:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694810621; cv=none; d=google.com; s=arc-20160816; b=JCr2ww88uogYxE5q2FInVxJvu5mm5eN+1RPdOv63Gi39QPcRKai0xkUrUiGOEJR3eT KSBK/odBP+op5aAATxluaO1sx4nBIqyKO3gPhRs/1pzBfuO6KQyzkMlpHyGPHichB9Cy tXM1aqFdxTctE5DVJOPGicV5UhTLqDY+Yme47GjsYyOEOzuTqd6cBCuOLS/CwCcp8iFb YO/N4G/nTn+ux0qKkGWrFpSFACCshxtVb0eHfIoA2viv1F3hMGRIZRR/rOJ4FnOANu1s 3KhIBfEXDTFbFnaLJLwJbeAmpL6eR8v8lmKomuQXIEWEOmgJiyTSa6SZNzb/CFtHTXjO pQ8g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:sender:dkim-signature; bh=fCZPvKJZFLYsRKxBEXIvSXuAd3mfyz+vRzHl+fPG8MY=; fh=9NxIsWcYaPsi1kWNDMnMoRPHi/NPclxNM3HG4JvZM/U=; b=ZGU0GOBEZEVq1tObmF6AvpXI6YU9oPqWYE7t8ieUYeBmCO0gT9q4L3Nn5EQUOYUizb gytdG5/GdheE8QVQ6CVvOXC+mGRmxoEF6eS697tjq4sPP+khgDBVR4SUf3r+w3GEMeXQ qRZnpU1WbOQJ6mUojVI6SDvZKTKkcWpEMhV0CWMYBe4XDc04uKGKw22lLH7GiJ1hk8mx gmJkrRI8h3FTMDWw/k6bF8NBpGNvaBV8xusNL6PtiIaCJmgMoxyhDmmCj8KtDlPNkEqs 0Sv1ZONnaJg2tPDVNpUGxr2dGd+bcPIFUbMxZS0fXuFPjcSNWbGLX3dB9WU0m8FTxDxe fViQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=Ho4Os4fu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from fry.vger.email (fry.vger.email. [23.128.96.38]) by mx.google.com with ESMTPS id bx29-20020a056a02051d00b005637e28270fsi4136563pgb.243.2023.09.15.13.43.40 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Sep 2023 13:43:41 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) client-ip=23.128.96.38; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=Ho4Os4fu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by fry.vger.email (Postfix) with ESMTP id 2FA7D83B16C7; Fri, 15 Sep 2023 04:23:03 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at fry.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232147AbjIOLXD (ORCPT + 99 others); Fri, 15 Sep 2023 07:23:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34890 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230153AbjIOLXB (ORCPT ); Fri, 15 Sep 2023 07:23:01 -0400 Received: from mail-wm1-x336.google.com (mail-wm1-x336.google.com [IPv6:2a00:1450:4864:20::336]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id DE64C1AB for ; Fri, 15 Sep 2023 04:22:56 -0700 (PDT) Received: by mail-wm1-x336.google.com with SMTP id 5b1f17b1804b1-401ec23be82so20991825e9.0 for ; Fri, 15 Sep 2023 04:22:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1694776975; x=1695381775; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:from:to:cc:subject:date:message-id :reply-to; bh=fCZPvKJZFLYsRKxBEXIvSXuAd3mfyz+vRzHl+fPG8MY=; b=Ho4Os4fue8m/1Fx9QHfplA7j4Xy9Dq9BnSmeONCy/Hhd16kO9zGh8HYud+q5ZhO6yL 6VPC3CQiBCvzbWj987MYIZUIcdtE525JY3OVuDI9Rk2culNVEm27UeMaOQUqPVwOP1Fo Y8/4dtsoSJoT2sHlfHPzSNk0Wb8sMmRDOIMcYhVCqJFGbGwI0PXeDx8NkKmYS+B1vsfw sssDTLZpk4SQWyShDmmpw//I+rZu8moLr7MgsRRFuSBG6NLc3DZj/qsiNd7W2roQCU+j VxDC9SEaNjMVhiaE6Ev/+z8bYWg81c7CBysfIFZH4vM+rs3d60LnQJhOjWBFKKB3jt4K 7CWg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1694776975; x=1695381775; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=fCZPvKJZFLYsRKxBEXIvSXuAd3mfyz+vRzHl+fPG8MY=; b=uCCA69bCuXxKhwxZs/AmvsRg54ARWYuNVZRJUAFMPhHoJ+mIcAiteoIjP6uIXC3AmD QY1AKUmKAvHREhQiQifYPRkwCEvBnXWXFA++0t5Nbmw/jQmktThSDsR0hTLlmE4/iiak tbT/isYHj6qJtItqcu5UdSTdEGTYWfVLitiSnGqhgiqJivx0FUP3hgmc0MHPMrpCOXY0 86QhsLjTHCEHQafWcAKcSXspxtK59blLPWI1meiNcQwUGQ2kXTIUGk5HwvRPWZ2BmS46 kfNj2MjYX6D9ScXjFkKFi2Uqg7WVxzCC1M6ujcV0y5HGXt8MPpD10FaK6Bv3/++AWhtR jNFw== X-Gm-Message-State: AOJu0Yw49px6u9GlwSxarYTpZVOUU3/stoz5DWD3lqH9X/ATWuoNVgsX NCXBd4cIQmx8halepBbf4Mk= X-Received: by 2002:a7b:ce07:0:b0:401:bdd7:499d with SMTP id m7-20020a7bce07000000b00401bdd7499dmr1290628wmc.25.1694776975183; Fri, 15 Sep 2023 04:22:55 -0700 (PDT) Received: from gmail.com (1F2EF265.nat.pool.telekom.hu. [31.46.242.101]) by smtp.gmail.com with ESMTPSA id m10-20020a7bce0a000000b003fee53feab5sm4394256wmc.10.2023.09.15.04.22.53 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Sep 2023 04:22:54 -0700 (PDT) Sender: Ingo Molnar Date: Fri, 15 Sep 2023 13:22:52 +0200 From: Ingo Molnar To: Uros Bizjak Cc: x86@kernel.org, linux-kernel@vger.kernel.org, Peter Zijlstra , Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , "H. Peter Anvin" Subject: Re: [PATCH 2/2] x86/percpu: Use raw_cpu_try_cmpxchg in preempt_count_set Message-ID: References: <20230830151623.3900-1-ubizjak@gmail.com> <20230830151623.3900-2-ubizjak@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-1.0 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on fry.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (fry.vger.email [0.0.0.0]); Fri, 15 Sep 2023 04:23:03 -0700 (PDT) * Ingo Molnar wrote: > > * Ingo Molnar wrote: > > > > > * Uros Bizjak wrote: > > > > > Use raw_cpu_try_cmpxchg instead of raw_cpu_cmpxchg (*ptr, old, new) == old. > > > x86 CMPXCHG instruction returns success in ZF flag, so this change saves a > > > compare after cmpxchg (and related move instruction in front of cmpxchg). > > > > > > Also, raw_cpu_try_cmpxchg implicitly assigns old *ptr value to "old" when > > > cmpxchg fails. There is no need to re-read the value in the loop. > > > > > > No functional change intended. > > > > > > Cc: Peter Zijlstra > > > Cc: Thomas Gleixner > > > Cc: Ingo Molnar > > > Cc: Borislav Petkov > > > Cc: Dave Hansen > > > Cc: "H. Peter Anvin" > > > Signed-off-by: Uros Bizjak > > > --- > > > arch/x86/include/asm/preempt.h | 4 ++-- > > > 1 file changed, 2 insertions(+), 2 deletions(-) > > > > > > diff --git a/arch/x86/include/asm/preempt.h b/arch/x86/include/asm/preempt.h > > > index 2d13f25b1bd8..4527e1430c6d 100644 > > > --- a/arch/x86/include/asm/preempt.h > > > +++ b/arch/x86/include/asm/preempt.h > > > @@ -31,11 +31,11 @@ static __always_inline void preempt_count_set(int pc) > > > { > > > int old, new; > > > > > > + old = raw_cpu_read_4(pcpu_hot.preempt_count); > > > do { > > > - old = raw_cpu_read_4(pcpu_hot.preempt_count); > > > new = (old & PREEMPT_NEED_RESCHED) | > > > (pc & ~PREEMPT_NEED_RESCHED); > > > - } while (raw_cpu_cmpxchg_4(pcpu_hot.preempt_count, old, new) != old); > > > + } while (!raw_cpu_try_cmpxchg_4(pcpu_hot.preempt_count, &old, new)); > > > > It would be really nice to have a before/after comparison of generated > > assembly code in the changelog, to demonstrate the effectiveness of this > > optimization. > > Never mind, you did exactly that in the September 6 variation of these > changes. I'll apply those. I mean, this third patch of yours: [PATCH] x86/percpu: Define {raw,this}_cpu_try_cmpxchg{64,128} Had a proper disassembly comparison - so I've applied all 3 optimization patches to tip:x86/asm as: b8e3dfa16ec5 ("x86/percpu: Use raw_cpu_try_cmpxchg() in preempt_count_set()") 5f863897d964 ("x86/percpu: Define raw_cpu_try_cmpxchg and this_cpu_try_cmpxchg()") 54cd971c6f44 ("x86/percpu: Define {raw,this}_cpu_try_cmpxchg{64,128}") Thanks, Ingo