Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp2606992iob; Mon, 16 May 2022 01:58:22 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwftsrqI80MH+7JA9hS7ZntirCbEORTBwM86lfmKcFySNlYH3Rw2/VyGMWYNddriCowQRat X-Received: by 2002:a17:906:c28a:b0:6f5:30e7:faf0 with SMTP id r10-20020a170906c28a00b006f530e7faf0mr14103402ejz.183.1652691501903; Mon, 16 May 2022 01:58:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652691501; cv=none; d=google.com; s=arc-20160816; b=CSp2PJpHEiBfQT4rIubMYUyPDi4+W35H/r0rkKQoJBCNZcrx1E4OT8k4wOfmR1+n9X WG71FeHX1Wzn6ItVeCZhfIn+/R/N18gmfC0cWv+FNtOkX1wNiN/LR6RCuwoSKBe8+Equ NmxqE1eho2YWqjyNHEXoQw5uQb4EKumsa4QihD/Iy750oS+bhx2aP+sowNbQ7BadduQ3 W4txZtLkxLJDeOExCUaTq+I2RqlVTgZwkhVHDm+vXHr18DpUaNRStQIE2i1kXFn9AceZ X0G8V6DBNz8OMm9DSGNsFlplEyGhtEDKSOlXyNN3HMWWJNOvWWyN0vqxCzhXk0kBileS auXg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=XyHLOKOVQbDpT7eahnROR3m4U53LPSqe6ZPVDJA+AE8=; b=TU8QYoP/G/Qc0xwfaOyPqmNDHwm5jaSuKRlOgdgbDqCuoyESA9OGyIu+BqcZRiFxnI k1krGh8+Ub6Z8hFURljpASiz4Ewvl6f2XNQH7UVTTJcyScWIJ52GQA/XdTstuRwg+B9d LT3hGDN7XbsitbsJ+EunX1CSMYiy8v2YrZch3WeRnPjbJN9WD2DTqCVo1qaIfWyNHXdC QSfPdEzqf0VuJ7WqOCFSaA0jA8bH5rizevbfeURhRmwxBOZF+0V13qzoXKHwYl+iyKnl ppxOUAPZWiW1LKc3llBFEgxzOd7OL0OT8Qikh21NZKoA3IyWr7CzFrsx198q6k7VT8kH 32iw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=JNYkq5Ec; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hp18-20020a1709073e1200b006fe1209e97esi9610908ejc.740.2022.05.16.01.57.56; Mon, 16 May 2022 01:58:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=JNYkq5Ec; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238277AbiEOSm3 (ORCPT + 99 others); Sun, 15 May 2022 14:42:29 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35872 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236850AbiEOSm0 (ORCPT ); Sun, 15 May 2022 14:42:26 -0400 Received: from mail-ej1-x62a.google.com (mail-ej1-x62a.google.com [IPv6:2a00:1450:4864:20::62a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7C529DE7 for ; Sun, 15 May 2022 11:42:24 -0700 (PDT) Received: by mail-ej1-x62a.google.com with SMTP id j6so24841179ejc.13 for ; Sun, 15 May 2022 11:42:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=XyHLOKOVQbDpT7eahnROR3m4U53LPSqe6ZPVDJA+AE8=; b=JNYkq5EcWWSWYB4B3v514DHG8EMVEqmr6E5cLyvmNEv3kcSa5l/GmIaJOPki2W1vae R2x9MrhyOd0p2JC2fk20WnPSyN0093UElSysa3FHDoHXpjmCyOMAZwIgQQg0FUTWfDEo +TrmR+VrXXiL4qdSIlVOXpVKZOpowhniQHtV1SRpO5GFj1xn+N50bawcpGlbtKodxT8l ofkBd3JM+ZqR0c2laA5Ic1bYBttGtozbVNHsQC2A+9VpTPhYVbSHRwBsBVNYuMDIy3tm 810Ogsx4fBqpkqQ2xxAD1QsAK+sWvjD+KUYY+gudLx6NuRK+eyYjDg6BS9s1VAG8xPTW lcvA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=XyHLOKOVQbDpT7eahnROR3m4U53LPSqe6ZPVDJA+AE8=; b=wkWsTk9dO+X1FmsrG1NIGdctcFfvj0Qj5hu7kCX/AdBl1TJd2uRyGdY0H/54DeUDol pTPO1GYG7RDgk1KSBF9cpKA5lNt7VSvtqAE3NeNTB4zxz8WbD6Uv1jLB9Ex/gE2+IhIS rHg2cq2VqHYLqx4BtBtV/B3Fmvu1nqBoz0cp2sYuFzg7MHC8LL5qlBRyWSyHBOcD9DKc TmPnN+U6dJ3kTt/S7b4LGTQYoA6wFIMPgzcmFkXAqpAeKTsK6kHHLBUP6TsOAKdcVguT tilNxNFHB8YXc/dll/x5/MrNlHSWehAJKqbHMPFypkz6igyBdXLy7ogzbV0VLMmsmY3y NEcA== X-Gm-Message-State: AOAM5300hDUIUSPPeBTEUNFFvLN24Y4Rxx4AbYROJ/X7JktAcRtrKtV+ i1FTT08dsbHcxO/49qh/TUU= X-Received: by 2002:a17:907:86a5:b0:6f9:aa0f:a838 with SMTP id qa37-20020a17090786a500b006f9aa0fa838mr12179163ejc.340.1652640142963; Sun, 15 May 2022 11:42:22 -0700 (PDT) Received: from localhost.localdomain (93-103-18-160.static.t-2.net. [93.103.18.160]) by smtp.gmail.com with ESMTPSA id hv7-20020a17090760c700b006f3ef214e62sm2861898ejc.200.2022.05.15.11.42.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 15 May 2022 11:42:22 -0700 (PDT) From: Uros Bizjak To: x86@kernel.org, linux-kernel@vger.kernel.org Cc: Uros Bizjak , Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , "H. Peter Anvin" , Will Deacon , Peter Zijlstra , Boqun Feng , Mark Rutland , "Paul E. McKenney" , Marco Elver Subject: [PATCH v3 0/2] locking/atomic/x86: Introduce arch_try_cmpxchg64 Date: Sun, 15 May 2022 20:42:02 +0200 Message-Id: <20220515184205.103089-1-ubizjak@gmail.com> X-Mailer: git-send-email 2.35.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The followign patchset introduces generic support for try_cmpxchg64 and introduces arch_try_cmpxchg64 for 64-bit and 32-bit targets. On 64-bit targets, the generated assembly improves from: ab: 89 c8 mov %ecx,%eax ad: 48 89 4c 24 60 mov %rcx,0x60(%rsp) b2: 83 e0 fd and $0xfffffffd,%eax b5: 89 54 24 64 mov %edx,0x64(%rsp) b9: 88 44 24 60 mov %al,0x60(%rsp) bd: 48 89 c8 mov %rcx,%rax c0: c6 44 24 62 f2 movb $0xf2,0x62(%rsp) c5: 48 8b 74 24 60 mov 0x60(%rsp),%rsi ca: f0 49 0f b1 34 24 lock cmpxchg %rsi,(%r12) d0: 48 39 c1 cmp %rax,%rcx d3: 75 cf jne a4 to: b3: 89 c2 mov %eax,%edx b5: 48 89 44 24 60 mov %rax,0x60(%rsp) ba: 83 e2 fd and $0xfffffffd,%edx bd: 89 4c 24 64 mov %ecx,0x64(%rsp) c1: 88 54 24 60 mov %dl,0x60(%rsp) c5: c6 44 24 62 f2 movb $0xf2,0x62(%rsp) ca: 48 8b 54 24 60 mov 0x60(%rsp),%rdx cf: f0 48 0f b1 13 lock cmpxchg %rdx,(%rbx) d4: 75 d5 jne ab where a move and a compare after cmpxchg is saved. The improvements for 32-bit targets are even more noticeable, because dual-word compare after cmpxchg8b gets eliminated. Changes since v2: * Remove invalid and unnecessary cast from 32-bit arch_try_cmpxchg64. Changes since v1: * Implement full support for try_cmpxchg64{,_acquire,_release,_relaxed} and their falbacks involving cmpxchg64. * Split patch to generic and target-dependant part. * Modernize __try_cmpxchg64 asm template with symbolic operand name. * Use generic fallback when arch_try_cmpxchg64 is not defined. Cc: Thomas Gleixner Cc: Ingo Molnar Cc: Borislav Petkov Cc: Dave Hansen Cc: "H. Peter Anvin" Cc: Will Deacon Cc: Peter Zijlstra Cc: Boqun Feng Cc: Mark Rutland Cc: "Paul E. McKenney" Cc: Marco Elver Uros Bizjak (2): locking/atomic: Add generic try_cmpxchg64 support locking/atomic/x86: Introduce arch_try_cmpxchg64 arch/x86/include/asm/cmpxchg_32.h | 21 ++++++ arch/x86/include/asm/cmpxchg_64.h | 6 ++ include/linux/atomic/atomic-arch-fallback.h | 72 ++++++++++++++++++++- include/linux/atomic/atomic-instrumented.h | 40 +++++++++++- scripts/atomic/gen-atomic-fallback.sh | 31 +++++---- scripts/atomic/gen-atomic-instrumented.sh | 2 +- 6 files changed, 156 insertions(+), 16 deletions(-) -- 2.35.1