Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp6502120rwl; Wed, 22 Mar 2023 11:25:55 -0700 (PDT) X-Google-Smtp-Source: AK7set/+Fqa7pBG2qdbyd8RrL1dzK69GfBiY4RKl683SN3iQznQuNDdGVlruD4uXJc+jDt6RepkS X-Received: by 2002:a05:6a20:2115:b0:d9:6c3d:29cf with SMTP id y21-20020a056a20211500b000d96c3d29cfmr392809pzy.52.1679509555416; Wed, 22 Mar 2023 11:25:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1679509555; cv=none; d=google.com; s=arc-20160816; b=XSMHfF5c0kWK+gFrXrvmb1c4IlF/oSEP7Zw4Iueo5BVj7fx6KbaPi46Tw5atIlLR1F JEQZWj5WURzMjoGAfLOgiSNzF+57UWaINaz7U99l8XeG3T4WwwSHO28W6GQr+rHvZO0/ sdoHMRCEbkN4eBq4RqxwvchkcSVdzm19+2kpp2IlWi/YCoUIMdlzlLt2X7Os7lNQeE+u h4hT2o6kBT2KTtmMI1NrYshUn/SEv4L/ko2vK6Y8Is1jNfg7AzJaVtYfF/ENoVidmWF8 lTmIjuY9HegcBKoZnLYVc+jxf7q8qxWhpFFvwx6u9oMDKV2QEo5xF8dWfyYL9iN4kemr 1S/w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version; bh=c1hCLbGmv/93q1rAOWFSJ2QvrOzS/r6TSMC+wYWYypc=; b=flpUKrDZ5e6LmEtIrNELE4Ga2nygeX+bAo9cNPeLN+LBJbc29bUWITSWPivBIT+jYo zkut1iX861XboZRbZCdawbZoAanJGGtK/ZPDlCQTOkAyVFCJJN6K3YN8dcFcpHCuQgsa 6Yt0Hh7+T2JNV/l4tbL7uLp6A6tda7yh6DPAictx7+VOL9a4an0TsRZgOwFonOEVKdci nqauyUP7Uq4djY9bjiSHIu2297wH/YldlgQ42e4U9ix3x9PajB8+Jjcm0pH+6QT8KV3Q 3cSxvRdLXsTkB5/s8LdmgAyM/RyfQcU2TUTPwjNJOV3GtBWC7pztcgC5W+waykDi3Pwh 3YAQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t68-20020a635f47000000b00502eff7faadsi17383497pgb.118.2023.03.22.11.25.43; Wed, 22 Mar 2023 11:25:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230250AbjCVSZG convert rfc822-to-8bit (ORCPT + 99 others); Wed, 22 Mar 2023 14:25:06 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52316 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230230AbjCVSZE (ORCPT ); Wed, 22 Mar 2023 14:25:04 -0400 Received: from mail-ed1-f43.google.com (mail-ed1-f43.google.com [209.85.208.43]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C395F2B629; Wed, 22 Mar 2023 11:25:01 -0700 (PDT) Received: by mail-ed1-f43.google.com with SMTP id eg48so76524238edb.13; Wed, 22 Mar 2023 11:25:01 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1679509500; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=FKffiRDbjDTeXJ2hTXuTV+MxI1Yz3INi1cXAJiYPuRc=; b=jktupxJB/B3eCyJBQWYPnoTKiO52yFzne1qmiqaPSg2mS5UYY5JCPQBF7MeaWFQk65 NDivHrqdFSiKbb+fgTby81o8Aadt9eT82Tnb02ediKD5yyA9InfN5Ofzn1aUI/XenZyj bFruOHdSm0aBMtjRhwtM2YOSWzfSMSwKngIYIEEhLKCi1VjO6YswZgU4gzuJm1YJh3Ez XFtrSH/FPk5mcltLEczmEIKTEVqEj+miScpi4y4YizoEux73qpRNo5+tIEpT/OP5dRsF KuiB/AGFd6lsaA9xXRZ61JKbtmNrsX/khZfKgqXrpMg3TCI8MLwsQP485CrBmIcjtvYq 6SHA== X-Gm-Message-State: AO0yUKWbwG7geZMWLtAmufRscc3NMIf45Ql33AN3ioSqzoF2qmHX0bHt s0Cz5+EG4czpKRVLxbauRoBCw1q2LhyN5ul8O2TS/2ySCyE= X-Received: by 2002:a50:c3cf:0:b0:4fb:2593:846 with SMTP id i15-20020a50c3cf000000b004fb25930846mr3902985edf.3.1679509500165; Wed, 22 Mar 2023 11:25:00 -0700 (PDT) MIME-Version: 1.0 References: <20230320212012.12704-1-ubizjak@gmail.com> In-Reply-To: <20230320212012.12704-1-ubizjak@gmail.com> From: "Rafael J. Wysocki" Date: Wed, 22 Mar 2023 19:24:49 +0100 Message-ID: Subject: Re: [PATCH v2] x86/ACPI/boot: Improve __acpi_acquire_global_lock To: Uros Bizjak Cc: x86@kernel.org, linux-acpi@vger.kernel.org, linux-kernel@vger.kernel.org, "Rafael J. Wysocki" , Len Brown , Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , "H. Peter Anvin" Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT X-Spam-Status: No, score=0.5 required=5.0 tests=FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Mar 20, 2023 at 10:20 PM Uros Bizjak wrote: > > Improve __acpi_acquire_global_lock by using a temporary variable. > This enables compiler to perform if-conversion and improves generated > code from: > > ... > 72a: d1 ea shr %edx > 72c: 83 e1 fc and $0xfffffffc,%ecx > 72f: 83 e2 01 and $0x1,%edx > 732: 09 ca or %ecx,%edx > 734: 83 c2 02 add $0x2,%edx > 737: f0 0f b1 17 lock cmpxchg %edx,(%rdi) > 73b: 75 e9 jne 726 <__acpi_acquire_global_lock+0x6> > 73d: 83 e2 03 and $0x3,%edx > 740: 31 c0 xor %eax,%eax > 742: 83 fa 03 cmp $0x3,%edx > 745: 0f 95 c0 setne %al > 748: f7 d8 neg %eax > > to: > > ... > 72a: d1 e9 shr %ecx > 72c: 83 e2 fc and $0xfffffffc,%edx > 72f: 83 e1 01 and $0x1,%ecx > 732: 09 ca or %ecx,%edx > 734: 83 c2 02 add $0x2,%edx > 737: f0 0f b1 17 lock cmpxchg %edx,(%rdi) > 73b: 75 e9 jne 726 <__acpi_acquire_global_lock+0x6> > 73d: 8d 41 ff lea -0x1(%rcx),%eax > > BTW: the compiler could generate: > > lea 0x2(%rcx,%rdx,1),%edx > > instead of: > > or %ecx,%edx > add $0x2,%edx > > but unwated conversion from add to or when bits are known to be zero > prevents this improvement. This is GCC PR108477. > > No functional change intended. > > Signed-off-by: Uros Bizjak > Cc: "Rafael J. Wysocki" > Cc: Len Brown > Cc: Thomas Gleixner > Cc: Ingo Molnar > Cc: Borislav Petkov > Cc: Dave Hansen > Cc: "H. Peter Anvin" > --- > v2: Expand return statement. Acked-by: Rafael J. Wysocki or please let me know if you want me to pick this up (in which case it will require an ACK from one of the x86 maintainers). > --- > arch/x86/kernel/acpi/boot.c | 11 ++++++++--- > 1 file changed, 8 insertions(+), 3 deletions(-) > > diff --git a/arch/x86/kernel/acpi/boot.c b/arch/x86/kernel/acpi/boot.c > index 1c38174b5f01..a08a4a7a03f8 100644 > --- a/arch/x86/kernel/acpi/boot.c > +++ b/arch/x86/kernel/acpi/boot.c > @@ -1853,13 +1853,18 @@ early_param("acpi_sci", setup_acpi_sci); > > int __acpi_acquire_global_lock(unsigned int *lock) > { > - unsigned int old, new; > + unsigned int old, new, val; > > old = READ_ONCE(*lock); > do { > - new = (((old & ~0x3) + 2) + ((old >> 1) & 0x1)); > + val = (old >> 1) & 0x1; > + new = (old & ~0x3) + 2 + val; > } while (!try_cmpxchg(lock, &old, new)); > - return ((new & 0x3) < 3) ? -1 : 0; > + > + if (val) > + return 0; > + > + return -1; > } > > int __acpi_release_global_lock(unsigned int *lock) > -- > 2.39.2 >