Received: by 2002:a05:7412:6592:b0:d7:7d3a:4fe2 with SMTP id m18csp824346rdg; Fri, 11 Aug 2023 00:18:10 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEGQjoq1KjVsFaprGxHgRWtLWU6S5BdIB4ourNa74LqQbdfTQIavVtuv6bgChfxsxHHyOu7 X-Received: by 2002:a05:6a00:a88:b0:687:4bd8:159a with SMTP id b8-20020a056a000a8800b006874bd8159amr1103775pfl.19.1691738289558; Fri, 11 Aug 2023 00:18:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1691738289; cv=none; d=google.com; s=arc-20160816; b=DvwPJJb49m3NLuElDwb7GvJqQicwCKAulfb+z08iaJeNZFb2T7qep2eYP16edGGXKv skBILe4mPSid+Wu0SFflycfkfTaARoOdgC+P21/Az372bIaZgMCRTsMplUZ8oToyNjjd YwKHlWoyxOMbw2MyIMj1Eiuza8fCYvHsujG+MWDUYcDMQWjlm/d+d9fLUE31rARY1wbn tFs/L6yZLe1KbDQuxQn1xhFY/9D3/vu/BtWKsGabLK36Sl2Yd6w3yxc80uj6EOzfNFp+ dVnMUrVNuc60tVJpJ32G2yu3+wTJ+mvo7YpGPCWJiXJdkielZHcFFPP/TzSzkYUTKrGx Au7A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=NM9p3OmMetCo+g86ZgCAE3y+mAro/y71LeXsFc8qyoo=; fh=frEKJYqJQluS8Xa6B9Sdr0X2i73y2o8v4ng20ahcfWA=; b=eE+dyVHdpxXv08Bfj0CTWM2ObeQUukYa/4+ZV1efWdEpxFThaD7m3ZcdDyKvS2DYXk LlaEgjjGj802brVSQVZvEN0A7bNZl6cGiedoNmw6stwB7oAyzUmyfmuJflHBDlzIRpbn b5QlA8WC/PzInaGkCYHwreaFQlagxMncT/5aWj1B5XBvKH61diIp9YRX//pEozKWiocf eN1bAP8k5wx7O3322MxYjvNQ5Nn6vmW2oaXVUawtpXPQgpOBEf+57n/HDdGII/tKgswA 550EGZRiih8Hhmb/p5wckSgJI7vjzikSoZFeSsv0OVpoo7loDcachcBi0Idl3Gg6hEed /6Cg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@microchip.com header.s=mchp header.b="L6a4LRB/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=microchip.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id fd20-20020a056a002e9400b006872ac0a222si3123621pfb.100.2023.08.11.00.17.57; Fri, 11 Aug 2023 00:18:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@microchip.com header.s=mchp header.b="L6a4LRB/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=microchip.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233754AbjHKG20 (ORCPT + 99 others); Fri, 11 Aug 2023 02:28:26 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36190 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233777AbjHKG2Z (ORCPT ); Fri, 11 Aug 2023 02:28:25 -0400 Received: from esa.microchip.iphmx.com (esa.microchip.iphmx.com [68.232.154.123]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 286F72D41 for ; Thu, 10 Aug 2023 23:28:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=microchip.com; i=@microchip.com; q=dns/txt; s=mchp; t=1691735299; x=1723271299; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=2OeaomGybcR9PEFYgBy/ntYKT5NkHHW6KabRuHr+4kY=; b=L6a4LRB/24cpLtmKqzXAuVZ8KcnXoM4cOc89QNqXwBpfSpo36AqZrepr 4kjYZNSTtqvtbJbgkUrFzJBi9c/l/j21TfePFt5ZvWsNuuTSRSGwzHNM0 CcozrwdIwLDrFC2W6EFy6DSJyIcrWU6HjgwirDaDpIO2BjCkJ0tej4U4t VZ5Zcmd6l67Q0dtpXcm1ERmVI1kQ9PaBaLTN5jOrprHvOQ7jxDIfXfsUe eYXI0K8ssQ2NJIyVhaziKERtyKq1dEulryKaqrjM0dYXeLNujV1cFmjqm qLxbN3xZwg4W4v52ZGUkUSDFBNrygJ/yW2wDUETfnql/14s+0ukn3FZuH g==; X-IronPort-AV: E=Sophos;i="6.01,164,1684825200"; d="asc'?scan'208";a="165961738" X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN Received: from unknown (HELO email.microchip.com) ([170.129.1.10]) by esa6.microchip.iphmx.com with ESMTP/TLS/AES256-SHA256; 10 Aug 2023 23:28:17 -0700 Received: from chn-vm-ex01.mchp-main.com (10.10.85.143) by chn-vm-ex02.mchp-main.com (10.10.85.144) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.21; Thu, 10 Aug 2023 23:28:08 -0700 Received: from wendy (10.10.115.15) by chn-vm-ex01.mchp-main.com (10.10.85.143) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.21 via Frontend Transport; Thu, 10 Aug 2023 23:28:06 -0700 Date: Fri, 11 Aug 2023 07:27:28 +0100 From: Conor Dooley To: Guo Ren CC: Palmer Dabbelt , , Arnd Bergmann , Will Deacon , , , Mark Rutland , Paul Walmsley , , , , , Subject: Re: [RFC PATCH v5 5/5] riscv/cmpxchg: Implement xchg for variables of size 1 and 2 Message-ID: <20230811-overlook-displace-330af8289647@wendy> References: <98f523e515b2adc2aa7bb8d133353bad74e30897.camel@redhat.com> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="oNLbMQHR+XQMIo/L" Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, RCVD_IN_DNSWL_BLOCKED,RCVD_IN_MSPIKE_H5,RCVD_IN_MSPIKE_WL, SPF_HELO_PASS,SPF_NONE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --oNLbMQHR+XQMIo/L Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Aug 11, 2023 at 09:40:30AM +0800, Guo Ren wrote: > On Fri, Aug 11, 2023 at 12:23=E2=80=AFAM Palmer Dabbelt wrote: > > > > On Thu, 10 Aug 2023 09:04:04 PDT (-0700), leobras@redhat.com wrote: > > > On Thu, 2023-08-10 at 08:51 +0200, Arnd Bergmann wrote: > > >> On Thu, Aug 10, 2023, at 06:03, Leonardo Bras wrote: > > >> > xchg for variables of size 1-byte and 2-bytes is not yet available= for > > >> > riscv, even though its present in other architectures such as arm6= 4 and > > >> > x86. This could lead to not being able to implement some locking m= echanisms > > >> > or requiring some rework to make it work properly. > > >> > > > >> > Implement 1-byte and 2-bytes xchg in order to achieve parity with = other > > >> > architectures. > > >> > > > >> > Signed-off-by: Leonardo Bras > > >> > > > > > > Hello Arnd Bergmann, thanks for reviewing! > > > > > >> Parity with other architectures by itself is not a reason to do this, > > >> in particular the other architectures you listed have the instructio= ns > > >> in hardware while riscv does not. > > > > > > Sure, I understand RISC-V don't have native support for xchg on varia= bles of > > > size < 4B. My argument is that it's nice to have even an emulated ver= sion for > > > this in case any future mechanism wants to use it. > > > > > > Not having it may mean we won't be able to enable given mechanism in = RISC-V. > > > > IIUC the ask is to have a user within the kernel for these functions. > > That's the general thing to do, and last time this came up there was no > > in-kernel use of it -- the qspinlock stuff would, but we haven't enabled > > it yet because we're worried about the performance/fairness stuff that > > other ports have seen and nobody's got concrete benchmarks yet (though > > there's another patch set out that I haven't had time to look through, > > so that may have changed). > Conor doesn't agree with using an alternative as a detour mechanism > between qspinlock & ticket lock. Hold on a sec, I don't recall having a problem with alternatives - it was calling the stronger forward progress guarantee an erratum (which it isn't) and an ISA extension w/o any "abusing" that framework that I did not like. > So I'm preparing V11 with static_key > (jump_label) style. I don't think there's much point rushing into making it based on static keys when no progress has been made on implementing support for non-standard extensions. Changing to a static key doesn't change the detection mechanism, I've not got a problem with using alternatives for this stuff. Thanks, Conor. > Next version, I would separate paravirt_qspinlock > & CNA_qspinlock from V10. That would make it easy to review the > qspinlock patch series. You can review the next version V11. Now I'm > debugging a static_key init problem when load_modules, which is > triggered by our combo_qspinlock. >=20 > The qspinlock is being tested on the riscv platform [1] with 128 cores > with 8 NUMA nodes, next, I would update the comparison results of > qspinlock & ticket lock. >=20 > [1]: https://www.sophon.ai/ >=20 > > > > So if something uses these I'm happy to go look closer. > > > > >> Emulating the small xchg() through cmpxchg() is particularly tricky > > >> since it's easy to run into a case where this does not guarantee > > >> forward progress. > > >> > > > > > > Didn't get this part: > > > By "emulating small xchg() through cmpxchg()", did you mean like emul= ating an > > > xchg (usually 1 instruction) with lr & sc (same used in cmpxchg) ? > > > > > > If so, yeah, it's a fair point: in some extreme case we could have mu= ltiple > > > threads accessing given cacheline and have sc always failing. On the = other hand, > > > there are 2 arguments on that: > > > > > > 1 - Other architectures, (such as powerpc, arm and arm64 without LSE = atomics) > > > also seem to rely in this mechanism for every xchg size. Another arch= s like csky > > > and loongarch use asm that look like mine to handle size < 4B xchg. > > > > > > > > >> This is also something that almost no architecture > > >> specific code relies on (generic qspinlock being a notable exception= ). > > >> > > > > > > 2 - As you mentioned, there should be very little code that will actu= ally make > > > use of xchg for vars < 4B, so it should be safe to assume its fine to= not > > > guarantee forward progress for those rare usages (like some of above = mentioned > > > archs). > > > > > >> I would recommend just dropping this patch from the series, at least > > >> until there is a need for it. > > > > > > While I agree this is a valid point, I believe its more interesting t= o have it > > > implemented if any future mechanism wants to make use of this. > > > > > > > > > Thanks! > > > Leo >=20 >=20 >=20 > --=20 > Best Regards > Guo Ren --oNLbMQHR+XQMIo/L Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iHUEABYIAB0WIQRh246EGq/8RLhDjO14tDGHoIJi0gUCZNXUwwAKCRB4tDGHoIJi 0njeAP0XzI1GhdVXpbWuJQNRNBa9GCfL9m7RtjGSCV19+HBuhQEA3vjnv64H/Z9T 0EDmuCpggBZ4G4zVu0qursp5bTFG4gg= =epxT -----END PGP SIGNATURE----- --oNLbMQHR+XQMIo/L--