Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp770356rwd; Thu, 25 May 2023 03:49:18 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ4SVg5JapEv5rFh4zdzSP6bXSkTNdffM00SUAdBgRKinbY9kKTBLyuI5ZDHSYCAdv7UBS/G X-Received: by 2002:a05:6a20:6a0d:b0:10d:5c7:6608 with SMTP id p13-20020a056a206a0d00b0010d05c76608mr6690419pzk.30.1685011758540; Thu, 25 May 2023 03:49:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685011758; cv=none; d=google.com; s=arc-20160816; b=Qjd7DksACG2bu598V+ydpMEa8NwPZGpEiLzR/el/vXdqbo+usUK1aRcMPUCOpYCSzR hMmTchaiqWUf7OGfo+hxH3Hhb1lAUBzbcbWrCnXhk/zF43dtbGOEC+oMF/bjCIxZtPsZ oPizsuE9Is+o9bOaGf/LLs2CTkgbgAm2ymttBEDojno3E1k0hRxtR65otrGaDU+XHgpW dt1DBHJC658/b9zb4dThfqLE/gHV9LngXAH33e9BzfAHoKEeSNDtZeS0CXv4i8WnVnvU ehBZhazCpdn1oCmoGiAlOJjdDoxbWOveu0YLYyjelnrmCUmfxA4Q7JeYCuQRVhTul/RK gWOg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=2b0PecxjqDga2q+GYKiWoz11cnthWs1G0M5GzyNmn34=; b=RMc5Pwswd4ozfP65cDN9RQhYrKnuEYLpAAISC1QUe/wc4eFplP0A2ocivVX93T9KNu atlOMn718huBpOQUUlZXN3LLzlswwOFAxCXRsg+nVsTWFuBXmMeRBcG7itbqFxlH6Gtk Q3DT3c5rNSkXNzqQXAWjVD/gWe3GAvt5dbkAyTNkRj3gskacsQlEKLkoTeqpxneLXH+n Fn8xz0u9GCq85emDUsRUQcmUjF5Vxyyk8IwGudrqRhWVtbqgL/FW8GNtkXaWH0jLmEv7 p2FPwY5CVs8eylkA8b5/TdSW13pSyEiYawGKun88O6NTMxxGZkHHNboDfAMU7+R/Q9r1 yj2Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=mpUeLnCW; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j4-20020a17090a588400b002507cbb009bsi3252684pji.112.2023.05.25.03.48.56; Thu, 25 May 2023 03:49:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=mpUeLnCW; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240923AbjEYKbM (ORCPT + 99 others); Thu, 25 May 2023 06:31:12 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59324 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240752AbjEYKbL (ORCPT ); Thu, 25 May 2023 06:31:11 -0400 Received: from casper.infradead.org (casper.infradead.org [IPv6:2001:8b0:10b:1236::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A18B7E6; Thu, 25 May 2023 03:31:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=2b0PecxjqDga2q+GYKiWoz11cnthWs1G0M5GzyNmn34=; b=mpUeLnCWo62XTpDTxX4burZAso 4r8I70jFGZVNknFGbGALewmKj5QyyPUMmDf6M1MerTWmgcKeUzUzEtXNEqaPjHyxCBnjAb3Lv70MZ l7lvU1Mx4eAPduZLZX2QpZh+U1lTy6ByReO1PgmZlX8E/iTEE/ijH39fUttPtdqP/o/tzowIaiOeF O0aQae7O5eq6uvs0Kg7tpmg/RNiZT6JSGFfKzy6eNymIVcT0hIlh6jkrpKYhbGwX6OJVtHYSd3YRk vziECqTmVDnUrcqh0m3alPjvQFSM6x2fyvISqjodfjXw1qSa6YkSNFLTMiO3Udk2pb+JeKz2f/QTx b49bAhdQ==; Received: from j130084.upc-j.chello.nl ([24.132.130.84] helo=noisy.programming.kicks-ass.net) by casper.infradead.org with esmtpsa (Exim 4.94.2 #2 (Red Hat Linux)) id 1q28EF-00C6Pg-2p; Thu, 25 May 2023 10:29:55 +0000 Received: from hirez.programming.kicks-ass.net (hirez.programming.kicks-ass.net [192.168.1.225]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by noisy.programming.kicks-ass.net (Postfix) with ESMTPS id 332303002C5; Thu, 25 May 2023 12:29:46 +0200 (CEST) Received: by hirez.programming.kicks-ass.net (Postfix, from userid 1000) id BE2BC24387C95; Thu, 25 May 2023 12:29:46 +0200 (CEST) Date: Thu, 25 May 2023 12:29:46 +0200 From: Peter Zijlstra To: torvalds@linux-foundation.org Cc: corbet@lwn.net, will@kernel.org, boqun.feng@gmail.com, mark.rutland@arm.com, catalin.marinas@arm.com, dennis@kernel.org, tj@kernel.org, cl@linux.com, hca@linux.ibm.com, gor@linux.ibm.com, agordeev@linux.ibm.com, borntraeger@linux.ibm.com, svens@linux.ibm.com, tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, dave.hansen@linux.intel.com, x86@kernel.org, hpa@zytor.com, joro@8bytes.org, suravee.suthikulpanit@amd.com, robin.murphy@arm.com, dwmw2@infradead.org, baolu.lu@linux.intel.com, Arnd Bergmann , Herbert Xu , davem@davemloft.net, penberg@kernel.org, rientjes@google.com, iamjoonsoo.kim@lge.com, Andrew Morton , vbabka@suse.cz, roman.gushchin@linux.dev, 42.hyeyoo@gmail.com, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-s390@vger.kernel.org, iommu@lists.linux.dev, linux-arch@vger.kernel.org, linux-crypto@vger.kernel.org, sfr@canb.auug.org.au, mpe@ellerman.id.au Subject: Re: [PATCH v3 08/11] slub: Replace cmpxchg_double() Message-ID: <20230525102946.GE38236@hirez.programming.kicks-ass.net> References: <20230515075659.118447996@infradead.org> <20230515080554.453785148@infradead.org> <20230524093246.GP83892@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20230524093246.GP83892@hirez.programming.kicks-ass.net> X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_NONE,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-crypto@vger.kernel.org On Wed, May 24, 2023 at 11:32:47AM +0200, Peter Zijlstra wrote: > On Mon, May 15, 2023 at 09:57:07AM +0200, Peter Zijlstra wrote: > > > @@ -3008,6 +3029,22 @@ static inline bool pfmemalloc_match(stru > > } > > > > #ifndef CONFIG_SLUB_TINY > > +static inline bool > > +__update_cpu_freelist_fast(struct kmem_cache *s, > > + void *freelist_old, void *freelist_new, > > + unsigned long tid) > > +{ > > +#ifdef system_has_freelist_aba > > + freelist_aba_t old = { .freelist = freelist_old, .counter = tid }; > > + freelist_aba_t new = { .freelist = freelist_new, .counter = next_tid(tid) }; > > + > > + return this_cpu_cmpxchg_freelist(s->cpu_slab->freelist_tid.full, > > + old.full, new.full) == old.full; > > +#else > > + return false; > > +#endif > > +} > > + > > /* > > * Check the slab->freelist and either transfer the freelist to the > > * per cpu freelist or deactivate the slab. > > @@ -3359,11 +3396,7 @@ static __always_inline void *__slab_allo > > * against code executing on this cpu *not* from access by > > * other cpus. > > */ > > - if (unlikely(!this_cpu_cmpxchg_double( > > - s->cpu_slab->freelist, s->cpu_slab->tid, > > - object, tid, > > - next_object, next_tid(tid)))) { > > - > > + if (unlikely(!__update_cpu_freelist_fast(s, object, next_object, tid))) { > > note_cmpxchg_failure("slab_alloc", s, tid); > > goto redo; > > } > > @@ -3736,11 +3769,7 @@ static __always_inline void do_slab_free > > > > set_freepointer(s, tail_obj, freelist); > > > > - if (unlikely(!this_cpu_cmpxchg_double( > > - s->cpu_slab->freelist, s->cpu_slab->tid, > > - freelist, tid, > > - head, next_tid(tid)))) { > > - > > + if (unlikely(!__update_cpu_freelist_fast(s, freelist, head, tid))) { > > note_cmpxchg_failure("slab_free", s, tid); > > goto redo; > > } > > This isn't right; the this_cpu_cmpxchg_double() was unconditional and > relied on the local_irq_save() fallback when no native cmpxchg128 is > present. This then also means I need to look at this_cpu_cmpxchg128 and this_cpu_cmoxchg64 behaviour when we dont have the CPUID feature. Because current verions seem to assume the instruction is present.