Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp5658550pxb; Mon, 28 Mar 2022 15:37:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyPV9dz4vZ0H6dbFwmaUkOODIDclE4BI4dB7DdK8Vkh0Cnpy4/WLF4OV8OK3x8jqbeuboG6 X-Received: by 2002:ac5:c252:0:b0:337:26a0:b672 with SMTP id n18-20020ac5c252000000b0033726a0b672mr14752162vkk.24.1648507034210; Mon, 28 Mar 2022 15:37:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1648507034; cv=none; d=google.com; s=arc-20160816; b=fCJB1jA/B9M4KntqdyMQ3JV6g4+koc4OQj6P7mWHuXmE6jC0i2QxVFHnqoqp0BndP4 twl9THqREpPZ/Cb7qhSRzVJqXMaAsMHQMuJMu7SKzxfV8MEE82OGWNW23/GcZqZLyPsk O6lXcHCe7OlZn4JX76ZCd/G9DZG5qoObIcNM9WemKTlQf5pBo1mqQeYyNfBkwYm8yJzd 3+i5ngsQ0KCoFUpmN8lhNxW+TsLn6odysuoM2oNORCGn45qWkVKSe0d/yccV6hdYCl/Z Vp1gI+gx+LwEdfxe+zHgCyS7G1VI0AaNeYysNADC5TpC99zhFm7/8MLjnlVzPFClAjyF fCjg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=7qKdtgITMOnvqGN+zt/ifdsol9K+eKOtDUuRZKr9tCQ=; b=JIRgkgPP4JlH6Hs1btIbV9ViKH4TFNfzChsqE+geI7hRqvzNRfmo6RS6OztmKy/IVY 6TlaMUhCnB+UKPvXotIaevtsE+oVcBsPN/y4xqvm9OCFAcZwie1LT6KJmZ2MB2VeQ/GB oRLaaMsWSTYKsPyQkAQTgAxOZzDNRRVReguLwoeES9yEFBjI3m3wy2MRQSfAfxrYhUmn T87v8NmLuxreEDWvca1gR01ZdGIGpyj9gv2NKUAXndE2O+vqi9Dz6IkU9xg0cfQPeabf MDL3xqGhJQejELGI3X2abCmPzNNZX+FJ0swVp+UOobNQDA70xsw6gHBWA9J3Uju3zLd8 RO1Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@infradead.org header.s=desiato.20200630 header.b=eLlJYWnl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id k6-20020a67ab46000000b0032545be3425si3337846vsh.382.2022.03.28.15.37.13 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 28 Mar 2022 15:37:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@infradead.org header.s=desiato.20200630 header.b=eLlJYWnl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 516123F880; Mon, 28 Mar 2022 14:45:25 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240278AbiC1KiD (ORCPT + 99 others); Mon, 28 Mar 2022 06:38:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60468 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240294AbiC1Khx (ORCPT ); Mon, 28 Mar 2022 06:37:53 -0400 Received: from desiato.infradead.org (desiato.infradead.org [IPv6:2001:8b0:10b:1:d65d:64ff:fe57:4e05]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id AC00636169 for ; Mon, 28 Mar 2022 03:36:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=desiato.20200630; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=7qKdtgITMOnvqGN+zt/ifdsol9K+eKOtDUuRZKr9tCQ=; b=eLlJYWnlSbKcvsify0sDO5jqPJ 1LTXl9Nt20LBpkJoOGcsy5axp/4KmNVb2x/aANkkgvcGvDZGoese1MJIBdon/n5+adG+Fgpv/YhMb arnjcB86Gv9drCo3b5cjXMXixTKVbOLnBEm4CNU1Px5wP1Q+7DpL990y+qAsvvUnW1/MPRnURJK/6 E7V38MPCpCdQgD7HAjG2a4E69lnjgxW7YD5leV8+j3d55GV5N5i/UZPXR2lX53fSZcSyYvxgrUuW1 qtg8PGp71iwx1zzULFT+Jae/Lrkrl0i30Cz9IK3DM0um4hAMcqdXjT15lVT8S0FZcevMxFjS1ookp UWuU99mw==; Received: from j217100.upc-j.chello.nl ([24.132.217.100] helo=worktop.programming.kicks-ass.net) by desiato.infradead.org with esmtpsa (Exim 4.94.2 #2 (Red Hat Linux)) id 1nYmj5-005Pnc-Ti; Mon, 28 Mar 2022 10:35:56 +0000 Received: by worktop.programming.kicks-ass.net (Postfix, from userid 1000) id CD5269861E7; Mon, 28 Mar 2022 12:35:51 +0200 (CEST) Date: Mon, 28 Mar 2022 12:35:51 +0200 From: Peter Zijlstra To: Nadav Amit Cc: Thomas Gleixner , linux-kernel@vger.kernel.org, Nadav Amit , Dave Hansen , Ingo Molnar , Andy Lutomirski , x86@kernel.org Subject: Re: [PATCH] x86/mm/tlb: avoid reading mm_tlb_gen when possible Message-ID: <20220328103551.GY8939@worktop.programming.kicks-ass.net> References: <20220322220757.8607-1-namit@vmware.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220322220757.8607-1-namit@vmware.com> X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 22, 2022 at 10:07:57PM +0000, Nadav Amit wrote: > From: Nadav Amit > > On extreme TLB shootdown storms, the mm's tlb_gen cacheline is highly > contended and reading it should (arguably) be avoided as much as > possible. > > Currently, flush_tlb_func() reads the mm's tlb_gen unconditionally, > even when it is not necessary (e.g., the mm was already switched). > This is wasteful. > > Moreover, one of the existing optimizations is to read mm's tlb_gen to > see if there are additional in-flight TLB invalidations and flush the > entire TLB in such a case. However, if the request's tlb_gen was already > flushed, the benefit of checking the mm's tlb_gen is likely to be offset > by the overhead of the check itself. > > Running will-it-scale with tlb_flush1_threads show a considerable > benefit on 56-core Skylake (up to +24%): > > threads Baseline (v5.17+) +Patch > 1 159960 160202 > 5 310808 308378 (-0.7%) > 10 479110 490728 > 15 526771 562528 > 20 534495 587316 > 25 547462 628296 > 30 579616 666313 > 35 594134 701814 > 40 612288 732967 > 45 617517 749727 > 50 637476 735497 > 55 614363 778913 (+24%) > Acked-by: Peter Zijlstra (Intel)