Received: by 2002:a5d:9c59:0:0:0:0:0 with SMTP id 25csp2274778iof; Wed, 8 Jun 2022 01:07:34 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyMyazEAazumSRudSYgvF11+AyEtDKXheW4+EOY7g37mdG8SxwMw6e5zmlK5FYnJu2iHsKO X-Received: by 2002:a17:903:248:b0:155:e660:b774 with SMTP id j8-20020a170903024800b00155e660b774mr33389071plh.174.1654675654228; Wed, 08 Jun 2022 01:07:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654675654; cv=none; d=google.com; s=arc-20160816; b=j/OzQXwSMGYRFsvkx/C3dzz3xJP8BCTxdRLv2RUNKX0fZo65w4GP7PnIUsQU+K18ce 8Aj0bt0dBfLySjodY7lNak91TUVr+hKvXHnW4e98/lrE32dGF377/9LNYCf0paTf7nL2 ryyLXiKNSdOEWrBN4pYiT7ckb1adT0FG6I51SHECogmf0w6L+GpzIZFFNzSJQGaKrKcB LX3rLK0bAe53vUEVEaTgVYoqgntbkzDIyjNcolX3SXYmHZDHOT2a6e2eAC0wYGmnQ6OB +muNFaP6Bk4Ax91D8SuXeRe444grB8bafTResNe+0kCEblkEPn8L5ubTQt9lQPrRgkTT s8Nw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:robot-unsubscribe :robot-id:message-id:mime-version:references:in-reply-to:cc:subject :to:reply-to:sender:from:dkim-signature:dkim-signature:date; bh=jngsz6dYDWW2zX55YZPRWzvjcIWvdvuKFA7feqYO/0c=; b=k0+EGs3c04pJhZ/1vftnm3GceJfhzsAf6eWhiPHrdKKpuVe3QoolKyk9y5PBOey1mx /XeJzXgOZx6HlwJ33qp/4nNp1xWeYsLGY3S0GjTYxWrYqj4EmD1zDgE60IQE5vzk8+I/ 1lECe//3MhlumBO+ssDjnOEKAYSzk+7nTukjHOYMPofhIlQh88BNqwEpELuxyF+Qk/69 WhG2TZ23O+XKiyZWVyix0+OyZ7oILBgSp5ysp227od9/xilFy3Ua30qUdGPzaLoQfekj gpJcdZwF+dJdhZzcG30G+7wkADUUN0x/F1WcnQQsUkPV9ktcKF4UP4q7Oqa7O0Xb0ZCl IJIQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=GjltvIMB; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id k189-20020a6384c6000000b003fe2062e88fsi3378191pgd.833.2022.06.08.01.07.33 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 08 Jun 2022 01:07:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=GjltvIMB; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id A25FA205FA3; Wed, 8 Jun 2022 00:38:57 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1345060AbiFGQir (ORCPT + 99 others); Tue, 7 Jun 2022 12:38:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39442 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1343997AbiFGQip (ORCPT ); Tue, 7 Jun 2022 12:38:45 -0400 Received: from galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7768F7E1F4; Tue, 7 Jun 2022 09:38:44 -0700 (PDT) Date: Tue, 07 Jun 2022 16:38:40 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1654619922; h=from:from:sender:sender:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=jngsz6dYDWW2zX55YZPRWzvjcIWvdvuKFA7feqYO/0c=; b=GjltvIMB6VY/6I7I+8+lBegNmCrxvZXdNYFlxUsbQzDIag/6f+bUggWreW3T/FMVbNUgeG qMYs3vSV2L1GYvCo3prJwaAeuAsEYziG+AGLOjK+caAFs6U47yvqbtl1LLmKYUnE27h5pW mL2ejSaaFWgVrTs1mORCG9vyeteA9uxKkArftJ3BoOK3v1PDchx/KAjIdzwMTcumxolg38 iq8jg0BorP/+H2y63gjctwX1mwhN/DBLbIt+AckkgyvrKeh91HkI5/cd2v6WpUy9URuDq1 5nJD6qOdRPndl+lPXh8t6HnuGj7Yt3McPLyvuvYw/NLYIagkauaINQg201DIHg== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1654619922; h=from:from:sender:sender:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=jngsz6dYDWW2zX55YZPRWzvjcIWvdvuKFA7feqYO/0c=; b=UmcgmJDiZQRRmbrYbrK6npKIRs3D31dwmnZTC7LfVFs3N3jpnGqux9wBv/P4gaK7veZUvj hnlRlXGISFYNW/Cw== From: "tip-bot2 for Nadav Amit" Sender: tip-bot2@linutronix.de Reply-to: linux-kernel@vger.kernel.org To: linux-tip-commits@vger.kernel.org Subject: [tip: x86/mm] x86/mm/tlb: Avoid reading mm_tlb_gen when possible Cc: Nadav Amit , Dave Hansen , "Peter Zijlstra (Intel)" , Andy Lutomirski , x86@kernel.org, linux-kernel@vger.kernel.org In-Reply-To: <20220606180123.2485171-1-namit@vmware.com> References: <20220606180123.2485171-1-namit@vmware.com> MIME-Version: 1.0 Message-ID: <165461992091.4207.8723057747625123650.tip-bot2@tip-bot2> Robot-ID: Robot-Unsubscribe: Contact to get blacklisted from these emails Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The following commit has been merged into the x86/mm branch of tip: Commit-ID: aa44284960d550eb4d8614afdffebc68a432a9b4 Gitweb: https://git.kernel.org/tip/aa44284960d550eb4d8614afdffebc68a432a9b4 Author: Nadav Amit AuthorDate: Mon, 06 Jun 2022 11:01:23 -07:00 Committer: Dave Hansen CommitterDate: Tue, 07 Jun 2022 08:48:03 -07:00 x86/mm/tlb: Avoid reading mm_tlb_gen when possible On extreme TLB shootdown storms, the mm's tlb_gen cacheline is highly contended and reading it should (arguably) be avoided as much as possible. Currently, flush_tlb_func() reads the mm's tlb_gen unconditionally, even when it is not necessary (e.g., the mm was already switched). This is wasteful. Moreover, one of the existing optimizations is to read mm's tlb_gen to see if there are additional in-flight TLB invalidations and flush the entire TLB in such a case. However, if the request's tlb_gen was already flushed, the benefit of checking the mm's tlb_gen is likely to be offset by the overhead of the check itself. Running will-it-scale with tlb_flush1_threads show a considerable benefit on 56-core Skylake (up to +24%): threads Baseline (v5.17+) +Patch 1 159960 160202 5 310808 308378 (-0.7%) 10 479110 490728 15 526771 562528 20 534495 587316 25 547462 628296 30 579616 666313 35 594134 701814 40 612288 732967 45 617517 749727 50 637476 735497 55 614363 778913 (+24%) Signed-off-by: Nadav Amit Signed-off-by: Dave Hansen Acked-by: Peter Zijlstra (Intel) Acked-by: Andy Lutomirski Link: https://lkml.kernel.org/r/20220606180123.2485171-1-namit@vmware.com --- arch/x86/mm/tlb.c | 18 +++++++++++++++++- 1 file changed, 17 insertions(+), 1 deletion(-) diff --git a/arch/x86/mm/tlb.c b/arch/x86/mm/tlb.c index d400b6d..d9314cc 100644 --- a/arch/x86/mm/tlb.c +++ b/arch/x86/mm/tlb.c @@ -734,10 +734,10 @@ static void flush_tlb_func(void *info) const struct flush_tlb_info *f = info; struct mm_struct *loaded_mm = this_cpu_read(cpu_tlbstate.loaded_mm); u32 loaded_mm_asid = this_cpu_read(cpu_tlbstate.loaded_mm_asid); - u64 mm_tlb_gen = atomic64_read(&loaded_mm->context.tlb_gen); u64 local_tlb_gen = this_cpu_read(cpu_tlbstate.ctxs[loaded_mm_asid].tlb_gen); bool local = smp_processor_id() == f->initiating_cpu; unsigned long nr_invalidate = 0; + u64 mm_tlb_gen; /* This code cannot presently handle being reentered. */ VM_WARN_ON(!irqs_disabled()); @@ -771,6 +771,22 @@ static void flush_tlb_func(void *info) return; } + if (f->new_tlb_gen <= local_tlb_gen) { + /* + * The TLB is already up to date in respect to f->new_tlb_gen. + * While the core might be still behind mm_tlb_gen, checking + * mm_tlb_gen unnecessarily would have negative caching effects + * so avoid it. + */ + return; + } + + /* + * Defer mm_tlb_gen reading as long as possible to avoid cache + * contention. + */ + mm_tlb_gen = atomic64_read(&loaded_mm->context.tlb_gen); + if (unlikely(local_tlb_gen == mm_tlb_gen)) { /* * There's nothing to do: we're already up to date. This can