Received: by 2002:a05:7412:419a:b0:f3:1519:9f41 with SMTP id i26csp4316563rdh; Tue, 28 Nov 2023 19:22:25 -0800 (PST) X-Google-Smtp-Source: AGHT+IFgJqJZwuW3fBoG81WfJDnRWuiPyc7xfo1VyZPut8vBEJ46RpdNxehLlJGIa5gg35qAOwq2 X-Received: by 2002:a17:903:4d4:b0:1cf:636f:1bf8 with SMTP id jm20-20020a17090304d400b001cf636f1bf8mr24545970plb.24.1701228144961; Tue, 28 Nov 2023 19:22:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701228144; cv=none; d=google.com; s=arc-20160816; b=Wyjxd5saYQ8+BbJM8j3J4VTJGVBbuEP4stGE5FyLIiNPwFjguGGJX2TbO6/7pVYGzQ E/aB1YYfuOq0ARq9QXFETIBCWMot6kxpQ7RUBZeHktoRQPaxHxim4C0t1y1sEASKf2T6 Z2QzD9QYt49uararYPm+RPc9TqMUF7cWk+v6wG9LKg0VhFkGS461RMJFG1AfOtPNBdPx QguGKrTzwZQGDWYTYVj6D+aDHrsy0r2Sl6vsySrp/5bokcV2icXg1dearMHq21WkkAky OIrW/IvvpodCreOWGFRWaj8cKp5C+0wvRYO+KE+3xZ+oHg3tRMl9D8JkT/iRwjGRPoLk CLKQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=GMYQlWUldBV9LWasymxz6cRCo6mSsi6IzDXKk+8BatU=; fh=IfBS1eUR4Ph2d1rlykJm3WUUhflOgyoV/WQxPmHrs48=; b=CbKqlMomdBgM7vQNcOdlG5LfQodJbbASSj66w9QVrwv7vprqNADodKeAW7SWo6fvgQ i9K1gaUJM3QxHMh3C96wJJxwvABcJueK0X+tM+3/CfjmgZvEj0utA7E7aLzqiukhbgKX fZoxzbP8nZW4Mqjiu7AJmO8Eup6/AyLDjCMLU6Gbhsf9JBF2Qo0MC/ZTEzPe/nCfE+Fn q5Mn1HTxo3rcwnVfFtWsmDZxyPO6iz+txCzEMLqbR72FxnjfYZF/0j1VETApOE2zYp0u wHddwa0Kk1sZywLaRO1kh2RUJ9z88oCFLHi9UcLgKvU2uRY1JLpQH3aSeCA5u3Qbg4QO mIoQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b="AZ/yirB/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from snail.vger.email (snail.vger.email. [2620:137:e000::3:7]) by mx.google.com with ESMTPS id 11-20020a170902c14b00b001cf8e4e84besi12722512plj.175.2023.11.28.19.22.24 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 28 Nov 2023 19:22:24 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) client-ip=2620:137:e000::3:7; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b="AZ/yirB/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id EF44D80C0380; Tue, 28 Nov 2023 19:22:23 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1376854AbjK2DWJ (ORCPT + 99 others); Tue, 28 Nov 2023 22:22:09 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53894 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234807AbjK2DWF (ORCPT ); Tue, 28 Nov 2023 22:22:05 -0500 Received: from mail-yw1-x114a.google.com (mail-yw1-x114a.google.com [IPv6:2607:f8b0:4864:20::114a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B871F19BC for ; Tue, 28 Nov 2023 19:22:08 -0800 (PST) Received: by mail-yw1-x114a.google.com with SMTP id 00721157ae682-5d1b431fa7bso20791357b3.1 for ; Tue, 28 Nov 2023 19:22:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1701228128; x=1701832928; darn=vger.kernel.org; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=GMYQlWUldBV9LWasymxz6cRCo6mSsi6IzDXKk+8BatU=; b=AZ/yirB/44st1c2ZHKBLXqzRbxEasy3KBBPGJALG7aHWkTWAW6SJwB49v+Nkz4U5m5 8IIjgEvQfBPuM34E2f2eiEXoluDlyAzXsDwmwHq2lCpCN78fr/FdAMJKvjGV6o4VJSbM 5dWuikSJx6swS7wEYfibZKtD3ymHsxK9YSOHGDo5qJWc6w8vyt/6Z3oW+RxU5JmU8NxG CrDWIZBk/vGYtRDZtO/3Kq9P+jZZ2iHbWCRNkOrgtDCc+TpnPP4EF4AvzRUg29AvLIIX kj1kBSMiSqYAlhE1ELnbhhLRirFG9jf9GMpRNbRcwkZq14LL3tXudmR6QEyqNVdkri3o Mpfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1701228128; x=1701832928; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=GMYQlWUldBV9LWasymxz6cRCo6mSsi6IzDXKk+8BatU=; b=JVeNKMAaTkK83AnoUwoefzfbaw9Y6bvGhVGOqXo8OBXNchOmM0ncPxJR6LgUSy/fsT OD5tRUdDNVmJfUFyEIJOG/2Y9iyhzvInMYkn17v0jCShFBYieVpcSRITC5KhYrEi44gr i5SHQJjhjzFCOlohOnXyqPgUCHDj1lKataTKJeIvN3lIzyjqb82PJZBdLa9eGhmj4a6Z /pxX66UQMA8UrYwLs4j3NnWmEXrWDoR93tAvGxB4qgvckdUqF+0Oi9V6hgwAI7iShVc9 w4wlCwoi2w/WukubylRxRsag/i6x72ZnZpvApIyHvl/T8+JssdcvMA62p1l067rQN/+u daRQ== X-Gm-Message-State: AOJu0YzrxNzQFMWOs3PkFgLrZzX2XcnGhVSdPjDAj8aSkpvUSFPDFxtM M+Q17kWbJ0wWktYCt2Ciu3RlQ6nH1kFHiRMB X-Received: from yosry.c.googlers.com ([fda3:e722:ac3:cc00:20:ed76:c0a8:29b4]) (user=yosryahmed job=sendgmr) by 2002:a05:690c:f84:b0:5ca:ad72:2d78 with SMTP id df4-20020a05690c0f8400b005caad722d78mr634136ywb.8.1701228128017; Tue, 28 Nov 2023 19:22:08 -0800 (PST) Date: Wed, 29 Nov 2023 03:21:52 +0000 In-Reply-To: <20231129032154.3710765-1-yosryahmed@google.com> Mime-Version: 1.0 References: <20231129032154.3710765-1-yosryahmed@google.com> X-Mailer: git-send-email 2.43.0.rc1.413.gea7ed67945-goog Message-ID: <20231129032154.3710765-5-yosryahmed@google.com> Subject: [mm-unstable v4 4/5] mm: workingset: move the stats flush into workingset_test_recent() From: Yosry Ahmed To: Andrew Morton Cc: Johannes Weiner , Michal Hocko , Roman Gushchin , Shakeel Butt , Muchun Song , Ivan Babrou , Tejun Heo , "=?UTF-8?q?Michal=20Koutn=C3=BD?=" , Waiman Long , kernel-team@cloudflare.com, Wei Xu , Greg Thelen , Domenico Cerasuolo , linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, Yosry Ahmed Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Tue, 28 Nov 2023 19:22:24 -0800 (PST) The workingset code flushes the stats in workingset_refault() to get accurate stats of the eviction memcg. In preparation for more scoped flushed and passing the eviction memcg to the flush call, move the call to workingset_test_recent() where we have a pointer to the eviction memcg. The flush call is sleepable, and cannot be made in an rcu read section. Hence, minimize the rcu read section by also moving it into workingset_test_recent(). Furthermore, instead of holding the rcu read lock throughout workingset_test_recent(), only hold it briefly to get a ref on the eviction memcg. This allows us to make the flush call after we get the eviction memcg. As for workingset_refault(), nothing else there appears to be protected by rcu. The memcg of the faulted folio (which is not necessarily the same as the eviction memcg) is protected by the folio lock, which is held from all callsites. Add a VM_BUG_ON() to make sure this doesn't change from under us. No functional change intended. Signed-off-by: Yosry Ahmed Tested-by: Domenico Cerasuolo --- mm/workingset.c | 36 ++++++++++++++++++++++++------------ 1 file changed, 24 insertions(+), 12 deletions(-) diff --git a/mm/workingset.c b/mm/workingset.c index c17d45c6f29b0..dce41577a49d2 100644 --- a/mm/workingset.c +++ b/mm/workingset.c @@ -425,8 +425,16 @@ bool workingset_test_recent(void *shadow, bool file, bool *workingset) struct pglist_data *pgdat; unsigned long eviction; - if (lru_gen_enabled()) - return lru_gen_test_recent(shadow, file, &eviction_lruvec, &eviction, workingset); + rcu_read_lock(); + + if (lru_gen_enabled()) { + bool recent = lru_gen_test_recent(shadow, file, + &eviction_lruvec, &eviction, workingset); + + rcu_read_unlock(); + return recent; + } + unpack_shadow(shadow, &memcgid, &pgdat, &eviction, workingset); eviction <<= bucket_order; @@ -448,8 +456,16 @@ bool workingset_test_recent(void *shadow, bool file, bool *workingset) * configurations instead. */ eviction_memcg = mem_cgroup_from_id(memcgid); - if (!mem_cgroup_disabled() && !eviction_memcg) + if (!mem_cgroup_disabled() && + (!eviction_memcg || !mem_cgroup_tryget(eviction_memcg))) { + rcu_read_unlock(); return false; + } + + rcu_read_unlock(); + + /* Flush stats (and potentially sleep) outside the RCU read section */ + mem_cgroup_flush_stats_ratelimited(); eviction_lruvec = mem_cgroup_lruvec(eviction_memcg, pgdat); refault = atomic_long_read(&eviction_lruvec->nonresident_age); @@ -493,6 +509,7 @@ bool workingset_test_recent(void *shadow, bool file, bool *workingset) } } + mem_cgroup_put(eviction_memcg); return refault_distance <= workingset_size; } @@ -519,19 +536,16 @@ void workingset_refault(struct folio *folio, void *shadow) return; } - /* Flush stats (and potentially sleep) before holding RCU read lock */ - mem_cgroup_flush_stats_ratelimited(); - - rcu_read_lock(); - /* * The activation decision for this folio is made at the level * where the eviction occurred, as that is where the LRU order * during folio reclaim is being determined. * * However, the cgroup that will own the folio is the one that - * is actually experiencing the refault event. + * is actually experiencing the refault event. Make sure the folio is + * locked to guarantee folio_memcg() stability throughout. */ + VM_BUG_ON_FOLIO(!folio_test_locked(folio), folio); nr = folio_nr_pages(folio); memcg = folio_memcg(folio); pgdat = folio_pgdat(folio); @@ -540,7 +554,7 @@ void workingset_refault(struct folio *folio, void *shadow) mod_lruvec_state(lruvec, WORKINGSET_REFAULT_BASE + file, nr); if (!workingset_test_recent(shadow, file, &workingset)) - goto out; + return; folio_set_active(folio); workingset_age_nonresident(lruvec, nr); @@ -556,8 +570,6 @@ void workingset_refault(struct folio *folio, void *shadow) lru_note_cost_refault(folio); mod_lruvec_state(lruvec, WORKINGSET_RESTORE_BASE + file, nr); } -out: - rcu_read_unlock(); } /** -- 2.43.0.rc1.413.gea7ed67945-goog