Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp2822641pxb; Fri, 12 Feb 2021 02:09:05 -0800 (PST) X-Google-Smtp-Source: ABdhPJz3CiRA/n4ycVgn4oDghedJC3+IMaiAJeePejwWXw9eOLxwNtrCSzCXY8Beoo8dH1yqt2/u X-Received: by 2002:a17:907:78d5:: with SMTP id kv21mr2210939ejc.461.1613124545591; Fri, 12 Feb 2021 02:09:05 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1613124545; cv=none; d=google.com; s=arc-20160816; b=V7TT863An0g+jaSKZMd04HKgT5cN+p6otPOkSA4tzF/+ODt3EHnAlWx00c0apaF/jp 73zCHeSZmGjQnvLjgzDXWC/5IIaSlH13Z8+nRr6T/IEsB7CHvWSZpSS7VX5QOz0yvnQ0 L8i/X0FmJNOM3BGbvtuSmCCL4/Fo0NNClub+YfRj+mPPNipdjCEmanVFc0aoseT8YT6r kGB+2/aa3RQv6aPMGCx14FgGOTxa9ILKNj3h2KufkzWW5AiN4pdzhYpqDh41JH3NSjsY igZGocVMXBhTmu6H5wxFUowJsyxJzCbAkC7e4/WxWNv0fa4VcG0mGAYR/GwawxCQ8S5/ 3Fuw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=DCMU+SNVi9LrmGxlBwBdqGWjuIz/B/V9ZdK3fwnkJ4o=; b=exIpqDY76BiYKeQzd3eDiJgpwUduRrNABak4+FJDyOkWNgmgdj/IMVz86f3s/AvkV4 XpDHm7YgKFE1+rw4w7n9Kb3o8KNpVO6rDIJoaPQpPyqRkOUgjnFkXKA5trcCaScYn3SX 7iPOVXomqXqsj8djxBhUHpfkQw1dDfmFiOm0FbUCOp3bnsBM2pp3hL9tV07g1CYoAH0L mG1hTJnGfFzLPgguazQ0bcMW5V+1Ikf9/bMiUTfjpxjFc4GodmBqE+BA/T7BQSDOEgbd 8g0NPceVxnhMhW8cw30L9293dLDANQ/Z3l8es88s4jt6691BXQVybpmHnZHFgTtiMiua vrBw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=E7NcXPfW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r18si5835878edc.276.2021.02.12.02.08.38; Fri, 12 Feb 2021 02:09:05 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=E7NcXPfW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230355AbhBLKHJ (ORCPT + 99 others); Fri, 12 Feb 2021 05:07:09 -0500 Received: from mx2.suse.de ([195.135.220.15]:40600 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230287AbhBLKGV (ORCPT ); Fri, 12 Feb 2021 05:06:21 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1613124332; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=DCMU+SNVi9LrmGxlBwBdqGWjuIz/B/V9ZdK3fwnkJ4o=; b=E7NcXPfWBUJmcZoA0RwgIJRCJZBZ7N/EncAEAq6h0yU4Yav0ZaIvYQh0TQ/bFt6kvTRMRw pC51eOvinqYijIAjvrztJK5Obe4izKvquhhn+zffFuyip5lKGhhAzIusP7NOqOkBkBDNCE josfV6qx7+IuEosXWGrDazb+gfYwq4U= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 82BDBB141; Fri, 12 Feb 2021 10:05:32 +0000 (UTC) Date: Fri, 12 Feb 2021 11:05:31 +0100 From: Michal Hocko To: Johannes Weiner Cc: Hugh Dickins , Andrew Morton , Shakeel Butt , Roman Gushchin , linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com Subject: Re: [PATCH v2] mm: page-writeback: simplify memcg handling in test_clear_page_writeback() Message-ID: References: <20210209214543.112655-1-hannes@cmpxchg.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 10-02-21 12:44:00, Johannes Weiner wrote: [...] > >From 5bcc0f468460aa2670c40318bb657e8b08ef96d5 Mon Sep 17 00:00:00 2001 > From: Johannes Weiner > Date: Tue, 9 Feb 2021 16:22:42 -0500 > Subject: [PATCH] mm: page-writeback: simplify memcg handling in > test_clear_page_writeback() > > Page writeback doesn't hold a page reference, which allows truncate to > free a page the second PageWriteback is cleared. This used to require > special attention in test_clear_page_writeback(), where we had to be > careful not to rely on the unstable page->memcg binding and look up > all the necessary information before clearing the writeback flag. > > Since commit 073861ed77b6 ("mm: fix VM_BUG_ON(PageTail) and > BUG_ON(PageWriteback)") test_clear_page_writeback() is called with an > explicit reference on the page, and this dance is no longer needed. > > Use unlock_page_memcg() and dec_lruvec_page_state() directly. > > This removes the last user of the lock_page_memcg() return value, > change it to void. Touch up the comments in there as well. This also > removes the last extern user of __unlock_page_memcg(), make it > static. Further, it removes the last user of dec_lruvec_state(), > delete it, along with a few other unused helpers. > > Signed-off-by: Johannes Weiner > Acked-by: Hugh Dickins > Reviewed-by: Shakeel Butt Acked-by: Michal Hocko Thanks! > --- > include/linux/memcontrol.h | 10 ++-------- > include/linux/vmstat.h | 24 +++--------------------- > mm/memcontrol.c | 36 +++++++++++------------------------- > mm/page-writeback.c | 9 +++------ > 4 files changed, 19 insertions(+), 60 deletions(-) > > diff --git a/include/linux/memcontrol.h b/include/linux/memcontrol.h > index a44b2d51aecc..b17053af3287 100644 > --- a/include/linux/memcontrol.h > +++ b/include/linux/memcontrol.h > @@ -874,8 +874,7 @@ void mem_cgroup_print_oom_group(struct mem_cgroup *memcg); > extern bool cgroup_memory_noswap; > #endif > > -struct mem_cgroup *lock_page_memcg(struct page *page); > -void __unlock_page_memcg(struct mem_cgroup *memcg); > +void lock_page_memcg(struct page *page); > void unlock_page_memcg(struct page *page); > > void __mod_memcg_state(struct mem_cgroup *memcg, int idx, int val); > @@ -1269,12 +1268,7 @@ mem_cgroup_print_oom_meminfo(struct mem_cgroup *memcg) > { > } > > -static inline struct mem_cgroup *lock_page_memcg(struct page *page) > -{ > - return NULL; > -} > - > -static inline void __unlock_page_memcg(struct mem_cgroup *memcg) > +static inline void lock_page_memcg(struct page *page) > { > } > > diff --git a/include/linux/vmstat.h b/include/linux/vmstat.h > index 506d625163a1..3299cd69e4ca 100644 > --- a/include/linux/vmstat.h > +++ b/include/linux/vmstat.h > @@ -512,16 +512,10 @@ static inline void mod_lruvec_page_state(struct page *page, > > #endif /* CONFIG_MEMCG */ > > -static inline void __inc_lruvec_state(struct lruvec *lruvec, > - enum node_stat_item idx) > -{ > - __mod_lruvec_state(lruvec, idx, 1); > -} > - > -static inline void __dec_lruvec_state(struct lruvec *lruvec, > - enum node_stat_item idx) > +static inline void inc_lruvec_state(struct lruvec *lruvec, > + enum node_stat_item idx) > { > - __mod_lruvec_state(lruvec, idx, -1); > + mod_lruvec_state(lruvec, idx, 1); > } > > static inline void __inc_lruvec_page_state(struct page *page, > @@ -536,18 +530,6 @@ static inline void __dec_lruvec_page_state(struct page *page, > __mod_lruvec_page_state(page, idx, -1); > } > > -static inline void inc_lruvec_state(struct lruvec *lruvec, > - enum node_stat_item idx) > -{ > - mod_lruvec_state(lruvec, idx, 1); > -} > - > -static inline void dec_lruvec_state(struct lruvec *lruvec, > - enum node_stat_item idx) > -{ > - mod_lruvec_state(lruvec, idx, -1); > -} > - > static inline void inc_lruvec_page_state(struct page *page, > enum node_stat_item idx) > { > diff --git a/mm/memcontrol.c b/mm/memcontrol.c > index 9e455815fb7a..e29d3d64c27e 100644 > --- a/mm/memcontrol.c > +++ b/mm/memcontrol.c > @@ -2124,11 +2124,10 @@ void mem_cgroup_print_oom_group(struct mem_cgroup *memcg) > * This function protects unlocked LRU pages from being moved to > * another cgroup. > * > - * It ensures lifetime of the returned memcg. Caller is responsible > - * for the lifetime of the page; __unlock_page_memcg() is available > - * when @page might get freed inside the locked section. > + * It ensures lifetime of the locked memcg. Caller is responsible > + * for the lifetime of the page. > */ > -struct mem_cgroup *lock_page_memcg(struct page *page) > +void lock_page_memcg(struct page *page) > { > struct page *head = compound_head(page); /* rmap on tail pages */ > struct mem_cgroup *memcg; > @@ -2138,21 +2137,15 @@ struct mem_cgroup *lock_page_memcg(struct page *page) > * The RCU lock is held throughout the transaction. The fast > * path can get away without acquiring the memcg->move_lock > * because page moving starts with an RCU grace period. > - * > - * The RCU lock also protects the memcg from being freed when > - * the page state that is going to change is the only thing > - * preventing the page itself from being freed. E.g. writeback > - * doesn't hold a page reference and relies on PG_writeback to > - * keep off truncation, migration and so forth. > */ > rcu_read_lock(); > > if (mem_cgroup_disabled()) > - return NULL; > + return; > again: > memcg = page_memcg(head); > if (unlikely(!memcg)) > - return NULL; > + return; > > #ifdef CONFIG_PROVE_LOCKING > local_irq_save(flags); > @@ -2161,7 +2154,7 @@ struct mem_cgroup *lock_page_memcg(struct page *page) > #endif > > if (atomic_read(&memcg->moving_account) <= 0) > - return memcg; > + return; > > spin_lock_irqsave(&memcg->move_lock, flags); > if (memcg != page_memcg(head)) { > @@ -2170,24 +2163,17 @@ struct mem_cgroup *lock_page_memcg(struct page *page) > } > > /* > - * When charge migration first begins, we can have locked and > - * unlocked page stat updates happening concurrently. Track > - * the task who has the lock for unlock_page_memcg(). > + * When charge migration first begins, we can have multiple > + * critical sections holding the fast-path RCU lock and one > + * holding the slowpath move_lock. Track the task who has the > + * move_lock for unlock_page_memcg(). > */ > memcg->move_lock_task = current; > memcg->move_lock_flags = flags; > - > - return memcg; > } > EXPORT_SYMBOL(lock_page_memcg); > > -/** > - * __unlock_page_memcg - unlock and unpin a memcg > - * @memcg: the memcg > - * > - * Unlock and unpin a memcg returned by lock_page_memcg(). > - */ > -void __unlock_page_memcg(struct mem_cgroup *memcg) > +static void __unlock_page_memcg(struct mem_cgroup *memcg) > { > if (memcg && memcg->move_lock_task == current) { > unsigned long flags = memcg->move_lock_flags; > diff --git a/mm/page-writeback.c b/mm/page-writeback.c > index eb34d204d4ee..f6c2c3165d4d 100644 > --- a/mm/page-writeback.c > +++ b/mm/page-writeback.c > @@ -2722,12 +2722,9 @@ EXPORT_SYMBOL(clear_page_dirty_for_io); > int test_clear_page_writeback(struct page *page) > { > struct address_space *mapping = page_mapping(page); > - struct mem_cgroup *memcg; > - struct lruvec *lruvec; > int ret; > > - memcg = lock_page_memcg(page); > - lruvec = mem_cgroup_page_lruvec(page, page_pgdat(page)); > + lock_page_memcg(page); > if (mapping && mapping_use_writeback_tags(mapping)) { > struct inode *inode = mapping->host; > struct backing_dev_info *bdi = inode_to_bdi(inode); > @@ -2755,11 +2752,11 @@ int test_clear_page_writeback(struct page *page) > ret = TestClearPageWriteback(page); > } > if (ret) { > - dec_lruvec_state(lruvec, NR_WRITEBACK); > + dec_lruvec_page_state(page, NR_WRITEBACK); > dec_zone_page_state(page, NR_ZONE_WRITE_PENDING); > inc_node_page_state(page, NR_WRITTEN); > } > - __unlock_page_memcg(memcg); > + unlock_page_memcg(page); > return ret; > } > > -- > 2.30.0 > -- Michal Hocko SUSE Labs