Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp1194083pxb; Wed, 10 Feb 2021 02:26:01 -0800 (PST) X-Google-Smtp-Source: ABdhPJyGmh1bKLP8Eue2JnQDpSKCDNlqnstb2gQ9PPvHOkbFKa4TIL5jCAsDMUlrYdB0pLNeo1GT X-Received: by 2002:a05:6402:2683:: with SMTP id w3mr2486086edd.378.1612952760999; Wed, 10 Feb 2021 02:26:00 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1612952760; cv=none; d=google.com; s=arc-20160816; b=XR/7/vDf6D2SYONNunJCQJXA35hBLSl0AgGXR/4OxK5LltDf3OPMxSHiUp6MAe/JY2 WtMyarDcfV4CMrDfnGg0xfqRUXUuY34BA3vwZdIgSRAkCfj7y4wD4mXc6qiG0Ofl2Unr fYNm/8UnxTsItj+IQjjVu8rCS3Yp7wvNEtlNyQsgngvY40OhaD9PfJAOE22wCPFDArki RjJQY33JOGKBz5/i3zPpWEzjiqs2qXDtUckIV17GAYSyMDjcgVXcO0uVbYviKvOpt+1T vavA0rAkjeLe7DmSEdbG+e4pH91zOrqOaqMVFacomeNJTtlanzzZm/+uAb9AO1Bp15ou TYVg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=Ig9aYUdY0A4bRb/viKrqlRI25TdZpdS3B9s1FAQ8d5M=; b=u2xG5Jj6NKGSf/KY873yB8wbX/e9M1vXFzy2Dz3qgFK316Q8p5UFqSHu3/0PxuMG4+ K/kkyaftoy19JzuBuKVtpwe1OjhLS8+FO4hiM/gizcA2Nq1v/72b+zSyIda/28lSqRhk T/FY70O8iO7Az7MBTgboj0sIiVrPqptCmAx22wvS3oyTHr/XaV9vi0zaCW8V58b/z+ZX Z9mwgNw6ZoNpgIZadI2Zuh80jwFf2ReRBwPqNWG1KYB7Uqa8mDQ9ULh4xeDdTs3oyuBZ PO1FOh66t4OcTB1vT0UB0cmVffrwSWzQa4TCgV7hXlmjurgs2/tRiwix5Oba6fhQflj5 NQNw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b="RmTN3T8/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t21si1229479edi.124.2021.02.10.02.25.38; Wed, 10 Feb 2021 02:26:00 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b="RmTN3T8/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230526AbhBJKWq (ORCPT + 99 others); Wed, 10 Feb 2021 05:22:46 -0500 Received: from mx2.suse.de ([195.135.220.15]:48952 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230497AbhBJKJU (ORCPT ); Wed, 10 Feb 2021 05:09:20 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1612951696; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=Ig9aYUdY0A4bRb/viKrqlRI25TdZpdS3B9s1FAQ8d5M=; b=RmTN3T8/hICAOC6l1XMuobPQa8aGQBEt6sUVbhhqfPzbXPp+lhGVtVyXxTbsWfvsph963Y Y1kkh53nLblH/DL0vjjcxYLKFTWG2ENavtqu0YDAbpmAWSF91HpdIw5A9ksXepM6G9BQTb 2HvhpCJwB3kbezOlEomfoSNoAl56rhQ= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 5AA3DAE14; Wed, 10 Feb 2021 10:08:16 +0000 (UTC) Date: Wed, 10 Feb 2021 11:08:15 +0100 From: Michal Hocko To: Tim Chen Cc: Andrew Morton , Johannes Weiner , Vladimir Davydov , Dave Hansen , Ying Huang , linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH 3/3] mm: Fix missing mem cgroup soft limit tree updates Message-ID: References: <3b6e4e9aa8b3ee1466269baf23ed82d90a8f791c.1612902157.git.tim.c.chen@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <3b6e4e9aa8b3ee1466269baf23ed82d90a8f791c.1612902157.git.tim.c.chen@linux.intel.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 09-02-21 12:29:47, Tim Chen wrote: > On a per node basis, the mem cgroup soft limit tree on each node tracks > how much a cgroup has exceeded its soft limit memory limit and sorts > the cgroup by its excess usage. On page release, the trees are not > updated right away, until we have gathered a batch of pages belonging to > the same cgroup. This reduces the frequency of updating the soft limit tree > and locking of the tree and associated cgroup. > > However, the batch of pages could contain pages from multiple nodes but > only the soft limit tree from one node would get updated. Change the > logic so that we update the tree in batch of pages, with each batch of > pages all in the same mem cgroup and memory node. An update is issued for > the batch of pages of a node collected till now whenever we encounter > a page belonging to a different node. I do agree with Johannes here. This shouldn't be done unconditionally for all memcgs. Wouldn't it be much better to do the fix up in the mem_cgroup_soft_reclaim path instead. Simply check the excess before doing any reclaim? Btw. have you seen this triggering a noticeable misbehaving? I would expect this to have a rather small effect considering how many sources of memcg_check_events we have. Unless I have missed something this has been introduced by 747db954cab6 ("mm: memcontrol: use page lists for uncharge batching"). Please add Fixes tag as well if this is really worth fixing. > Reviewed-by: Ying Huang > Signed-off-by: Tim Chen > --- > mm/memcontrol.c | 6 +++++- > 1 file changed, 5 insertions(+), 1 deletion(-) > > diff --git a/mm/memcontrol.c b/mm/memcontrol.c > index d72449eeb85a..f5a4a0e4e2ec 100644 > --- a/mm/memcontrol.c > +++ b/mm/memcontrol.c > @@ -6804,6 +6804,7 @@ struct uncharge_gather { > unsigned long pgpgout; > unsigned long nr_kmem; > struct page *dummy_page; > + int nid; > }; > > static inline void uncharge_gather_clear(struct uncharge_gather *ug) > @@ -6849,7 +6850,9 @@ static void uncharge_page(struct page *page, struct uncharge_gather *ug) > * exclusive access to the page. > */ > > - if (ug->memcg != page_memcg(page)) { > + if (ug->memcg != page_memcg(page) || > + /* uncharge batch update soft limit tree on a node basis */ > + (ug->dummy_page && ug->nid != page_to_nid(page))) { > if (ug->memcg) { > uncharge_batch(ug); > uncharge_gather_clear(ug); > @@ -6869,6 +6872,7 @@ static void uncharge_page(struct page *page, struct uncharge_gather *ug) > ug->pgpgout++; > > ug->dummy_page = page; > + ug->nid = page_to_nid(page); > page->memcg_data = 0; > css_put(&ug->memcg->css); > } > -- > 2.20.1 -- Michal Hocko SUSE Labs