Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp68143imm; Tue, 24 Jul 2018 14:09:09 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfdCY4DlX+6kGKcDZ0boVqW8FakSpmldeaxsC40WQui5ICAMIj4gh19cX+vgf+60qV98xNH X-Received: by 2002:a63:5c52:: with SMTP id n18-v6mr17627405pgm.360.1532466549465; Tue, 24 Jul 2018 14:09:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1532466549; cv=none; d=google.com; s=arc-20160816; b=D25OQtOu6K4yEuawK9tU3VxP7xIsMy9lSXctsJQDgGsC9gEwQQnn6COr4ctHdcRskL feOH1UxmGtaoQOEuXCD7/jh2NpuMTwVimv5BogUKLiAXZlnGA78pZ/i8qUl/yvhpCt+5 aDFt9F89rhMKKC6k2BFEHI0YrKNsydDA7D7EvOucOXl9eWR7UZZh6ocrC+rqAD6AjSBs 8x0JnqWgFJXSaeXwWowQBu92OTtQehzwYdw+4lPgVVKnHngNA7Fb9PWm7ujaC04peZnR jxZ8bV9XwP+d2IQrh6hWlsun98e+cMUYjYwYg9I3aplh2hQq0ELRvJOuRqSR9yz+3Zy/ db0Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date:dkim-signature :arc-authentication-results; bh=GRpMvOtQ2l4EWsyFMosWvTYZzcMc5iTWpAY5ngEzkuw=; b=zFpZ9mFoh0Hecf3l8Iv9pDSJAYWtGFhL3BQ9GWYslhdFKDjQFC0vIgU3lTAtdXQuE9 4HorL0D2ULfH50sYS0pXY5Lk3FODDvhKazJTezpxjsoKUP+HZBUq7oUmzh81Ul6YahxI 5PO9lkDWmdwX09bHl/UitLAZQhLFumOvpF6YZRBb3+XqWAJdbCTcCQ31MOTQCMNrst4l aEarXUQbyhxsFUCACQY7BPut86/Rcsdroi3QNkasGvtU92QUO2Wmbn0ePkIbEpBCWPDu n5PmIM1HVy5+vxmbBlJmbyGro7P2rDlhiXqlnRj7BXCWXO0p/zdQ60cAEiw+dpxZMwuf y6Dw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b="lsN5H/OF"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f21-v6si12295735pgl.235.2018.07.24.14.08.53; Tue, 24 Jul 2018 14:09:09 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b="lsN5H/OF"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388872AbeGXWQL (ORCPT + 99 others); Tue, 24 Jul 2018 18:16:11 -0400 Received: from mail-pg1-f193.google.com ([209.85.215.193]:39339 "EHLO mail-pg1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388823AbeGXWQL (ORCPT ); Tue, 24 Jul 2018 18:16:11 -0400 Received: by mail-pg1-f193.google.com with SMTP id g2-v6so3725836pgs.6 for ; Tue, 24 Jul 2018 14:07:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:from:to:cc:subject:in-reply-to:message-id:references :user-agent:mime-version; bh=GRpMvOtQ2l4EWsyFMosWvTYZzcMc5iTWpAY5ngEzkuw=; b=lsN5H/OFLNNc7MzqgLfTH21qJ+pljulOpQ/hF45IXmXn1DY7NDIpU30dGn2eVKVub4 vRSVLOFimGJINmhyswNmdfjhBrwRgL3WcjolU8B08hK/Jk66m1NX/ubbAXNpGginmciV rTLsNUC0khsINDJUTDptuvEGc7aoMx9mloM1xgdam3tMHCYodVGxPmXRId6r6URAMm8s zdgXgWE8frwNYyQBw5StdMr/v+csKcZHkCzZFjpRC++cAgQk/+CR/T7F2RACT3IgN4rj 3v5o+BZFD6LlPWVU5z/P7BYDaPtuuaj018EoVa2z0yORwLqadQdGJUk0vOacRad7PUIR zyuA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:in-reply-to:message-id :references:user-agent:mime-version; bh=GRpMvOtQ2l4EWsyFMosWvTYZzcMc5iTWpAY5ngEzkuw=; b=tNVxiMYGuxrog6rBOe5OvzcUQ6+KQ9WN8UGgW9JSDlHTyaz9X/RDyF+oFZM1GVVPAb tLGeFe58rLQCk8J9UFrI58TtvbynBpVQ8QzpMrhNXKgS+qF3P4boalg5oLx/kGitZfQL xNjGDuY3tuCgUDYBndUOMTDKjpKOL96+GCRuCfbEL88BBR9f2xNWW5bI1O4CeraJMDJ1 6mKoXJgtk8hzUz+XUNmZAYkIJyhAkxvzWRHw1872suJegttC83XDyNkhaEknuSssAjTE CCf6b2GeVQXo3vXF/H4OWEPkpgGe4F6qFhKuH1m1WD5jAOKYpRx4+N7nlCsB77Hv8v1d WQqQ== X-Gm-Message-State: AOUpUlEOy9Cq+Zd7cS0dcJaOWRlVBg5c+dwwyfbl8WIygzjBf9l7+yYX U01hcxgAvzYjMgVqpM6XZt1UVSA9UBc= X-Received: by 2002:a65:658d:: with SMTP id u13-v6mr17985311pgv.20.1532466470849; Tue, 24 Jul 2018 14:07:50 -0700 (PDT) Received: from [2620:15c:17:3:3a5:23a7:5e32:4598] ([2620:15c:17:3:3a5:23a7:5e32:4598]) by smtp.gmail.com with ESMTPSA id 84-v6sm34654729pfj.33.2018.07.24.14.07.49 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 24 Jul 2018 14:07:50 -0700 (PDT) Date: Tue, 24 Jul 2018 14:07:49 -0700 (PDT) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: Michal Hocko cc: Andrew Morton , LKML , linux-mm@kvack.org, "David (ChunMing) Zhou" , Paolo Bonzini , =?UTF-8?Q?Radim_Kr=C4=8Dm=C3=A1=C5=99?= , Alex Deucher , David Airlie , Jani Nikula , Joonas Lahtinen , Rodrigo Vivi , Doug Ledford , Jason Gunthorpe , Mike Marciniszyn , Dennis Dalessandro , Sudeep Dutt , Ashutosh Dixit , Dimitri Sivanich , Boris Ostrovsky , Juergen Gross , =?UTF-8?Q?J=C3=A9r=C3=B4me_Glisse?= , Andrea Arcangeli , Felix Kuehling , kvm@vger.kernel.org, amd-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org, intel-gfx@lists.freedesktop.org, linux-rdma@vger.kernel.org, xen-devel@lists.xenproject.org, =?UTF-8?Q?Christian_K=C3=B6nig?= , Leon Romanovsky Subject: Re: [PATCH] mm, oom: distinguish blockable mode for mmu notifiers In-Reply-To: <20180724141747.GP28386@dhcp22.suse.cz> Message-ID: References: <20180716115058.5559-1-mhocko@kernel.org> <20180720170902.d1137060c23802d55426aa03@linux-foundation.org> <20180724141747.GP28386@dhcp22.suse.cz> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 24 Jul 2018, Michal Hocko wrote: > oom_reap_task_mm should return false when __oom_reap_task_mm return > false. This is what my patch did but it seems this changed by > http://www.ozlabs.org/~akpm/mmotm/broken-out/mm-oom-remove-oom_lock-from-oom_reaper.patch > so that one should be fixed. > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > index 104ef4a01a55..88657e018714 100644 > --- a/mm/oom_kill.c > +++ b/mm/oom_kill.c > @@ -565,7 +565,7 @@ static bool oom_reap_task_mm(struct task_struct *tsk, struct mm_struct *mm) > /* failed to reap part of the address space. Try again later */ > if (!__oom_reap_task_mm(mm)) { > up_read(&mm->mmap_sem); > - return true; > + return false; > } > > pr_info("oom_reaper: reaped process %d (%s), now anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB\n", > > > On top of that the proposed cleanup looks as follows: > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > index 88657e018714..4e185a282b3d 100644 > --- a/mm/oom_kill.c > +++ b/mm/oom_kill.c > @@ -541,8 +541,16 @@ bool __oom_reap_task_mm(struct mm_struct *mm) > return ret; > } > > +/* > + * Reaps the address space of the give task. > + * > + * Returns true on success and false if none or part of the address space > + * has been reclaimed and the caller should retry later. > + */ > static bool oom_reap_task_mm(struct task_struct *tsk, struct mm_struct *mm) > { > + bool ret = true; > + > if (!down_read_trylock(&mm->mmap_sem)) { > trace_skip_task_reaping(tsk->pid); > return false; > @@ -555,28 +563,28 @@ static bool oom_reap_task_mm(struct task_struct *tsk, struct mm_struct *mm) > * down_write();up_write() cycle in exit_mmap(). > */ > if (test_bit(MMF_OOM_SKIP, &mm->flags)) { > - up_read(&mm->mmap_sem); > trace_skip_task_reaping(tsk->pid); > - return true; > + goto out_unlock; > } > > trace_start_task_reaping(tsk->pid); > > /* failed to reap part of the address space. Try again later */ > - if (!__oom_reap_task_mm(mm)) { > - up_read(&mm->mmap_sem); > - return false; > - } > + ret = __oom_reap_task_mm(mm); > + if (!ret) > + goto out_finish; > > pr_info("oom_reaper: reaped process %d (%s), now anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB\n", > task_pid_nr(tsk), tsk->comm, > K(get_mm_counter(mm, MM_ANONPAGES)), > K(get_mm_counter(mm, MM_FILEPAGES)), > K(get_mm_counter(mm, MM_SHMEMPAGES))); > +out_finish: > + trace_finish_task_reaping(tsk->pid); > +out_unlock: > up_read(&mm->mmap_sem); > > - trace_finish_task_reaping(tsk->pid); > - return true; > + return ret; > } > > #define MAX_OOM_REAP_RETRIES 10 I think we still want to trace when reaping was skipped to know that the oom reaper will retry again later. mm/oom_kill.c: clean up oom_reap_task_mm() fix indicate reaping has been partially skipped so we can expect future skips or another start before finish. Signed-off-by: David Rientjes --- mm/oom_kill.c | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/mm/oom_kill.c b/mm/oom_kill.c --- a/mm/oom_kill.c +++ b/mm/oom_kill.c @@ -569,10 +569,12 @@ static bool oom_reap_task_mm(struct task_struct *tsk, struct mm_struct *mm) trace_start_task_reaping(tsk->pid); - /* failed to reap part of the address space. Try again later */ ret = __oom_reap_task_mm(mm); - if (!ret) + if (!ret) { + /* Failed to reap part of the address space. Try again later */ + trace_skip_task_reaping(tsk->pid); goto out_finish; + } pr_info("oom_reaper: reaped process %d (%s), now anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB\n", task_pid_nr(tsk), tsk->comm,