Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1352297ybl; Thu, 22 Aug 2019 13:07:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqyTrsZkicl+vVEOaQJdBGDomJH/JbY4L9Bj8XIUFk046qViDFRR3rjtft/I1hQbmP/8rrjE X-Received: by 2002:a17:902:2f05:: with SMTP id s5mr641316plb.170.1566504451702; Thu, 22 Aug 2019 13:07:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1566504451; cv=none; d=google.com; s=arc-20160816; b=o0kf7kfqmXa32WQESXX8upzhc3sowPnr223SMNPTNAckIcBy4Z7dYtSCZfGni2FIck azENy2z6N9OAWgMaJ7Z0kTobVfOOdW4zrcsbyT7DhkjusNGqgD2S+WXd2kyFtZNBEtCI nUqZ29HgXiKSgHZ8cajI2uDDDSvaVK3CpVamxGmqal8wpoxDzvepGxbRtecjtRRmLO37 UwNqjIaDXHz9U4pfxwtZrMDUJHyJbSRWph1hcW0r8IuPlHTBGkJn9BG9pVXoaLPztqFX u7bMI+3u4JPfRNkdmJ7X8hA5XAh/CQOCBjXqdG43VC6RN6FAiHndoD025+KYgEPIdiP4 9z8A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=W+PfW3oHN0X1/1L8v9CDtjGCmSfcGD06YOATXf5wVB8=; b=fDFeBJCVA2eJc2Wx9pHzXC0/LzTPABbJfP8+Hh/SxntzKu7H3Qs4vfHY4ZtxVbwQE6 Jzxu0EC2z3EFRTLjxJrkk9wAodR1Dgyc37ROf3DjGVyoEE/JcEeVr5b3rwsAJnBGjlEh Lrri/INFGM+S9VgVRTnzEioKnQDNv2IPcgtOVV8More6DgPTpHpZn4NtN5RL6Tqrt8YW ZvSpPHVqlpN1W9XfQBmrdlq57JW9H5NSMgZESV97p3awiWJmrU7qdbxrhd5aNYxGDL0H Oy7mfWAyV4Qf29xA9TYUJhRef6WTt4ltyjSh7wUK8/bof/eTnuJHBPJqtpcLgRb8uYcX iI5g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@arista.com header.s=googlenew header.b=LYNXfViS; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=REJECT dis=NONE) header.from=arista.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b41si424287pla.155.2019.08.22.13.07.16; Thu, 22 Aug 2019 13:07:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@arista.com header.s=googlenew header.b=LYNXfViS; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=REJECT dis=NONE) header.from=arista.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388014AbfHVOrx (ORCPT + 99 others); Thu, 22 Aug 2019 10:47:53 -0400 Received: from mail-io1-f67.google.com ([209.85.166.67]:36281 "EHLO mail-io1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727553AbfHVOrx (ORCPT ); Thu, 22 Aug 2019 10:47:53 -0400 Received: by mail-io1-f67.google.com with SMTP id o9so12394065iom.3 for ; Thu, 22 Aug 2019 07:47:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=arista.com; s=googlenew; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=W+PfW3oHN0X1/1L8v9CDtjGCmSfcGD06YOATXf5wVB8=; b=LYNXfViSeb7bFrSCrWzgiuZ9WPsUNAZeikwzUq9CRFc3JVWX1m00LX3zbz/1sj31Ba Pacpkc/VaV4hE/VtcvNV8S9oK7M/fd1bvVUImFA8ALptXKNwOaUGzJlcjVd9YXXhfZNZ k2w32JFpBalwlozpymRTzmcR2m1Xuyx/h2IVqKfqcLvl0mSvl8D8yk/7exO7k9DZ2Khg 3g/SCRFf2JbwKmKsvWuHgL9Yn1T+ECeIFdu5YHCUABST8sRdeqQxLcNYYm/3G5BydEtK E0vSJQRWDmp92FMVyJfnsVs8EomSNoLs4FCnkkgl9rdho//jq2qArSHinO02fdLay5hl Mwgw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=W+PfW3oHN0X1/1L8v9CDtjGCmSfcGD06YOATXf5wVB8=; b=qYmh+kHAR26PurrHxjVZULD90sO8N6diU+Oc/2NtM8rqWEcrXmbG8V6C6LqOKfggHF j+b30MOGwYO9ckBIRQA0ecO2lNxG1eKaLkZO8T569bHIJURXtZ3hkdECc16HX6RHf0EM RfZntd/ttXOGvSQ8qoLIBBUID0CfnqXwIsoKB+BOkvVYeOtRHqvcSRnVQgywa8vLb7j2 HljICpc2llxsgj2eWZb/A95AJa8njH9ANZNjJXf/LkWIGQv/ExumsiGhWhil6eIlJ1zp RgSy4M8v5xh1hWxRPoZ1eW32eMCXNleJwshkOaBpUZdnEqo0NTD+/G8/UuMYwXXDAMsu D3TA== X-Gm-Message-State: APjAAAV90halwEbOiKqujiCQS1pgR6N+2UIllvGkyy6YAmicwUuDi4Ui HKbKas28gDWoUDWiUVs8Hf+d0Vy9vOR/JG7AX4jVmw== X-Received: by 2002:a5e:c744:: with SMTP id g4mr2054838iop.187.1566485272069; Thu, 22 Aug 2019 07:47:52 -0700 (PDT) MIME-Version: 1.0 References: <20190821001445.32114-1-echron@arista.com> <20190821064732.GW3111@dhcp22.suse.cz> <20190822071544.GC12785@dhcp22.suse.cz> In-Reply-To: <20190822071544.GC12785@dhcp22.suse.cz> From: Edward Chron Date: Thu, 22 Aug 2019 07:47:40 -0700 Message-ID: Subject: Re: [PATCH] mm/oom: Add oom_score_adj value to oom Killed process message To: Michal Hocko Cc: David Rientjes , Andrew Morton , Roman Gushchin , Johannes Weiner , Tetsuo Handa , Shakeel Butt , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Ivan Delalande Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 22, 2019 at 12:15 AM Michal Hocko wrote: > > On Wed 21-08-19 15:22:07, Edward Chron wrote: > > On Wed, Aug 21, 2019 at 12:19 AM David Rientjes wrote: > > > > > > On Wed, 21 Aug 2019, Michal Hocko wrote: > > > > > > > > vm.oom_dump_tasks is pretty useful, however, so it's curious why you > > > > > haven't left it enabled :/ > > > > > > > > Because it generates a lot of output potentially. Think of a workload > > > > with too many tasks which is not uncommon. > > > > > > Probably better to always print all the info for the victim so we don't > > > need to duplicate everything between dump_tasks() and dump_oom_summary(). > > > > > > Edward, how about this? > > > > > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > > > --- a/mm/oom_kill.c > > > +++ b/mm/oom_kill.c > > > @@ -420,11 +420,17 @@ static int dump_task(struct task_struct *p, void *arg) > > > * State information includes task's pid, uid, tgid, vm size, rss, > > > * pgtables_bytes, swapents, oom_score_adj value, and name. > > > */ > > > -static void dump_tasks(struct oom_control *oc) > > > +static void dump_tasks(struct oom_control *oc, struct task_struct *victim) > > > { > > > pr_info("Tasks state (memory values in pages):\n"); > > > pr_info("[ pid ] uid tgid total_vm rss pgtables_bytes swapents oom_score_adj name\n"); > > > > > > + /* If vm.oom_dump_tasks is disabled, only show the victim */ > > > + if (!sysctl_oom_dump_tasks) { > > > + dump_task(victim, oc); > > > + return; > > > + } > > > + > > > if (is_memcg_oom(oc)) > > > mem_cgroup_scan_tasks(oc->memcg, dump_task, oc); > > > else { > > > @@ -465,8 +471,8 @@ static void dump_header(struct oom_control *oc, struct task_struct *p) > > > if (is_dump_unreclaim_slabs()) > > > dump_unreclaimable_slab(); > > > } > > > - if (sysctl_oom_dump_tasks) > > > - dump_tasks(oc); > > > + if (p || sysctl_oom_dump_tasks) > > > + dump_tasks(oc, p); > > > if (p) > > > dump_oom_summary(oc, p); > > > } > > > > I would be willing to accept this, though as Michal mentions in his > > post, it would be very helpful to have the oom_score_adj on the Killed > > process message. > > > > One reason for that is that the Killed process message is the one > > message that is printed with error priority (pr_err) > > and so that message can be filtered out and sent to notify support > > that an OOM event occurred. > > Putting any information that can be shared in that message is useful > > from my experience as it the initial point of triage for an OOM event. > > Even if the full log with per user process is available it the > > starting point for triage for an OOM event. > > > > So from my perspective I would be happy having both, with David's > > proposal providing a bit of extra information as shown here: > > > > Jul 21 20:07:48 linuxserver kernel: [ pid ] uid tgid total_vm > > rss pgtables_bytes swapents oom_score_adj name > > Jul 21 20:07:48 linuxserver kernel: [ 547] 0 547 31664 > > 615 299008 0 0 > > systemd-journal > > > > The OOM Killed process message will print as: > > > > Jul 21 20:07:48 linuxserver kernel: Out of memory: Killed process 2826 > > (oomprocs) total-vm:1056800kB, anon-rss:1052784kB, file-rss:4kB, > > shmem-rss:0kB oom_score_adj:1000 > > > > But if only one one output change is allowed I'd favor the Killed > > process message since that can be singled due to it's print priority > > and forwarded. > > > > By the way, right now there is redundancy in that the Killed process > > message is printing vm, rss even if vm.oom_dump_tasks is enabled. > > I don't see why that is a big deal. > > There will always be redundancy there because dump_tasks part is there > mostly to check the oom victim decision for potential wrong/unexpected > selection. While "killed..." message is there to inform who has been > killed. Most people really do care about that part only. > > > It is very useful to have all the information that is there. > > Wouldn't mind also having pgtables too but we would be able to get > > that from the output of dump_task if that is enabled. > > I am not against adding pgrable information there. That memory is going > to be released when the task dies. Oh Thank-you, will include that in updated patch as it useful information. > > > If it is acceptable to also add the dump_task for the killed process > > for !sysctl_oom_dump_tasks I can repost the patch including that as > > well. > > Well, I would rather focus on adding the missing pieces to the killed > task message instead. > Will do. > -- > Michal Hocko > SUSE Labs