Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp593571ybl; Thu, 22 Aug 2019 01:44:36 -0700 (PDT) X-Google-Smtp-Source: APXvYqz8senWUt4CO6TNACib//IU2FC1PfvVJDzW4NSewNLXbS7IwHLloEPSxx0L4hIh9s6uSNGM X-Received: by 2002:a17:90a:d598:: with SMTP id v24mr4398864pju.8.1566463476662; Thu, 22 Aug 2019 01:44:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1566463476; cv=none; d=google.com; s=arc-20160816; b=RuGtZQakJZneSolhojkFvcsKzQWbn8ljwOrWdJGpkTgA7cjpp+7pTPTfp43VxncFj/ l8RwnqfAPC3mUbmsvs0nQrIUC6vL9dXkOeJP6j8t/X4pGuzOh14cXBGT6Xr2eV7WYqvc vwQ6COTY8kEt8YC1Mvrd0k3QKBzKZxgZD4nHMbc8M1ye5EumunH3NAdVIkSlrgdvJ3rI eMozeFwq6dKlgaY6lW1/e2mSo+bgRAuBd8/YeLRpdeyBfTydiWYQJzadKaNUGnnqJq5o hdLoVwEkvK50NeBmxwZROL89yzH/N7Io+MdCBDNqZYjxikj+NGsFzLy4KL6SnkTJVup9 BB6A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=t1tknVyU6mJy/UfxuWlQahNDH32jDRp1ne+fmM+k4AE=; b=ZIRPbzKFRzEE1wR3QKUj5zeRKyqgMpzcmk84hAevvPNTxHRv0nYVNkWNMMWjXbE8By hXS2EqYIr7cdVDBVfhXw4+Y1mAIsrl0zA+oZGUppEh2c+biIfyAl5sT2ROz/iDmEWe0W K8IhDmaWL3gusTlQOcRcmsNmbEplhYN5H+urfwvGJsIJiF3TplpZ+GXt1a5CTD66eZGW PdiaNkFnIM9R4w44fni476auX9dgnsxjR+sp8QNyleFlgzbveJz7GNwihUwOSxQnggX0 VKHj71r2KSjmQTYDmpEj7xY0B4LSpj6aZVnqDssxLO/p0tiPSZdqPgp0hyhNV3kemsYb 9cUA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 142si16241519pgf.35.2019.08.22.01.44.21; Thu, 22 Aug 2019 01:44:36 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730733AbfHVHJX (ORCPT + 99 others); Thu, 22 Aug 2019 03:09:23 -0400 Received: from mx2.suse.de ([195.135.220.15]:59992 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727401AbfHVHJW (ORCPT ); Thu, 22 Aug 2019 03:09:22 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 27E53AE00; Thu, 22 Aug 2019 07:09:20 +0000 (UTC) Date: Thu, 22 Aug 2019 09:09:19 +0200 From: Michal Hocko To: Edward Chron Cc: David Rientjes , Andrew Morton , Roman Gushchin , Johannes Weiner , Tetsuo Handa , Shakeel Butt , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Ivan Delalande Subject: Re: [PATCH] mm/oom: Add oom_score_adj value to oom Killed process message Message-ID: <20190822070919.GB12785@dhcp22.suse.cz> References: <20190821001445.32114-1-echron@arista.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 21-08-19 15:25:13, Edward Chron wrote: > On Tue, Aug 20, 2019 at 8:25 PM David Rientjes wrote: > > > > On Tue, 20 Aug 2019, Edward Chron wrote: > > > > > For an OOM event: print oom_score_adj value for the OOM Killed process to > > > document what the oom score adjust value was at the time the process was > > > OOM Killed. The adjustment value can be set by user code and it affects > > > the resulting oom_score so it is used to influence kill process selection. > > > > > > When eligible tasks are not printed (sysctl oom_dump_tasks = 0) printing > > > this value is the only documentation of the value for the process being > > > killed. Having this value on the Killed process message documents if a > > > miscconfiguration occurred or it can confirm that the oom_score_adj > > > value applies as expected. > > > > > > An example which illustates both misconfiguration and validation that > > > the oom_score_adj was applied as expected is: > > > > > > Aug 14 23:00:02 testserver kernel: Out of memory: Killed process 2692 > > > (systemd-udevd) total-vm:1056800kB, anon-rss:1052760kB, file-rss:4kB, > > > shmem-rss:0kB oom_score_adj:1000 > > > > > > The systemd-udevd is a critical system application that should have an > > > oom_score_adj of -1000. Here it was misconfigured to have a adjustment > > > of 1000 making it a highly favored OOM kill target process. The output > > > documents both the misconfiguration and the fact that the process > > > was correctly targeted by OOM due to the miconfiguration. Having > > > the oom_score_adj on the Killed message ensures that it is documented. > > > > > > Signed-off-by: Edward Chron > > > Acked-by: Michal Hocko > > > > Acked-by: David Rientjes > > > > vm.oom_dump_tasks is pretty useful, however, so it's curious why you > > haven't left it enabled :/ > > > > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > > > index eda2e2a0bdc6..c781f73b6cd6 100644 > > > --- a/mm/oom_kill.c > > > +++ b/mm/oom_kill.c > > > @@ -884,12 +884,13 @@ static void __oom_kill_process(struct task_struct *victim, const char *message) > > > */ > > > do_send_sig_info(SIGKILL, SEND_SIG_PRIV, victim, PIDTYPE_TGID); > > > mark_oom_victim(victim); > > > - pr_err("%s: Killed process %d (%s) total-vm:%lukB, anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB\n", > > > + pr_err("%s: Killed process %d (%s) total-vm:%lukB, anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB oom_score_adj:%ld\n", > > > message, task_pid_nr(victim), victim->comm, > > > K(victim->mm->total_vm), > > > K(get_mm_counter(victim->mm, MM_ANONPAGES)), > > > K(get_mm_counter(victim->mm, MM_FILEPAGES)), > > > - K(get_mm_counter(victim->mm, MM_SHMEMPAGES))); > > > + K(get_mm_counter(victim->mm, MM_SHMEMPAGES)), > > > + (long)victim->signal->oom_score_adj); > > > task_unlock(victim); > > > > > > /* > > > > Nit: why not just use %hd and avoid the cast to long? > > Sorry I may have accidently top posted my response to this. Here is > where my response should go: > ----------------------------------------------------------------------------------------------------------------------------------- > > Good point, I can post this with your correction. > > I will add your Acked-by: David Rientjes > > I am adding your Acked-by to the revised patch as this is what Michal > asked me to do (so I assume that is what I should do). > > Should I post as a separate fix again or simply post here? Andrew usually folds these small fixups automagically. If that doesn't happen here for some reason then just repost with acks and the fixup. Thanks! -- Michal Hocko SUSE Labs