Received: by 2002:a25:e7d8:0:0:0:0:0 with SMTP id e207csp4090797ybh; Tue, 17 Mar 2020 12:03:19 -0700 (PDT) X-Google-Smtp-Source: ADFU+vupnGTgxAMu8PREwaL0nnhpkLqPZvb6d2lQLoIBrWHlgjxyRsblccLt0+E5N5BEDaqSdGnF X-Received: by 2002:a9d:a68:: with SMTP id 95mr603714otg.87.1584471799439; Tue, 17 Mar 2020 12:03:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584471799; cv=none; d=google.com; s=arc-20160816; b=febjAeJ34NwOVdwPW0vrqwOiCLPJY/98IyYidt1qjfftq+MGEum9+koPQnsSbTO7FO CZIli4Y7zrgSGnPUaywm02/6BuXgzPY2b8Pdd3qhoO677i7goMjUsPK+ya3SSZ6Y1okA dpZCAWYKRH7ipqJ8ufWdg3518aowhkGC5iJWKdUFShtwlNdY5sPPX+0XhfaSWaJlZbYT 6X8cAGZc3jiaO1qxnTfh0ztyJtgaRYRvYg5fI9eNGo0IiGCuT+W7JT+RfPA1iJpWWQx1 sWMjC/nHrd9BdkMXQdPLowkpnYPTZnX4pNwmB3XPLDoCwWTZzBOfphXf1QVCbPIYOm62 eifA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=Y0VT78F3dxvDrSDRU0usDCfY5N57Y8ZF/aY+SJ6VytM=; b=wWh/xZDoD69CUWV/TQcnNuWfzwVvpT9wbB53nTVitkQ3sBEUVGlqGb/JzdF52NwBGW 1x8+GLgjvWqt/cqlXyGol8UslooDnkucw6PJhCl1GAoC8z+hh/Ew8TtTj1sg9o1gaSRb HBECi4ANAShZSYUpUNQN3cKB6AjGkYzUJ/IJ+QrTJxwLcksuSwwkRfMMuqfPn7bt1h/w A+/AFT7VbuKl2iX2N0J9R7jI/1HepDDd+0tTaOoumSLzHu843chjExHie24FADXOEi/d lin1+Tu+Qp/uK13h8eU5wiciQCE91asavXv09movOaCCFmJemg3sBZmdK3n6BhrW+pVH Hohg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=OwjZi9Xd; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h23si2192980otk.294.2020.03.17.12.03.04; Tue, 17 Mar 2020 12:03:19 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=OwjZi9Xd; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726740AbgCQTA7 (ORCPT + 99 others); Tue, 17 Mar 2020 15:00:59 -0400 Received: from mail-lj1-f194.google.com ([209.85.208.194]:39591 "EHLO mail-lj1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726491AbgCQTA7 (ORCPT ); Tue, 17 Mar 2020 15:00:59 -0400 Received: by mail-lj1-f194.google.com with SMTP id f10so24194143ljn.6 for ; Tue, 17 Mar 2020 12:00:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Y0VT78F3dxvDrSDRU0usDCfY5N57Y8ZF/aY+SJ6VytM=; b=OwjZi9XdTCjPbZ21iUQ+IU6Y08aeWbK57bHoTZtarsLaihwXamXErvGEo5E5/lfKQi JA0/JuuWsaMWjZsm2kvtbqaBaKj8T079+Yc5qX6+8BIheZJ69AlxqQfOHW2R04ErokWz qZ1JaoSphttl82iL+gNmJz/GLhogkmpLchnAEXx26muc/dAoSBImwlxEiMevGb0t4C4c bT4wrzepziqNgpVPnQtwgpGC3THGNnsgGrJj9GkpZ32OsQZ1kzLAjGyjmRtaVWL5RKs2 06iaqxrEbgrz3QY1Uz1TCI6cA7I20q+3SYGou+DxBiFMstmHXJZCUc3iRzYKcT+rmgx5 4koQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Y0VT78F3dxvDrSDRU0usDCfY5N57Y8ZF/aY+SJ6VytM=; b=iFnRi/D+cBZ04wKSm1QdF47p1Js25TWkg70aAk7c1iP8dbNj27c0IKv8aVAsrGID8G HPcSSV8MSeGsp9T0JUiWRjCSxOuTMPIm6TMbzt5ANy4HNt88X5ubst7WRzswdXF35Oz4 Jp72FBC1Iu7pm2B3hPH2cfY6ca8ZBqO1I/Y1fl9BXUqFejSlYG7M0XT0dCoycurCk8/g 63JEgdIgl4O7pWdMUYaIZ3DOUz9X9y9QY51VIRxXg/qfAW/gLwj4vjq6i9KEtPNECzRd /abvyAuG65siP5S2fQqm7h+6YybHpSPq7Cn26mYiL3j/pYgyI+akH+rPlnU4BqL4pYwM miHg== X-Gm-Message-State: ANhLgQ3g6ccaI48Ip6/1GGMKjBUiwkBwpdfm1lbe4oQk6itu1pr4WNNn xUB59uA+RnTKyMeVmC4Qd7QbYZXNawRrXi49bO6HLw== X-Received: by 2002:a2e:b0f7:: with SMTP id h23mr141835ljl.56.1584471656937; Tue, 17 Mar 2020 12:00:56 -0700 (PDT) MIME-Version: 1.0 References: <20200310221938.GF8447@dhcp22.suse.cz> In-Reply-To: From: Ami Fischman Date: Tue, 17 Mar 2020 12:00:45 -0700 Message-ID: Subject: Re: [patch] mm, oom: make a last minute check to prevent unnecessary memcg oom kills To: Robert Kolchmeyer Cc: David Rientjes , Michal Hocko , Andrew Morton , Vlastimil Babka , linux-kernel@vger.kernel.org, linux-mm@kvack.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 17, 2020 at 11:26 AM Robert Kolchmeyer wrote: > > On Tue, Mar 10, 2020 at 3:54 PM David Rientjes wrote: > > > > Robert, could you elaborate on the user-visible effects of this issue that > > caused it to initially get reported? > > Ami (now cc'ed) knows more, but here is my understanding. Robert's description of the mechanics we observed is accurate. We discovered this regression in the oom-killer's behavior when attempting to upgrade our system. The fraction of the system that went unhealthy due to this issue was approximately equal to the _sum_ of all other causes of unhealth, which are many and varied, but each of which contribute only a small amount of unhealth. This issue forced a rollback to the previous kernel where we ~never see this behavior, returning our unhealth levels to the previous background levels. Cheers, -a