Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755557AbdIGQPP (ORCPT ); Thu, 7 Sep 2017 12:15:15 -0400 Received: from gum.cmpxchg.org ([85.214.110.215]:54364 "EHLO gum.cmpxchg.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755334AbdIGQPN (ORCPT ); Thu, 7 Sep 2017 12:15:13 -0400 Date: Thu, 7 Sep 2017 12:14:57 -0400 From: Johannes Weiner To: Michal Hocko Cc: Roman Gushchin , linux-mm@kvack.org, Vladimir Davydov , Tetsuo Handa , David Rientjes , Andrew Morton , Tejun Heo , kernel-team@fb.com, cgroups@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [v7 5/5] mm, oom: cgroup v2 mount option to disable cgroup-aware OOM killer Message-ID: <20170907161457.GA1728@cmpxchg.org> References: <20170904142108.7165-1-guro@fb.com> <20170904142108.7165-6-guro@fb.com> <20170905134412.qdvqcfhvbdzmarna@dhcp22.suse.cz> <20170905215344.GA27427@cmpxchg.org> <20170906082859.qlqenftxuib64j35@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170906082859.qlqenftxuib64j35@dhcp22.suse.cz> User-Agent: Mutt/1.8.3 (2017-05-23) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2272 Lines: 44 On Wed, Sep 06, 2017 at 10:28:59AM +0200, Michal Hocko wrote: > On Tue 05-09-17 17:53:44, Johannes Weiner wrote: > > The cgroup-awareness in the OOM killer is exactly the same thing. It > > should have been the default from the beginning, because the user > > configures a group of tasks to be an interdependent, terminal unit of > > memory consumption, and it's undesirable for the OOM killer to ignore > > this intention and compare members across these boundaries. > > I would agree if that was true in general. I can completely see how the > cgroup awareness is useful in e.g. containerized environments (especially > with kill-all enabled) but memcgs are used in a large variety of > usecases and I cannot really say all of them really demand the new > semantic. Say I have a workload which doesn't want to see reclaim > interference from others on the same machine. Why should I kill a > process from that particular memcg just because it is the largest one > when there is a memory hog/leak outside of this memcg? Sure, it's always possible to come up with a config for which this isn't the optimal behavior. But this is about picking a default that makes sense to most users, and that type of cgroup usage just isn't the common case. > From my point of view the safest (in a sense of the least surprise) > way to go with opt-in for the new heuristic. I am pretty sure all who > would benefit from the new behavior will enable it while others will not > regress in unexpected way. This thinking simply needs to be balanced against the need to make an unsurprising and consistent final interface. The current behavior breaks isolation by letting tasks in different cgroups compete with each other during an OOM kill. While you can rightfully argue that it's possible for usecases to rely on this, you cannot tell me that this is the least-surprising thing we can offer users; certainly not new users, but also not many/most existing ones. > We can talk about the way _how_ to control these oom strategies, of > course. But I would be really reluctant to change the default which is > used for years and people got used to it. I really doubt there are many cgroup users that rely on that particular global OOM behavior. We have to agree to disagree, I guess.