Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp813189ybv; Thu, 13 Feb 2020 10:00:20 -0800 (PST) X-Google-Smtp-Source: APXvYqwdG3NDnJgwE7xOqYixs6w+rX8M3N4G10A0rFkQGdAPb9MYekVhJ1tRzvKX51ZBzF5m6p0q X-Received: by 2002:a05:6830:9a:: with SMTP id a26mr14671917oto.273.1581616820737; Thu, 13 Feb 2020 10:00:20 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581616820; cv=none; d=google.com; s=arc-20160816; b=B84A4O2rn9DFraY38GBCfwEuqfx8h7tvSmPDKAygv+gNDUwbfR35LnQ3mqGqFoGVwx JtuIx1ItGQSWIou8MoIhzM/jvwfneQH/3CZrPFEyEv7DTINCOr29ToHu3jeXaLPu7QFF PV9RKFLRhHEzpLVL85Cyokcyo6O1KSj8hPz1XEb+3RLCOWZpLyvVbnM3dOMg4r0mlSYv +BDKnUsduH0yLrNjdz9+MBai3YY99wAkrYXiMr1wC98KjZdozYjrlu2N2hrYJ2P83o3M fcq36a25nLNDjN3eQCuY8qikUCkEdotr3oA2mAMeYw+wRfixs8DRsEndUYyqKFzYYztC 3cpQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=+ipGsEe3/yOsYlV8o1qjdWtTtOYX3RrBOQJnaAHZ74E=; b=K3lXJB4w9xKO7CCA96DLz0y8g2rsq7PDPjjAZ/rHRE70ntY4sVPMLhJ8GlTVxEXHMD lqePQiVbI0kskBWWB0Zb+2Xn5Hpw+2kd3cc5x5HaYzobUoMfbcZm4XsGbWoL0Ihw2hvr SLcNbLvmOV4taVUM6q8yay49mHjdcUDasaZ67v7BVVzERbt8fXZzAWdx+6Nm9Kppvses TKNrNnDvwUpKJ4si79EVuE4cg1GhlGkKGyq9IuSb42x5Vfw0HwHRKBQ1DYBtA/qg3BtG P44q4XmHWf19lcBUdmEz/2mYsA65teL99htiBHYjtmJY//ZD14vb5SJslm8/O5uzcIoC 1R8g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@cmpxchg-org.20150623.gappssmtp.com header.s=20150623 header.b=yyrJ5+jL; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=cmpxchg.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u21si1368590otq.137.2020.02.13.10.00.07; Thu, 13 Feb 2020 10:00:20 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@cmpxchg-org.20150623.gappssmtp.com header.s=20150623 header.b=yyrJ5+jL; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=cmpxchg.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728006AbgBMR6R (ORCPT + 99 others); Thu, 13 Feb 2020 12:58:17 -0500 Received: from mail-qt1-f194.google.com ([209.85.160.194]:33505 "EHLO mail-qt1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727972AbgBMR6Q (ORCPT ); Thu, 13 Feb 2020 12:58:16 -0500 Received: by mail-qt1-f194.google.com with SMTP id d5so5079660qto.0 for ; Thu, 13 Feb 2020 09:58:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cmpxchg-org.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=+ipGsEe3/yOsYlV8o1qjdWtTtOYX3RrBOQJnaAHZ74E=; b=yyrJ5+jL6U0zkaNRdTW4XuwU+6h5CQnk2Kna+wx5jv/oLlIDtuaHbYHVYKCkC3AygR YWYwMBVcOwka2f+bwSLH5UGQ8QqtDRjbEfmYqCzn5CXOf9r+l8xySGHsDT97EoNmJL+Z BFqyHHG97LmrX/nEHoQVEjni7ym6/0qgt0zC+8whWuuLAw+gyCIlEm/f28W3B6cMhgm3 ZdgG+MHFDtKu2Hgkht8OtjfuDsoa/6E1Pt+/dXISLj3RRI2FkFHI1wENiBIY6Mw6tfKq weY9CBjvOOr6U205IE39Kt35Lchk9QylIsO67BL1UYFSaY/d0BvAGrrR7FrOAY0TIXHX cQ7w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=+ipGsEe3/yOsYlV8o1qjdWtTtOYX3RrBOQJnaAHZ74E=; b=mXp1FSdyPWAykZlHESypU1YyvN2ElrAo2vX/Cv953WzubKLMDuF17dkvo55AutiotT nPQpW5A/eUGnLRiB1xq1KibrfZjDkkKHn+KZI37ehE2gAQg6Z7dZ6FIJaO/tCctNwRqi yVs7o/byG5SamkVSudxeauMD2hfgesd1Zl21gLsAAIrqP3C9C5mzsYB46FJPfa3sti3u XisPe4/YxgDEdKJiVuzpXEndViZ2oUoD1cg7Si1QoCR+gPG7CbNdvXpiGtE5FEuQxSIg 3D37bUzQljX6FcogYJmcXLkC93bBv9o7qs9mTFEJVFh/0zL4TtMz5zgggkJGMFvDu7uz CQTw== X-Gm-Message-State: APjAAAUCQlumt6KcH/ci4ukvzb04rYkoqKgQj3SYuPb7Jq/OF+YpGdSb zgnkbzXeLtxrNNsRrBu8e4xx1A== X-Received: by 2002:aed:25a4:: with SMTP id x33mr13058777qtc.165.1581616694352; Thu, 13 Feb 2020 09:58:14 -0800 (PST) Received: from localhost ([2620:10d:c091:500::d837]) by smtp.gmail.com with ESMTPSA id d9sm1826312qth.34.2020.02.13.09.58.13 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 13 Feb 2020 09:58:13 -0800 (PST) Date: Thu, 13 Feb 2020 12:58:13 -0500 From: Johannes Weiner To: Michal Hocko Cc: Andrew Morton , Roman Gushchin , Tejun Heo , linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com Subject: Re: [PATCH v2 3/3] mm: memcontrol: recursive memory.low protection Message-ID: <20200213175813.GA216470@cmpxchg.org> References: <20191219200718.15696-1-hannes@cmpxchg.org> <20191219200718.15696-4-hannes@cmpxchg.org> <20200130170020.GZ24244@dhcp22.suse.cz> <20200203215201.GD6380@cmpxchg.org> <20200211164753.GQ10636@dhcp22.suse.cz> <20200212170826.GC180867@cmpxchg.org> <20200213074049.GA31689@dhcp22.suse.cz> <20200213132317.GA208501@cmpxchg.org> <20200213154627.GD31689@dhcp22.suse.cz> <20200213174135.GC208501@cmpxchg.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200213174135.GC208501@cmpxchg.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 13, 2020 at 12:41:36PM -0500, Johannes Weiner wrote: > On Thu, Feb 13, 2020 at 04:46:27PM +0100, Michal Hocko wrote: > > On Thu 13-02-20 08:23:17, Johannes Weiner wrote: > > > On Thu, Feb 13, 2020 at 08:40:49AM +0100, Michal Hocko wrote: > > > > On Wed 12-02-20 12:08:26, Johannes Weiner wrote: > > > > > On Tue, Feb 11, 2020 at 05:47:53PM +0100, Michal Hocko wrote: > > > > > > Unless I am missing something then I am afraid it doesn't. Say you have a > > > > > > default systemd cgroup deployment (aka deeper cgroup hierarchy with > > > > > > slices and scopes) and now you want to grant a reclaim protection on a > > > > > > leaf cgroup (or even a whole slice that is not really important). All the > > > > > > hierarchy up the tree has the protection set to 0 by default, right? You > > > > > > simply cannot get that protection. You would need to configure the > > > > > > protection up the hierarchy and that is really cumbersome. > > > > > > > > > > Okay, I think I know what you mean. Let's say you have a tree like > > > > > this: > > > > > > > > > > A > > > > > / \ > > > > > B1 B2 > > > > > / \ \ > > > > > C1 C2 C3 > > > > So let's see how that works in practice, say a multi workload setup > > > > with a complex/deep cgroup hierachies (e.g. your above example). No > > > > delegation point this time. > > > > > > > > C1 asks for low=1G while using 500M, C3 low=100M using 80M. B1 and > > > > B2 are completely independent workloads and the same applies to C2 which > > > > doesn't ask for any protection at all? C2 uses 100M. Now the admin has > > > > to propagate protection upwards so B1 low=1G, B2 low=100M and A low=1G, > > > > right? Let's say we have a global reclaim due to external pressure that > > > > originates from outside of A hierarchy (it is not overcommited on the > > > > protection). > > > > > > > > Unless I miss something C2 would get a protection even though nobody > > > > asked for it. > > > > > > Good observation, but I think you spotted an unintentional side effect > > > of how I implemented the "floating protection" calculation rather than > > > a design problem. > > > > > > My patch still allows explicit downward propagation. So if B1 sets up > > > 1G, and C1 explicitly claims those 1G (low>=1G, usage>=1G), C2 does > > > NOT get any protection. There is no "floating" protection left in B1 > > > that could get to C2. > > > > Yeah, the saturated protection works reasonably AFAICS. > > Hm, Tejun raises a good point though: even if you could direct memory > protection down to one targeted leaf, you can't do the same with IO or > CPU. Those follow non-conserving weight distribution, and whatever you "work-conserving", obviously. > allocate to a certain level is available at that level - if one child > doesn't consume it, the other children can. > > And we know that controlling memory without controlling IO doesn't > really work in practice. The sibling with less memory allowance will > just page more. > > So the question becomes: is this even a legit usecase? If every other > resource is distributed on a level-by-level method already, does it > buy us anything to make memory work differently?