Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp3538729pxb; Mon, 4 Apr 2022 20:27:07 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxGr90swPJADAwmHOwArMLZndGbNMmnAeW3IKnAPBk0umSqxI3EsbS0hQKPADZ5y8vBG3hX X-Received: by 2002:a63:794c:0:b0:398:2f41:4b0f with SMTP id u73-20020a63794c000000b003982f414b0fmr1124241pgc.448.1649129227817; Mon, 04 Apr 2022 20:27:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649129227; cv=none; d=google.com; s=arc-20160816; b=hjXBBpylRYNeKzV66lvBxAGVq8yr8hjIhpNmdoQApFRXMXJBOSBc3ATYdphz1f57t3 9HstIruucG2iIi85YMFw9/eBG0j7XpIoVUiXkap5A2BTWvkNuK6zHLvm5gINKhaY3rDN +vO0VHSzWVnvKzLKDLTRDiPyJu8aI3DiSBDGxmsgXPbPB/vt+ZUMo99Cgj9p/Rk2BuJp 3UqpHa9EZ2/3tiahZBTJRTFvc90kSzKxu1QZDCZ21KpBenoxhhzaj+KjY84VT5gevj5I qI4I2PGC0eKS7MUATWz2nOTtN3ii+0CNMQEQT7IPAJYPE0KEYDj0zxxraASPjj0ffSOC /wNQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=LpYoL6AgjzJZ4tpGv3/xyzDAd+WQcpBXKtpN+zEEZ6M=; b=JZ2Xsu8jH3O/yY2x2K81CD4+Q4SDAZbxzNmpXLtxp0YyIGJTwOjO3kbl3vuROkuVty lC08nTL/KDn5t35pszZmscEHLmvCrHL9mtFuidG9sGixusHx+WwIPPSdQ6Yj/aFUnqal N/DATeHj70vK51H0uZtxn6+dnzcTEVXP/CAziaXxZ8GpR+NR/sEWzypDKcWE46XTvlKi G2w6Bkzc/mZndzX79W5rpPFZHZWJWo3FdGyZcRj7xGom5aTXocFcJyjLn54AeC22FR3V W60Y1WimozPAPlbFPNeh/UviMkZebchQqkTmNTMirCqsOBEYJLgM4Y7lMgcvHMIjZFxo ncTg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=URZHSoRi; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id z23-20020a17090a541700b001c6454edecasi710421pjh.152.2022.04.04.20.27.07 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 Apr 2022 20:27:07 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=URZHSoRi; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 78A26312B4E; Mon, 4 Apr 2022 18:26:46 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1354605AbiDDJip (ORCPT + 99 others); Mon, 4 Apr 2022 05:38:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42934 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1354650AbiDDJil (ORCPT ); Mon, 4 Apr 2022 05:38:41 -0400 Received: from smtp-out2.suse.de (smtp-out2.suse.de [195.135.220.29]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5B73D3BFBC; Mon, 4 Apr 2022 02:36:36 -0700 (PDT) Received: from relay2.suse.de (relay2.suse.de [149.44.160.134]) by smtp-out2.suse.de (Postfix) with ESMTP id 0F7091F37E; Mon, 4 Apr 2022 09:36:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1649064995; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=LpYoL6AgjzJZ4tpGv3/xyzDAd+WQcpBXKtpN+zEEZ6M=; b=URZHSoRiL8o7b+yMQLyfEwpcnDVBQxC25DaiTXuXyRDL0IVDsBLTG0R5+BoVe4h2RwHe+8 846rB9SKsGl/yDW1VzKHMBpdDywbpp7qqlqIszK75ScfGaGQ/nc4UbHX9VFyfjHiaxZssh AyAChWqakgt161TVz0sPrchh0q2dSMg= Received: from suse.cz (unknown [10.100.201.86]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by relay2.suse.de (Postfix) with ESMTPS id CC8C7A3B92; Mon, 4 Apr 2022 09:36:34 +0000 (UTC) Date: Mon, 4 Apr 2022 11:36:33 +0200 From: Michal Hocko To: Zhaoyang Huang Cc: Suren Baghdasaryan , "zhaoyang.huang" , Andrew Morton , Johannes Weiner , Vladimir Davydov , "open list:MEMORY MANAGEMENT" , LKML , cgroups mailinglist , Ke Wang Subject: Re: [RFC PATCH] cgroup: introduce dynamic protection for memcg Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 04-04-22 11:32:28, Michal Hocko wrote: > On Mon 04-04-22 17:23:43, Zhaoyang Huang wrote: > > On Mon, Apr 4, 2022 at 5:07 PM Zhaoyang Huang wrote: > > > > > > On Mon, Apr 4, 2022 at 4:51 PM Michal Hocko wrote: > > > > > > > > On Mon 04-04-22 10:33:58, Zhaoyang Huang wrote: > > > > [...] > > > > > > One thing that I don't understand in this approach is: why memory.low > > > > > > should depend on the system's memory pressure. It seems you want to > > > > > > allow a process to allocate more when memory pressure is high. That is > > > > > > very counter-intuitive to me. Could you please explain the underlying > > > > > > logic of why this is the right thing to do, without going into > > > > > > technical details? > > > > > What I want to achieve is make memory.low be positive correlation with > > > > > timing and negative to memory pressure, which means the protected > > > > > memcg should lower its protection(via lower memcg.low) for helping > > > > > system's memory pressure when it's high. > > > > > > > > I have to say this is still very confusing to me. The low limit is a > > > > protection against external (e.g. global) memory pressure. Decreasing > > > > the protection based on the external pressure sounds like it goes right > > > > against the purpose of the knob. I can see reasons to update protection > > > > based on refaults or other metrics from the userspace but I still do not > > > > see how this is a good auto-magic tuning done by the kernel. > > > > > > > > > The concept behind is memcg's > > > > > fault back of dropped memory is less important than system's latency > > > > > on high memory pressure. > > > > > > > > Can you give some specific examples? > > > For both of the above two comments, please refer to the latest test > > > result in Patchv2 I have sent. I prefer to name my change as focus > > > transfer under pressure as protected memcg is the focus when system's > > > memory pressure is low which will reclaim from root, this is not > > > against current design. However, when global memory pressure is high, > > > then the focus has to be changed to the whole system, because it > > > doesn't make sense to let the protected memcg out of everybody, it > > > can't > > > do anything when the system is trapped in the kernel with reclaiming work. > > Does it make more sense if I describe the change as memcg will be > > protect long as system pressure is under the threshold(partially > > coherent with current design) and will sacrifice the memcg if pressure > > is over the threshold(added change) > > No, not really. For one it is still really unclear why there should be any > difference in the semantic between global and external memory pressure > in general. The low limit is always a protection from the external > pressure. And what should be the actual threshold? Amount of the reclaim > performed, effectivness of the reclaim or what? Btw. you might want to have a look at http://lkml.kernel.org/r/20220331084151.2600229-1-yosryahmed@google.com where a new interface to allow pro-active memory reclaim is discussed. I think that this might turn out to be a better fit then an automagic kernel manipulation with a low limit. It will require a user agent to drive the reclaim though. -- Michal Hocko SUSE Labs