Received: by 2002:a25:868d:0:0:0:0:0 with SMTP id z13csp1370768ybk; Thu, 21 May 2020 05:26:03 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzrJYj5JDbTdkeqh/kdkh32TNKF61rXfVMMW9gwStc6uGtdXrCNIRwOdtFGRw106qAKFMoV X-Received: by 2002:a05:6402:948:: with SMTP id h8mr7303196edz.127.1590063963126; Thu, 21 May 2020 05:26:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1590063963; cv=none; d=google.com; s=arc-20160816; b=J8RMTujKElmdhBC9STIdUwCUoDyzidl2zj27THBPojEJoz4kbmPLz4DLZee/HiqU/T W5dT10MwmSDTGfl/4+bIaAme0EUBbfF0r/6hEzNTlUCFEI9qHfUFeVmAC5PZDDNEYuFX wCNrQg1D8rZ6osbT5wl3fHHUY2l0e0CWzVbly3qhziS7tuelzj8SOp3U7q5B4dOGs1Qm DDzxC3fUYOA2l2CdVar7Z01E792E+LSxtQR4+qy0OKpS80zwZV9KNB0D/jKIKUSXF8DB NN7erbmycbiCufC6ojvObwCz47pbSWfBNIBdn3RF3GKhbJhZuqkxVHkkO+L5nD/UpRfd PSRQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=uwOifwCrhhMbK5Uq7O2JD6i7Fke3H2izTnc0aJuCXEg=; b=xVftaRXn/XTRl9wziVW+pGnAJrAxopKagJJm/4ufsGvjNFgB199LHIXIcWeTYRW6Co pG7k/4j0QX4/vKNXI17WLiI+GK9SbNvr/kiX7Eag+0MLUjUjH11SM5cf1uB1VUvvH72W EoO0wSgeqyBcNNDYIhlXRKYZSa1wXUksdzQhcj/Szhr+iiKLG7utwaUfvPpQRquC6qZk /FheXOYnQo8VnN47hJvioNBb6LZ2W8sNh1jEM991iAwIgtSMx9c0YsoUOFFnMX+xvJuQ WFREZv83HLuOmOt+2AMwlHBFCkKx36p6wC/+ytfc1oFlnlOqvcv9j05H7vo1kV9Y4sdQ qj/w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@chrisdown.name header.s=google header.b=XPG2fmle; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chrisdown.name Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z16si3324037eja.213.2020.05.21.05.25.39; Thu, 21 May 2020 05:26:03 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@chrisdown.name header.s=google header.b=XPG2fmle; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chrisdown.name Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729310AbgEUMXa (ORCPT + 99 others); Thu, 21 May 2020 08:23:30 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35178 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729304AbgEUMX3 (ORCPT ); Thu, 21 May 2020 08:23:29 -0400 Received: from mail-ed1-x543.google.com (mail-ed1-x543.google.com [IPv6:2a00:1450:4864:20::543]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 57770C061A0F for ; Thu, 21 May 2020 05:23:29 -0700 (PDT) Received: by mail-ed1-x543.google.com with SMTP id l25so6527837edj.4 for ; Thu, 21 May 2020 05:23:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chrisdown.name; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=uwOifwCrhhMbK5Uq7O2JD6i7Fke3H2izTnc0aJuCXEg=; b=XPG2fmle0UHD0TI3rprNxDcmfUfFqTRmNpREToIIZtED2Tw1ccSUVY3s4nRvf2eeHI 6fyQbuYFMmtWFb/NIweeYDAxLIZQggq40/t2t6OcwndTJ9KfQbOi2cjxxCdmgouj8g5f PWs23lR8s68dqTVlmt13rZ3Mv+n9u0mXUNON0= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=uwOifwCrhhMbK5Uq7O2JD6i7Fke3H2izTnc0aJuCXEg=; b=Yf8VL8JIoUyGDgAkZb0V8eGaovj5zzHBmD4Gai1ulNAw+tMPPgyKhnvTcrUh9Kd3uk vUVPbZKdkggArRce/ll4PJpLnTymWCCG63rIkxugkXZyRCgRRbN5RDBj4lczA6rBTfN3 zaH95XPuQOQpZkYmuJfFKbI+zki42jV43uBKyCO3AvaucG6H2JAc4jJhXFpDT6QZGAt4 VcBa3vb5j5Q0hrA/H7CTxIIk66eZXBlJ4LL/fZGHoGUarTUUYXh0TG9QvEWr8hAi/Qkk NaBrFtWVR6RdEbM33M6wTBWswLPoozmAbUmb3UAhzLgtM7z3gfC8xZuzc1S6JSCyl758 w12A== X-Gm-Message-State: AOAM533KSn9U29LSLsfcSMX22wJBybJaollsuX2upEnwHeuevO8Za+Ht lswWy22QsIt3GJj+7orx/RnuUI9PxFG7VIoE X-Received: by 2002:a50:bf04:: with SMTP id f4mr7265805edk.91.1590063807770; Thu, 21 May 2020 05:23:27 -0700 (PDT) Received: from localhost ([2620:10d:c093:400::5:4262]) by smtp.gmail.com with ESMTPSA id h16sm4695618ejy.47.2020.05.21.05.23.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 21 May 2020 05:23:27 -0700 (PDT) Date: Thu, 21 May 2020 13:23:27 +0100 From: Chris Down To: Michal Hocko Cc: Andrew Morton , Johannes Weiner , Tejun Heo , linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com Subject: Re: [PATCH] mm, memcg: reclaim more aggressively before high allocator throttling Message-ID: <20200521122327.GB990580@chrisdown.name> References: <20200520143712.GA749486@chrisdown.name> <20200520160756.GE6462@dhcp22.suse.cz> <20200520202650.GB558281@chrisdown.name> <20200521071929.GH6462@dhcp22.suse.cz> <20200521112711.GA990580@chrisdown.name> <20200521120455.GM6462@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <20200521120455.GM6462@dhcp22.suse.cz> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org (I'll leave the dirty throttling discussion to Johannes, because I'm not so familiar with that code or its history.) Michal Hocko writes: >> > The main problem I see with that approach is that the loop could easily >> > lead to reclaim unfairness when a heavy producer which doesn't leave the >> > kernel (e.g. a large read/write call) can keep a different task doing >> > all the reclaim work. The loop is effectivelly unbound when there is a >> > reclaim progress and so the return to the userspace is by no means >> > proportional to the requested memory/charge. >> >> It's not unbound when there is reclaim progress, it stops when we are within >> the memory.high throttling grace period. Right after reclaim, we check if >> penalty_jiffies is less than 10ms, and abort and further reclaim or >> allocator throttling: > >Just imagine that you have parallel producers increasing the high limit >excess while somebody reclaims those. Sure in practice the loop will be >bounded but the reclaimer might perform much more work on behalf of >other tasks. A cgroup is a unit and breaking it down into "reclaim fairness" for individual tasks like this seems suspect to me. For example, if one task in a cgroup is leaking unreclaimable memory like crazy, everyone in that cgroup is going to be penalised by allocator throttling as a result, even if they aren't "responsible" for that reclaim. So the options here are as follows when a cgroup is over memory.high and a single reclaim isn't enough: 1. Decline further reclaim. Instead, throttle for up to 2 seconds. 2. Keep on reclaiming. Only throttle if we can't get back under memory.high. The outcome of your suggestion to decline further reclaim is case #1, which is significantly more practically "unfair" to that task. Throttling is extremely disruptive to tasks and should be a last resort when we've exhausted all other practical options. It shouldn't be something you get just because you didn't try to reclaim hard enough.