Received: by 2002:a05:7412:37c9:b0:e2:908c:2ebd with SMTP id jz9csp89079rdb; Mon, 18 Sep 2023 08:58:58 -0700 (PDT) X-Google-Smtp-Source: AGHT+IF5gTXxeMHFrlvYPtQUYDH5b/WafyNpGEvH1o85pmfzWjJyhUdzRjC8kgmOiHcJgAXVswKD X-Received: by 2002:a05:6a20:a110:b0:152:cb38:5b47 with SMTP id q16-20020a056a20a11000b00152cb385b47mr11076206pzk.55.1695052738108; Mon, 18 Sep 2023 08:58:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1695052738; cv=none; d=google.com; s=arc-20160816; b=dUrbkjcIs+xGiC6gi9OnZYwQg8xUUWtoSdv3wiqD7LnrngFBQwRjnUxUUraKc9n4Ai 4g+bNH4s5cTE31R3Rxu7Bxdrh8IVTjoya9n+OpsuAZqGsUWf4Gdm7ofTSFj/9uW4WXxW i9hV1a7+g5utKjW5c4Rt+MqUBGS5P7tUd66cO2vs3BcODOO/QI/xvSt45bakFPANCp+r APlV0OvcERy/op4PXE9t+97SspOWD8PpNQjoF6oIMmE2cy/kyxUkfyEIlDHlnehlBobt rBTZPELAP3/xncbgso6AgM9CnqB4qBtY394iw4untBegPKJoaUHi4jsZk7CIGlXIHByl UaXQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :content-language:references:cc:to:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=SmPJUeRmGlrr7ZOL7yO07ODjPeIo7fJ0V72dQGfG3EE=; fh=uMPlVuGaIiBEcMFK8CcyZawuQTkhdT77YRu3WBtITNs=; b=TLpafrvml/njEsUkcNQHgsWeCGdFSE3sKd+L1JDxtrmBfyhmw+RsrLRjXoTLl86X+n rQwveqPSr4rTgstjOAVdBmMJY1hMcKTrureRofxlXcpeGEhmLIXfI4ivFBnYltD00/Rf DbTmwGh4QmtJCkxKpuogdfh7FaYyi8oOG0TYNlG35x1NY7YrfSUOfvvZlHx/R9GUgCY5 Vl/2a/Vdye8mLC42QqgbLy834H0I7CjgRGunqyGTUUTJWhzwahvMimowue4J5pqGrBkV AiabLWdsBk86sATbIU8DtA2UNmC8tLhWFBejwP25vqieQY4rvqgLpuvVEMHb8iBXKfe3 9Vpw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=TLz+wWb6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.31 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Return-Path: Received: from morse.vger.email (morse.vger.email. [23.128.96.31]) by mx.google.com with ESMTPS id s11-20020a65584b000000b00578a26aee75si399511pgr.295.2023.09.18.08.58.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 18 Sep 2023 08:58:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.31 as permitted sender) client-ip=23.128.96.31; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=TLz+wWb6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.31 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by morse.vger.email (Postfix) with ESMTP id F169680A1371; Mon, 18 Sep 2023 00:53:31 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at morse.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240288AbjIRHxC (ORCPT + 99 others); Mon, 18 Sep 2023 03:53:02 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57470 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240512AbjIRHwm (ORCPT ); Mon, 18 Sep 2023 03:52:42 -0400 Received: from mail-pf1-x42a.google.com (mail-pf1-x42a.google.com [IPv6:2607:f8b0:4864:20::42a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 513E31AA for ; Mon, 18 Sep 2023 00:48:50 -0700 (PDT) Received: by mail-pf1-x42a.google.com with SMTP id d2e1a72fcca58-68c576d35feso3881248b3a.2 for ; Mon, 18 Sep 2023 00:48:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1695023330; x=1695628130; darn=vger.kernel.org; h=content-transfer-encoding:in-reply-to:from:content-language :references:cc:to:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date:message-id:reply-to; bh=SmPJUeRmGlrr7ZOL7yO07ODjPeIo7fJ0V72dQGfG3EE=; b=TLz+wWb6QjPaC20tJdpt2w5/ODGLJkdRUMwkxqsbRXDhC6uVzpRTg8ZOkdmhvtF15X bsDBF6qIysEqpvjvD/Jv8fxX5ErSArs2NU+tJceWDmzWuuZMnmrmexrp6zBruChRmn8c o2/fSH1+xFmw0okvvW9yOtcSSmE9mpLbUxp9xnbPRC5thLJY9TtpHU5i84kQk0kj2oDY 4kBU2iNZAt589b2LFptEmUiOUyBKNjsN8tb+azEiYpyQRX0P17r4dyG8uQDxPhttATNI L8R03Wsebn4SJrCusJ5unBMz7gp08MBMDSYEV9QhLB4ryLefWFe3KhgEJV/eOwZDgVet iqwg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1695023330; x=1695628130; h=content-transfer-encoding:in-reply-to:from:content-language :references:cc:to:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=SmPJUeRmGlrr7ZOL7yO07ODjPeIo7fJ0V72dQGfG3EE=; b=s5Io32io9kT+jb6MqAu74tDIRz3V6I0AO4inM9aznJlUC/a6zQBEzAuJJMW49JZZFX csCNHxZFQh7MokLNVBUPyu0AK1A+6fCQ1eW+PAmGVysO7rC0AR7UdKFnbZ7QdptXZSVn elO5KivKpRn/w5J40PCbsoeTnxniT/i/kyEqlM/czQVkXh1QE4BKQunCHE3jbTkW6SJw 4yK11L5xOmtO6wsLUD/YK4Gbe5IiGCxT/94TduZKymUxh3aVu0AT+EVdg/Gq66/C+rJJ 091Sor4oEOklbRUYad04SeMbmyEt4jjH8tO8zQSlnmaGwZtZwPjoJi8mmUBXh0gs7ZFO D2dg== X-Gm-Message-State: AOJu0YzqMygxjNayc6hTXY1Mzmklm+MDBJYn4SPYQtJXiCnXNx/J099K cWe9+A6rCau8vL9cs7VMht6FbQ== X-Received: by 2002:a05:6a20:6a1a:b0:137:a08b:8bef with SMTP id p26-20020a056a206a1a00b00137a08b8befmr10484697pzk.44.1695023330281; Mon, 18 Sep 2023 00:48:50 -0700 (PDT) Received: from [10.254.1.7] ([139.177.225.229]) by smtp.gmail.com with ESMTPSA id q8-20020a170902a3c800b001c474d6a408sm2238768plb.201.2023.09.18.00.48.42 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 18 Sep 2023 00:48:49 -0700 (PDT) Message-ID: <8c470d4d-b972-3f43-9b0a-712ee882a402@bytedance.com> Date: Mon, 18 Sep 2023 15:48:39 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.15.1 Subject: Re: [RFC PATCH net-next 3/3] sock: Throttle pressure-aware sockets under pressure To: "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , Andrew Morton , Shakeel Butt , Roman Gushchin , Michal Hocko , Johannes Weiner , Yosry Ahmed , Yu Zhao , "Matthew Wilcox (Oracle)" , Kefeng Wang , Yafang Shao , Kuniyuki Iwashima , Martin KaFai Lau , Breno Leitao , Alexander Mikhalitsyn , David Howells , Jason Xing Cc: open list , "open list:NETWORKING [GENERAL]" , "open list:MEMORY MANAGEMENT" References: <20230901062141.51972-1-wuyun.abel@bytedance.com> <20230901062141.51972-4-wuyun.abel@bytedance.com> Content-Language: en-US From: Abel Wu In-Reply-To: <20230901062141.51972-4-wuyun.abel@bytedance.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.3 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on morse.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (morse.vger.email [0.0.0.0]); Mon, 18 Sep 2023 00:53:32 -0700 (PDT) On 9/1/23 2:21 PM, Abel Wu wrote: > @@ -3087,8 +3100,20 @@ int __sk_mem_raise_allocated(struct sock *sk, int size, int amt, int kind) > if (sk_has_memory_pressure(sk)) { > u64 alloc; > > - if (!sk_under_memory_pressure(sk)) > + /* Be more conservative if the socket's memcg (or its > + * parents) is under reclaim pressure, try to possibly > + * avoid further memstall. > + */ > + if (under_memcg_pressure) > + goto suppress_allocation; > + > + if (!sk_under_global_memory_pressure(sk)) > return 1; > + > + /* Trying to be fair among all the sockets of same > + * protocal under global memory pressure, by allowing > + * the ones that under average usage to raise. > + */ > alloc = sk_sockets_allocated_read_positive(sk); > if (sk_prot_mem_limits(sk, 2) > alloc * > sk_mem_pages(sk->sk_wmem_queued + I totally agree with what Shakeel said in last reply and will try ebpf- based solution to let userspace inject proper strategies. But IMHO the above hunk is irrelevant to the idea of this patchset, and is the right thing to do, so maybe worth a separate patch? This hunk originally passes the allocation when this socket is below average usage even under global and/or memcg pressure. It makes sense to do so under global pressure, as the 'average' is in the scope of global, but it's really weird from a memcg's point of view. Actually this pass condition was present before memcg pressure was introduced. Please correct me if I missed something, thanks! Best, Abel