Received: by 2002:a05:6a10:eb17:0:0:0:0 with SMTP id hx23csp1510181pxb; Fri, 10 Sep 2021 07:31:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzOpKiriO8xlEJXvGNW8hDMigMEb7YM44kNUECM8k92MAK/XsjiIqUTWxXrs6VPsIIY1b07 X-Received: by 2002:a50:d713:: with SMTP id t19mr9282986edi.2.1631284274497; Fri, 10 Sep 2021 07:31:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1631284274; cv=none; d=google.com; s=arc-20160816; b=h3WSdx3U5Wn8mdrJ9rYcvcIzBcwpwrGF3OW5ovRj14KBIWZADhcGIhpiBH4igTzDe+ 02sA0908kNyPKyQqMk283qbF9Yu7vPcJN0fTy/Yg9VP39d3YRq8++baNCtY6viK237Dy 2VYI4mGjTmj3lryg+OIYmnDCe+qA3iD34GMdBG6QMNOrOt87i2RHhiq+dUurzVJCYej6 HHGo9O5Cyj6foS+5hMntrZiwF0gPbFpVjCINRzg5UL4x6ODGYZhi7GM2RGsj+OZMa+hu zGX1BBGeIz1al4hnCElUhMsiuIOdxm5X3s57WfyO1k+fXbD9Sdxmw+oMxZ2VDlY7cVr8 pBtQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=o86Ela2dKbz2GzHznXe/g7ScOOJeqlE0Zm5503NyBDQ=; b=TajqVyA5YiP5nhUOWpPbiv9sRvq3kCniJPJma9AL4ZLM2ojtYfVy62KKbUv2rmayJQ x4ZJW6EZH/78qvHVFO+hmaCoFpyjZKeDTkQ3UtZp51pWyH8Uzl/5F35ZqWqVBT0Vz61k kyBB2RMX9WjMBjbsRNCqEH2VRtQFcn+j2nSrdlPijzpcPBLYvEn4ZglHXjgnG0p1dvZp p3Mq3TXOFn//69lqer7DJMHzRZEINlR6oWxUXx+sstL3snRHsZ1fua3C6B9eTyuBDU8Z 1AMm/GP+idlCXm4g29KC0YWhmk/YdgobdQUaVFjlMY13kl+j13xg6ASTJYUenlcSevzO +v5A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Bxn804R0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m3si5128991edq.359.2021.09.10.07.30.45; Fri, 10 Sep 2021 07:31:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Bxn804R0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233943AbhIJOah (ORCPT + 99 others); Fri, 10 Sep 2021 10:30:37 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57430 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233120AbhIJOag (ORCPT ); Fri, 10 Sep 2021 10:30:36 -0400 Received: from mail-pl1-x633.google.com (mail-pl1-x633.google.com [IPv6:2607:f8b0:4864:20::633]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6632DC061574; Fri, 10 Sep 2021 07:29:25 -0700 (PDT) Received: by mail-pl1-x633.google.com with SMTP id d17so1245449plr.12; Fri, 10 Sep 2021 07:29:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=o86Ela2dKbz2GzHznXe/g7ScOOJeqlE0Zm5503NyBDQ=; b=Bxn804R0i4B8zYlx7gxgC59R+MuM6Dyg5UPT8+rmC10z0AGCD6J3RNEhD4t8PqcsUy XjMywQShlJsNUa+mJ9R3hTGfLxIvIo07GAicJZuvELS7AJouCi2m6hd1QT85YqLEVt3P XXQ9urfO7RBljpYSWvgX9x2iQRIUdVRgj0X77MuEt+hZ/M168SMbUg/Y5X/r+BBhbaSo V4w3Y1lCO9+JsoJHPpuhPazv6MIKsYMSV0vY+xb/4+r2sN927yPTM5HUm5qXYPk881E2 dIWPDgI/Ph4IXh9dIIRKZ8jG0g/FhwDO8RHAg45Co+dMRiaSOo7umTTON+QCRPQJZ3Sq ZVcQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=o86Ela2dKbz2GzHznXe/g7ScOOJeqlE0Zm5503NyBDQ=; b=XOqeU5aFyZX4rMXX/vZK6tDA61/qV1B3D3CipjHAFc41W39+asr+t0a0a/RDJSVZdk S5rBTGtLBnOaFmePXk373EtJWqF5xNDqNbiPY+nG6PvHSLrRDZIZJoglGFZl+GpP1sAS fbwWxSh3MsmquS1zKgnMnYCH0ZNtKXhcMQmv5P7h3W+w+1C+PmdeuQNWLV8OpSzn+sqp h8q2uRiL8xW+Ey9i7+1uPCryGGQxe3SxuPb4Litl4O1Ls6n/SuQ7p67VDRlqtbDr1bkS 3ySF5jT4QvbzvCBYDb0re47wNnu4LiiWvvKEUPO++GDgDE5RnuiLdMykYl5NlMWddeQV sTUw== X-Gm-Message-State: AOAM533YJq3FyHTYZyf7spEpMkJzzctBGYAPqVEu2OLtIh93aSF3eeJ9 VD1OQuqezNIXI4PIImppMF0g+ULd1ew= X-Received: by 2002:a17:902:e5cb:b0:13a:f7fd:cba2 with SMTP id u11-20020a170902e5cb00b0013af7fdcba2mr3855598plf.85.1631284164660; Fri, 10 Sep 2021 07:29:24 -0700 (PDT) Received: from [192.168.255.10] ([203.205.141.116]) by smtp.gmail.com with ESMTPSA id s17sm5134942pfg.4.2021.09.10.07.29.22 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 10 Sep 2021 07:29:24 -0700 (PDT) Subject: Re: [RFC PATCH 3/3] misc_cgroup: remove error log to avoid log flood To: =?UTF-8?Q?Michal_Koutn=c3=bd?= Cc: Vipin Sharma , tj@kernel.org, lizefan.x@bytedance.com, hannes@cmpxchg.org, linux-kernel@vger.kernel.org, cgroups@vger.kernel.org References: <988f340462a1a3c62b7dc2c64ceb89a4c0a00552.1631077837.git.brookxu@tencent.com> <86e89df640f2b4a65dd77bdbab8152fa8e8f5bf1.1631077837.git.brookxu@tencent.com> <20210909143720.GA14709@blackbody.suse.cz> <478e986c-bc69-62b8-936e-5b075f9270b4@gmail.com> <20210910092310.GA18084@blackbody.suse.cz> From: "brookxu.cn" Message-ID: <1679f995-5a6f-11b8-7870-54318db07d0d@gmail.com> Date: Fri, 10 Sep 2021 22:29:21 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.13.0 MIME-Version: 1.0 In-Reply-To: <20210910092310.GA18084@blackbody.suse.cz> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Thanks for your time. On 2021/9/10 5:23 PM, Michal Koutný wrote: > On Fri, Sep 10, 2021 at 01:30:46PM +0800, brookxu wrote: >> I am a bit confused here. For misc_cgroup, we can only be rejected when the count >> touch Limit, but there may be other more reasons for other subsystems. > > Sorry, I wasn't clear about that -- the failures I meant to be counted > here were only the ones caused by (an ancestor) limit. Maybe there's a > better naem for that. > >> Therefore, when we are rejected, does it mean that we have touch >> Limit? If so, do we still need to distinguish between max and fail? >> (for misc_cgroup) > > r > `- c1 > `- c2.max > `- c3 > `- c4.max > `- task t > `- c5 > > Assuming c2.max < c4.max, when a task t calls try_charge and it fails > because of c2.max, then the 'max' event is counted to c2 (telling that > the limit is perhaps low) and the 'fail' event is counted to c4 (telling > you where the troubles originated). That is my idea. Although in the > case of short-lived cgroups, you'd likely only get the hierarchically > aggregated 'fail' events from c3 or higher with lower (spatial) > precision. > What would be the type of information useful for your troubleshooting? Through events and events.local, we can determine which node has insufficient resources. For example, when the ‘events’ is large, we traverse down and use events.local to determine which node has insufficient resources. 'fail' counter does not seem to provide more effective information in this regard. When 'fail' is big, it seems that we still need to use events and events.local to determine the node of insufficient resources. I am not very sure what details can we learn through 'fail' counter. > > Cheers, > Michal >