Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp5491730ioo; Wed, 1 Jun 2022 06:43:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxYK0skXcGebGFJcMUPbNxZyg6SZqMEBov/1zWn4ARG9kLWqmmI2ND41iZ7hLkIHGVWmPJ7 X-Received: by 2002:a05:6402:42c6:b0:42d:ed84:6fe0 with SMTP id i6-20020a05640242c600b0042ded846fe0mr7258485edc.58.1654091013287; Wed, 01 Jun 2022 06:43:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654091013; cv=none; d=google.com; s=arc-20160816; b=1DhohUuUpRhonLlhEBbPnpIBRAHV32Z+xLnvyrG1utyBGHIFhNxKkgJPoKBYRRRA+L Sajz9x+86MwSECakpXWpzrFZ+VYIFUayLzm08WZFQGl1vI3qDG6Zctwea3hD+6hCTwQn ncEtPyoVRvscFTp3s01YnhuyrxDK6Zh98vHol2z1x3fnmu66qhGQL6qsvGZL8M67kq2k QSmkM4aX7ikntVyASYdKrqFo2M+Etzj3apZc7uB01eAXkYlor/Mn3mY0QL6m8uTAxhKX 03n8Beo+6hknA8Em52lutzBLZDwQfNDwOdpCyHtqUD1iFpZrL0TIJYUwMZzauUyoziJi +wrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=ynHr+f3HdbTV3fIx5fiKz75mjaU0slPzyjQUZFWsNNE=; b=Z8yPMu4Icd6F0wYImAR3xbcL7jeqUXCp8a5Jf8v2+ebqtlqC3wF7GlmymF4L+NPyqQ K1F7xfxbZuBeZKShawPBhZMfGDDs+u5Yu0AiPEdYhfI4vOlo1k8p6ayRffkAipmncwK+ bxsKh0QJgqOWBiewO8kDiIO2ET4T4pw2w/FzwDlCE8T2yEKCsTZ3HysuAWrmOVNgLBbl Y9GUNYzNYmX0hD092VdVb+3xmOtYz9D8wo+jT3wnhXcfbK6arQoGqOHSy+SmA0eKEB+9 af7dMo8dk3yT3X5hy5qV8sDdg4PfCp3e2VS4NMsmmo61bafVksnNv+a3C85GE2egSIUO WqAA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=3BSBjnH3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t12-20020a170906a10c00b00702d8adf775si1733319ejy.169.2022.06.01.06.43.05; Wed, 01 Jun 2022 06:43:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=3BSBjnH3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S244514AbiEaH3k (ORCPT + 99 others); Tue, 31 May 2022 03:29:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39490 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244507AbiEaH3i (ORCPT ); Tue, 31 May 2022 03:29:38 -0400 Received: from mail-pf1-x435.google.com (mail-pf1-x435.google.com [IPv6:2607:f8b0:4864:20::435]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7D4EF92D2B for ; Tue, 31 May 2022 00:29:36 -0700 (PDT) Received: by mail-pf1-x435.google.com with SMTP id i18so12429426pfk.7 for ; Tue, 31 May 2022 00:29:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20210112.gappssmtp.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=ynHr+f3HdbTV3fIx5fiKz75mjaU0slPzyjQUZFWsNNE=; b=3BSBjnH3ZVmWwEIVHzHXEhghdgZpFnAo+yvYtAzpgRfIXkbyBVhTXgf5d/xjNpaQHS sz0UFZvQzv5e5HRgAt9iGDSpGTZx9tTCiezIhnMbEdJOAJ7K+A0IMSH0rp5jdGF1tm8K EIrFKWV2qoxlzowMCVlAVqGHTETYK99rGlF5k+vSuVBHe/TdQ3E8MnxKfl9JfYEjOtJH gnrlRHJsGlcnnMfhwsuKx9xFyDr/GEvdyf13nvjHmdiDKABo1mgxQFAJZxSAAZaNhZhd NeqvDimuZDLbss4uGUDTImQXiGHwgXTn0NpL9vNjO1nvo7gCeAaXNnUrMKHbLulytSh2 JmIg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=ynHr+f3HdbTV3fIx5fiKz75mjaU0slPzyjQUZFWsNNE=; b=59fCqR5e79NhWqGJ2tR3nwR1lL58ipGQ1lyJXQd7sJSuBHzbln3rG/ekLolfmBCIJt cnw+HlTFpcxLmU+Ueft4homgQ1aZL1B5se3B9fxRPwDazAKjrJhL1p+JZJ/8InGlFv7c wea6tqNuZwWb81y7pDmQsT0pvi2Rg468LvN15q2LcVwAbc0kB6XLYCvvDXt/djBqvCuG h+WTaUQ7DTv+A3xutQS48EWSDfwsG/YleWnnb/7Zpgd2ahM3DmE+z6AyFngrGmMnmIuQ JblyIcBcftr06SYR8ezcwxBfJ4lEBp73bARia9KB9+/kuO7Rsd8JXUdSd7eciUst/9r4 VQ0w== X-Gm-Message-State: AOAM531zi8irc3ln8hK+93sLSneGzwBzuWoQGoRA+bisDtdECbdU4oqA g/CPrql+3vRIqPnrJeOH8DrNgg== X-Received: by 2002:a63:7156:0:b0:3fb:fa23:480e with SMTP id b22-20020a637156000000b003fbfa23480emr9579581pgn.553.1653982176034; Tue, 31 May 2022 00:29:36 -0700 (PDT) Received: from localhost ([2408:8207:18da:2310:94c7:fca6:824f:4dab]) by smtp.gmail.com with ESMTPSA id j190-20020a6380c7000000b003fbea5453c5sm4901384pgd.9.2022.05.31.00.29.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 31 May 2022 00:29:35 -0700 (PDT) Date: Tue, 31 May 2022 15:29:27 +0800 From: Muchun Song To: Waiman Long Cc: hannes@cmpxchg.org, mhocko@kernel.org, roman.gushchin@linux.dev, shakeelb@google.com, akpm@linux-foundation.org, cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, duanxiongchun@bytedance.com Subject: Re: [PATCH v5 00/11] Use obj_cgroup APIs to charge the LRU pages Message-ID: References: <20220530074919.46352-1-songmuchun@bytedance.com> <1ecec7cb-035c-a4aa-3918-1a00ba48c6f9@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1ecec7cb-035c-a4aa-3918-1a00ba48c6f9@redhat.com> X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, May 30, 2022 at 10:41:30PM -0400, Waiman Long wrote: > On 5/30/22 03:49, Muchun Song wrote: > > This version is rebased on v5.18. > > > > Since the following patchsets applied. All the kernel memory are charged > > with the new APIs of obj_cgroup. > > > > [v17,00/19] The new cgroup slab memory controller [1] > > [v5,0/7] Use obj_cgroup APIs to charge kmem pages [2] > > > > But user memory allocations (LRU pages) pinning memcgs for a long time - > > it exists at a larger scale and is causing recurring problems in the real > > world: page cache doesn't get reclaimed for a long time, or is used by the > > second, third, fourth, ... instance of the same job that was restarted into > > a new cgroup every time. Unreclaimable dying cgroups pile up, waste memory, > > and make page reclaim very inefficient. > > > > We can convert LRU pages and most other raw memcg pins to the objcg direction > > to fix this problem, and then the LRU pages will not pin the memcgs. > > > > This patchset aims to make the LRU pages to drop the reference to memory > > cgroup by using the APIs of obj_cgroup. Finally, we can see that the number > > of the dying cgroups will not increase if we run the following test script. > > > > ```bash > > #!/bin/bash > > > > dd if=/dev/zero of=temp bs=4096 count=1 > > cat /proc/cgroups | grep memory > > > > for i in {0..2000} > > do > > mkdir /sys/fs/cgroup/memory/test$i > > echo $$ > /sys/fs/cgroup/memory/test$i/cgroup.procs > > cat temp >> log > > echo $$ > /sys/fs/cgroup/memory/cgroup.procs > > rmdir /sys/fs/cgroup/memory/test$i > > done > > > > cat /proc/cgroups | grep memory > > > > rm -f temp log > > ``` > > > > [1] https://lore.kernel.org/linux-mm/20200623015846.1141975-1-guro@fb.com/ > > [2] https://lore.kernel.org/linux-mm/20210319163821.20704-1-songmuchun@bytedance.com/ > > > > v4: https://lore.kernel.org/all/20220524060551.80037-1-songmuchun@bytedance.com/ > > v3: https://lore.kernel.org/all/20220216115132.52602-1-songmuchun@bytedance.com/ > > v2: https://lore.kernel.org/all/20210916134748.67712-1-songmuchun@bytedance.com/ > > v1: https://lore.kernel.org/all/20210814052519.86679-1-songmuchun@bytedance.com/ > > RFC v4: https://lore.kernel.org/all/20210527093336.14895-1-songmuchun@bytedance.com/ > > RFC v3: https://lore.kernel.org/all/20210421070059.69361-1-songmuchun@bytedance.com/ > > RFC v2: https://lore.kernel.org/all/20210409122959.82264-1-songmuchun@bytedance.com/ > > RFC v1: https://lore.kernel.org/all/20210330101531.82752-1-songmuchun@bytedance.com/ > > > > v5: > > - Lots of improvements from Johannes, Roman and Waiman. > > - Fix lockdep warning reported by kernel test robot. > > - Add two new patches to do code cleanup. > > - Collect Acked-by and Reviewed-by from Johannes and Roman. > > - I didn't replace local_irq_disable/enable() to local_lock/unlock_irq() since > > local_lock/unlock_irq() takes an parameter, it needs more thinking to transform > > it to local_lock. It could be an improvement in the future. > > My comment about local_lock/unlock is just a note that > local_irq_disable/enable() have to be eventually replaced. However, we need > to think carefully where to put the newly added local_lock. It is perfectly > fine to keep it as is and leave the conversion as a future follow-up. > Totally agree. > Thank you very much for your work on this patchset. > Thanks.