Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp5113937ybl; Wed, 22 Jan 2020 10:33:28 -0800 (PST) X-Google-Smtp-Source: APXvYqxRZk6svL2jRZLrz9BT1GgeJZ6ogKgk7pnBt6PyqzDXY4kEQXQW+yLTtIlm48zIagvfIJAz X-Received: by 2002:a9d:7d99:: with SMTP id j25mr7838824otn.226.1579718008061; Wed, 22 Jan 2020 10:33:28 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579718008; cv=none; d=google.com; s=arc-20160816; b=z4BUU5UegSI1XG4KGCHw3wBuhJO00ViPUfwY6fKtyCdAzm5WWHEY1jqLtpPjmsXOns zNc0Jh+6dCTA12iXW3lE/Zk/c/vzAuuMl3FeTrHrdXcEQdxjinEbVrCcFgy3crr84qP0 TBXVZeTKa1OnHffNxH1oBfo3CG8gN6AkSfW7c3OUtmjH65hqsglo09F2klqh7JRuWx1x qS5WYRMCsCVuWgXpwS/M0nkmLFnWcSIIbPfGTuNRj9w2CtICdzw3+LDHMyV2TWjTaAMk sxJWbUnY1VlWiXiG1Ak3T4AfTlaq7yeZXFbL3NbvLLQ7rHIa86VteNAEibCkppUKkoGZ +iMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=KsSmCaLhw2eH8Ck1bfIH7ogc6/ppRHUB+hk51YkiZzE=; b=cPGwRenLiQgrIjK5+PGg7YItn9sxyJ3wvVO3+b/l0uhoiUU48UpXyFxQoqZw62xZZZ KsWVOC7ndFt0yG61p1xNLVr/YWbTJfLqLEAPu6n/jZwx72zpwJ5B07PBFpPEBnvWzu2/ BZg0MzcpZ7Ta80+oOEZS57vTgGgURQNbSWrI7fimmuK3vpF3ThpjdMMwZNTlmR52qs/R aVcmq7dD/HPNixxenz/0L3YFAyzHFcy1JyGj0QlW2b2LNVPltAfJWinPgGvKjZTil7BL tu7yh31FL3JMXQIKomnL5eGMzxF29U1DvBHn4Blmyr0fzxVL5mcIF5pxWssBadY66ASn 35gg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@cmpxchg-org.20150623.gappssmtp.com header.s=20150623 header.b=u0FpVPQr; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=cmpxchg.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p28si21545359oth.296.2020.01.22.10.33.15; Wed, 22 Jan 2020 10:33:28 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@cmpxchg-org.20150623.gappssmtp.com header.s=20150623 header.b=u0FpVPQr; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=cmpxchg.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726207AbgAVSbS (ORCPT + 99 others); Wed, 22 Jan 2020 13:31:18 -0500 Received: from mail-qv1-f65.google.com ([209.85.219.65]:43077 "EHLO mail-qv1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725836AbgAVSbR (ORCPT ); Wed, 22 Jan 2020 13:31:17 -0500 Received: by mail-qv1-f65.google.com with SMTP id p2so210706qvo.10 for ; Wed, 22 Jan 2020 10:31:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cmpxchg-org.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=KsSmCaLhw2eH8Ck1bfIH7ogc6/ppRHUB+hk51YkiZzE=; b=u0FpVPQrv4MeJj0uTvYpr+Sr33hV6aI1K+Gy8Mixj42CY52n6sZTaMWDIYtZSSCP24 mk9AQBCjBEZZe/npyXvADUGSV7Bm/VG+E0ISXV7nQTLQINJU53OZm0CsA7WB6ZEmjmRK pO0ElN1AQ3/y67SssU92eOrzZRg0MKD3ME3UblFkLpoa73KGxdADtGQrWy9scDLLZEn1 cBQjulW+4DY/St0t8MIpZkDxNDNaQ6UYuWCw+oUB3dFEwi8KBcScjaWaD0a22oqbJREl xzWXKZUcI7ERm13Gre8OHCeZmsTaUtphwjUy+FCbkLULIvv3cbrV+hnaYWW+HYDN+Tso qRHw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=KsSmCaLhw2eH8Ck1bfIH7ogc6/ppRHUB+hk51YkiZzE=; b=Eu+HKIsmfTyT5ueaIzq2JwzJ78e88CBWHpnisZVa9sfHxLvcYLxSpG8ojNsMCyZuJz EJUaBx89GsC/yXLYUBOtGAvjuLgdLyM52aVCdY8+1x5j/V1A5trDcYtK94ifGD94rSH+ RSqk07N9KI9XXJYQ3KQ7OuWm5TThm1RUYllbWG5IsqVn/slaQC3XJi4RRc+sNnxKCcoc JzNI1fCS+W6/N3DYVZV4DLP3jDYMZwXMAIcepvsF7hroc57/PWZ6Lqt1xdZruzdd4l/X AWxNvBw9YQ0S5cwjA0d6fyZZcYYvRL221UA1zRmkCRYMaFG3foKL8ekCJ+iMVxx2VEJ7 y/bA== X-Gm-Message-State: APjAAAUAhWWi50TkkIkG6QA9/1DVZh5GVdyasmkUATIty6LV1G778Py4 usONVcksN6Ymql10QLjbN/c2cQ== X-Received: by 2002:a05:6214:982:: with SMTP id dt2mr11565088qvb.174.1579717875415; Wed, 22 Jan 2020 10:31:15 -0800 (PST) Received: from localhost ([2620:10d:c091:500::3:203c]) by smtp.gmail.com with ESMTPSA id x3sm21488705qts.35.2020.01.22.10.31.14 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 22 Jan 2020 10:31:14 -0800 (PST) Date: Wed, 22 Jan 2020 13:31:13 -0500 From: Johannes Weiner To: Alex Shi Cc: cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, akpm@linux-foundation.org, mgorman@techsingularity.net, tj@kernel.org, hughd@google.com, khlebnikov@yandex-team.ru, daniel.m.jordan@oracle.com, yang.shi@linux.alibaba.com, willy@infradead.org, shakeelb@google.com, Michal Hocko , Vladimir Davydov , Roman Gushchin , Chris Down , Thomas Gleixner , Vlastimil Babka , Qian Cai , Andrey Ryabinin , "Kirill A. Shutemov" , =?iso-8859-1?B?Suly9G1l?= Glisse , Andrea Arcangeli , David Rientjes , "Aneesh Kumar K.V" , swkhack , "Potyra, Stefan" , Mike Rapoport , Stephen Rothwell , Colin Ian King , Jason Gunthorpe , Mauro Carvalho Chehab , Peng Fan , Nikolay Borisov , Ira Weiny , Kirill Tkhai , Yafang Shao Subject: Re: [PATCH v8 03/10] mm/lru: replace pgdat lru_lock with lruvec lock Message-ID: <20200122183113.GA98452@cmpxchg.org> References: <1579143909-156105-1-git-send-email-alex.shi@linux.alibaba.com> <1579143909-156105-4-git-send-email-alex.shi@linux.alibaba.com> <20200116215222.GA64230@cmpxchg.org> <9ee80b68-a78f-714a-c727-1f6d2b4f87ea@linux.alibaba.com> <20200121160005.GA69293@cmpxchg.org> <0bd0a561-93cc-11b6-1eae-24b450b0f033@linux.alibaba.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <0bd0a561-93cc-11b6-1eae-24b450b0f033@linux.alibaba.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 22, 2020 at 08:01:29PM +0800, Alex Shi wrote: > Yes I understand isolatation would exclusive by PageLRU, but forgive my > stupid, I didn't figure out how a new page lruvec adding could be blocked. I don't see why we would need this. Can you elaborate where you think this is a problem? If compaction races with charging for example, compaction doesn't need to prevent a new page from being added to an lruvec. PageLRU is only set after page->mem_cgroup is updated, so there are two race outcomes: 1) TestClearPageLRU() fails. That means the page isn't (fully) created yet and cannot be migrated. We goto isolate_fail before even trying to lock the lruvec. 2) TestClearPageLRU() succeeds. That means the page was fully created and page->mem_cgroup has been set up. Anybody who now wants to change page->mem_cgroup needs PageLRU, but we have it, so lruvec is stable. I.e. cgroup charging does this: page->mem_cgroup = new_group lock(pgdat->lru_lock) SetPageLRU() add_page_to_lru_list() unlock(pgdat->lru_lock) and compaction currently does this: lock(pgdat->lru_lock) if (!PageLRU()) goto isolate_fail // __isolate_lru_page: if (!get_page_unless_zero()) goto isolate_fail ClearPageLRU() del_page_from_lru_list() unlock(pgdat->lru_lock) We can replace charging with this: page->mem_cgroup = new_group lock(lruvec->lru_lock) add_page_to_lru_list() unlock(lruvec->lru_lock) SetPageLRU() and the compaction sequence with something like this: if (!get_page_unless_zero()) goto isolate_fail if (!TestClearPageLRU()) goto isolate_fail_put // We got PageLRU, so charging is complete and nobody // can modify page->mem_cgroup until we set it again. lruvec = mem_cgroup_page_lruvec(page, pgdat) lock(lruvec->lru_lock) del_page_from_lru_list() unlock(lruvec->lru_lock)