Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp8952020ybl; Fri, 17 Jan 2020 04:00:47 -0800 (PST) X-Google-Smtp-Source: APXvYqw4MU6MZ/NNiopTlSc9UsWI3ycTn3yYatj7rvWepd1XT3x/VjxgpILwQG3M3UPhkqXabN1A X-Received: by 2002:aca:f507:: with SMTP id t7mr3167369oih.156.1579262446940; Fri, 17 Jan 2020 04:00:46 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579262446; cv=none; d=google.com; s=arc-20160816; b=eZUnNVy+ICRjemuhegueaAL80z1FyXVG8iz4QdED10pdaOEULqhigLsnm9R6JYRYnk Tyke6N/ob5F0AvP6hAHNSe+k1H8nfI41pAnqVyUYTsggPRx99W2AIFQUqNuG5c1LvCtt 9mbHZMv2xkpZ0/PJsc3PFncqH7cC+TO4EIPyWadOP6f54MxC3TYveciYHGWKQByXt4j9 K7mAGqj7CkA2YGAtWVGvI7DnVN47TLAG30PTQACmH607cjGD0Y76x5mV21k5EtNLmzNK VCFGHmtUZrxg3Ms5XjUSLvaO9y8dBKt4J8QdbAnSu8lT/QnI4ogmD2wUg7aM5tVYO7d1 fXAQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=udlU6SpvxzvjKHENifeUg1Az8MLTSzxnXtnR1+RwbnI=; b=rAD6CNFpLDQPY+xU0A+6GvZ4VNlTGyapfAFL2MYgiI6tkWqC3h37TPDI02UOzbuI8X 1DoYuAaG5jhueQdg85ewmwnrqLvwRN+QUta7/YHPBufL8ytHe1sJHKFx9m26I1CcSlxu xC2ixoGAFT/A1nvQSX2qlhh3dYlRHl0vU4M4oXUzI18Kc/Xvk0whZC6MVuanfHagMZ5j AsDKJEz5VAAaIDK9qIpOosiDH359BXKWIofvfUlExpPVt8JBN1KI/dKqI91Vf6qpbxj0 JdpYw9YrzyYHUcZpdsv41R06aor6leW9FzhB2TlcQdlg/sym00ZvZYWxVuKE1jzzBlPW L8pA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w18si14623672otj.148.2020.01.17.04.00.33; Fri, 17 Jan 2020 04:00:46 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728778AbgAQL7j (ORCPT + 99 others); Fri, 17 Jan 2020 06:59:39 -0500 Received: from mail-wm1-f43.google.com ([209.85.128.43]:33770 "EHLO mail-wm1-f43.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726785AbgAQL7j (ORCPT ); Fri, 17 Jan 2020 06:59:39 -0500 Received: by mail-wm1-f43.google.com with SMTP id d139so9345971wmd.0; Fri, 17 Jan 2020 03:59:37 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=udlU6SpvxzvjKHENifeUg1Az8MLTSzxnXtnR1+RwbnI=; b=RJ1UNoEvmG3Oqy2LAy3L4HjXT2D3EBRlZoMfHXg5zEJKnyZjbd4g0z4Jm7t/Ixg7dy f2L/zv2GfX4blABsESuzUKstOf8ecHxINhy2f4h7kKBKVyFF6FIu6QHy0EbUlFWAbHRM NBD4NiJ2gmfNfBtMQsEXUGqyug8E3gWRZFbiyaj06VNpaXK4Od0Qdbri84ilXa3rWvdy /JfvkIcWefEB7i1PrnCF1rF5jKd0Zm92EMrO1F0tPSCkYv3qH9CpA5EpzOM6p7GSgVR7 N0/n9jkkIAA0zqPNTmTTDIMfrJU6DyjB81dcItUyv/b+boSmq1O1VbGDsX/GaDp/b/yF sKNg== X-Gm-Message-State: APjAAAVSZ/xU+DNPBM9/J48yR/m0N9SoVsHiGmDhFffE/h/EuqyVu91e Mt9ziUKNDH2l61uAtTxMVRU= X-Received: by 2002:a1c:1d1:: with SMTP id 200mr4248107wmb.181.1579262377136; Fri, 17 Jan 2020 03:59:37 -0800 (PST) Received: from localhost (prg-ext-pat.suse.com. [213.151.95.130]) by smtp.gmail.com with ESMTPSA id o129sm9364730wmb.1.2020.01.17.03.59.36 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 17 Jan 2020 03:59:36 -0800 (PST) Date: Fri, 17 Jan 2020 12:59:35 +0100 From: Michal Hocko To: Kirill Tkhai , g@suse.cz, kirill.shutemov@linux.intel.com Cc: David Rientjes , Wei Yang , hannes@cmpxchg.org, vdavydov.dev@gmail.com, akpm@linux-foundation.or, yang.shi@linux.alibaba.com, cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, alexander.duyck@gmail.com, stable@vger.kernel.org Subject: Re: [Patch v3] mm: thp: grab the lock before manipulation defer list Message-ID: <20200117115935.GW19428@dhcp22.suse.cz> References: <20200116013100.7679-1-richardw.yang@linux.intel.com> <0bb34c4a-97c7-0b3c-cf43-8af6cf9c4396@virtuozzo.com> <20200117091002.GM19428@dhcp22.suse.cz> <11ba0af7-c2b2-83f9-ac55-7793cedb8028@virtuozzo.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <11ba0af7-c2b2-83f9-ac55-7793cedb8028@virtuozzo.com> User-Agent: Mutt/1.12.2 (2019-09-21) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri 17-01-20 12:42:05, Kirill Tkhai wrote: > On 17.01.2020 12:32, David Rientjes wrote: > > On Fri, 17 Jan 2020, Kirill Tkhai wrote: > > > >>>> I think that's a good point, especially considering that the current code > >>>> appears to unconditionally place any compound page on the deferred split > >>>> queue of the destination memcg. The correct list that it should appear > >>>> on, I believe, depends on whether the pmd has been split for the process > >>>> being moved: note the MC_TARGET_PAGE caveat in > >>>> mem_cgroup_move_charge_pte_range() that does not move the charge for > >>>> compound pages with split pmds. So when mem_cgroup_move_account() is > >>>> called with compound == true, we're moving the charge of the entire > >>>> compound page: why would it appear on that memcg's deferred split queue? > >>> > >>> I believe Kirill asked how do we know that the page should be actually > >>> added to the deferred list just from the list_empty check. In other > >>> words what if the page hasn't been split at all? > >> > >> Yes, I'm talking about this. Function mem_cgroup_move_account() adds every > >> huge page to the deferred list, while we need to do that only for pages, > >> which are queued for splitting... > >> > > > > Yup, and that appears broken before Wei's patch. Since we only migrate > > charges of entire compound pages (we have a mapping pmd, the underlying > > page cannot be split), it should not appear on the deferred split queue > > for any memcg, right? > > Hm. Can't a huge page be mapped in two tasks: It can but it will get charged to only of the initially. I haven't checked the THP code in that aspect but from what I remember subpages shouldn't refer to different memcgs. Kirill Shutemov? -- Michal Hocko SUSE Labs