Received: by 10.213.65.68 with SMTP id h4csp1692229imn; Thu, 5 Apr 2018 02:01:10 -0700 (PDT) X-Google-Smtp-Source: AIpwx4+r8mJQKgtlxdCcawbjHDNtRqn63b+FI0S3C99FswEsoy9P+hopoPwpA556GQFnRM2KJsHW X-Received: by 10.99.97.138 with SMTP id v132mr14214849pgb.138.1522918870089; Thu, 05 Apr 2018 02:01:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1522918870; cv=none; d=google.com; s=arc-20160816; b=zVuFGy1GLg4icOiI9m1D61zY6ZMUZjJnHWEtwtiKIX/eE2RU2IhLH+QxxYUvRnoWqK JQzbML09f5AMg2g1PgkITQgG81NaNniEjckky7McoabYC9+3ZtSHd8yp5veqn7kNtlKj oDJ2aNuPL7CpZ4hfn8X6Qdr77EvRhPZAiHkE/wcty00O/Uvtu9Xc82n1NA5RECy3FStc d7wRj6b13ur6xQlZvXdDaPfWBA//RkdpmFv+hXhDNUBQk4bG+D35BYBxx4JN1/iZa1iP L77BMYW2bmxLkJ1sTj6DnpfxUdANaHbOmPLYcaS2JjAeDrGqRbgRwQ4aZ2UFFknNga5R tueQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=pnTApUSoMhHlxjZijdMGLL7J1R+jxXYawss30W2uMLA=; b=zkHT4ArTw49exhVlKLa/jfe9Gne0wG4KJfrVIFzwJ70u+1PUIvg7JHa9/DeWTI1wcT TCy08lMJm/zIw25Al3qDNaYn/HFrb2S/TM5jJlGflc1CDMnLIDP6qzQm0NyLfSMnNhIA jFLGGLMCxmtdtABVSitpYg1Wb2tH5hHBhKRPJychHSCXLJ51EgxPTeUnNIcFn/PrJxgI ilt5rcCDsc4TcJBMwC4X4XxVsCB+LV/toMy2zbWc4aO/14+TIkUoCvzgbRDTE3wnf41X UMwJdNNqg+Ys5ywWNCnaVfja7IfsOt5SpXLn4bcS0nRT0jqIw5/KJXCxb8+l8INGrPhf 6HUw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t5si5016093pgu.491.2018.04.05.02.00.55; Thu, 05 Apr 2018 02:01:10 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751544AbeDEI7c (ORCPT + 99 others); Thu, 5 Apr 2018 04:59:32 -0400 Received: from mx2.suse.de ([195.135.220.15]:52374 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751233AbeDEI73 (ORCPT ); Thu, 5 Apr 2018 04:59:29 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 5160FACC4; Thu, 5 Apr 2018 08:59:28 +0000 (UTC) Date: Thu, 5 Apr 2018 10:59:27 +0200 From: Michal Hocko To: "Kirill A. Shutemov" Cc: Naoya Horiguchi , "linux-mm@kvack.org" , Andrew Morton , Vlastimil Babka , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH v1] mm: consider non-anonymous thp as unmovable page Message-ID: <20180405085927.GC6312@dhcp22.suse.cz> References: <1522730788-24530-1-git-send-email-n-horiguchi@ah.jp.nec.com> <20180403075928.GC5501@dhcp22.suse.cz> <20180403082405.GA23809@hori1.linux.bs1.fc.nec.co.jp> <20180403083451.GG5501@dhcp22.suse.cz> <20180403105411.hknofkbn6rzs26oz@node.shutemov.name> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180403105411.hknofkbn6rzs26oz@node.shutemov.name> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 03-04-18 13:54:11, Kirill A. Shutemov wrote: > On Tue, Apr 03, 2018 at 10:34:51AM +0200, Michal Hocko wrote: > > On Tue 03-04-18 08:24:06, Naoya Horiguchi wrote: > > > On Tue, Apr 03, 2018 at 09:59:28AM +0200, Michal Hocko wrote: > > > > On Tue 03-04-18 13:46:28, Naoya Horiguchi wrote: > > > > > My testing for the latest kernel supporting thp migration found out an > > > > > infinite loop in offlining the memory block that is filled with shmem > > > > > thps. We can get out of the loop with a signal, but kernel should > > > > > return with failure in this case. > > > > > > > > > > What happens in the loop is that scan_movable_pages() repeats returning > > > > > the same pfn without any progress. That's because page migration always > > > > > fails for shmem thps. > > > > > > > > Why does it fail? Shmem pages should be movable without any issues. > > > > > > .. because try_to_unmap_one() explicitly skips unmapping for migration. > > > > > > #ifdef CONFIG_ARCH_ENABLE_THP_MIGRATION > > > /* PMD-mapped THP migration entry */ > > > if (!pvmw.pte && (flags & TTU_MIGRATION)) { > > > VM_BUG_ON_PAGE(PageHuge(page) || !PageTransCompound(page), page); > > > > > > if (!PageAnon(page)) > > > continue; > > > > > > set_pmd_migration_entry(&pvmw, page); > > > continue; > > > } > > > #endif > > > > > > When I implemented this code, I felt hard to work on both of anon thp > > > and shmem thp at one time, so I separated the proposal into smaller steps. > > > Shmem uses pagecache so we need some non-trivial effort (including testing) > > > to extend thp migration for shmem. But I think it's a reasonable next step. > > > > OK, I see. I have forgot about this part. Please be explicit about that > > in the changelog. Also the proper fix is to not use movable zone for > > shmem page THP rather than hack around it in the hotplug specific code > > IMHO. > > No. We should just split the page before running > try_to_unmap(TTU_MIGRATION) on the page. Something like this or it is completely broken. I completely forgot the whole page_vma_mapped_walk business. diff --git a/mm/rmap.c b/mm/rmap.c index 9eaa6354fe70..cbbfbcb08b83 100644 --- a/mm/rmap.c +++ b/mm/rmap.c @@ -1356,6 +1356,7 @@ static bool try_to_unmap_one(struct page *page, struct vm_area_struct *vma, return true; if (flags & TTU_SPLIT_HUGE_PMD) { +split: split_huge_pmd_address(vma, address, flags & TTU_SPLIT_FREEZE, page); } @@ -1375,7 +1376,7 @@ static bool try_to_unmap_one(struct page *page, struct vm_area_struct *vma, VM_BUG_ON_PAGE(PageHuge(page) || !PageTransCompound(page), page); if (!PageAnon(page)) - continue; + goto split; set_pmd_migration_entry(&pvmw, page); continue; -- Michal Hocko SUSE Labs