Received: by 10.223.148.5 with SMTP id 5csp7572240wrq; Thu, 18 Jan 2018 07:01:13 -0800 (PST) X-Google-Smtp-Source: ACJfBot9sBtOWPrsKm+B56rB0X0YUy6vrhU0eCoe5YzvMIx6JC6Da9ZDLMDvJVi6CKS8mRfmccS0 X-Received: by 10.98.246.8 with SMTP id x8mr4353710pfh.234.1516287673562; Thu, 18 Jan 2018 07:01:13 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516287673; cv=none; d=google.com; s=arc-20160816; b=sZuhuH6RJ4B38eDbFD5VM3FeXfQC9RO97wVOdoxMiVJxqBUWtJK8pSHwSThYDbBMzy NqpSjO3F75HZHyiIHtb0Fh5CpCT4wpqX/SeAeeWNOVvAoqvEycCJEVMaXV0VNK9vh/8R It9M7Zlnxoife7gMXwipbx2kd84dFf7AfjFUIBvoVuhwSkfmWaE0wDN9C7WgIfl94WNd P0DwBY8Ju3FYC6IjjY2L4VuS9s1vw4GGYCGjAs9Nw24U7LF4SuQmUv96l/ENAAQM8sDN RefB+xrFPRcbsu4eLRgExh2kfPb1iDo3qJ67R6brsGiZQ6OSO4UWAnoAT+9hVxaeWshV Z0zg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=/pK68A7USb7pl1lMU41+TssMja+7hyOljBKH/EwG98c=; b=dqPWi6owAqeXu945wPzeDTr2BXBgMMNrErTfWZzu/mem7tP+PyJKHZiA+nFmFWFBY+ L5KTsK0IcZtax6PsiwvLXTTgdYsrTEeIA3wUPAvcGHLCem2aKNkYjVij8vlxr/9FTIvV NrHt/TqBnFLIWtVifOTqbXpsxxEkMcKEtQKYXMFbCHnHmGmovUEeqR5AyqhIvkwbGrCO R2GC9JBW8LTFMeJJ2lW8Fejsdq4phkII+KyTLSAKYlVvw7dg3tiQDLnv6TLAeSo2YuMe TLH+uRL3VyF2tAUyVe5c/2zw9N441XLFThwZbyngp5t50Zek+vguo9pyJcYh5797M1b8 k3sA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k62si6191580pga.539.2018.01.18.07.00.56; Thu, 18 Jan 2018 07:01:13 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755945AbeARO7E (ORCPT + 99 others); Thu, 18 Jan 2018 09:59:04 -0500 Received: from mx1.redhat.com ([209.132.183.28]:35172 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750972AbeARO7D (ORCPT ); Thu, 18 Jan 2018 09:59:03 -0500 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.phx2.redhat.com [10.5.11.16]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 6D6DAC0587E2; Thu, 18 Jan 2018 14:59:02 +0000 (UTC) Received: from mail.random (ovpn-116-144.ams2.redhat.com [10.36.116.144]) by smtp.corp.redhat.com (Postfix) with ESMTPS id EB5EC4B1; Thu, 18 Jan 2018 14:58:32 +0000 (UTC) Date: Thu, 18 Jan 2018 15:58:30 +0100 From: Andrea Arcangeli To: Dave Hansen Cc: "Kirill A. Shutemov" , Tetsuo Handa , torvalds@linux-foundation.org, kirill.shutemov@linux.intel.com, akpm@linux-foundation.org, hannes@cmpxchg.org, iamjoonsoo.kim@lge.com, mgorman@techsingularity.net, tony.luck@intel.com, vbabka@suse.cz, mhocko@kernel.org, hillf.zj@alibaba-inc.com, hughd@google.com, oleg@redhat.com, peterz@infradead.org, riel@redhat.com, srikar@linux.vnet.ibm.com, vdavydov.dev@gmail.com, mingo@kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, x86@kernel.org Subject: Re: [mm 4.15-rc8] Random oopses under memory pressure. Message-ID: <20180118145830.GA6406@redhat.com> References: <201801160115.w0G1FOIG057203@www262.sakura.ne.jp> <201801170233.JDG21842.OFOJMQSHtOFFLV@I-love.SAKURA.ne.jp> <201801172008.CHH39543.FFtMHOOVSQJLFO@I-love.SAKURA.ne.jp> <201801181712.BFD13039.LtHOSVMFJQFOFO@I-love.SAKURA.ne.jp> <20180118122550.2lhsjx7hg5drcjo4@node.shutemov.name> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.2 (2017-12-15) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.16 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.32]); Thu, 18 Jan 2018 14:59:02 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 18, 2018 at 06:45:00AM -0800, Dave Hansen wrote: > On 01/18/2018 04:25 AM, Kirill A. Shutemov wrote: > > [ 10.084024] diff: -858690919 > > [ 10.084258] hpage_nr_pages: 1 > > [ 10.084386] check1: 0 > > [ 10.084478] check2: 0 > ... > > diff --git a/mm/page_vma_mapped.c b/mm/page_vma_mapped.c > > index d22b84310f6d..57b4397f1ea5 100644 > > --- a/mm/page_vma_mapped.c > > +++ b/mm/page_vma_mapped.c > > @@ -70,6 +70,14 @@ static bool check_pte(struct page_vma_mapped_walk *pvmw) > > } > > if (pte_page(*pvmw->pte) < pvmw->page) > > return false; > > + > > + if (pte_page(*pvmw->pte) - pvmw->page) { > > + printk("diff: %d\n", pte_page(*pvmw->pte) - pvmw->page); > > + printk("hpage_nr_pages: %d\n", hpage_nr_pages(pvmw->page)); > > + printk("check1: %d\n", pte_page(*pvmw->pte) - pvmw->page < 0); > > + printk("check2: %d\n", pte_page(*pvmw->pte) - pvmw->page >= hpage_nr_pages(pvmw->page)); > > + BUG(); > > + } > > This says that pte_page(*pvmw->pte) and pvmw->page are roughly 4GB away > from each other (858690919*4=0xccba559c0). That's not the compiler > being wonky, it just means that the virtual addresses of the memory > sections are that far apart. > > This won't happen when you have vmemmap or flatmem because the mem_map[] > is virtually contiguous and pointer arithmetic just works against all > 'struct page' pointers. But with classic sparsemem, it doesn't. > > You need to make sure that the PFNs are in the same section before you > can do the math that you want to do here. Isn't it simply that pvmw->page isn't a page or pte_page(*pvmw->pte) isn't a page? The distance cannot matter, MMU isn't involved, this is pure 64bit aritmetics, 1giga 1 terabyte, 48bits 5level pagetables are meaningless in this comparison. #include int main() { volatile long i; struct x { char a[4000000000]; }; for (i = 0; i < 4000000000*3; i += 4000000000) { printf("%ld\n", ((struct x *)0)-((((struct x *)i)))); } printf("xxxx\n"); for (i = 0; i < 4000000000; i += 1) { if (i==4) i = 4000000000; printf("%ld\n", ((struct x *)0)-((((struct x *)i)))); } return 0; } You need to add two debug checks on "pte_page(*pvmw->pte) % 64" and same for pvmw->page to find out the one of the two that isn't a page. If both are real pages there's a bug that allocates page structs not naturally aligned.