Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp1462441imm; Fri, 29 Jun 2018 19:50:48 -0700 (PDT) X-Google-Smtp-Source: ADUXVKIqwKTKrf317aDVtfgesTe4Lc8QNaEdWZ8wKsqP047FuAYV/afVGtmc5SZn+Z/u3JIt6ZwB X-Received: by 2002:a65:6114:: with SMTP id z20-v6mr14720311pgu.312.1530327048876; Fri, 29 Jun 2018 19:50:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1530327048; cv=none; d=google.com; s=arc-20160816; b=MO5sSzmNiynkNwXeyP05ep0Ia/MdKvABwUUGE9r+fd8yhElKskW1dpImRHi2StHXug nup/hEEz08LeUc5PfF+3mSikLubgPacWDnea3ge9bcjj5hI6eI3Cu+bSslYot+O5clJy vOcjrLXJvoxp0CA+jLcCG5dVFwmDbsSfm+foz+uFtRLJcJFpGd4XhnFDLMUJl1rRIIHg q/wSIKg/9OFEv02itHze+REFaLZ1LIG8kI72PJJyxAA9mitWZ1cG2q9vWX/Ji+mECirk q3Sf2ddEs3S9FvRh9wCLW3ywvXcOfl6iI7hzOAePJ7OHbaihX7SOSAOcRufEJoZhXcd+ TdcQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :arc-authentication-results; bh=Lz+m2CPTLcOyIN7jL5morkhJXhRA0WfZGjaqkD8CfSk=; b=vk2VwXZ7bF2AQUjozj7XdggZaWMV4COnFwxpNXF2sCzfHKlaS8r741cir4SsOb+Kkv BsxQwOxPq5yiUpBGx/Q1HCR78lLrB2/HPaz4MitF7IKy+rBnBBdCpQS7C0Wq0RZW12Re j9AsIB50qRfJ4BagHKxyXWrZov/UxJOBPL5q883ZErzpwFCgnTXFDSWnMniIgPVhbF86 X5Ex6y3O5iXwN7Sif1ABNzXFzaLnRx8Qwc6nELDPDB/TQozALB8VFILqbYtGVQvszfyy i9mJW969hkSpAuAKo9zJlBOWb6HYOedyxfKVX03vkMILVRRZw44efMI7S9fgFSKyyAQ0 fc1g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 33-v6si10447985plg.260.2018.06.29.19.50.34; Fri, 29 Jun 2018 19:50:48 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935796AbeF3BfF (ORCPT + 99 others); Fri, 29 Jun 2018 21:35:05 -0400 Received: from mail.linuxfoundation.org ([140.211.169.12]:59190 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S935556AbeF3BfE (ORCPT ); Fri, 29 Jun 2018 21:35:04 -0400 Received: from localhost.localdomain (c-24-4-125-7.hsd1.ca.comcast.net [24.4.125.7]) by mail.linuxfoundation.org (Postfix) with ESMTPSA id 4FEBDCA1; Sat, 30 Jun 2018 01:35:03 +0000 (UTC) Date: Fri, 29 Jun 2018 18:35:01 -0700 From: Andrew Morton To: Yang Shi Cc: mhocko@kernel.org, willy@infradead.org, ldufour@linux.vnet.ibm.com, peterz@infradead.org, mingo@redhat.com, acme@kernel.org, alexander.shishkin@linux.intel.com, jolsa@redhat.com, namhyung@kernel.org, tglx@linutronix.de, hpa@zytor.com, linux-mm@kvack.org, x86@kernel.org, linux-kernel@vger.kernel.org Subject: Re: [RFC v3 PATCH 4/5] mm: mmap: zap pages with read mmap_sem for large mapping Message-Id: <20180629183501.9e30c26135f11853245c56c7@linux-foundation.org> In-Reply-To: <1530311985-31251-5-git-send-email-yang.shi@linux.alibaba.com> References: <1530311985-31251-1-git-send-email-yang.shi@linux.alibaba.com> <1530311985-31251-5-git-send-email-yang.shi@linux.alibaba.com> X-Mailer: Sylpheed 3.5.1 (GTK+ 2.24.31; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 30 Jun 2018 06:39:44 +0800 Yang Shi wrote: And... > diff --git a/mm/mmap.c b/mm/mmap.c > index 87dcf83..d61e08b 100644 > --- a/mm/mmap.c > +++ b/mm/mmap.c > @@ -2763,6 +2763,128 @@ static int munmap_lookup_vma(struct mm_struct *mm, struct vm_area_struct **vma, > return 1; > } > > +/* Consider PUD size or 1GB mapping as large mapping */ > +#ifdef HPAGE_PUD_SIZE > +#define LARGE_MAP_THRESH HPAGE_PUD_SIZE > +#else > +#define LARGE_MAP_THRESH (1 * 1024 * 1024 * 1024) > +#endif So this assumes that 32-bit machines cannot have 1GB mappings (fair enough) and this is the sole means by which we avoid falling into the "len >= LARGE_MAP_THRESH" codepath, which will behave very badly, at least because for such machines, VM_DEAD is zero. This is rather ugly and fragile. And, I guess, explains why we can't give all mappings this treatment: 32-bit machines can't do it. And we're adding a bunch of code to 32-bit kernels which will never be executed. I'm thinking it would be better to be much more explicit with "#ifdef CONFIG_64BIT" in this code, rather than relying upon the above magic. But I tend to think that the fact that we haven't solved anything on locked vmas or on uprobed mappings is a shostopper for the whole approach :(