Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp4365665imm; Mon, 6 Aug 2018 23:45:22 -0700 (PDT) X-Google-Smtp-Source: AAOMgpef9f3gTuBi5CGzb3+HKvAtrJ0eua9SdoaQ0YjWhF0lFcbv5PUZcypuYf2Wwu6aRJg0Qt7j X-Received: by 2002:a63:1f20:: with SMTP id f32-v6mr16794929pgf.84.1533624322680; Mon, 06 Aug 2018 23:45:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533624322; cv=none; d=google.com; s=arc-20160816; b=hvtzh4y+EDQWkYYatRrF/Czn0ICWcJOtkEnkFbSBjOMsucwoFg5j4znjCc46Eg/1qW WOYnjUcibR9EVDaI7rGhgtOpNJkyHk3MZCwFbeGfwji5AHZx9Jp7/NflpEifTe7o23dd si71z8ZcJMztl3ZBp4HDD7x8DYAlgnJWjtSXZvk6A20hQR7xQ3AmuUyg4ahaoWsUrSwm C2W1f5hlHDcrfpmHc3Z0ZKzJoVurD9/orFcCoiDO2Uk8/hsPpyOZgj0PQPgpag/3rNgS tKzcraXoBQdMnlIKRjSMvAb0VZGhJyL4rF3lFXUJNOvvj74BPO87HWHwCeWuYCorJWfD VYaw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=ioj6iZNC1Ej87F1NR8cpxmJY7HOTgmLQBK/tQsNv1pw=; b=ODf5CZH2YAu/1cXHSS55Y50DpgUv92AOap62qOVjxf3wMyox16TH00NJxxLytLQsny 6f+mo4z9h0IGOIY6zIGQaJ9/F+OzVe3pXdCaUeAsC7XTHMluJZNjDW7vfvJyY4RBKxmz kgQ4/FM0yF9xMkX3RKYDp4rJrpPgVJfcZ0OmcKR+vK0xjcCmbDhlvuBV+yWR6r9eLMxa kOQsv1u3BwEmJn/30cQ9lWaIRFCKKVRPqhmN3e3qXz7pA8z4FLtYJ8f9l0xR6nrgBhH+ k1B+7oqBgs3aI/8xEIo6UPe8QoQATAlNrZwrAS2YjdQd8UgQ6e1TQYuXstJ2N6ON0BYm g2kA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r7-v6si677726pgh.473.2018.08.06.23.45.08; Mon, 06 Aug 2018 23:45:22 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1733279AbeHGH6G (ORCPT + 99 others); Tue, 7 Aug 2018 03:58:06 -0400 Received: from mx2.suse.de ([195.135.220.15]:50190 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727198AbeHGH6G (ORCPT ); Tue, 7 Aug 2018 03:58:06 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay1.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 96616AD0C; Tue, 7 Aug 2018 05:45:26 +0000 (UTC) Date: Tue, 7 Aug 2018 07:45:24 +0200 From: Michal Hocko To: Yang Shi Cc: willy@infradead.org, ldufour@linux.vnet.ibm.com, kirill@shutemov.name, akpm@linux-foundation.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: [RFC v6 PATCH 2/2] mm: mmap: zap pages with read mmap_sem in munmap Message-ID: <20180807054524.GQ10003@dhcp22.suse.cz> References: <1532628614-111702-1-git-send-email-yang.shi@linux.alibaba.com> <1532628614-111702-3-git-send-email-yang.shi@linux.alibaba.com> <20180803090759.GI27245@dhcp22.suse.cz> <20180806094005.GG19540@dhcp22.suse.cz> <76c0fc2b-fca7-9f22-214a-920ee2537898@linux.alibaba.com> <20180806204119.GL10003@dhcp22.suse.cz> <28de768b-c740-37b3-ea5a-8e2cb07d2bdc@linux.alibaba.com> <20180806205232.GN10003@dhcp22.suse.cz> <0cdff13a-2713-c5be-a33e-28c07e093bcc@linux.alibaba.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <0cdff13a-2713-c5be-a33e-28c07e093bcc@linux.alibaba.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 06-08-18 15:19:06, Yang Shi wrote: > > > On 8/6/18 1:52 PM, Michal Hocko wrote: > > On Mon 06-08-18 13:48:35, Yang Shi wrote: > > > > > > On 8/6/18 1:41 PM, Michal Hocko wrote: > > > > On Mon 06-08-18 09:46:30, Yang Shi wrote: > > > > > On 8/6/18 2:40 AM, Michal Hocko wrote: > > > > > > On Fri 03-08-18 14:01:58, Yang Shi wrote: > > > > > > > On 8/3/18 2:07 AM, Michal Hocko wrote: > > > > > > > > On Fri 27-07-18 02:10:14, Yang Shi wrote: > > > > [...] > > > > > > > > > If the vma has VM_LOCKED | VM_HUGETLB | VM_PFNMAP or uprobe, they are > > > > > > > > > considered as special mappings. They will be dealt with before zapping > > > > > > > > > pages with write mmap_sem held. Basically, just update vm_flags. > > > > > > > > Well, I think it would be safer to simply fallback to the current > > > > > > > > implementation with these mappings and deal with them on top. This would > > > > > > > > make potential issues easier to bisect and partial reverts as well. > > > > > > > Do you mean just call do_munmap()? It sounds ok. Although we may waste some > > > > > > > cycles to repeat what has done, it sounds not too bad since those special > > > > > > > mappings should be not very common. > > > > > > VM_HUGETLB is quite spread. Especially for DB workloads. > > > > > Wait a minute. In this way, it sounds we go back to my old implementation > > > > > with special handling for those mappings with write mmap_sem held, right? > > > > Yes, I would really start simple and add further enhacements on top. > > > If updating vm_flags with read lock is safe in this case, we don't have to > > > do this. The only reason for this special handling is about vm_flags update. > > Yes, maybe you are right that this is safe. I would still argue to have > > it in a separate patch for easier review, bisectability etc... > > Sorry, I'm a little bit confused. Do you mean I should have the patch > *without* handling the special case (just like to assume it is safe to > update vm_flags with read lock), then have the other patch on top of it, > which simply calls do_munmap() to deal with the special cases? Just skip those special cases in the initial implementation and handle each special case in its own patch on top. -- Michal Hocko SUSE Labs