Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp7258647imm; Tue, 24 Jul 2018 10:57:56 -0700 (PDT) X-Google-Smtp-Source: AAOMgpeVj53q5BR6XDTsD6dM9Msa0PcPPXRWdEJF2njc1zyGYqe0FXeVO0kuO/LkmSnyf6ORsGo+ X-Received: by 2002:a63:4663:: with SMTP id v35-v6mr16821788pgk.178.1532455076852; Tue, 24 Jul 2018 10:57:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1532455076; cv=none; d=google.com; s=arc-20160816; b=CuwHyrBpKDLm0dVbWr7tUaUHAI6XZMgxHH6W6Xh3xUo7CRV5m9SbvnuqZbL6T4u4Y6 hnJKg2/ly0qebx1lYpiO/aIQt4nd9qjP5+APd5Y9d1jwRY3DkWLD1yMMA36g0SOlOE0I fkVpB8up3Y0lH4Ii2bDl4Nb1pAjj4VtXcYm4XQiQjerIOj61cEuXeuhYZTQ0X7FGudq6 2apRH7CXrRcCbQxREmDnsCwDJEY4rbsNVnGcJrNIxtlncFZTqztM975kb5Qy27Eru11C oPa6MNa+kHUVSvMfYGH0OzID4uW0z5T6YRLubRkM7lGFyJew/iG3AHXw4cVoy6L1vlYN MiYw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=Yfym9Nh8ZRxvn5c1cWOcD71HHg+WmT+VwotueygALK4=; b=o6hbU4FRF3FwL7UH7gZEeTo1J4GBSTB6nPbCxjytipxp1UFigmjRKGcssw23oarIrl kHeVK3x0kfHel6RW5p5VmsSZse0A4gOdPoCH1lwMc++7YptfV2AcIvhVnuF+GpgL1/n+ JSxGoHZSQ2PdTEbBRdGTpPAhf1CHaRgBbFtD//Xxmd9RR53bUY/EaH1Eb4FVlFP69uCv YMfsOedlUjYMh3sJZ+8byglG+Np3lULMaBGCQo3wVt8mkqU0ROYxqcrWVXMAi/JeXhZ8 svYsLlEIrxEdSmVo5ciQZaYEAsFeR5F8BoZw5/62CyBPWMpagEE7OEurxvULKYiP/PBx i0MA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m68-v6si12676056pgm.637.2018.07.24.10.57.41; Tue, 24 Jul 2018 10:57:56 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388502AbeGXTEV (ORCPT + 99 others); Tue, 24 Jul 2018 15:04:21 -0400 Received: from out30-131.freemail.mail.aliyun.com ([115.124.30.131]:43261 "EHLO out30-131.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388386AbeGXTEV (ORCPT ); Tue, 24 Jul 2018 15:04:21 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R441e4;CH=green;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e07487;MF=yang.shi@linux.alibaba.com;NM=1;PH=DS;RN=7;SR=0;TI=SMTPD_---0T5GDPxi_1532454991; Received: from US-143344MP.local(mailfrom:yang.shi@linux.alibaba.com fp:SMTPD_---0T5GDPxi_1532454991) by smtp.aliyun-inc.com(127.0.0.1); Wed, 25 Jul 2018 01:56:36 +0800 Subject: Re: [RFC v5 PATCH 2/2] mm: mmap: zap pages with read mmap_sem in munmap To: Laurent Dufour , mhocko@kernel.org, willy@infradead.org, kirill@shutemov.name, akpm@linux-foundation.org Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org References: <1531956101-8526-1-git-send-email-yang.shi@linux.alibaba.com> <1531956101-8526-3-git-send-email-yang.shi@linux.alibaba.com> <25fca2a1-0a55-13eb-0c75-6d0238fe780b@linux.vnet.ibm.com> From: Yang Shi Message-ID: <02228f42-438b-7840-5653-f076fc190f14@linux.alibaba.com> Date: Tue, 24 Jul 2018 10:56:31 -0700 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org >>>> +static int vm_munmap_zap_rlock(unsigned long start, size_t len) >>>> +{ >>>> +    int ret; >>>> +    struct mm_struct *mm = current->mm; >>>> +    LIST_HEAD(uf); >>>> + >>>> +    ret = do_munmap_zap_rlock(mm, start, len, &uf); >>>> +    userfaultfd_unmap_complete(mm, &uf); >>>> +    return ret; >>>> +} >>>> + >>>>   int vm_munmap(unsigned long start, size_t len) >>>>   { >>>>       int ret; >>> A stupid question, since the overhead of vm_munmap_zap_rlock() compared to >>> vm_munmap() is not significant, why not putting that in vm_munmap() instead of >>> introducing a new vm_munmap_zap_rlock() ? >> Since vm_munmap() is called in other paths too, i.e. drm driver, kvm, etc. I'm >> not quite sure if those paths are safe enough to this optimization. And, it >> looks they are not the main sources of the latency, so here I introduced >> vm_munmap_zap_rlock() for munmap() only. > For my information, what could be unsafe for these paths ? I'm just not sure if they are safe enough nor not, because I'm not knowledgeable enough to kvm and drm drivers. They might be safe, but I don't know how to prove that. So, since they might be not the main sources of latency (I haven't seen any hung report due to them), so it sounds safe to not touch them for now. > >> If someone reports or we see they are the sources of latency too, and the >> optimization is proved safe to them, we can definitely extend this to all >> vm_munmap() calls >> >> Thanks, >> Yang >> >>>> @@ -2855,10 +2939,9 @@ int vm_munmap(unsigned long start, size_t len) >>>>   SYSCALL_DEFINE2(munmap, unsigned long, addr, size_t, len) >>>>   { >>>>       profile_munmap(addr); >>>> -    return vm_munmap(addr, len); >>>> +    return vm_munmap_zap_rlock(addr, len); >>>>   } >>>> >>>> - >>>>   /* >>>>    * Emulation of deprecated remap_file_pages() syscall. >>>>    */ >>>> @@ -3146,7 +3229,7 @@ void exit_mmap(struct mm_struct *mm) >>>>       tlb_gather_mmu(&tlb, mm, 0, -1); >>>>       /* update_hiwater_rss(mm) here? but nobody should be looking */ >>>>       /* Use -1 here to ensure all VMAs in the mm are unmapped */ >>>> -    unmap_vmas(&tlb, vma, 0, -1); >>>> +    unmap_vmas(&tlb, vma, 0, -1, false); >>>>       free_pgtables(&tlb, vma, FIRST_USER_ADDRESS, USER_PGTABLES_CEILING); >>>>       tlb_finish_mmu(&tlb, 0, -1); >>>>