Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp3747600imm; Mon, 6 Aug 2018 09:54:48 -0700 (PDT) X-Google-Smtp-Source: AAOMgpcclY6Jw3wt58Ez/VWfBqgA+Ftq42VpCwFAKdFWvkRE1kn4h+5wlgs27fiOuJmroYWFe9DX X-Received: by 2002:a62:d1b:: with SMTP id v27-v6mr18152425pfi.87.1533574488108; Mon, 06 Aug 2018 09:54:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533574488; cv=none; d=google.com; s=arc-20160816; b=EzOUYADpmfI4W5EwebHNW/NeuRagmtRKSfSPVA+v4ux4hNv0eEkg4Xij3t4psd0mgI 3B0N/vCL9OKTWw4D7el2oLBKTSyot3GCI1Y0As99h4iJXs378BpYzRJnM6tb/NkyPuPS n95dt66GcJxDuaXFTJ6/+kl98BryRFf0OIhANwh2VBjIskH0UNEGvvpOFsYHITkkSPGd Y8Mx0Sfpxzz8xsbRC8wb5d7o05EmI6MOtHdUFjiZurRO0xbs1H85qmBolZmJno6bFdjM SyMWC37lzqRDeO3V9PwKWqWIWPqKFcY5GSmrT2l4DSCbwIZJabilH47xpuuAkStGd/EO uqkw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=gehcwgI2uy+qco3rXSBGBqS7YjtIngvTcGLfqvlkwcI=; b=yBHFFw7MrCcHfif/LxIm4QsdqwLG/2m6hVlIiLthKSOityO9DHgOIKQeITfAwY149v Dkj8p1r9LW4UDSSpbDnkwnErzuMZthipA6o8jbyfBzsIN3//fYSuE1F89Aqj/Dct78BK rfJsM76XN03c2wq5O2pE9fpmm4dLu/8J0ZmEUWyNe9+OjxITjQnJhUed8P4j1JknZ/1+ v66vJk4fTvL54WGU1TsGeq/hh5lWTbM1/CECuzZtxe1/kzZopJeSfc0c4rZhuX7cKjrv QQXbDVFoMt8T6M3f3kiYjyA9aCoIOzaD8fFlt1yHDXKYzXbymzD7T+WFpC1kqMa972mo OLwA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r28-v6si13653327pfb.65.2018.08.06.09.54.32; Mon, 06 Aug 2018 09:54:48 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730390AbeHFTDm (ORCPT + 99 others); Mon, 6 Aug 2018 15:03:42 -0400 Received: from out30-132.freemail.mail.aliyun.com ([115.124.30.132]:46399 "EHLO out30-132.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727834AbeHFTDm (ORCPT ); Mon, 6 Aug 2018 15:03:42 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R571e4;CH=green;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e07487;MF=yang.shi@linux.alibaba.com;NM=1;PH=DS;RN=7;SR=0;TI=SMTPD_---0T68S4jH_1533574401; Received: from US-143344MP.local(mailfrom:yang.shi@linux.alibaba.com fp:SMTPD_---0T68S4jH_1533574401) by smtp.aliyun-inc.com(127.0.0.1); Tue, 07 Aug 2018 00:53:24 +0800 Subject: Re: [RFC v6 PATCH 1/2] mm: refactor do_munmap() to extract the common part To: Michal Hocko Cc: willy@infradead.org, ldufour@linux.vnet.ibm.com, kirill@shutemov.name, akpm@linux-foundation.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org References: <1532628614-111702-1-git-send-email-yang.shi@linux.alibaba.com> <1532628614-111702-2-git-send-email-yang.shi@linux.alibaba.com> <20180803085335.GH27245@dhcp22.suse.cz> <7b84088a-4e49-ed7c-e750-7aba5cc17f11@linux.alibaba.com> <20180806132657.GB22858@dhcp22.suse.cz> From: Yang Shi Message-ID: <6af33f8f-042a-888f-2dad-6023fa5533f0@linux.alibaba.com> Date: Mon, 6 Aug 2018 09:53:13 -0700 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <20180806132657.GB22858@dhcp22.suse.cz> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 8/6/18 6:26 AM, Michal Hocko wrote: > On Fri 03-08-18 13:47:19, Yang Shi wrote: >> >> On 8/3/18 1:53 AM, Michal Hocko wrote: >>> On Fri 27-07-18 02:10:13, Yang Shi wrote: >>>> Introduces three new helper functions: >>>> * munmap_addr_sanity() >>>> * munmap_lookup_vma() >>>> * munmap_mlock_vma() >>>> >>>> They will be used by do_munmap() and the new do_munmap with zapping >>>> large mapping early in the later patch. >>>> >>>> There is no functional change, just code refactor. >>>> >>>> Reviewed-by: Laurent Dufour >>>> Signed-off-by: Yang Shi >>>> --- >>>> mm/mmap.c | 120 ++++++++++++++++++++++++++++++++++++++++++-------------------- >>>> 1 file changed, 82 insertions(+), 38 deletions(-) >>>> >>>> diff --git a/mm/mmap.c b/mm/mmap.c >>>> index d1eb87e..2504094 100644 >>>> --- a/mm/mmap.c >>>> +++ b/mm/mmap.c >>>> @@ -2686,34 +2686,44 @@ int split_vma(struct mm_struct *mm, struct vm_area_struct *vma, >>>> return __split_vma(mm, vma, addr, new_below); >>>> } >>>> -/* Munmap is split into 2 main parts -- this part which finds >>>> - * what needs doing, and the areas themselves, which do the >>>> - * work. This now handles partial unmappings. >>>> - * Jeremy Fitzhardinge >>>> - */ >>>> -int do_munmap(struct mm_struct *mm, unsigned long start, size_t len, >>>> - struct list_head *uf) >>>> +static inline bool munmap_addr_sanity(unsigned long start, size_t len) >>> munmap_check_addr? Btw. why does this need to have munmap prefix at all? >>> This is a general address space check. >> Just because I extracted this from do_munmap, no special consideration. It >> is definitely ok to use another name. >> >>>> { >>>> - unsigned long end; >>>> - struct vm_area_struct *vma, *prev, *last; >>>> - >>>> if ((offset_in_page(start)) || start > TASK_SIZE || len > TASK_SIZE-start) >>>> - return -EINVAL; >>>> + return false; >>>> - len = PAGE_ALIGN(len); >>>> - if (len == 0) >>>> - return -EINVAL; >>>> + if (PAGE_ALIGN(len) == 0) >>>> + return false; >>>> + >>>> + return true; >>>> +} >>>> + >>>> +/* >>>> + * munmap_lookup_vma: find the first overlap vma and split overlap vmas. >>>> + * @mm: mm_struct >>>> + * @vma: the first overlapping vma >>>> + * @prev: vma's prev >>>> + * @start: start address >>>> + * @end: end address >>> This really doesn't help me to understand how to use the function. >>> Why do we need both prev and vma etc... >> prev will be used by unmap_region later. > But what does it stand for? Why cannot you take prev from the returned > vma? In other words, if somebody reads this documentation how does he > know what the prev is supposed to be used for? > >>>> + * >>>> + * returns 1 if successful, 0 or errno otherwise >>> This is a really weird calling convention. So what does 0 tell? /me >>> checks the code. Ohh, it is nothing to do. Why cannot you simply return >>> the vma. NULL implies nothing to do, ERR_PTR on error. >> A couple of reasons why it is implemented as so: >> >>     * do_munmap returns 0 for both success and no suitable vma >> >>     * Since prev is needed by finding the start vma, and prev will be used >> by unmap_region later too, so I just thought it would look clean to have one >> function to return both start vma and prev. In this way, we can share as >> much as possible common code. >> >>     * In this way, we just need return 0, 1 or error no just as same as what >> do_munmap does currently. Then we know what is failure case exactly to just >> bail out right away. >> >> Actually, I tried the same approach as you suggested, but it had two >> problems: >> >>     * If it returns the start vma, we have to re-find its prev later, but >> the prev has been found during finding start vma. And, duplicate the code in >> do_munmap_zap_rlock. It sounds not that ideal. >> >>     * If it returns prev, it might be null (start vma is the first vma). We >> can't tell if null is a failure or success case > Even if you need to return both vma and prev then it would be better to > simply return vma directly than having this -errno, 0 or 1 return > semantic. OK, I will try to refactor the code. Thanks, Yang