Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp970121imm; Wed, 20 Jun 2018 09:25:57 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKTxLegddEzrsOXfKhOAWwVTrcyyJnRXzJEsLIcwAAvo3oGSbVq4QMAHV7cVBhgKF0aTPdi X-Received: by 2002:a63:a44a:: with SMTP id c10-v6mr19104483pgp.198.1529511957130; Wed, 20 Jun 2018 09:25:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1529511957; cv=none; d=google.com; s=arc-20160816; b=h0Mnny3i5oHHAWxCsn7z4XXbiZzB5nzh92APCH/bJ+vOd/L1YeV4FKRh8djisVauHC rFezk8gxrYQajQlUc/x6RJJ37j6X8IaHmJ/IG49/nz6mtH3pR9FELAptWnzpeDY95fMu zRcAo68J4rVPN7bVFn6wWZIo/HFRviVVhQpg0ReqiCUHjJN4rEvt77Gf1wTnumXSY3Pk Hp2kuuV7Bb87T2t84UuKW/oakk0IepFQUTdAhSvLxnELoHjZB0PccKHpw6zwFuD1tYpp y4YeKAnYru6Ta8eBj/RQAyRAisTy/31zRyzDUBSmz3Li1rMUAZrXYFxPPQ/zoIhkQY4R vkww== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=oeunq/k3kooaYVhMyyojPcWWyAGZnqjTa6q7v3ky+wo=; b=eEFW+eyh6pLbIalrWfEW/W6DTc/1sSTFO3Sq8h5axbqqFiDGGXeyUaYcxGnmJLjXRW /8e0dJ0KPfFi/RCKiAnUOWarDEzU48YVg8isbZoVbv7xI6+YSksOb5Uz+G+k2rPVxJIG 9t78cDXAKqH5YDFaYREoCug0FDksCX575WrHA6nIidmQK7HUpRhV6lTfyMaIiqRpPg05 RixuBKVHqaltTxJg93Hz/OyYB9/ZMDBPtw1ZLvu3LNQYe2k0UF9RF3d9tuhsAMkjr8Sl iFMR1+nLoV6dXhSeKfUGPscj9sZY70RTr3SEpQa7cw9WTLb7Y3yTL9NhJhFk0MAA6SC8 SElQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e12-v6si2177377pgn.171.2018.06.20.09.25.42; Wed, 20 Jun 2018 09:25:57 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754335AbeFTQXg (ORCPT + 99 others); Wed, 20 Jun 2018 12:23:36 -0400 Received: from out30-132.freemail.mail.aliyun.com ([115.124.30.132]:48500 "EHLO out30-132.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754065AbeFTQXf (ORCPT ); Wed, 20 Jun 2018 12:23:35 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R201e4;CH=green;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01f04446;MF=yang.shi@linux.alibaba.com;NM=1;PH=DS;RN=12;SR=0;TI=SMTPD_---0T33Bl7A_1529511799; Received: from US-143344MP.local(mailfrom:yang.shi@linux.alibaba.com fp:SMTPD_---0T33Bl7A_1529511799) by smtp.aliyun-inc.com(127.0.0.1); Thu, 21 Jun 2018 00:23:26 +0800 Subject: Re: [RFC v2 PATCH 2/2] mm: mmap: zap pages with read mmap_sem for large mapping To: Michal Hocko Cc: Peter Zijlstra , willy@infradead.org, ldufour@linux.vnet.ibm.com, akpm@linux-foundation.org, mingo@redhat.com, acme@kernel.org, alexander.shishkin@linux.intel.com, jolsa@redhat.com, namhyung@kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org References: <1529364856-49589-1-git-send-email-yang.shi@linux.alibaba.com> <1529364856-49589-3-git-send-email-yang.shi@linux.alibaba.com> <20180619100218.GN2458@hirez.programming.kicks-ass.net> <20180620071708.GI13685@dhcp22.suse.cz> From: Yang Shi Message-ID: <41456a0f-0091-dfdb-952b-9bf08b323ba6@linux.alibaba.com> Date: Wed, 20 Jun 2018 09:23:17 -0700 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <20180620071708.GI13685@dhcp22.suse.cz> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/20/18 12:17 AM, Michal Hocko wrote: > On Tue 19-06-18 14:13:05, Yang Shi wrote: >> >> On 6/19/18 3:02 AM, Peter Zijlstra wrote: > [...] >>> Hold up, two things: you having to copy most of do_munmap() didn't seem >>> to suggest a helper function? And second, since when are we allowed to >> Yes, they will be extracted into a helper function in the next version. >> >> May bad, I don't think it is allowed. We could reform this to: >> >> acquire write mmap_sem >> vma lookup (split vmas) >> release write mmap_sem >> >> acquire read mmap_sem >> zap pages >> release read mmap_sem >> >> I'm supposed this is safe as what Michal said before. > I didn't get to read your patches carefully yet but I am wondering why > do you need to split in the first place. Why cannot you simply unmap the > range (madvise(DONTNEED)) under the read lock and then take the lock for > write to finish the rest? Yes, we can. I just thought splitting vma up-front sounds more straight forward. But, I neglected the write mmap_sem issue. Will move the vma split into later write mmap_sem in the next version. Thanks, Yang