Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp278002yba; Wed, 24 Apr 2019 00:39:39 -0700 (PDT) X-Google-Smtp-Source: APXvYqzPxsqRHjjxG3XQnEj9LTabUpMiqa+o4z14pJrOaTiBWJwQjyr5JeIcoa+X1sKNjfmPRjGV X-Received: by 2002:a63:ea52:: with SMTP id l18mr15485062pgk.124.1556091579468; Wed, 24 Apr 2019 00:39:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1556091579; cv=none; d=google.com; s=arc-20160816; b=MBvlx2Rpj2Yefhyw2SER9zkEfA5G0T9T5qcK3Q5BG8P6KLIJ/eUalcx0wfXqSRDf3z zzxwisQp/wkh+L0Y3feZfxHTjtI7QOJJny8qbYmDTKxJTUU7GmjhBT7FeF6fxj//IEPo k+5rRaBza2+4xYYg9WKVTnzRrAUcGp6XR+ARH8lqZeJLNmIk9v3l3ub7cVdgLTFX1yjH 4hIKEYhbi2V+iYLzv3Lg1jIUXjnJz/VFgujCl7iPTiiyzFc6gqUrC74tR5SwPcir8Jyz qZMxctLkwDvWPMiFYGK9bim2f3EnP31OYHP1lZZ4iJoTIcbKVW4KsIr6Ib0ZOxXSB6OY /nsw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:from :references:cc:to:subject; bh=PGIeyyW0aHT7wAY531ZRN0buOrMMO3XQFst6xyKzV4c=; b=VdX3EMaBTKcQDAID9LU8XVxOpqG828t0vYLfy4eog8qJg8ccF0Skobm0EdUjRJWMu2 bicO7pcVxB4Yv9YgKI0y/+h7GXXE7rEubIyUH8lYlAv/eS0b9SgbEsgM2s/FznY/HWrK J39sfPyaPjcQKe05JMU3Si4w9dw/d+gU5BW1rJZpxk04M5AhNWofmWFlFmFRBTiZtPai Jh2X4RAEQJzKA24ghVFoMTMIycJq/cMveQb12JPvsXUoTQEq64zBQ2+hwBH7G98fyVvG y6CKeHA/ZxdsTaJSAcFvumZndP7ZW50xCcJuF+VfWWAb1An2s/8NiMXlfKWDmg8gW/uy KO0g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 10si9551175pgp.481.2019.04.24.00.39.23; Wed, 24 Apr 2019 00:39:39 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730222AbfDXHeD (ORCPT + 99 others); Wed, 24 Apr 2019 03:34:03 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:35566 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729381AbfDXHeC (ORCPT ); Wed, 24 Apr 2019 03:34:02 -0400 Received: from pps.filterd (m0098417.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.27/8.16.0.27) with SMTP id x3O7VJsq120182 for ; Wed, 24 Apr 2019 03:34:01 -0400 Received: from e06smtp04.uk.ibm.com (e06smtp04.uk.ibm.com [195.75.94.100]) by mx0a-001b2d01.pphosted.com with ESMTP id 2s2j58m383-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Wed, 24 Apr 2019 03:34:01 -0400 Received: from localhost by e06smtp04.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Wed, 24 Apr 2019 08:33:59 +0100 Received: from b06cxnps3074.portsmouth.uk.ibm.com (9.149.109.194) by e06smtp04.uk.ibm.com (192.168.101.134) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Wed, 24 Apr 2019 08:33:48 +0100 Received: from d06av23.portsmouth.uk.ibm.com (d06av23.portsmouth.uk.ibm.com [9.149.105.59]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id x3O7XluQ58458312 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 24 Apr 2019 07:33:47 GMT Received: from d06av23.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 351E1A4053; Wed, 24 Apr 2019 07:33:47 +0000 (GMT) Received: from d06av23.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id C72CFA4059; Wed, 24 Apr 2019 07:33:44 +0000 (GMT) Received: from [9.145.184.124] (unknown [9.145.184.124]) by d06av23.portsmouth.uk.ibm.com (Postfix) with ESMTP; Wed, 24 Apr 2019 07:33:44 +0000 (GMT) Subject: Re: [PATCH v12 00/31] Speculative page faults To: Peter Zijlstra , Michel Lespinasse Cc: Andrew Morton , Michal Hocko , "Kirill A. Shutemov" , Andi Kleen , dave@stgolabs.net, Jan Kara , Matthew Wilcox , aneesh.kumar@linux.ibm.com, Benjamin Herrenschmidt , mpe@ellerman.id.au, Paul Mackerras , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" , Will Deacon , Sergey Senozhatsky , sergey.senozhatsky.work@gmail.com, Andrea Arcangeli , Alexei Starovoitov , kemi.wang@intel.com, Daniel Jordan , David Rientjes , Jerome Glisse , Ganesh Mahendran , Minchan Kim , Punit Agrawal , vinayak menon , Yang Shi , zhong jiang , Haiyan Song , Balbir Singh , sj38.park@gmail.com, Mike Rapoport , LKML , linux-mm , haren@linux.vnet.ibm.com, Nick Piggin , "Paul E. McKenney" , Tim Chen , linuxppc-dev@lists.ozlabs.org, x86@kernel.org References: <20190416134522.17540-1-ldufour@linux.ibm.com> <20190423093851.GJ11158@hirez.programming.kicks-ass.net> From: Laurent Dufour Date: Wed, 24 Apr 2019 09:33:44 +0200 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <20190423093851.GJ11158@hirez.programming.kicks-ass.net> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 x-cbid: 19042407-0016-0000-0000-00000272C3E4 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19042407-0017-0000-0000-000032CF3377 Message-Id: <05df6720-7130-62fe-a71f-074b6fafff3e@linux.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2019-04-24_05:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1904240065 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Le 23/04/2019 à 11:38, Peter Zijlstra a écrit : > On Mon, Apr 22, 2019 at 02:29:16PM -0700, Michel Lespinasse wrote: >> The proposed spf mechanism only handles anon vmas. Is there a >> fundamental reason why it couldn't handle mapped files too ? >> My understanding is that the mechanism of verifying the vma after >> taking back the ptl at the end of the fault would work there too ? >> The file has to stay referenced during the fault, but holding the vma's >> refcount could be made to cover that ? the vm_file refcount would have >> to be released in __free_vma() instead of remove_vma; I'm not quite sure >> if that has more implications than I realize ? > > IIRC (and I really don't remember all that much) the trickiest bit was > vs unmount. Since files can stay open past the 'expected' duration, > umount could be delayed. > > But yes, I think I had a version that did all that just 'fine'. Like > mentioned, I didn't keep the refcount because it sucked just as hard as > the mmap_sem contention, but the SRCU callback did the fput() just fine > (esp. now that we have delayed_fput). I had to use a refcount for the VMA because I'm using RCU in place of SRCU and only protecting the RB tree using RCU. Regarding the file pointer, I decided to release it synchronously to avoid the latency of RCU during the file closing. As you mentioned this could delayed the umount but not only, as Linus Torvald demonstrated by the past [1]. Anyway, since the file support is not yet here there is no need for that currently. Regarding the file mapping support, the concern is to ensure that vm_ops->fault() will not try to release the mmap_sem. This is true for most of the file system operation using the generic one, but there is currently no clever way to identify that except by checking the vm_ops->fault pointer. Adding a flag to the vm_operations_struct structure is another option. that's doable as far as the underlying fault() function is not dealing with the mmap_sem, and I made a try by the past but was thinking that first the anonymous case should be accepted before moving forward this way. [1] https://lore.kernel.org/linux-mm/alpine.LFD.2.00.1001041904250.3630@localhost.localdomain/