Received: by 2002:ac0:98c7:0:0:0:0:0 with SMTP id g7-v6csp5938627imd; Wed, 31 Oct 2018 04:33:59 -0700 (PDT) X-Google-Smtp-Source: AJdET5fp4tQtCT/xflBfiUE+BshMhW/F+zz3CRdt7k94UAbe0RlmG9odCOCZVUSRDGolIfRu+Uws X-Received: by 2002:a17:902:74c4:: with SMTP id f4-v6mr3072758plt.52.1540985639687; Wed, 31 Oct 2018 04:33:59 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1540985639; cv=none; d=google.com; s=arc-20160816; b=OZjs9RWxRrMMT45AbbphSiNHGFU/pJO0rQK5Ltx/rjTLm/d1bd+3OddYrcKHS8PJk8 yPS2eqtZoLXYF00pbTDjrDir+T47UBIx3dhmleezAfAkp0Tic3h6Ms+gR12+uPfrDeKx A9FNf/wTFVFG56JOvuT8C+GIE3D6h2z6QuP7blKIoaifTbJz6l/KeLEpht6JJ/I2Ohmh OtTamWqQiuZw/J3h5YVb/HrUJHkRU97GA4rNzlqh/4csyLzmjAKOsktufTPNy4m+MrNE U5WPZhjiFuU4cgxK1wyp+damuFsYgdp+lbK7UGsd9F1jKDJDK0/3XM/m739r/dohf4qS 4Dkg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=1Nx7BO2VnRh0a+k2wp2hDZfug8S8paE2B6A9HLC6QA8=; b=h5QwDaSL7DoYC4SlCEtoPczKx2Fi+Nfupvas58aES7GLRizyLWyyTwfAV2y5qdBGcK ELnWCc/CLh54UC0c/ucQRcOvGEe6x2o25XTu+jhDxseWmLaJXhnYYp8q6NOPLRmwlSZt I04a78j0Cbo7EFGo4kEd1/KBi3wQh11xR4Hi/B7xJR4sYSaBmDS07DjskvIccGfKMnxf HJ8e88NJQKRfSJHGCazKAswKlbF0+UEiyAaX4JkV33hHfkH9ucJtKP67vX4ppU10KU2+ ik7x7x+hORyqj23BYv0/msTxBZ/XkGsLUzrrLzeIov0s7bSW8dY/tpxM8ma/Uk3B0Eeg mOCg== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=XOqqvlIV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v8-v6si27592048pgo.220.2018.10.31.04.33.44; Wed, 31 Oct 2018 04:33:59 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=XOqqvlIV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728706AbeJaUbD (ORCPT + 99 others); Wed, 31 Oct 2018 16:31:03 -0400 Received: from bombadil.infradead.org ([198.137.202.133]:44280 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727546AbeJaUbC (ORCPT ); Wed, 31 Oct 2018 16:31:02 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20170209; h=In-Reply-To:Content-Type:MIME-Version :References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=1Nx7BO2VnRh0a+k2wp2hDZfug8S8paE2B6A9HLC6QA8=; b=XOqqvlIVZO/TCdFsj9fzYAmfN vtDeI48fJxQqPNLDNNW2FRQ59jRc0YgoyWc9OZk1adej3fYHmt+6SSNPQMTL73MRRLd51foZ5ngyr mOrh1B6t1BAsAiyCg0xWZmKqF4Hgxwpr47JkvcOMd4rzIL/Q1J3OXgFO9Ypv3J3pITxZN52VnBJHM 9NTbMJHfSSrqWKf77P0SreElImwrKXths6lRkk/dsgFD2hqCOjfWr14YpuxIuDsVbloZVtzBCVKFO FtwS1T7Tjfk3GwYpJGdLK1FynxfapKnEjRAkPuiE++hznb2wbpm7Ld7C8FaltPVcev3oq2gMlFkXh Xk3OM+keA==; Received: from willy by bombadil.infradead.org with local (Exim 4.90_1 #2 (Red Hat Linux)) id 1gHokB-0006ej-FS; Wed, 31 Oct 2018 11:33:03 +0000 Date: Wed, 31 Oct 2018 04:33:03 -0700 From: Matthew Wilcox To: Peter Zijlstra Cc: Andy Lutomirski , Kees Cook , Igor Stoppa , Mimi Zohar , Dave Chinner , James Morris , Michal Hocko , Kernel Hardening , linux-integrity , linux-security-module , Igor Stoppa , Dave Hansen , Jonathan Corbet , Laura Abbott , Randy Dunlap , Mike Rapoport , "open list:DOCUMENTATION" , LKML , Thomas Gleixner Subject: Re: [PATCH 10/17] prmem: documentation Message-ID: <20181031113303.GJ10491@bombadil.infradead.org> References: <20181023213504.28905-11-igor.stoppa@huawei.com> <20181026092609.GB3159@worktop.c.hoisthospitality.com> <20181028183126.GB744@hirez.programming.kicks-ass.net> <40cd77ce-f234-3213-f3cb-0c3137c5e201@gmail.com> <20181030152641.GE8177@hirez.programming.kicks-ass.net> <0A7AFB50-9ADE-4E12-B541-EC7839223B65@amacapital.net> <20181030175814.GB10491@bombadil.infradead.org> <20181031093659.GL744@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181031093659.GL744@hirez.programming.kicks-ass.net> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Oct 31, 2018 at 10:36:59AM +0100, Peter Zijlstra wrote: > On Tue, Oct 30, 2018 at 10:58:14AM -0700, Matthew Wilcox wrote: > > On Tue, Oct 30, 2018 at 10:06:51AM -0700, Andy Lutomirski wrote: > > > > On Oct 30, 2018, at 9:37 AM, Kees Cook wrote: > > > I support the addition of a rare-write mechanism to the upstream kernel. > > > And I think that there is only one sane way to implement it: using an > > > mm_struct. That mm_struct, just like any sane mm_struct, should only > > > differ from init_mm in that it has extra mappings in the *user* region. > > > > I'd like to understand this approach a little better. In a syscall path, > > we run with the user task's mm. What you're proposing is that when we > > want to modify rare data, we switch to rare_mm which contains a > > writable mapping to all the kernel data which is rare-write. > > > > So the API might look something like this: > > > > void *p = rare_alloc(...); /* writable pointer */ > > p->a = x; > > q = rare_protect(p); /* read-only pointer */ > > > > To subsequently modify q, > > > > p = rare_modify(q); > > q->a = y; > > rare_protect(p); > > Why would you have rare_alloc() imply rare_modify() ? Would you have the > allocator meta data inside the rare section? Normally when I allocate some memory I need to initialise it before doing anything else with it ;-) I mean, you could do: ro = rare_alloc(..); rare = rare_modify(ro); rare->a = x; rare_protect(rare); but that's more typing.