Received: by 2002:ac0:da4c:0:0:0:0:0 with SMTP id a12csp47240imi; Wed, 20 Jul 2022 16:47:49 -0700 (PDT) X-Google-Smtp-Source: AGRyM1vO1FH7eDMv24qOFqbkSZcgC7UwLFI6HgxVLXkTFnyRMadKiCOklfmErf577iwCtmdn+1vD X-Received: by 2002:a05:6402:254c:b0:43a:9e77:3b29 with SMTP id l12-20020a056402254c00b0043a9e773b29mr53399044edb.356.1658360868838; Wed, 20 Jul 2022 16:47:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658360868; cv=none; d=google.com; s=arc-20160816; b=qlba65z2MZGeGEEmdH0z/Q7UfzAJymar5E9Kb7i84y1H9+F4uV0RRCjPv4A6BjWTd5 UW1gejb6RrwshlZfXojx7ax6Zht2LgxN03GxO8XcTdOZMxyr4QKCs6bZPACfVpbpz3vf 3pFkXmN5D602PCXKQWk0/Nt/uBvwbmxqbB4L32+kqvKQHlQeECu2jhryHD1MmuMhv+/D NGccBlLvcZrbrp4w0QtkA1qOuT8wDA+wQSoCKZvQ8Hen3bIe3RYvGUCMlzdIdBgm+/PD bBat5ElT+owT6j/MbK8Vcf5toDd31U+/2MOEwMPZGqocOmUOl1WXCqzZ+OIyiNQN4syK tY8Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=m1T4qbDxhSzNmOGzCQwvKdvIhsg9ftO1mwgvo87Nrw8=; b=YddZeaeUGtBk93CvKMPd35MF0KkjJ64Qudy1yJ0vjxm5fZ/SWnqv1mipyoKCrdtYah guHKKseNiqMo+QqcZJOkh3lOBdiMlBGaa8aarcYaXwJ3nSYDnf9/deJ+s+GjQyO9woFt QJJ+b8b9NbcceHQ3hp4kLCiAPXc1CieU/kobSLDVPGyIzTF4Fe5Syj/PnOrbyUhftjpe yz+moH0Ds12k1ohSY0vspIt73z41rt5pOJdi67GWlQDNvvOwoRR8F0o8IZVimEGBVzJm RjPuL9AEBbYRFLzgeJkwmnIklh5o2fFTPe8W33T4ydya5dFr/8UFbhqZTriag4tIWvuU +X4Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=bhybBTw1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l20-20020a056402255400b0043a91cd3039si427033edb.628.2022.07.20.16.47.24; Wed, 20 Jul 2022 16:47:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=bhybBTw1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231387AbiGTXF0 (ORCPT + 99 others); Wed, 20 Jul 2022 19:05:26 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54696 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231389AbiGTXFW (ORCPT ); Wed, 20 Jul 2022 19:05:22 -0400 Received: from mail-io1-xd2f.google.com (mail-io1-xd2f.google.com [IPv6:2607:f8b0:4864:20::d2f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6C577A191 for ; Wed, 20 Jul 2022 16:05:20 -0700 (PDT) Received: by mail-io1-xd2f.google.com with SMTP id h145so81806iof.9 for ; Wed, 20 Jul 2022 16:05:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=m1T4qbDxhSzNmOGzCQwvKdvIhsg9ftO1mwgvo87Nrw8=; b=bhybBTw1im+DhwPHQWv+PzdM9trpgtKREb+kM6QNvHD05jCfagtV+lxooUJ731yakd AtmTTjzC8ckolKZiHiuVoq5DcUcI7f0G3HBUlfBmed4XPivrb/aOGboWYwHTinfh/++2 KRREJxhfTSMu6dTk41fjcd+Bqsc5KPFtBHWAbuwg8IPFwwXIE//5nOy4HCjcrxn47h40 mLXHvQIlkmaXgPe8ZDzmvua0rHVH5BFWYJyCkSblCjWPH1/rObx2gAxsel7HdAEjQ3FR fmrAxAKRzPK/ulKi/nMXeHIsh58mE9P7jvbmC3e7oo0ga1aE+dxNq4jKi8+XsEwstDfN YTIw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=m1T4qbDxhSzNmOGzCQwvKdvIhsg9ftO1mwgvo87Nrw8=; b=ixzfURQ7P++xfis3s2NuqO+x/glgZmw07oxU2jqioMo41Gmzqz6e9N5wqQvMQh5dCy CuXjOrNeRhALvcU9pHegqgHq+2ABlhIE+/NyJ1CKZMNut2h8NhlelEwDmTeP7gz/VDIj zsmgu3eJyyGOO9uFMqjis2sv03aMVWkqlfsOPO+y6pd+2x/LmAwUqAjYhmwkxoJUZmZS 96GfDYQ7sOq5dGs2cJinQZwgBqgxk+IzJ6NHT54tWkFylzt/Su8NA7jtNDDt9jaVZTT0 HndFd/Bs+hoKoiTFo3ErqejdFNxxi/9zVxJ8I7r7Ny1nJrmRWPHDBGHb0A36gPKbd4dD JdMg== X-Gm-Message-State: AJIora92YF4KG6T4riu6vuUvQlsq9T/lAkj4VCll0ixZtmwSyw6wix3z fTEzW0NsXnTuvJLmYAIkavb4Ct6dkJrZbjW+7sM30A== X-Received: by 2002:a05:6602:2f03:b0:678:9c7c:97a5 with SMTP id q3-20020a0566022f0300b006789c7c97a5mr18761335iow.32.1658358319643; Wed, 20 Jul 2022 16:05:19 -0700 (PDT) MIME-Version: 1.0 References: <20220719195628.3415852-1-axelrasmussen@google.com> In-Reply-To: From: Axel Rasmussen Date: Wed, 20 Jul 2022 16:04:43 -0700 Message-ID: Subject: Re: [PATCH v4 0/5] userfaultfd: add /dev/userfaultfd for fine grained access control To: "Schaufler, Casey" Cc: Alexander Viro , Andrew Morton , Dave Hansen , "Dmitry V . Levin" , Gleb Fotengauer-Malinovskiy , Hugh Dickins , Jan Kara , Jonathan Corbet , Mel Gorman , Mike Kravetz , Mike Rapoport , "Amit, Nadav" , Peter Xu , Shuah Khan , Suren Baghdasaryan , Vlastimil Babka , zhangyi , "linux-doc@vger.kernel.org" , "linux-fsdevel@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "linux-mm@kvack.org" , "linux-kselftest@vger.kernel.org" , Andrea Arcangeli Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jul 20, 2022 at 3:16 PM Schaufler, Casey wrote: > > > -----Original Message----- > > From: Axel Rasmussen > > Sent: Tuesday, July 19, 2022 12:56 PM > > To: Alexander Viro ; Andrew Morton > > ; Dave Hansen > > ; Dmitry V . Levin ; Gleb > > Fotengauer-Malinovskiy ; Hugh Dickins > > ; Jan Kara ; Jonathan Corbet > > ; Mel Gorman ; Mike > > Kravetz ; Mike Rapoport ; > > Amit, Nadav ; Peter Xu ; > > Shuah Khan ; Suren Baghdasaryan > > ; Vlastimil Babka ; zhangyi > > > > Cc: Axel Rasmussen ; linux- > > doc@vger.kernel.org; linux-fsdevel@vger.kernel.org; linux- > > kernel@vger.kernel.org; linux-mm@kvack.org; linux- > > kselftest@vger.kernel.org > > Subject: [PATCH v4 0/5] userfaultfd: add /dev/userfaultfd for fine grained > > access control > > I assume that leaving the LSM mailing list off of the CC is purely > accidental. Please, please include us in the next round. Honestly it just hadn't occurred to me, but I'm more than happy to CC it on future revisions. > > > > > This series is based on torvalds/master. > > > > The series is split up like so: > > - Patch 1 is a simple fixup which we should take in any case (even by itself). > > - Patches 2-6 add the feature, configurable selftest support, and docs. > > > > Why not ...? > > ============ > > > > - Why not /proc/[pid]/userfaultfd? The proposed use case for this is for one > > process to open a userfaultfd which can intercept another process' page > > faults. This seems to me like exactly what CAP_SYS_PTRACE is for, though, > > so I > > think this use case can simply use a syscall without the powers > > CAP_SYS_PTRACE > > grants being "too much". > > > > - Why not use a syscall? Access to syscalls is generally controlled by > > capabilities. We don't have a capability which is used for userfaultfd access > > without also granting more / other permissions as well, and adding a new > > capability was rejected [1]. > > > > - It's possible a LSM could be used to control access instead. I suspect > > adding a brand new one just for this would be rejected, > > You won't know if you don't ask. Fair enough - I wonder if MM folks (Andrew, Peter, Nadav especially) would find that approach more palatable than /proc/[pid]/userfaultfd? Would it make sense from our perspective to propose a userfaultfd- or MM-specific LSM for controlling access to certain features? I remember +Andrea saying Red Hat was also interested in some kind of access control mechanism like this. Would one or the other approach be more convenient for you? > > > but I think some > > existing ones like SELinux can be used to filter syscall access. Enabling > > SELinux for large production deployments which don't already use it is > > likely to be a huge undertaking though, and I don't think this use case by > > itself is enough to motivate that kind of architectural change. > > > > Changelog > > ========= > > > > v3->v4: > > - Picked up an Acked-by on 5/5. > > - Updated cover letter to cover "why not ...". > > - Refactored userfaultfd_allowed() into userfaultfd_syscall_allowed(). > > [Peter] > > - Removed obsolete comment from a previous version. [Peter] > > - Refactored userfaultfd_open() in selftest. [Peter] > > - Reworded admin-guide documentation. [Mike, Peter] > > - Squashed 2 commits adding /dev/userfaultfd to selftest and making > > selftest > > configurable. [Peter] > > - Added "syscall" test modifier (the default behavior) to selftest. [Peter] > > > > v2->v3: > > - Rebased onto linux-next/akpm-base, in order to be based on top of the > > run_vmtests.sh refactor which was merged previously. > > - Picked up some Reviewed-by's. > > - Fixed ioctl definition (_IO instead of _IOWR), and stopped using > > compat_ptr_ioctl since it is unneeded for ioctls which don't take a pointer. > > - Removed the "handle_kernel_faults" bool, simplifying the code. The result > > is > > logically equivalent, but simpler. > > - Fixed userfaultfd selftest so it returns KSFT_SKIP appropriately. > > - Reworded documentation per Shuah's feedback on v2. > > - Improved example usage for userfaultfd selftest. > > > > v1->v2: > > - Add documentation update. > > - Test *both* userfaultfd(2) and /dev/userfaultfd via the selftest. > > > > [1]: https://lore.kernel.org/lkml/686276b9-4530-2045-6bd8- > > 170e5943abe4@schaufler-ca.com/T/ > > > > Axel Rasmussen (5): > > selftests: vm: add hugetlb_shared userfaultfd test to run_vmtests.sh > > userfaultfd: add /dev/userfaultfd for fine grained access control > > userfaultfd: selftests: modify selftest to use /dev/userfaultfd > > userfaultfd: update documentation to describe /dev/userfaultfd > > selftests: vm: add /dev/userfaultfd test cases to run_vmtests.sh > > > > Documentation/admin-guide/mm/userfaultfd.rst | 41 +++++++++++- > > Documentation/admin-guide/sysctl/vm.rst | 3 + > > fs/userfaultfd.c | 69 ++++++++++++++++---- > > include/uapi/linux/userfaultfd.h | 4 ++ > > tools/testing/selftests/vm/run_vmtests.sh | 11 +++- > > tools/testing/selftests/vm/userfaultfd.c | 69 +++++++++++++++++--- > > 6 files changed, 169 insertions(+), 28 deletions(-) > > > > -- > > 2.37.0.170.g444d1eabd0-goog >