Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp1367384pxb; Fri, 13 Nov 2020 10:47:22 -0800 (PST) X-Google-Smtp-Source: ABdhPJwOzWQLnLPVuUNqlyrftW55dvZUNnEoqhCoRz5LRyLc130ybK54hkOmuUjfcCIP4geB/3Xh X-Received: by 2002:a17:906:5f89:: with SMTP id a9mr3255683eju.262.1605293242055; Fri, 13 Nov 2020 10:47:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605293242; cv=none; d=google.com; s=arc-20160816; b=GE9g3B2ikvdSEKck+2B/xOut/e73pQc+6AroFIiprBKakVDj4Sf/Vl1hXUOKxfdQvJ 2okx85pEGcMtQ5XBNYmjxXQZqYuoWrySTwag2F1QWfq86Y6yGzl7Td8RT1814rZncr0x LXjmoHg5beyNfqel49LfnK06QF8X6Iw3uQYyS6Gz+FMKnylaGaa/Ln1hXf1w0nUEy5R4 iL4IYh/zrgciIJ3lWX++ZeNvQU3qtKji5x8YLfN4TK+Yp7PUt6GFPZC7fH0kFpBKBSxn Ptn54YX8fyxr8/BowoZwNDKZIU7ihCI12NCo5rWS27clXEgtBZcxUC12PrNMYIQcmLzh N04g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=cTJlJ+7xtu0sUuJYplh5pdrvcoxPXi2kVPalslnbuBo=; b=Gp7lCAi8I7MVD7yQ888pGE0Omb+IbAX77zQLiN2n+wGdfCHcnhVqnZ6ui869g72mZ3 2QGmPWtYtfTxRehDlMWmhVXBWmZhG06I+MKHYnWeO9m9jGWeSQvdt1EBEpMNFFLFhczc LDUVP4Qr4GvnbjXmhg2p6GcB9d1ntFSFto2V93OvrYRIscuG/RnNFYyTAJJmReaJWYXN CTh2/IB1vAxWzqfzP9msFY6Tsb58JCNRR2xZIX01GkToDT8MM+rxvxf/rtbr0MBzEwR0 WUMz5u7nTeXAdtVJOoh8YBz9OjbsUUxS2FADQ/9UPh+hjlQGpmryZVNkQ7hNHzdvCyw4 n14A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@sargun.me header.s=google header.b="MXdK/hBF"; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cc22si7303984edb.558.2020.11.13.10.46.46; Fri, 13 Nov 2020 10:47:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@sargun.me header.s=google header.b="MXdK/hBF"; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726239AbgKMSqo (ORCPT + 99 others); Fri, 13 Nov 2020 13:46:44 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43348 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726199AbgKMSqn (ORCPT ); Fri, 13 Nov 2020 13:46:43 -0500 Received: from mail-il1-x143.google.com (mail-il1-x143.google.com [IPv6:2607:f8b0:4864:20::143]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8C0C4C0617A6 for ; Fri, 13 Nov 2020 10:46:43 -0800 (PST) Received: by mail-il1-x143.google.com with SMTP id h6so5732878ilj.8 for ; Fri, 13 Nov 2020 10:46:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sargun.me; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=cTJlJ+7xtu0sUuJYplh5pdrvcoxPXi2kVPalslnbuBo=; b=MXdK/hBFVZU9sIyi3jdmY6UGXxmLRQ+WPtbV+AuWQVNdIgqdvtprYRVMYDUfArJhYY V0/yWQy7fmHZh04SVLKUZwm4tfOG47Hfd1Y/w77U3XwqOFW5FmEBxhhI6GZ9BqgveJby t5bXvkXnG6oH4Zddp+WDRbVHtJO7TGT/wWc9s= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=cTJlJ+7xtu0sUuJYplh5pdrvcoxPXi2kVPalslnbuBo=; b=FaqsRY9A5Y19VAttY+dUcIDBSWZqEeeUa/L6RKxT2mlOQLPdbhSzMY/S55RONTQxnk 2WTaujrEwCW0TmMaL72IS9gT0iPinU9O2rG/Vn5C5drt23luJC+ZtAG1WGFFfTBKSWii 49pfCCB5sZQXp6RGDED9D7sIcZD2G/kMc65KhRRPzc5h7GiXP9iPRgEs+wpvdc9+1AVB OtZrcbL9Ys3l7XiHQ0+uhpfzq6yBiVU6XNmU0YUNhQ2PpuaYgOXD30DNiJi62lbeYJDz hGFaNzeLk3iWI/HMdDrj3gClJPf+7rwUD2xApAaxIwhwBYIaicVrbgJ8kFtgbbreDOv8 XMBQ== X-Gm-Message-State: AOAM530w6bBlDc3GdnE3J0evwZbMYDZ3cPcm9sTvWy7djB0lcRT1yY5n BUJBW2JFzRXKbQFUNTkQ/jrCNw== X-Received: by 2002:a92:6706:: with SMTP id b6mr975121ilc.42.1605293202608; Fri, 13 Nov 2020 10:46:42 -0800 (PST) Received: from ircssh-2.c.rugged-nimbus-611.internal (80.60.198.104.bc.googleusercontent.com. [104.198.60.80]) by smtp.gmail.com with ESMTPSA id k28sm1099897ilg.40.2020.11.13.10.46.42 (version=TLS1_2 cipher=ECDHE-ECDSA-CHACHA20-POLY1305 bits=256/256); Fri, 13 Nov 2020 10:46:42 -0800 (PST) Date: Fri, 13 Nov 2020 18:46:40 +0000 From: Sargun Dhillon To: "J . Bruce Fields" , Chuck Lever , Trond Myklebust , Anna Schumaker , David Howells , Scott Mayhew Cc: mauricio@kinvolk.io, Alban Crequy , linux-nfs@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Kyle Anderson Subject: Re: [PATCH v5 0/2] NFS: Fix interaction between fs_context and user namespaces Message-ID: <20201113184640.GA29286@ircssh-2.c.rugged-nimbus-611.internal> References: <20201112100952.3514-1-sargun@sargun.me> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201112100952.3514-1-sargun@sargun.me> User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Thu, Nov 12, 2020 at 02:09:50AM -0800, Sargun Dhillon wrote: > Right now, it is possible to mount NFS with an non-matching super block > user ns, and NFS sunrpc user ns. This (for the user) results in an awkward > set of interactions if using anything other than auth_null, where the UIDs > being sent to the server are different than the local UIDs being checked. > This can cause "breakage", where if you try to communicate with the NFS > server with any other set of mappings, it breaks. > > The reason for this is that you can call fsopen("nfs4") in the unprivileged > namespace, and that configures fs_context with all the right information > for that user namespace. In addition, it also keeps a gets a cred object > associated with the caller -- which should match the user namespace. > Unfortunately, the mount has to be finished in the init_user_ns because we > currently require CAP_SYS_ADMIN in the init user namespace to call fsmount. > This means that the superblock's user namespace is set "correctly" to the > container, but there's absolutely no way nfs4idmap to consume an > unprivileged user namespace because the cred / user_ns that's passed down > to nfs4idmap is the one at fsmount. > > How this actually exhibits is let's say that the UID 0 in the user > namespace is mapped to UID 1000 in the init user ns (and kuid space). What > will happen is that nfs4idmap will translate the UID 1000 into UID 0 on the > wire, even if the mount is in entirely in the mount / user namespace of the > container. > > So, it looks something like this > Client in unprivileged User NS (UID: 0, KUID: 0) > ->Perform open() > ...VFS / NFS bits... > nfs_map_uid_to_name -> > from_kuid_munged(init_user_ns, uid) (returns 0) > RPC with UID 0 > > This behaviour happens "the other way" as well, where the UID in the > container may be 0, but the corresponding kuid is 1000. When a response > from an NFS server comes in we decode it according to the idmap userns. > The way this exhibits is even more odd. > > Server responds with file attribute (UID: 0, GID: 0) > ->nfs_map_name_to_uid(..., 0) > ->make_kuid(init_user_ns, id) (returns 0) > ....VFS / NFS Bits... > ->from_kuid(container_ns, 0) -> invalid uid > -> EOVERFLOW > > This changes the nfs server to use the cred / userns from fs_context, which > is how idmap is constructed. This subsequently is used in the above > described flow of converting uids back-and-forth. > > Trond gave the feedback that this behaviour [implemented by this patch] is > how the legacy sys_mount() behaviour worked[1], and that the intended > behaviour is for UIDs to be plumbed through entirely, where the user > namespaces UIDs are what is sent over the wire, and not the init user ns. > > [1]: https://lore.kernel.org/linux-nfs/8feccf45f6575a204da03e796391cc135283eb88.camel@hammerspace.com/ > > Sargun Dhillon (2): > NFS: NFSv2/NFSv3: Use cred from fs_context during mount > NFSv4: Refactor to use user namespaces for nfs4idmap > > fs/nfs/client.c | 4 ++-- > fs/nfs/nfs4client.c | 2 +- > 2 files changed, 3 insertions(+), 3 deletions(-) > > > base-commit: 8c39076c276be0b31982e44654e2c2357473258a > -- > 2.25.1 > Trond, I was just thinking, since you said that this is the behaviour of the sys_mount API, would this be considered a regression? Should it go to stable (v5.9)?