Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp548624pxj; Tue, 18 May 2021 09:03:05 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxKM5u2maoe7k5qaGKwRu7m6L7MNN34G55Hr8/W31rUxxM47WOVANV8KhfzaO1O7mjFgzUG X-Received: by 2002:aa7:c7d5:: with SMTP id o21mr8033406eds.166.1621353784837; Tue, 18 May 2021 09:03:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1621353784; cv=none; d=google.com; s=arc-20160816; b=KMLOUw+KG31Pn/ctGgDNEt8Xu9GEnX7lbMg5LujIRCwR0xTvK5qKOgOLetUs414Q0i xxNC2ZigvSvtDIT/lZp6dqGDM1b9HVAPSzFmQYHDqtFQ3WirXokVL6Zo9Xskf6YLIvHM TGqPDX3V4SjuU9vQrSs58tar5IPl5bSk858Wk5vs+e9S698ek2vH5H7QSIv1nyJsg7MY TR/ogO1vSOp+VkaBX8kOafDYYoIIx0N7QwaJgeNeg6XY24RiePu65umFPf9h1Xx+I+wC MeTfsV72Ih3Or4NaTnDaMCxtUbWnyUqQO3EaX9nx4cGX9JJscEzxjMTt6zlnKN3aehKS Qk3Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=pILfy0NzTJFKfmrj2Qj/tW2uNngs8mCwTGY0GG6PwYc=; b=zfeN7AaaScTSQkFdFZpHah6vetCETMwfGiCxXQ03Ou5MVkAHtc+Rq3G+6R9KS6pPJ/ jo6TwAiGknLr0hT0sKpX899D3U+KoJ4QuT4geQ9TeLsId2WKcWaBnITOu8FAHB/jdhXX tcfObEbOweKuCzaas7OBVf1Ru1OEwchiWwcahPop8r1cYvd+bCBcCHCgNnhoHlqI1TO4 sgoT6yqvhboGi7736dXbnwgK5vlVjEdXoPYGY8nVh5OqQ4nPAt2ibJhSuciekJXVgceN UJnmEpJ+6qmAkCsfwmiUdWv1ZcJ9yUWJjHW7G8HAcyCrMBxNyCZONrmVZHzTJeec0kSR rEEw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=EK53ExbE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t10si3810309edw.323.2021.05.18.09.02.41; Tue, 18 May 2021 09:03:04 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=EK53ExbE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344947AbhEQQAK (ORCPT + 99 others); Mon, 17 May 2021 12:00:10 -0400 Received: from mail.kernel.org ([198.145.29.99]:52294 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344012AbhEQPkL (ORCPT ); Mon, 17 May 2021 11:40:11 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 35ABB61D0C; Mon, 17 May 2021 14:41:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1621262507; bh=lBmajCjIQ/09p1KgMeI9jgaURLMVarkzR9C9MZGAk2o=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=EK53ExbEcqPlvw8PwBNUtTr2A9c7zhOlfxfrH7zFRmPaPZ4sPHUtF7eErZ+/9BsBy rZT9IJ7Vnp/Bt122ffjDtyfC61vM7Lf8RIk4G/65mh6HOmeYd+dyd1wrXymlOx0rPs l4WcVD9/rDC+fHMKMzkAFzEUziRPoUmVFkj33bOg= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Peter Xu , Hugh Dickins , Mike Kravetz , "Joel Fernandes (Google)" , Andrew Morton , Linus Torvalds Subject: [PATCH 5.10 203/289] mm/hugetlb: fix F_SEAL_FUTURE_WRITE Date: Mon, 17 May 2021 16:02:08 +0200 Message-Id: <20210517140311.947246302@linuxfoundation.org> X-Mailer: git-send-email 2.31.1 In-Reply-To: <20210517140305.140529752@linuxfoundation.org> References: <20210517140305.140529752@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Peter Xu commit 22247efd822e6d263f3c8bd327f3f769aea9b1d9 upstream. Patch series "mm/hugetlb: Fix issues on file sealing and fork", v2. Hugh reported issue with F_SEAL_FUTURE_WRITE not applied correctly to hugetlbfs, which I can easily verify using the memfd_test program, which seems that the program is hardly run with hugetlbfs pages (as by default shmem). Meanwhile I found another probably even more severe issue on that hugetlb fork won't wr-protect child cow pages, so child can potentially write to parent private pages. Patch 2 addresses that. After this series applied, "memfd_test hugetlbfs" should start to pass. This patch (of 2): F_SEAL_FUTURE_WRITE is missing for hugetlb starting from the first day. There is a test program for that and it fails constantly. $ ./memfd_test hugetlbfs memfd-hugetlb: CREATE memfd-hugetlb: BASIC memfd-hugetlb: SEAL-WRITE memfd-hugetlb: SEAL-FUTURE-WRITE mmap() didn't fail as expected Aborted (core dumped) I think it's probably because no one is really running the hugetlbfs test. Fix it by checking FUTURE_WRITE also in hugetlbfs_file_mmap() as what we do in shmem_mmap(). Generalize a helper for that. Link: https://lkml.kernel.org/r/20210503234356.9097-1-peterx@redhat.com Link: https://lkml.kernel.org/r/20210503234356.9097-2-peterx@redhat.com Fixes: ab3948f58ff84 ("mm/memfd: add an F_SEAL_FUTURE_WRITE seal to memfd") Signed-off-by: Peter Xu Reported-by: Hugh Dickins Reviewed-by: Mike Kravetz Cc: Joel Fernandes (Google) Cc: Signed-off-by: Andrew Morton Signed-off-by: Linus Torvalds Signed-off-by: Greg Kroah-Hartman --- fs/hugetlbfs/inode.c | 5 +++++ include/linux/mm.h | 32 ++++++++++++++++++++++++++++++++ mm/shmem.c | 22 ++++------------------ 3 files changed, 41 insertions(+), 18 deletions(-) --- a/fs/hugetlbfs/inode.c +++ b/fs/hugetlbfs/inode.c @@ -131,6 +131,7 @@ static void huge_pagevec_release(struct static int hugetlbfs_file_mmap(struct file *file, struct vm_area_struct *vma) { struct inode *inode = file_inode(file); + struct hugetlbfs_inode_info *info = HUGETLBFS_I(inode); loff_t len, vma_len; int ret; struct hstate *h = hstate_file(file); @@ -146,6 +147,10 @@ static int hugetlbfs_file_mmap(struct fi vma->vm_flags |= VM_HUGETLB | VM_DONTEXPAND; vma->vm_ops = &hugetlb_vm_ops; + ret = seal_check_future_write(info->seals, vma); + if (ret) + return ret; + /* * page based offset in vm_pgoff could be sufficiently large to * overflow a loff_t when converted to byte offset. This can --- a/include/linux/mm.h +++ b/include/linux/mm.h @@ -3178,5 +3178,37 @@ unsigned long wp_shared_mapping_range(st extern int sysctl_nr_trim_pages; +/** + * seal_check_future_write - Check for F_SEAL_FUTURE_WRITE flag and handle it + * @seals: the seals to check + * @vma: the vma to operate on + * + * Check whether F_SEAL_FUTURE_WRITE is set; if so, do proper check/handling on + * the vma flags. Return 0 if check pass, or <0 for errors. + */ +static inline int seal_check_future_write(int seals, struct vm_area_struct *vma) +{ + if (seals & F_SEAL_FUTURE_WRITE) { + /* + * New PROT_WRITE and MAP_SHARED mmaps are not allowed when + * "future write" seal active. + */ + if ((vma->vm_flags & VM_SHARED) && (vma->vm_flags & VM_WRITE)) + return -EPERM; + + /* + * Since an F_SEAL_FUTURE_WRITE sealed memfd can be mapped as + * MAP_SHARED and read-only, take care to not allow mprotect to + * revert protections on such mappings. Do this only for shared + * mappings. For private mappings, don't need to mask + * VM_MAYWRITE as we still want them to be COW-writable. + */ + if (vma->vm_flags & VM_SHARED) + vma->vm_flags &= ~(VM_MAYWRITE); + } + + return 0; +} + #endif /* __KERNEL__ */ #endif /* _LINUX_MM_H */ --- a/mm/shmem.c +++ b/mm/shmem.c @@ -2256,25 +2256,11 @@ out_nomem: static int shmem_mmap(struct file *file, struct vm_area_struct *vma) { struct shmem_inode_info *info = SHMEM_I(file_inode(file)); + int ret; - if (info->seals & F_SEAL_FUTURE_WRITE) { - /* - * New PROT_WRITE and MAP_SHARED mmaps are not allowed when - * "future write" seal active. - */ - if ((vma->vm_flags & VM_SHARED) && (vma->vm_flags & VM_WRITE)) - return -EPERM; - - /* - * Since an F_SEAL_FUTURE_WRITE sealed memfd can be mapped as - * MAP_SHARED and read-only, take care to not allow mprotect to - * revert protections on such mappings. Do this only for shared - * mappings. For private mappings, don't need to mask - * VM_MAYWRITE as we still want them to be COW-writable. - */ - if (vma->vm_flags & VM_SHARED) - vma->vm_flags &= ~(VM_MAYWRITE); - } + ret = seal_check_future_write(info->seals, vma); + if (ret) + return ret; /* arm64 - allow memory tagging on RAM-based files */ vma->vm_flags |= VM_MTE_ALLOWED;