Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp347359pxj; Tue, 18 May 2021 04:55:13 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwjAXuH8hcorUsQjpc8VHiUXeBaaCh62RienPTHLTdeULJRPfM8fwEOZ86262LwD+lutTRV X-Received: by 2002:a05:6402:42cd:: with SMTP id i13mr6551225edc.34.1621338912739; Tue, 18 May 2021 04:55:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1621338912; cv=none; d=google.com; s=arc-20160816; b=vIp/V4Uqzk8Vz/SiYJEo+xNx3yP5DfsqBsmUoWoMFmRFjrYIQL9bY+hydw00bkANYQ xCzkj1gdXHvjhCnbyKdrfrMEyiOCXw2PVUMRvKYa65u/PimzJpllZzk840n9mnNIVk2D dk1+W+rYfIJHI1Fuq8esNzZ1tfPDbBfc4j68cRApoxn4/RThfYGaHNr8cIsyjp4ncL6Z YZhmBnhM4GJ9qQJ4kZpKzZdNotGXFA0gEPsjdjcuV5+w1QJDIEKiWimPl67DL9TdVfLI DApip+MSuAVg9qdvHUr+PT4+SwiU5DwfIdoLIOJ72QTcW3R4Kk12gxws6LU8WdHfo5dH iQWA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=sOhFp20Vptq52Ng1AUqsaYzdaS+vDNikJEAVmvzgDdo=; b=quzhF8PX1Bf0OdKzrSnJf5F0XoUM6fWKbP6fpQ5wG/CwoEU9tkg+gKp/xCTmV5Zvw3 EpspPOs5swwUXAaTu0b2zjavwMaBTG+P3ZkpOLmREKfLj9H6fcSJYJyiL+emG+ep5TIk tuH2Htd8QP//VHAd6fv3vYxuNmHoKgz7PjjELQkLFnFDQAivnhFPEroBRNG4nnhHOqS8 axj8aPcNNEyeXj9d5D6nmli5iBXT00P/L936aEw5MEeogM+v8HXkxUVHWUr/Nnm0W1QI g5Me20aevRWOMTd2StO2iSO9g4tcVKdlIZfURUC5CLGXOu64y1YZRlMXIMwIIqWvE40G QuVg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=uHCqxoS2; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b8si18887444edd.8.2021.05.18.04.54.49; Tue, 18 May 2021 04:55:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=uHCqxoS2; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237496AbhEQPZY (ORCPT + 99 others); Mon, 17 May 2021 11:25:24 -0400 Received: from mail.kernel.org ([198.145.29.99]:41010 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242494AbhEQPLf (ORCPT ); Mon, 17 May 2021 11:11:35 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 0F3AF61876; Mon, 17 May 2021 14:31:05 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1621261866; bh=NUTveeO8K0bmPe5fcBuUXsxOCvDXFMHlPCWl3/R4p3o=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=uHCqxoS2XCS0gYGf7Q+IxBAIhbRCeZDhVVQNnf/DkpR+Bx3iNKmbBRa96QYCpKTgY 8wXoSYlTcstgprmjAbHGDRvEwaPBvZAih2eCM5tVv7/VL3PnMV7bBe4hyiwyB8+On1 37K8UMj9MENWVefSrvtB7QLVl7kjH9wfawKn9crQ= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Peter Xu , Hugh Dickins , Mike Kravetz , "Joel Fernandes (Google)" , Andrew Morton , Linus Torvalds Subject: [PATCH 5.4 102/141] mm/hugetlb: fix F_SEAL_FUTURE_WRITE Date: Mon, 17 May 2021 16:02:34 +0200 Message-Id: <20210517140246.214457854@linuxfoundation.org> X-Mailer: git-send-email 2.31.1 In-Reply-To: <20210517140242.729269392@linuxfoundation.org> References: <20210517140242.729269392@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Peter Xu commit 22247efd822e6d263f3c8bd327f3f769aea9b1d9 upstream. Patch series "mm/hugetlb: Fix issues on file sealing and fork", v2. Hugh reported issue with F_SEAL_FUTURE_WRITE not applied correctly to hugetlbfs, which I can easily verify using the memfd_test program, which seems that the program is hardly run with hugetlbfs pages (as by default shmem). Meanwhile I found another probably even more severe issue on that hugetlb fork won't wr-protect child cow pages, so child can potentially write to parent private pages. Patch 2 addresses that. After this series applied, "memfd_test hugetlbfs" should start to pass. This patch (of 2): F_SEAL_FUTURE_WRITE is missing for hugetlb starting from the first day. There is a test program for that and it fails constantly. $ ./memfd_test hugetlbfs memfd-hugetlb: CREATE memfd-hugetlb: BASIC memfd-hugetlb: SEAL-WRITE memfd-hugetlb: SEAL-FUTURE-WRITE mmap() didn't fail as expected Aborted (core dumped) I think it's probably because no one is really running the hugetlbfs test. Fix it by checking FUTURE_WRITE also in hugetlbfs_file_mmap() as what we do in shmem_mmap(). Generalize a helper for that. Link: https://lkml.kernel.org/r/20210503234356.9097-1-peterx@redhat.com Link: https://lkml.kernel.org/r/20210503234356.9097-2-peterx@redhat.com Fixes: ab3948f58ff84 ("mm/memfd: add an F_SEAL_FUTURE_WRITE seal to memfd") Signed-off-by: Peter Xu Reported-by: Hugh Dickins Reviewed-by: Mike Kravetz Cc: Joel Fernandes (Google) Cc: Signed-off-by: Andrew Morton Signed-off-by: Linus Torvalds Signed-off-by: Greg Kroah-Hartman --- fs/hugetlbfs/inode.c | 5 +++++ include/linux/mm.h | 32 ++++++++++++++++++++++++++++++++ mm/shmem.c | 22 ++++------------------ 3 files changed, 41 insertions(+), 18 deletions(-) --- a/fs/hugetlbfs/inode.c +++ b/fs/hugetlbfs/inode.c @@ -135,6 +135,7 @@ static void huge_pagevec_release(struct static int hugetlbfs_file_mmap(struct file *file, struct vm_area_struct *vma) { struct inode *inode = file_inode(file); + struct hugetlbfs_inode_info *info = HUGETLBFS_I(inode); loff_t len, vma_len; int ret; struct hstate *h = hstate_file(file); @@ -150,6 +151,10 @@ static int hugetlbfs_file_mmap(struct fi vma->vm_flags |= VM_HUGETLB | VM_DONTEXPAND; vma->vm_ops = &hugetlb_vm_ops; + ret = seal_check_future_write(info->seals, vma); + if (ret) + return ret; + /* * page based offset in vm_pgoff could be sufficiently large to * overflow a loff_t when converted to byte offset. This can --- a/include/linux/mm.h +++ b/include/linux/mm.h @@ -2925,5 +2925,37 @@ static inline int pages_identical(struct return !memcmp_pages(page1, page2); } +/** + * seal_check_future_write - Check for F_SEAL_FUTURE_WRITE flag and handle it + * @seals: the seals to check + * @vma: the vma to operate on + * + * Check whether F_SEAL_FUTURE_WRITE is set; if so, do proper check/handling on + * the vma flags. Return 0 if check pass, or <0 for errors. + */ +static inline int seal_check_future_write(int seals, struct vm_area_struct *vma) +{ + if (seals & F_SEAL_FUTURE_WRITE) { + /* + * New PROT_WRITE and MAP_SHARED mmaps are not allowed when + * "future write" seal active. + */ + if ((vma->vm_flags & VM_SHARED) && (vma->vm_flags & VM_WRITE)) + return -EPERM; + + /* + * Since an F_SEAL_FUTURE_WRITE sealed memfd can be mapped as + * MAP_SHARED and read-only, take care to not allow mprotect to + * revert protections on such mappings. Do this only for shared + * mappings. For private mappings, don't need to mask + * VM_MAYWRITE as we still want them to be COW-writable. + */ + if (vma->vm_flags & VM_SHARED) + vma->vm_flags &= ~(VM_MAYWRITE); + } + + return 0; +} + #endif /* __KERNEL__ */ #endif /* _LINUX_MM_H */ --- a/mm/shmem.c +++ b/mm/shmem.c @@ -2208,25 +2208,11 @@ out_nomem: static int shmem_mmap(struct file *file, struct vm_area_struct *vma) { struct shmem_inode_info *info = SHMEM_I(file_inode(file)); + int ret; - if (info->seals & F_SEAL_FUTURE_WRITE) { - /* - * New PROT_WRITE and MAP_SHARED mmaps are not allowed when - * "future write" seal active. - */ - if ((vma->vm_flags & VM_SHARED) && (vma->vm_flags & VM_WRITE)) - return -EPERM; - - /* - * Since an F_SEAL_FUTURE_WRITE sealed memfd can be mapped as - * MAP_SHARED and read-only, take care to not allow mprotect to - * revert protections on such mappings. Do this only for shared - * mappings. For private mappings, don't need to mask - * VM_MAYWRITE as we still want them to be COW-writable. - */ - if (vma->vm_flags & VM_SHARED) - vma->vm_flags &= ~(VM_MAYWRITE); - } + ret = seal_check_future_write(info->seals, vma); + if (ret) + return ret; file_accessed(file); vma->vm_ops = &shmem_vm_ops;