Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp4879987rwb; Sun, 13 Nov 2022 16:14:06 -0800 (PST) X-Google-Smtp-Source: AA0mqf7n00g2cyjaaWAeZ4+/HvDzDzGRph+pVAhaA4jUULdtOz/soFKxvWcUdatSOykX3nKKfSYH X-Received: by 2002:a05:6402:4447:b0:461:9f41:2960 with SMTP id o7-20020a056402444700b004619f412960mr9197404edb.242.1668384846698; Sun, 13 Nov 2022 16:14:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1668384846; cv=none; d=google.com; s=arc-20160816; b=qRpK5YUMOcdbvxKPHTWEIyydqR99jqwmbN+g5x1NHtXgbc3UxP9y4Lr5BVBhkr68S6 RCwKKITTzOoE+88N48x+vNzdW/kXqcCzGlTaCXkNgwASFGqYnE1ocHH371kvn8ylTlSS NJXs+wZO+znnVwkwcE5+pUCa4T+djXyzM5F0biIgx844deU8d4bp1wBes6sPEGufTWDP FFn6AMExXT/zhTDldMDpUVWIGZbJRII1C/IW3Ku7UYpMVXhC8aN2/PkH+wGm1kOSd3Fx AYrimgRAQfquG2YzbgJR5mZwP3DDEEq8Ali2CC2gFavBc/nveRaMhNiSwrj6ewOiECOI zLBw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=E/uqXcWNCa1p19zapYQ5e7dpPK5NM6V2qbPw5ZdKILA=; b=x3DBGYB5GDdqKuhB9V7GuGvGoNgi37UNJJ0KuczZxEmPkP47aGnnPL6kvUW/yVT8rJ mwLqR2U/q2VF+EM8ndMMMcoULVhUoxRhzaxmmhFZl7ydG+Y1iKmqMiY90OOpzOehYnrf NAHZ9521LaIs/MQTMqOBd8XgwLzvu8EZK7PudKJJ26p8YUulFBbOrK+qnyUi8iA/MEYT tTKWPXVmZGtvGTezfW311kj0er5I2tS72jD2WnvZWOltc5+kEWSr9mKp8riEc+4WkIss WBO8dWaYvr/kipEn4bsDavUszlhC/BpetsVJVxWa0p5j0T2FES5bas5V4+Tk0gKAMV6E Manw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=VCcfQZab; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id r9-20020a05640251c900b00459c2ac54e2si6815682edd.255.2022.11.13.16.13.44; Sun, 13 Nov 2022 16:14:06 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=VCcfQZab; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235565AbiKNAFw (ORCPT + 90 others); Sun, 13 Nov 2022 19:05:52 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54180 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229692AbiKNAFs (ORCPT ); Sun, 13 Nov 2022 19:05:48 -0500 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2B302D2C3 for ; Sun, 13 Nov 2022 16:04:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1668384293; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=E/uqXcWNCa1p19zapYQ5e7dpPK5NM6V2qbPw5ZdKILA=; b=VCcfQZabfHrzrM/jtpQ5dyInD7YpWIN1QynGU1eie023bo7vmFn2x1ms+47pfqv2NaSQYQ xo+I77aK4B2+JxE2VFGBOPoM4BZA7PEwISd8wo+NCdyEBpzITv7NJrjX3Yp2wXd8N5oINT sR3lYVJsBkzLsCo4H2nFOau5dKCywD0= Received: from mail-qk1-f198.google.com (mail-qk1-f198.google.com [209.85.222.198]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id us-mta-507-7Zf8Fmx3PMCYGNb_SmPxHg-1; Sun, 13 Nov 2022 19:04:52 -0500 X-MC-Unique: 7Zf8Fmx3PMCYGNb_SmPxHg-1 Received: by mail-qk1-f198.google.com with SMTP id h13-20020a05620a244d00b006fb713618b8so1685253qkn.0 for ; Sun, 13 Nov 2022 16:04:52 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=E/uqXcWNCa1p19zapYQ5e7dpPK5NM6V2qbPw5ZdKILA=; b=QsJRbHdfUX1UMCLDU1+gB30eMrk0y5vsWk1u2ifaqWiiKyDIQG/KtKi3cdPFlq/7cE gt+6mWvi4Xpxtzu/2jOQLGwGH+hZSCkCp2cJBPDBlAIg6/z2DzmPYTC4b5RYI/dAFc0m E4yZ0YFYK9expH/6eIasLEXlCDpKiUiPkIw42xfMNGLilluHMOMP7zxPoKhnt25RYPYt lh8mm4cvc2ppKaMM2szLxIavCivzvPaKaF51gP5WhyX+EcMLnlZW/AJZVvopkW1Sfo82 14I2kCD3jgURL6R1U/UpXE62nz3Q7y29gQQTppilBDD5n6w2OkLSOcUNebcxvO5SiHtt 13nA== X-Gm-Message-State: ANoB5pmiZg9xVX77oEvSssayiZhd/IrXwA0qbSIm/G0t1SRnjU0ulxWl EntfUdTc5yfLSmLSYlKGB37Nxuhay9bfUf5IPLJ0ZvcOVMbj64YBzD/3KTG8Za/ddEx1huN6KzV JOCTDJVjmTPYntCDvnABQT78HitBlCYBoQUrdWNnoFt0wiupGNSeUcQG0e0EPFUQgcofoW2y0wA == X-Received: by 2002:a05:622a:1c0f:b0:3a5:47c8:3889 with SMTP id bq15-20020a05622a1c0f00b003a547c83889mr10331823qtb.66.1668384291141; Sun, 13 Nov 2022 16:04:51 -0800 (PST) X-Received: by 2002:a05:622a:1c0f:b0:3a5:47c8:3889 with SMTP id bq15-20020a05622a1c0f00b003a547c83889mr10331790qtb.66.1668384290766; Sun, 13 Nov 2022 16:04:50 -0800 (PST) Received: from x1n.redhat.com (bras-base-aurron9127w-grc-46-70-31-27-79.dsl.bell.ca. [70.31.27.79]) by smtp.gmail.com with ESMTPSA id cb5-20020a05622a1f8500b0039cc0fbdb61sm4870380qtb.53.2022.11.13.16.04.49 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 13 Nov 2022 16:04:50 -0800 (PST) From: Peter Xu To: linux-kernel@vger.kernel.org, linux-mm@kvack.org Cc: Mike Rapoport , Nadav Amit , Andrew Morton , peterx@redhat.com, Andrea Arcangeli , Ives van Hoorne , Axel Rasmussen , Alistair Popple , stable@vger.kernel.org Subject: [PATCH v3 1/2] mm/migrate: Fix read-only page got writable when recover pte Date: Sun, 13 Nov 2022 19:04:46 -0500 Message-Id: <20221114000447.1681003-2-peterx@redhat.com> X-Mailer: git-send-email 2.37.3 In-Reply-To: <20221114000447.1681003-1-peterx@redhat.com> References: <20221114000447.1681003-1-peterx@redhat.com> MIME-Version: 1.0 Content-type: text/plain Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Ives van Hoorne from codesandbox.io reported an issue regarding possible data loss of uffd-wp when applied to memfds on heavily loaded systems. The symptom is some read page got data mismatch from the snapshot child VMs. Here I can also reproduce with a Rust reproducer that was provided by Ives that keeps taking snapshot of a 256MB VM, on a 32G system when I initiate 80 instances I can trigger the issues in ten minutes. It turns out that we got some pages write-through even if uffd-wp is applied to the pte. The problem is, when removing migration entries, we didn't really worry about write bit as long as we know it's not a write migration entry. That may not be true, for some memory types (e.g. writable shmem) mk_pte can return a pte with write bit set, then to recover the migration entry to its original state we need to explicit wr-protect the pte or it'll has the write bit set if it's a read migration entry. For uffd it can cause write-through. The relevant code on uffd was introduced in the anon support, which is commit f45ec5ff16a7 ("userfaultfd: wp: support swap and page migration", 2020-04-07). However anon shouldn't suffer from this problem because anon should already have the write bit cleared always, so that may not be a proper Fixes target, while I'm adding the Fixes to be uffd shmem support. Cc: Andrea Arcangeli Cc: stable@vger.kernel.org Fixes: b1f9e876862d ("mm/uffd: enable write protection for shmem & hugetlbfs") Reported-by: Ives van Hoorne Reviewed-by: Alistair Popple Tested-by: Ives van Hoorne Signed-off-by: Peter Xu --- mm/migrate.c | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/mm/migrate.c b/mm/migrate.c index dff333593a8a..8b6351c08c78 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -213,8 +213,14 @@ static bool remove_migration_pte(struct folio *folio, pte = pte_mkdirty(pte); if (is_writable_migration_entry(entry)) pte = maybe_mkwrite(pte, vma); - else if (pte_swp_uffd_wp(*pvmw.pte)) + else + /* NOTE: mk_pte can have write bit set */ + pte = pte_wrprotect(pte); + + if (pte_swp_uffd_wp(*pvmw.pte)) { + WARN_ON_ONCE(pte_write(pte)); pte = pte_mkuffd_wp(pte); + } if (folio_test_anon(folio) && !is_readable_migration_entry(entry)) rmap_flags |= RMAP_EXCLUSIVE; -- 2.37.3