Received: by 2002:a05:7412:d1aa:b0:fc:a2b0:25d7 with SMTP id ba42csp888816rdb; Tue, 30 Jan 2024 01:21:36 -0800 (PST) X-Google-Smtp-Source: AGHT+IGhEWjS0CeN0OQQZiEEoAWyEXs4R4/NH9H+X05BZTWg5rQ+uynirXZkKMtQDTIv8os4I3IG X-Received: by 2002:a17:902:ec87:b0:1d8:fcac:efe5 with SMTP id x7-20020a170902ec8700b001d8fcacefe5mr2288691plg.17.1706606496664; Tue, 30 Jan 2024 01:21:36 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1706606496; cv=pass; d=google.com; s=arc-20160816; b=BIiv1LlQ2H0PYSavdFgVKNnrFvDmFpjimQP8sIf2B3tKRk80XVDJSZQRz0A9FoTCIG jGF6LTV3Hejn4IE87pMMBec5btYmIFTRU0ER9VaKgpTp3kscZjJuePHZhM7YPmdPgI9I Q1PqcS4Lr9gozcXBRO13sANbH4hDcgbYsaZ3SZFhev++L2qULZ3aNL7248Bfx2C8yus9 oRm/dZwNvYP8/s2qyMOFR8SLIzCHaCqDCh2T8Z1XqE9hTFW6jnwjQpw+l2X5ghny+llu DblKahArhjRXt+/KGoH2Cl54A30wrP5vgmODQ4eYJzd4wYwTYENjgXfkp0VdEXVtrASJ /hSg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:date:message-id; bh=Ks2+JNJg1ArNLj0KToMmFpPYxWEEQ0PiogT9em1TKh0=; fh=VVjuivqjwQTGD3KqdtfMHM2u5ZCac4G8eN8eVGHfbQw=; b=uKb2ZkB0hgyhC5ilOIYOW2S0S1ojCcuuFRDme5+OQeGekRiXHYmWnHH/je8XelzMJO 2x7ubuxqV5brfqgohnegoLzCZTahi54Mr4bcLo9FNDrNFWAAlrVclBQ7I5y8qZ49/sFM dgJJ2yz6JtsbpaaAyOxNMNI0dvyxk+efyXLQFY4qagg7tc/CCR3oFlcTX/udOR68AHc0 Th624T+K8UG646x2Pu0+Cq+GN+UDxpwsKuG3EzVPI1TnS0ZL6Wdyb3ffZfjWFsNgefc7 2ctfnINBfymDjvFaa5Cx208xhQPpbK825Uvz+Bgw8KapwQXpXE82opGaBcSPSVmaGNf1 p5sA== ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-44170-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-44170-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from sy.mirrors.kernel.org (sy.mirrors.kernel.org. [2604:1380:40f1:3f00::1]) by mx.google.com with ESMTPS id ju20-20020a170903429400b001d8ead4fbedsi2597921plb.505.2024.01.30.01.21.36 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 30 Jan 2024 01:21:36 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-44170-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) client-ip=2604:1380:40f1:3f00::1; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-44170-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-44170-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sy.mirrors.kernel.org (Postfix) with ESMTPS id E9E19B2D415 for ; Tue, 30 Jan 2024 08:46:43 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 49CD557874; Tue, 30 Jan 2024 08:45:52 +0000 (UTC) Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 57BF557861; Tue, 30 Jan 2024 08:45:48 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=217.140.110.172 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706604351; cv=none; b=NvDhH/+bLjIB+1PSiiikbLNmiZ/B9fb/1EHI2t8CzavBecy5LH4rzDHzagWmvzLVLTL/jEeyF7HNbQAQvMQ0fXGneqQlGVJRk+kKpzKlVKADOnJQz3zPSFn4FXwxDF2D5ggHjda76Ip0fum+izUxzPmQiONPEBvSizDCUapWHLM= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706604351; c=relaxed/simple; bh=YPivXGUwsgxM6K1OyqG9VqP6YDZRo0TgVHoytzvbpTk=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=LBkSOvvy0GeFekgq5cJaCG9J1F1DTM8foyah8TljxyjFB8LFTp9IzAKORRZlwDpDFRFB+7ViSK6iuQmRLUikuWkPih5nz5mqPhbSaRPp1mm5rcp10c3eS6g5cpMD6+M33Ex0MHI2lTFKsrzQpvXaGtuehgPgpOo/MjhaWkr+tok= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com; spf=pass smtp.mailfrom=arm.com; arc=none smtp.client-ip=217.140.110.172 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=arm.com Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 600E3DA7; Tue, 30 Jan 2024 00:46:31 -0800 (PST) Received: from [10.57.79.54] (unknown [10.57.79.54]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 8B93E3F738; Tue, 30 Jan 2024 00:45:43 -0800 (PST) Message-ID: Date: Tue, 30 Jan 2024 08:45:41 +0000 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v1 3/9] mm/memory: further separate anon and pagecache folio handling in zap_present_pte() Content-Language: en-GB To: David Hildenbrand , linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, Andrew Morton , Matthew Wilcox , Catalin Marinas , Will Deacon , "Aneesh Kumar K.V" , Nick Piggin , Peter Zijlstra , Michael Ellerman , Christophe Leroy , "Naveen N. Rao" , Heiko Carstens , Vasily Gorbik , Alexander Gordeev , Christian Borntraeger , Sven Schnelle , Arnd Bergmann , linux-arch@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, linux-s390@vger.kernel.org References: <20240129143221.263763-1-david@redhat.com> <20240129143221.263763-4-david@redhat.com> <40cfb242-ceb0-44c6-afe7-c1744825dc62@arm.com> From: Ryan Roberts In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit On 30/01/2024 08:37, David Hildenbrand wrote: > On 30.01.24 09:31, Ryan Roberts wrote: >> On 29/01/2024 14:32, David Hildenbrand wrote: >>> We don't need up-to-date accessed-dirty information for anon folios and can >>> simply work with the ptent we already have. Also, we know the RSS counter >>> we want to update. >>> >>> We can safely move arch_check_zapped_pte() + tlb_remove_tlb_entry() + >>> zap_install_uffd_wp_if_needed() after updating the folio and RSS. >>> >>> While at it, only call zap_install_uffd_wp_if_needed() if there is even >>> any chance that pte_install_uffd_wp_if_needed() would do *something*. >>> That is, just don't bother if uffd-wp does not apply. >>> >>> Signed-off-by: David Hildenbrand >>> --- >>>   mm/memory.c | 16 +++++++++++----- >>>   1 file changed, 11 insertions(+), 5 deletions(-) >>> >>> diff --git a/mm/memory.c b/mm/memory.c >>> index 69502cdc0a7d..20bc13ab8db2 100644 >>> --- a/mm/memory.c >>> +++ b/mm/memory.c >>> @@ -1552,12 +1552,9 @@ static inline void zap_present_pte(struct mmu_gather >>> *tlb, >>>       folio = page_folio(page); >>>       if (unlikely(!should_zap_folio(details, folio))) >>>           return; >>> -    ptent = ptep_get_and_clear_full(mm, addr, pte, tlb->fullmm); >>> -    arch_check_zapped_pte(vma, ptent); >>> -    tlb_remove_tlb_entry(tlb, pte, addr); >>> -    zap_install_uffd_wp_if_needed(vma, addr, pte, details, ptent); >>>         if (!folio_test_anon(folio)) { >>> +        ptent = ptep_get_and_clear_full(mm, addr, pte, tlb->fullmm); >>>           if (pte_dirty(ptent)) { >>>               folio_mark_dirty(folio); >>>               if (tlb_delay_rmap(tlb)) { >>> @@ -1567,8 +1564,17 @@ static inline void zap_present_pte(struct mmu_gather >>> *tlb, >>>           } >>>           if (pte_young(ptent) && likely(vma_has_recency(vma))) >>>               folio_mark_accessed(folio); >>> +        rss[mm_counter(folio)]--; >>> +    } else { >>> +        /* We don't need up-to-date accessed/dirty bits. */ >>> +        ptep_get_and_clear_full(mm, addr, pte, tlb->fullmm); >>> +        rss[MM_ANONPAGES]--; >>>       } >>> -    rss[mm_counter(folio)]--; >>> +    arch_check_zapped_pte(vma, ptent); >> >> Isn't the x86 (only) implementation of this relying on the dirty bit? So doesn't >> that imply you still need get_and_clear for anon? (And in hindsight I think that >> logic would apply to the previous patch too?) > > x86 uses the encoding !writable && dirty to indicate special shadow stacks. That > is, the hw dirty bit is set by software (to create that combination), not by > hardware. > > So you don't have to sync against any hw changes of the hw dirty bit. What you > had in the original PTE you read is sufficient. > Right, got it. In that case: Reviewed-by: Ryan Roberts