Received: by 2002:a05:6500:1b41:b0:1fb:d597:ff75 with SMTP id cz1csp69957lqb; Tue, 4 Jun 2024 05:32:39 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCVfMdFj5p75PlpOMQdvCQaJfCaZM2n+O9PryRs26FnK0M1PCUXragK+pA9GZNv8RfVurmqDQmp5cxbjfepqQpEXoUqAPPEHvpbV+CMTJg== X-Google-Smtp-Source: AGHT+IEg3XdMr34W2Ca8ybYGYWB5Btg8Whv5xVkYGaP9YbNRc77CrAnmzIJByO1YRi0H9MKI7szq X-Received: by 2002:a05:6359:5fa9:b0:19c:5c1c:11b3 with SMTP id e5c5f4694b2df-19c5c1c133amr30383355d.22.1717504359466; Tue, 04 Jun 2024 05:32:39 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1717504359; cv=pass; d=google.com; s=arc-20160816; b=Ev6ByPMkxcAMGmUoU0HG0pYDhtHrwBCldPfKipoM/lAPZ1qXdn6HE12S4vRwpnUe8j Br35EqEKEQltAF6IIFL01a7ego5HcW9rlszPh7eqYvcf7IdpKuIU2lubywmeUoVwS87E Kh0PTj3JEp1PkiW0qZ3kc5gfcl8MpN6gLxtgexupFDuqqnvEbteJ2+IvbAYbY+dinS0u leBZTTqCo7PJm4vjCxSU90ggeHJNR+GMtrL6QouqEiRlgA7+sfqAlnANEPk/vkXAIX4W NfDukj5OqkgAY2cx7IaETLZ6XvD8kx9rIy5g33TQl8ILndfq7BqTzuKbu7dt6UC8bdPc h47w== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:organization:autocrypt :content-language:from:references:cc:to:subject:user-agent :mime-version:list-unsubscribe:list-subscribe:list-id:precedence :date:message-id:dkim-signature; bh=pxBbH0lhXHJMihWPKVtSiFVLyKo1HrHgO+/fn5W2KeM=; fh=Si+IIt6BqGcS5FtdWIS3jENCTMTBBMUXHLy1zdr/ADw=; b=D3y10UbW6gVH90fIoVr9jDgF6831mSl8O//Q3UjM8SKSwv4Z+bUrcgWT/aFEob2K/Y CDyRovY5E0cAi14SfAusYIxNC1DJycLZBniKvfO9XA9t4WPCt0AEXHGFfjKumLyHP82G dwLhe+LYcZmjq/EwBnxcz7Fi0ZhePzAjoq11sakycnvKDeswsh+c1qbsjcDd7/OIgBnJ eGJbg1kj6bcKhk7sduuOzxVX8RKWmYLCsR0zLy5+UHG1V6i0Q0Pa18cQQAjSutHW4ZTs ClcsZgcJ6Jko54yiRf3A2otl+kwOtsXTJ1bOiDUIhvnnSMdJCUnpleaF9H9rOe6YWvSE CL7g==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=N0yvPNgL; arc=pass (i=1 spf=pass spfdomain=redhat.com dkim=pass dkdomain=redhat.com dmarc=pass fromdomain=redhat.com); spf=pass (google.com: domain of linux-kernel+bounces-200601-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-200601-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [139.178.88.99]) by mx.google.com with ESMTPS id 41be03b00d2f7-6c35a0e6c78si8082207a12.373.2024.06.04.05.32.39 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 04 Jun 2024 05:32:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-200601-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) client-ip=139.178.88.99; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=N0yvPNgL; arc=pass (i=1 spf=pass spfdomain=redhat.com dkim=pass dkdomain=redhat.com dmarc=pass fromdomain=redhat.com); spf=pass (google.com: domain of linux-kernel+bounces-200601-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-200601-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id 964CD282982 for ; Tue, 4 Jun 2024 12:32:35 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 0DF3C14659C; Tue, 4 Jun 2024 12:31:09 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="N0yvPNgL" Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7F1A2266A7 for ; Tue, 4 Jun 2024 12:31:06 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.133.124 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717504268; cv=none; b=EtQI2NTR0yQIvM65YRheYv6IAcg8ttMN3BxhM39LD5DO1iwkUw9T3GaeS6Q7JmrfNPuBjfaqFW/t19/LT6WGiuyFUPwNVb+RRiKMrhKCA7ZJEPJfpyOZFwSu4Pccs+bOJjfg98myklpeIW2Sfm+p6BLStg9EU1C70HZA/PbE5us= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717504268; c=relaxed/simple; bh=4VNm3M5GSBRbOrpZpaRQHRNTSeoJF3TPn3jbOyJWLoo=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=TmUJ5xZvhJxxu3tuIiUe0zcH2dJOwzmnaTTBxcyPRZrP7KWcZ9vUiaHW3Zm3pKhwApDxDhJvV6zuJAqt+Sr2kEp2pHsFI38tsuMU00ZHKisQG3JUNym2A/MkaZdShRJ1d1yTR0ONjIT2glcE4dFzB/YZycWysPukqsecvsMlKQA= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=N0yvPNgL; arc=none smtp.client-ip=170.10.133.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1717504265; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:autocrypt:autocrypt; bh=pxBbH0lhXHJMihWPKVtSiFVLyKo1HrHgO+/fn5W2KeM=; b=N0yvPNgLZtF2/YzSNwCp0WEGtAkMbagpATjr3urQjSvNcltFEA0Qe/TrVYTWh/22Mtp8SA 2OS/Xzgp4MFQEmgkVJG9gwge82LBJpi1//OGfbCozU/bCUmo9TUkvxC7w+xg7zsrPjIBEq mIekHZWBJlVZUyjHiJJyUToLXiUPHFk= Received: from mail-lf1-f69.google.com (mail-lf1-f69.google.com [209.85.167.69]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-120-bT-2w5qePyyIh1uSFRdu1Q-1; Tue, 04 Jun 2024 08:31:03 -0400 X-MC-Unique: bT-2w5qePyyIh1uSFRdu1Q-1 Received: by mail-lf1-f69.google.com with SMTP id 2adb3069b0e04-52b7e000358so3236405e87.2 for ; Tue, 04 Jun 2024 05:31:02 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1717504261; x=1718109061; h=content-transfer-encoding:in-reply-to:organization:autocrypt :content-language:from:references:cc:to:subject:user-agent :mime-version:date:message-id:x-gm-message-state:from:to:cc:subject :date:message-id:reply-to; bh=pxBbH0lhXHJMihWPKVtSiFVLyKo1HrHgO+/fn5W2KeM=; b=g4Opy51Jxx/CP+iz5oFioNduw2Q0OQMnLA6QjMtBhWERBUPwH9cVDtg1BZTZq51g06 0vYVgD75zNolUZoR4WGKu5hq5LISFA3tqjnMKFrgQBlr0qlyaNHr7R1ekO0w5PbpDirl UC5XgNZE/ZVrSwle9iTCIgoH1euMChkhteyc91VhQym/6uag1dR0ZRfKRTihM5pr1cNr 6J/YmysQIM0CxiihhaF+++jLhbtTO7TZ29ve2zhfncoscHd9itmd6sw18TVZm0CbKqsA Sy2XgAEQo+ClFWukI/84weG+7GxeUqyNtUyNHSxM6LysQymqB9iychadZVWdirAC/TY/ iauw== X-Forwarded-Encrypted: i=1; AJvYcCVND5RH8TWkGx3+esemtcNtGTDB+cdSFao9UZL/uofhF5E5lNLD5uUa6/hXvNMdFc0jGGcX6fyFInffgGjd3VzbZrRc2HjynSb8jJD5 X-Gm-Message-State: AOJu0YzkDJzGHZPQNGn65la7zQdw1HfC+L7L7mvZ1ynrHlo89NiI5v9B FtrO+uIhREHjgQhFxGBY0oPbFlb0SrEsfZCwpgGZgzOkQtPpYZXdx+BckHzgF6Vtz2VnX67JjzV s1bN6s4xI8yEmmbVUqThiH7luxLaK1VuE4wdXGSLwcSxw6G5F5Q5h89ZpWnjghV8dad0T+Q== X-Received: by 2002:a05:6512:3b81:b0:52b:88b1:4808 with SMTP id 2adb3069b0e04-52b8949e166mr11394382e87.0.1717504261606; Tue, 04 Jun 2024 05:31:01 -0700 (PDT) X-Received: by 2002:a05:6512:3b81:b0:52b:88b1:4808 with SMTP id 2adb3069b0e04-52b8949e166mr11394348e87.0.1717504261085; Tue, 04 Jun 2024 05:31:01 -0700 (PDT) Received: from ?IPV6:2003:cb:c73a:3a00:a025:9b06:549e:c16b? (p200300cbc73a3a00a0259b06549ec16b.dip0.t-ipconnect.de. [2003:cb:c73a:3a00:a025:9b06:549e:c16b]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-35dd04caf50sm11446760f8f.41.2024.06.04.05.31.00 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 04 Jun 2024 05:31:00 -0700 (PDT) Message-ID: <6b1485b6-c2a1-45b8-8afe-7b211689070b@redhat.com> Date: Tue, 4 Jun 2024 14:30:59 +0200 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v2 1/2] mm: clear pte for folios that are zero filled To: Usama Arif , akpm@linux-foundation.org Cc: hannes@cmpxchg.org, willy@infradead.org, yosryahmed@google.com, nphamcs@gmail.com, chengming.zhou@linux.dev, linux-mm@kvack.org, linux-kernel@vger.kernel.org, kernel-team@meta.com References: <20240604105950.1134192-1-usamaarif642@gmail.com> <20240604105950.1134192-2-usamaarif642@gmail.com> From: David Hildenbrand Content-Language: en-US Autocrypt: addr=david@redhat.com; keydata= xsFNBFXLn5EBEAC+zYvAFJxCBY9Tr1xZgcESmxVNI/0ffzE/ZQOiHJl6mGkmA1R7/uUpiCjJ dBrn+lhhOYjjNefFQou6478faXE6o2AhmebqT4KiQoUQFV4R7y1KMEKoSyy8hQaK1umALTdL QZLQMzNE74ap+GDK0wnacPQFpcG1AE9RMq3aeErY5tujekBS32jfC/7AnH7I0v1v1TbbK3Gp XNeiN4QroO+5qaSr0ID2sz5jtBLRb15RMre27E1ImpaIv2Jw8NJgW0k/D1RyKCwaTsgRdwuK Kx/Y91XuSBdz0uOyU/S8kM1+ag0wvsGlpBVxRR/xw/E8M7TEwuCZQArqqTCmkG6HGcXFT0V9 PXFNNgV5jXMQRwU0O/ztJIQqsE5LsUomE//bLwzj9IVsaQpKDqW6TAPjcdBDPLHvriq7kGjt WhVhdl0qEYB8lkBEU7V2Yb+SYhmhpDrti9Fq1EsmhiHSkxJcGREoMK/63r9WLZYI3+4W2rAc UucZa4OT27U5ZISjNg3Ev0rxU5UH2/pT4wJCfxwocmqaRr6UYmrtZmND89X0KigoFD/XSeVv jwBRNjPAubK9/k5NoRrYqztM9W6sJqrH8+UWZ1Idd/DdmogJh0gNC0+N42Za9yBRURfIdKSb B3JfpUqcWwE7vUaYrHG1nw54pLUoPG6sAA7Mehl3nd4pZUALHwARAQABzSREYXZpZCBIaWxk ZW5icmFuZCA8ZGF2aWRAcmVkaGF0LmNvbT7CwZgEEwEIAEICGwMGCwkIBwMCBhUIAgkKCwQW AgMBAh4BAheAAhkBFiEEG9nKrXNcTDpGDfzKTd4Q9wD/g1oFAl8Ox4kFCRKpKXgACgkQTd4Q 9wD/g1oHcA//a6Tj7SBNjFNM1iNhWUo1lxAja0lpSodSnB2g4FCZ4R61SBR4l/psBL73xktp rDHrx4aSpwkRP6Epu6mLvhlfjmkRG4OynJ5HG1gfv7RJJfnUdUM1z5kdS8JBrOhMJS2c/gPf wv1TGRq2XdMPnfY2o0CxRqpcLkx4vBODvJGl2mQyJF/gPepdDfcT8/PY9BJ7FL6Hrq1gnAo4 3Iv9qV0JiT2wmZciNyYQhmA1V6dyTRiQ4YAc31zOo2IM+xisPzeSHgw3ONY/XhYvfZ9r7W1l pNQdc2G+o4Di9NPFHQQhDw3YTRR1opJaTlRDzxYxzU6ZnUUBghxt9cwUWTpfCktkMZiPSDGd KgQBjnweV2jw9UOTxjb4LXqDjmSNkjDdQUOU69jGMUXgihvo4zhYcMX8F5gWdRtMR7DzW/YE BgVcyxNkMIXoY1aYj6npHYiNQesQlqjU6azjbH70/SXKM5tNRplgW8TNprMDuntdvV9wNkFs 9TyM02V5aWxFfI42+aivc4KEw69SE9KXwC7FSf5wXzuTot97N9Phj/Z3+jx443jo2NR34XgF 89cct7wJMjOF7bBefo0fPPZQuIma0Zym71cP61OP/i11ahNye6HGKfxGCOcs5wW9kRQEk8P9 M/k2wt3mt/fCQnuP/mWutNPt95w9wSsUyATLmtNrwccz63XOwU0EVcufkQEQAOfX3n0g0fZz Bgm/S2zF/kxQKCEKP8ID+Vz8sy2GpDvveBq4H2Y34XWsT1zLJdvqPI4af4ZSMxuerWjXbVWb T6d4odQIG0fKx4F8NccDqbgHeZRNajXeeJ3R7gAzvWvQNLz4piHrO/B4tf8svmRBL0ZB5P5A 2uhdwLU3NZuK22zpNn4is87BPWF8HhY0L5fafgDMOqnf4guJVJPYNPhUFzXUbPqOKOkL8ojk CXxkOFHAbjstSK5Ca3fKquY3rdX3DNo+EL7FvAiw1mUtS+5GeYE+RMnDCsVFm/C7kY8c2d0G NWkB9pJM5+mnIoFNxy7YBcldYATVeOHoY4LyaUWNnAvFYWp08dHWfZo9WCiJMuTfgtH9tc75 7QanMVdPt6fDK8UUXIBLQ2TWr/sQKE9xtFuEmoQGlE1l6bGaDnnMLcYu+Asp3kDT0w4zYGsx 5r6XQVRH4+5N6eHZiaeYtFOujp5n+pjBaQK7wUUjDilPQ5QMzIuCL4YjVoylWiBNknvQWBXS lQCWmavOT9sttGQXdPCC5ynI+1ymZC1ORZKANLnRAb0NH/UCzcsstw2TAkFnMEbo9Zu9w7Kv AxBQXWeXhJI9XQssfrf4Gusdqx8nPEpfOqCtbbwJMATbHyqLt7/oz/5deGuwxgb65pWIzufa N7eop7uh+6bezi+rugUI+w6DABEBAAHCwXwEGAEIACYCGwwWIQQb2cqtc1xMOkYN/MpN3hD3 AP+DWgUCXw7HsgUJEqkpoQAKCRBN3hD3AP+DWrrpD/4qS3dyVRxDcDHIlmguXjC1Q5tZTwNB boaBTPHSy/Nksu0eY7x6HfQJ3xajVH32Ms6t1trDQmPx2iP5+7iDsb7OKAb5eOS8h+BEBDeq 3ecsQDv0fFJOA9ag5O3LLNk+3x3q7e0uo06XMaY7UHS341ozXUUI7wC7iKfoUTv03iO9El5f XpNMx/YrIMduZ2+nd9Di7o5+KIwlb2mAB9sTNHdMrXesX8eBL6T9b+MZJk+mZuPxKNVfEQMQ a5SxUEADIPQTPNvBewdeI80yeOCrN+Zzwy/Mrx9EPeu59Y5vSJOx/z6OUImD/GhX7Xvkt3kq Er5KTrJz3++B6SH9pum9PuoE/k+nntJkNMmQpR4MCBaV/J9gIOPGodDKnjdng+mXliF3Ptu6 3oxc2RCyGzTlxyMwuc2U5Q7KtUNTdDe8T0uE+9b8BLMVQDDfJjqY0VVqSUwImzTDLX9S4g/8 kC4HRcclk8hpyhY2jKGluZO0awwTIMgVEzmTyBphDg/Gx7dZU1Xf8HFuE+UZ5UDHDTnwgv7E th6RC9+WrhDNspZ9fJjKWRbveQgUFCpe1sa77LAw+XFrKmBHXp9ZVIe90RMe2tRL06BGiRZr jPrnvUsUUsjRoRNJjKKA/REq+sAnhkNPPZ/NNMjaZ5b8Tovi8C0tmxiCHaQYqj7G2rgnT0kt WNyWQQ== Organization: Red Hat In-Reply-To: <20240604105950.1134192-2-usamaarif642@gmail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit On 04.06.24 12:58, Usama Arif wrote: > Approximately 10-20% of pages to be swapped out are zero pages [1]. > Rather than reading/writing these pages to flash resulting > in increased I/O and flash wear, the pte can be cleared for those > addresses at unmap time while shrinking folio list. When this > causes a page fault, do_pte_missing will take care of this page. > With this patch, NVMe writes in Meta server fleet decreased > by almost 10% with conventional swap setup (zswap disabled). > > [1] https://lore.kernel.org/all/20171018104832epcms5p1b2232e2236258de3d03d1344dde9fce0@epcms5p1/ > > Signed-off-by: Usama Arif > --- > include/linux/rmap.h | 1 + > mm/rmap.c | 163 ++++++++++++++++++++++--------------------- > mm/vmscan.c | 89 ++++++++++++++++------- > 3 files changed, 150 insertions(+), 103 deletions(-) > > diff --git a/include/linux/rmap.h b/include/linux/rmap.h > index bb53e5920b88..b36db1e886e4 100644 > --- a/include/linux/rmap.h > +++ b/include/linux/rmap.h > @@ -100,6 +100,7 @@ enum ttu_flags { > * do a final flush if necessary */ > TTU_RMAP_LOCKED = 0x80, /* do not grab rmap lock: > * caller holds it */ > + TTU_ZERO_FOLIO = 0x100,/* zero folio */ > }; > > #ifdef CONFIG_MMU > diff --git a/mm/rmap.c b/mm/rmap.c > index 52357d79917c..d98f70876327 100644 > --- a/mm/rmap.c > +++ b/mm/rmap.c > @@ -1819,96 +1819,101 @@ static bool try_to_unmap_one(struct folio *folio, struct vm_area_struct *vma, > */ > dec_mm_counter(mm, mm_counter(folio)); > } else if (folio_test_anon(folio)) { > - swp_entry_t entry = page_swap_entry(subpage); > - pte_t swp_pte; > - /* > - * Store the swap location in the pte. > - * See handle_pte_fault() ... > - */ > - if (unlikely(folio_test_swapbacked(folio) != > - folio_test_swapcache(folio))) { > + if (flags & TTU_ZERO_FOLIO) { > + pte_clear(mm, address, pvmw.pte); > + dec_mm_counter(mm, MM_ANONPAGES); Is there an easy way to reduce the code churn and highlight the added code? Like } else if (folio_test_anon(folio) && (flags & TTU_ZERO_FOLIO)) { } else if (folio_test_anon(folio)) { Also to concerns that I want to spell out: (a) what stops the page from getting modified in the meantime? The CPU can write it until the TLB was flushed. (b) do you properly handle if the page is pinned (or just got pinned) and we must not discard it? -- Cheers, David / dhildenb