Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp5111624iob; Mon, 9 May 2022 08:50:34 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxVy3KowwjIlZ9Q2PeSIrBgvcCADsWYp5Hst6IN5p2MK03D/XXC0NnE1bRQMNR1bp92H1vb X-Received: by 2002:aca:4254:0:b0:322:953b:872 with SMTP id p81-20020aca4254000000b00322953b0872mr7536203oia.32.1652111434419; Mon, 09 May 2022 08:50:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652111434; cv=none; d=google.com; s=arc-20160816; b=LJqVvUPZ8ZYolfu59weckpqBLFX6iOTJVLQepEA/8zMZVLZVcOPehaTLTM6EJoFZmE DL8iO81iAFk1AAQx79FUFsb5V7VQ1OrH1O1W582ry8Yc5UxS1QKzkgVp78YkOF83El8E XjHSIl3ixQnpUrjyjWVjLebG93zb4RfM4EHMZ4tJpovBeyY1VAdH1Nc8loLX7Iumfz5K LcpTpleI7QuiWGv259vffDQHNK0YDkKDBYleIFyqBxB5Y159IvKJ5EOQQ1r5OtYqOf1i p7VNtdjM018fSq7OO2UYz5mfFG113c+ZO8uH/LHY6eQSDdPMxQwKyw/XVR3BAtpgwwdn Im6g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature:dkim-signature; bh=G8T6pl1iznxs13Ey9cez14qQxbbfQ6jd4EKhTAHfONQ=; b=pOihbQJqaf81j2bF8KioLrzRXzNGPvsNgVAJczXRZtK8Fw+jlIiWQhsDvsQu8hoIJx 9K7jQTtdQbKRdMiEOajrvFCTw8XIEgTk7pXZ+unAbkO7yOnjMX4qD7Ipa5NWJI8grVhu 8P6WkpnZ5qhwoeZYQarpg10aHxWNGaiBxH1iAgPPXdMRwG9J8Jg68CiUPPKizwngMNuc GtAOorkGEI0r5xcuR/ChwOxiKwMmhcskHfCv7GflgiH1/VCiJCMEstVakYd9S7pL2vOo qjyM0kILsqa01eHrVNQSPFQCSAcDy5SDEO8uOCcH6/KQqeoPoSIDWGVsUKRvWOKRlv4S fo4Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.cz header.s=susede2_rsa header.b=OIX4wvit; dkim=neutral (no key) header.i=@suse.cz header.s=susede2_ed25519; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id z14-20020a056870e30e00b000eda37e0e95si12032761oad.267.2022.05.09.08.50.33 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 09 May 2022 08:50:34 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.cz header.s=susede2_rsa header.b=OIX4wvit; dkim=neutral (no key) header.i=@suse.cz header.s=susede2_ed25519; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 3835A1B7773; Mon, 9 May 2022 08:43:39 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238559AbiEIPr1 (ORCPT + 99 others); Mon, 9 May 2022 11:47:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60280 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238415AbiEIPrX (ORCPT ); Mon, 9 May 2022 11:47:23 -0400 Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 963241B5F97; Mon, 9 May 2022 08:43:29 -0700 (PDT) Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id 5125D21C54; Mon, 9 May 2022 15:43:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_rsa; t=1652111008; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=G8T6pl1iznxs13Ey9cez14qQxbbfQ6jd4EKhTAHfONQ=; b=OIX4wvitWGHcNBQ8JmpcU3rPsZU1gR6+LHcNOwZdDXt/ctTFk3ovA/alCFpxrJgZHkU0jO j8H8ca2us26w1OtLpELOr66VCMVrsGfD9mSVAmr9PH2eHNxXsyMiymy4TEEYmN0veUZ4rO jmPneqJd3q3wmPSpiG3aBLWFqPbxo1E= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_ed25519; t=1652111008; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=G8T6pl1iznxs13Ey9cez14qQxbbfQ6jd4EKhTAHfONQ=; b=jK3ZXCl1D7XaTIz1+9uo9WVUNqIOJAFF6WB0oqleV2JaHMus92O7168s829hFh2XwbYV9M 8GbAzUyq/BtpAbBw== Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id 245CD13AA5; Mon, 9 May 2022 15:43:28 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id b11GCKA2eWJIIgAAMHmgww (envelope-from ); Mon, 09 May 2022 15:43:28 +0000 Message-ID: <0d8f368a-54bb-b4e7-931f-9bf3ae24af4c@suse.cz> Date: Mon, 9 May 2022 17:43:27 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.8.1 Subject: Re: [v3 PATCH 8/8] mm: mmap: register suitable readonly file vmas for khugepaged Content-Language: en-US To: Yang Shi , kirill.shutemov@linux.intel.com, linmiaohe@huawei.com, songliubraving@fb.com, riel@surriel.com, willy@infradead.org, ziy@nvidia.com, tytso@mit.edu, akpm@linux-foundation.org Cc: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org References: <20220404200250.321455-1-shy828301@gmail.com> <20220404200250.321455-9-shy828301@gmail.com> From: Vlastimil Babka In-Reply-To: <20220404200250.321455-9-shy828301@gmail.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-3.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 4/4/22 22:02, Yang Shi wrote: > The readonly FS THP relies on khugepaged to collapse THP for suitable > vmas. But it is kind of "random luck" for khugepaged to see the > readonly FS vmas (https://lore.kernel.org/linux-mm/00f195d4-d039-3cf2-d3a1-a2c88de397a0@suse.cz/) > since currently the vmas are registered to khugepaged when: > - Anon huge pmd page fault > - VMA merge > - MADV_HUGEPAGE > - Shmem mmap > > If the above conditions are not met, even though khugepaged is enabled > it won't see readonly FS vmas at all. MADV_HUGEPAGE could be specified > explicitly to tell khugepaged to collapse this area, but when khugepaged > mode is "always" it should scan suitable vmas as long as VM_NOHUGEPAGE > is not set. > > So make sure readonly FS vmas are registered to khugepaged to make the > behavior more consistent. > > Registering suitable vmas in common mmap path, that could cover both > readonly FS vmas and shmem vmas, so removed the khugepaged calls in > shmem.c. > > Still need to keep the khugepaged call in vma_merge() since vma_merge() > is called in a lot of places, for example, madvise, mprotect, etc. > > Reported-by: Vlastimil Babka > Signed-off-by: Yang Shi Acked-by: Vlastimil Babka > --- > mm/mmap.c | 6 ++++++ > mm/shmem.c | 4 ---- > 2 files changed, 6 insertions(+), 4 deletions(-) > > diff --git a/mm/mmap.c b/mm/mmap.c > index 604c8dece5dd..616ebbc2d052 100644 > --- a/mm/mmap.c > +++ b/mm/mmap.c > @@ -1842,6 +1842,12 @@ unsigned long mmap_region(struct file *file, unsigned long addr, > } > > vma_link(mm, vma, prev, rb_link, rb_parent); > + > + /* > + * vma_merge() calls khugepaged_enter_vma() either, the below > + * call covers the non-merge case. > + */ > + khugepaged_enter_vma(vma, vma->vm_flags); > /* Once vma denies write, undo our temporary denial count */ > unmap_writable: > if (file && vm_flags & VM_SHARED) > diff --git a/mm/shmem.c b/mm/shmem.c > index 92eca974771d..0c448080d210 100644 > --- a/mm/shmem.c > +++ b/mm/shmem.c > @@ -34,7 +34,6 @@ > #include > #include > #include > -#include > #include > #include > #include > @@ -2239,7 +2238,6 @@ static int shmem_mmap(struct file *file, struct vm_area_struct *vma) > > file_accessed(file); > vma->vm_ops = &shmem_vm_ops; > - khugepaged_enter_vma(vma, vma->vm_flags); > return 0; > } > > @@ -4132,8 +4130,6 @@ int shmem_zero_setup(struct vm_area_struct *vma) > vma->vm_file = file; > vma->vm_ops = &shmem_vm_ops; > > - khugepaged_enter_vma(vma, vma->vm_flags); > - > return 0; > } >