Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp794901iog; Wed, 15 Jun 2022 12:29:27 -0700 (PDT) X-Google-Smtp-Source: AGRyM1uw96Sae9IZWEv1GQ7BLYe4T7q3iIHTTDxSXW56fyFmt2woFEYy+7Y3IJs0eqe3z7x8PYNt X-Received: by 2002:a17:90b:1809:b0:1e8:7495:3c6d with SMTP id lw9-20020a17090b180900b001e874953c6dmr12000801pjb.193.1655321367135; Wed, 15 Jun 2022 12:29:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1655321367; cv=none; d=google.com; s=arc-20160816; b=IxlZwHbbirKSCtdSmGEGR00esVP5d4UxToaQIRjQDDUKp7aYlScVmOnqtcB+kmrJf8 8VP0jidL8gS8ubQ88Z3Ha8Qu++hvC048qhEsCHqHXwLUyjO0mFx3H9qsE3d+8zXfSN5v /IDdG/KI8/nNsXXrTuOheTf3DqzSeFYtnSDx3ZCVvnOPDPEP6xoMEa20KkJDzXUvD0ma Iun8arculco12+ax5ur5tAzcmV/W/44G0qrBwExy2xNgWuNuayiHw6bs1nEoG1dS+UKR At+l7Ubj29KB+2wXukJNtgrNgwfG6Jm8tXYW08YzV2C4HtjwqbtkC4Mrgr+36vXcIE0j aRDQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=kpn2Q2CsireSptzmUqdySxnWjnGpd3Q3NSEkQU1AXEY=; b=Y3RSqcz1dmnKYfLejpL0/5mibV5nhSGZ7N9ckeeX8+sJ7YvNcPWFlT6sHHUGH8SyjC xYIUvIfU0rg2i+S2vm3cG046N/Phfv5nsdSviUXAV5Tj9fnhjkAOBE1NzTWUfkwcJG7+ ee6p7KuTIj9mIuSbxo/pLhyDkYUtqh5EZCVO5KBcGnfx2YMRrCi4JbElRUGTbwoFnWrc 4+3H/1t9saqidIIZ5jw9SOEEKPc0gsZt4uFM1H4lhPVSIDITXAJTOL7YfhfqVeLIn85M tpoWfuCfd8pFvcKwL5/9h6RaqpPjDPLfwBtFkm3KghSV4dEpRi1Q0QhYoJTo9r0Y54GW 3o1Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=I4NZ826O; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id x3-20020a17090a6b4300b001e294f66638si3238972pjl.0.2022.06.15.12.29.15; Wed, 15 Jun 2022 12:29:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=I4NZ826O; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1358040AbiFORvd (ORCPT + 99 others); Wed, 15 Jun 2022 13:51:33 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35554 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242299AbiFORvb (ORCPT ); Wed, 15 Jun 2022 13:51:31 -0400 Received: from mail-pl1-x62e.google.com (mail-pl1-x62e.google.com [IPv6:2607:f8b0:4864:20::62e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 073DC544C1 for ; Wed, 15 Jun 2022 10:51:29 -0700 (PDT) Received: by mail-pl1-x62e.google.com with SMTP id m14so635047plg.5 for ; Wed, 15 Jun 2022 10:51:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=kpn2Q2CsireSptzmUqdySxnWjnGpd3Q3NSEkQU1AXEY=; b=I4NZ826O2XxVNyXyxHS8+3OKpnYH2JJkLmwygLtw+BpcvgRcGWVOAtNSgP57L8I75l v+N7dKJcZFX0jjwzdnu5SRk6q1mAJlegmNuAR3QCtZAeCFRSLxfDP/g9T/aEH7QCaCg1 pDy/Jnel+l4y6YXdRuoply+AI7cSElKHUpFTbUdpbNCghczWG7WTLPOLVrWOEXDWLM6+ uXQJ1iJj+Gsf0As3WrV0UcEBjvQjWmyjEvid8c+tb40fUpdpsRlqhdeBEhyDuGSXHZ+3 raV/Vl2R0x+XqGXVwDnucEbgR6m/OeK3oTZsXZ4FqQBPCiAl5ARzP+wHhEh+tMl0uYoU HujA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=kpn2Q2CsireSptzmUqdySxnWjnGpd3Q3NSEkQU1AXEY=; b=d4rNmwRJWebq5HfZEB6JzYxub5ZvNilFna7ma7NijCx4pXfcnGPf4DVLxaDrbeGDG/ lGpbUhLEOQgG3hPWNWwYotmolzIA4qMOBuLIiP3/Tto/Sk/tF1t/Vm8gMEBpvparBqtO WCvspk/f38/iVVeGOGet4FyP06vgNQ+pOKyUVGgGRh3rNRyEqlK0HLQ0xy5Ya0/2Ft9B 93+2QsrQuKDiWECLeeQRqa1VQVaI90Tk1MlIs9UJydvBcyZXnpg9qyyNJFS0bFsLOC0A /X17XrGe9TXF6fKxoVy09ZDVCJf1/faQaiBBmilMxZXM9qLxML8/zXqVHBsuXHYEKfIN b9vw== X-Gm-Message-State: AJIora8cGrTxwbAyQyIzvYlSb4hvKER03CJ7Z7cFcBeQl1RkiwhVG4ZJ ymX51wKP8BIqDBgizeX9OWXmxxjm2gJtGcP6CaQ= X-Received: by 2002:a17:903:32c4:b0:167:6e6f:204b with SMTP id i4-20020a17090332c400b001676e6f204bmr682333plr.117.1655315489159; Wed, 15 Jun 2022 10:51:29 -0700 (PDT) MIME-Version: 1.0 References: <20220611084731.55155-1-linmiaohe@huawei.com> <20220611084731.55155-3-linmiaohe@huawei.com> In-Reply-To: From: Yang Shi Date: Wed, 15 Jun 2022 10:51:17 -0700 Message-ID: Subject: Re: [PATCH 2/7] mm/khugepaged: stop swapping in page when VM_FAULT_RETRY occurs To: "Zach O'Keefe" Cc: Miaohe Lin , Andrew Morton , Andrea Arcangeli , Matthew Wilcox , Vlastimil Babka , David Howells , NeilBrown , Alistair Popple , David Hildenbrand , Suren Baghdasaryan , Peter Xu , Linux MM , Linux Kernel Mailing List Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_ENVFROM_END_DIGIT, FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jun 15, 2022 at 8:14 AM Zach O'Keefe wrote: > > On 11 Jun 16:47, Miaohe Lin wrote: > > When do_swap_page returns VM_FAULT_RETRY, we do not retry here and thus > > swap entry will remain in pagetable. This will result in later failure. > > So stop swapping in pages in this case to save cpu cycles. > > > > Signed-off-by: Miaohe Lin > > --- > > mm/khugepaged.c | 19 ++++++++----------- > > 1 file changed, 8 insertions(+), 11 deletions(-) > > > > diff --git a/mm/khugepaged.c b/mm/khugepaged.c > > index 73570dfffcec..a8adb2d1e9c6 100644 > > --- a/mm/khugepaged.c > > +++ b/mm/khugepaged.c > > @@ -1003,19 +1003,16 @@ static bool __collapse_huge_page_swapin(struct mm_struct *mm, > > swapped_in++; > > ret = do_swap_page(&vmf); > > > > - /* do_swap_page returns VM_FAULT_RETRY with released mmap_lock */ > > + /* > > + * do_swap_page returns VM_FAULT_RETRY with released mmap_lock. > > + * Note we treat VM_FAULT_RETRY as VM_FAULT_ERROR here because > > + * we do not retry here and swap entry will remain in pagetable > > + * resulting in later failure. > > + */ > > if (ret & VM_FAULT_RETRY) { > > mmap_read_lock(mm); > > - if (hugepage_vma_revalidate(mm, haddr, &vma)) { > > - /* vma is no longer available, don't continue to swapin */ > > - trace_mm_collapse_huge_page_swapin(mm, swapped_in, referenced, 0); > > - return false; > > - } > > - /* check if the pmd is still valid */ > > - if (mm_find_pmd(mm, haddr) != pmd) { > > - trace_mm_collapse_huge_page_swapin(mm, swapped_in, referenced, 0); > > - return false; > > - } > > + trace_mm_collapse_huge_page_swapin(mm, swapped_in, referenced, 0); > > + return false; > > } > > if (ret & VM_FAULT_ERROR) { > > trace_mm_collapse_huge_page_swapin(mm, swapped_in, referenced, 0); > > -- > > 2.23.0 > > > > > > I've convinced myself this is correct, but don't understand how we got here. > AFAICT, we've always continued to fault in pages, and, as you mention, don't > retry ones that have failed with VM_FAULT_RETRY - so > __collapse_huge_page_isolate() should fail. I don't think (?) there is any > benefit to continuing to swap if we don't handle VM_FAULT_RETRY appropriately. > > So, I think this change looks good from that perspective. I suppose the only > other question would be: should we handle the VM_FAULT_RETRY case? Maybe 1 > additional attempt then fail? AFAIK, this mostly (?) happens when the page is > locked. Maybe it's not worth the extra complexity though.. It should be unnecessary for khugepaged IMHO since it will scan all the valid mm periodically, so it will come back eventually. >