Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp1850512iob; Thu, 19 May 2022 16:43:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzHhjKkeZyWNPM05eaf2Y72k7/o0pK6eLaArk+60MdeUWRxciqnGKXS97R3/iS4BLoiR2di X-Received: by 2002:a17:902:7296:b0:14b:4bc6:e81 with SMTP id d22-20020a170902729600b0014b4bc60e81mr6995862pll.132.1653003832578; Thu, 19 May 2022 16:43:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653003832; cv=none; d=google.com; s=arc-20160816; b=LpE+BGSD/KRlGVZX5PhjnjVfBCJc4qHXzNuUPPZ4X523r4ICAcWSgr4JwiuACN0Q3G KlpTAsIwvW2TIlh7viO1QcfN+A637yx//5ZBPFFg7iSXXZPzd17YVkmFv+ABBC8JwOAp 0gEUbVMomdsFsX6w6K/5yuV39sO38YnVmdLm0TEvR4j73fuB8assW3vp45lhFhrPKl5r 4k0xpdbfk7wY0+6K1LKVqH3xNnbkCTUZmToT26AnBmCLuQG1a0Q9CmkIgbvAP513BdxL bIhfDenhic+/0p9v64KfiSAOsAbMbQAIKayjRo8ESWYmuL02Uufvn/7Ehoo/4pKuZ3Ur fbtA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=Q/iVT6eHGUL7xt25EZCD9Iruv1QN8td0bn1+mXm/2MM=; b=zYeWT20qDI5n326JkWFCqLxnQ/8mMsebDUWTx+zinEGcwlQq+U+NTICize+2kS21R9 SrtySAxEyhdxeDhc2/yA2NIpWO5CfPa8YLcH2cAGKw5E5EOwIugTm6v+Bl4CfsUq14xC VCjXscf1dqBnmxOdPx3vpIiOx+YI7yRY/KYnI1Htx6vtqVYFseNoWwG1SbciGR32pC7P srRs/bIAXDVXnNjw/7MpDxV6Izo6Qis4dtrr+cc9qMceHY378HBF0leIj9hfdia2Jnyj SGnWTDJ6JRfytg6/plTRjo55Ga9/YsvgmC9YJ7iOA0PQXxxCB+GMDmKrklUwSRLh5AWJ TSOw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b=auTvNIBT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id h18-20020a170902f71200b001590829860dsi8045276plo.400.2022.05.19.16.43.36; Thu, 19 May 2022 16:43:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b=auTvNIBT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S244985AbiESVFM (ORCPT + 99 others); Thu, 19 May 2022 17:05:12 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50992 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230385AbiESVFJ (ORCPT ); Thu, 19 May 2022 17:05:09 -0400 Received: from alexa-out-sd-01.qualcomm.com (alexa-out-sd-01.qualcomm.com [199.106.114.38]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id BC7EBED784 for ; Thu, 19 May 2022 14:05:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; i=@quicinc.com; q=dns/txt; s=qcdkim; t=1652994308; x=1684530308; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=Q/iVT6eHGUL7xt25EZCD9Iruv1QN8td0bn1+mXm/2MM=; b=auTvNIBTblJGAcXySRkBRkOmWubNHqjK7L4lGbL/u3GM5XNsA8DwnFsD XrHyNeolV2TjvFbJP1ayPGdzBKRu9S83k+nUTc5lPpmDqVTpXPS4Ij7YM PNlpPsD6H4uZiVwkVYEZbleI1bixAgEgFEoIA4R64cu/UdOo9+xqXvdL2 o=; Received: from unknown (HELO ironmsg04-sd.qualcomm.com) ([10.53.140.144]) by alexa-out-sd-01.qualcomm.com with ESMTP; 19 May 2022 14:05:08 -0700 X-QCInternal: smtphost Received: from nasanex01c.na.qualcomm.com ([10.47.97.222]) by ironmsg04-sd.qualcomm.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 19 May 2022 14:05:08 -0700 Received: from nalasex01a.na.qualcomm.com (10.47.209.196) by nasanex01c.na.qualcomm.com (10.47.97.222) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.22; Thu, 19 May 2022 14:05:07 -0700 Received: from qian (10.80.80.8) by nalasex01a.na.qualcomm.com (10.47.209.196) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.22; Thu, 19 May 2022 14:05:06 -0700 Date: Thu, 19 May 2022 17:05:04 -0400 From: Qian Cai To: "Paul E. McKenney" CC: Mel Gorman , Andrew Morton , Nicolas Saenz Julienne , Marcelo Tosatti , Vlastimil Babka , Michal Hocko , LKML , Linux-MM , , Subject: Re: [PATCH 0/6] Drain remote per-cpu directly v3 Message-ID: References: <20220512085043.5234-1-mgorman@techsingularity.net> <20220517233507.GA423@qian> <20220518125152.GQ3441@techsingularity.net> <20220518171503.GQ1790663@paulmck-ThinkPad-P17-Gen-1> <20220519191524.GC1790663@paulmck-ThinkPad-P17-Gen-1> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline In-Reply-To: <20220519191524.GC1790663@paulmck-ThinkPad-P17-Gen-1> X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01b.na.qualcomm.com (10.46.141.250) To nalasex01a.na.qualcomm.com (10.47.209.196) X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 19, 2022 at 12:15:24PM -0700, Paul E. McKenney wrote: > Is the task doing offline_pages()->synchronize_rcu() doing this > repeatedly? Or is there a stalled RCU grace period? (From what > I can see, offline_pages() is not doing huge numbers of calls to > synchronize_rcu() in any of its loops, but I freely admit that I do not > know this code.) Yes, we are running into an endless loop in isolate_single_pageblock(). There was a similar issue happened not long ago, so I am wondering if we did not solve it entirely then. Anyway, I will continue the thread over there. https://lore.kernel.org/all/YoavU%2F+NfQIzQiDF@qian/ > Or is it possible that reverting those three patches simply decreases > the probability of failure, rather than eliminating the failure? > Such a decrease could be due to many things, for example, changes to > offsets and sizes of data structures. Entirely possible. Sorry for the false alarm. > Do you ever see RCU CPU stall warnings? No.