Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp7519679rwi; Mon, 24 Oct 2022 16:08:17 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7Q28vcBzzqxGFBhnsjsA0xUZJot9B290z0d29EglJ0P7BjnKMUo7JfO4noQPlHod291y6g X-Received: by 2002:a17:907:a059:b0:78d:1a16:fc5c with SMTP id gz25-20020a170907a05900b0078d1a16fc5cmr29718307ejc.31.1666652896988; Mon, 24 Oct 2022 16:08:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666652896; cv=none; d=google.com; s=arc-20160816; b=gYVQv7SIsqcHiaE7HaQn8RI/7GA0W8rUXvBrtiDgd9fMpYS8uwRKbLzf3rwRCCpryV s4Cr20l/iosswkaolIL9NEotRWmHg8qkDDn+LZ3smMLNNXvBXBQO4X6ZvqjZUl8aysAO LczLxjPf8vYOUlvna+yoIQCy+rFqKN4SxkydmjMa+CUI37MPlJvRt7xzqhGDKceVWLur f5RUzP/xdzIHHeiEKRMFvhsGdSnVTT4WypeJWllqBQ6QwlcJgNqirZoTQN6F2opS0t/n allBSyGQ+tfEdCdLfs1l+3f2TBM9m/QVDk9iCMiQFTyfja3K8QXFhCQPDn2j0VmNco1Y 0ZlQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=6n+GVaYb71sAj6STn288vJaIqZluDEPwN7ltkVlgtTw=; b=uMyGlzWGoi8E2ANqRdF58xiYZwLNMpsohyjmdBydFXvQA5Ndc/5gn94xLR4RfnSlDE baEM7Kgr7JVc7Fn60z+v7wTtl9Bv/6dz8IG2ELwCG9DkfDOmDF3MHeNjYenSABFNaLt7 g/vubKLlW6lU6+6BFjW5sLnR++g8CXCh5w8FUkCn3ZxUIBLQfLiCO30yeJ+31jQBvHjf n9jSTB14RJnI2/1HfFcYI8XgeJ7c+Ak5z0jBDR+L4eD5HvZCbu+OVuwJkbJAiWDxPs3o cyW8bItVB39oy19RvQR/CkvSsPpgmJIh0m4rc/HWeAzZnGn2cQ0cDWNnAyoHyuYQpk0V Egaw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=owazcT1m; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q26-20020a50aa9a000000b004593fb0c125si1004321edc.103.2022.10.24.16.07.51; Mon, 24 Oct 2022 16:08:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=owazcT1m; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232022AbiJXWHv (ORCPT + 99 others); Mon, 24 Oct 2022 18:07:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60700 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231990AbiJXWH0 (ORCPT ); Mon, 24 Oct 2022 18:07:26 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E888D1B864C; Mon, 24 Oct 2022 13:20:51 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 3E55B61286; Mon, 24 Oct 2022 20:19:54 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id A2D10C433C1; Mon, 24 Oct 2022 20:19:53 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1666642793; bh=93lqRustw7COaoopDPktOpS07XxMDTQVBmPpMB8aIOY=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=owazcT1mRXGDTYf9zXcFZzrutQ9Odl1oBFCklsIkiO+BzGhbvgjWNYdpF+BclYiBs 0hhSdRL3u26P9wbS8oZ51VqFs60XLCqqRuroi+wGZJAu2Sw314bvEH/cTib4WqYy1r 8nhUlczkjv4/I+3UrRxxQ5zp0jUNxnM1TMw2LoIoDq0X6Cl7S14w2ffge9rTNR/l9Y lxjGhlhcQeK+RLMhcVtS+eI9CBOq3a92YDYfRAb7fmzwXGYN+jS9kLGi1O0XCi+cKG zXZ/3w4xSk+kKeq5mwbMuz6FA4rn15nY+Vl0sks8uL/C8ZNKQbY5MiwhbuyMs1X1Fi VaW4BkwN6DCYw== Received: by paulmck-ThinkPad-P17-Gen-1.home (Postfix, from userid 1000) id 4712A5C0DDA; Mon, 24 Oct 2022 13:19:53 -0700 (PDT) Date: Mon, 24 Oct 2022 13:19:53 -0700 From: "Paul E. McKenney" To: Joel Fernandes Cc: Uladzislau Rezki , rcu@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, rostedt@goodmis.org Subject: Re: [PATCH rcu 13/14] workqueue: Make queue_rcu_work() use call_rcu_flush() Message-ID: <20221024201953.GE5600@paulmck-ThinkPad-P17-Gen-1> Reply-To: paulmck@kernel.org References: <20221024153958.GY5600@paulmck-ThinkPad-P17-Gen-1> <20221024164819.GA5600@paulmck-ThinkPad-P17-Gen-1> <20221024173558.GC5600@paulmck-ThinkPad-P17-Gen-1> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-7.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,RCVD_IN_DNSWL_HI,SPF_HELO_NONE, SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Oct 24, 2022 at 04:12:59PM -0400, Joel Fernandes wrote: > On Mon, Oct 24, 2022 at 1:36 PM Paul E. McKenney wrote: > > > > On Mon, Oct 24, 2022 at 01:20:26PM -0400, Joel Fernandes wrote: > > > On Mon, Oct 24, 2022 at 1:08 PM Uladzislau Rezki wrote: > > > > > > > > On Mon, Oct 24, 2022 at 06:55:16PM +0200, Uladzislau Rezki wrote: > > > > > On Mon, Oct 24, 2022 at 09:48:19AM -0700, Paul E. McKenney wrote: > > > > > > On Mon, Oct 24, 2022 at 06:25:30PM +0200, Uladzislau Rezki wrote: > > > > > > > > > > > > > > > > You guys might need to agree on the definition of "good" here. Or maybe > > > > > > > > understand the differences in your respective platforms' definitions of > > > > > > > > "good". ;-) > > > > > > > > > > > > > > > Indeed. Bad is when once per-millisecond infinitely :) At least in such use > > > > > > > workload a can detect a power delta and power gain. Anyway, below is a new > > > > > > > trace where i do not use "flush" variant for the kvfree_rcu(): > > > > > > > > > > > > > > > > > > > > > 1. Home screen swipe: > > > > > > > rcuop/0-15 [003] d..1 1792.767750: rcu_batch_start: rcu_preempt CBs=1003 bl=10 > > > > > > > rcuop/2-33 [002] d..1 1792.771717: rcu_batch_start: rcu_preempt CBs=934 bl=10 > > > > > > > rcuop/3-40 [001] d..1 1794.811816: rcu_batch_start: rcu_preempt CBs=1508 bl=11 > > > > > > > rcuop/1-26 [003] d..1 1797.116382: rcu_batch_start: rcu_preempt CBs=2127 bl=16 > > > > > > > rcuop/4-48 [001] d..1 1797.124422: rcu_batch_start: rcu_preempt CBs=95 bl=10 > > > > > > > rcuop/5-55 [002] d..1 1797.124731: rcu_batch_start: rcu_preempt CBs=143 bl=10 > > > > > > > rcuop/6-62 [005] d..1 1798.911719: rcu_batch_start: rcu_preempt CBs=132 bl=10 > > > > > > > rcuop/2-33 [002] d..1 1803.003966: rcu_batch_start: rcu_preempt CBs=3797 bl=29 > > > > > > > rcuop/0-15 [003] d..1 1803.004707: rcu_batch_start: rcu_preempt CBs=2969 bl=23 > > > > > > > > > > 2. App launches: > > > > > > > rcuop/4-48 [005] d..1 1831.087612: rcu_batch_start: rcu_preempt CBs=6141 bl=47 > > > > > > > rcuop/7-69 [007] d..1 1831.095578: rcu_batch_start: rcu_preempt CBs=5464 bl=42 > > > > > > > rcuop/5-55 [004] d..1 1832.703571: rcu_batch_start: rcu_preempt CBs=8461 bl=66 > > > > > > > rcuop/0-15 [004] d..1 1833.731603: rcu_batch_start: rcu_preempt CBs=2548 bl=19 > > > > > > > rcuop/1-26 [006] d..1 1833.743691: rcu_batch_start: rcu_preempt CBs=2567 bl=20 > > > > > > > rcuop/2-33 [006] d..1 1833.744005: rcu_batch_start: rcu_preempt CBs=2359 bl=18 > > > > > > > rcuop/3-40 [006] d..1 1833.744286: rcu_batch_start: rcu_preempt CBs=3681 bl=28 > > > > > > > rcuop/4-48 [002] d..1 1838.079777: rcu_batch_start: rcu_preempt CBs=10444 bl=81 > > > > > > > rcuop/7-69 [001] d..1 1838.080375: rcu_batch_start: rcu_preempt CBs=12572 bl=98 > > > > > > > <...>-62 [002] d..1 1838.080646: rcu_batch_start: rcu_preempt CBs=14135 bl=110 > > > > > > > rcuop/6-62 [000] d..1 1838.087722: rcu_batch_start: rcu_preempt CBs=10839 bl=84 > > > > > > > <...>-62 [003] d..1 1839.227022: rcu_batch_start: rcu_preempt CBs=1834 bl=14 > > > > > > > <...>-26 [001] d..1 1839.963315: rcu_batch_start: rcu_preempt CBs=5769 bl=45 > > > > > > > rcuop/2-33 [001] d..1 1839.966485: rcu_batch_start: rcu_preempt CBs=3789 bl=29 > > > > > > > <...>-40 [001] d..1 1839.966596: rcu_batch_start: rcu_preempt CBs=6425 bl=50 > > > > > > > rcuop/2-33 [005] d..1 1840.541272: rcu_batch_start: rcu_preempt CBs=825 bl=10 > > > > > > > rcuop/2-33 [005] d..1 1840.547724: rcu_batch_start: rcu_preempt CBs=44 bl=10 > > > > > > > rcuop/2-33 [005] d..1 1841.075759: rcu_batch_start: rcu_preempt CBs=516 bl=10 > > > > > > > rcuop/0-15 [002] d..1 1841.695716: rcu_batch_start: rcu_preempt CBs=6312 bl=49 > > > > > > > rcuop/0-15 [003] d..1 1841.709714: rcu_batch_start: rcu_preempt CBs=39 bl=10 > > > > > > > rcuop/5-55 [004] d..1 1843.112442: rcu_batch_start: rcu_preempt CBs=16007 bl=125 > > > > > > > rcuop/5-55 [004] d..1 1843.115444: rcu_batch_start: rcu_preempt CBs=7901 bl=61 > > > > > > > rcuop/6-62 [001] dn.1 1843.123983: rcu_batch_start: rcu_preempt CBs=8427 bl=65 > > > > > > > rcuop/6-62 [006] d..1 1843.412383: rcu_batch_start: rcu_preempt CBs=981 bl=10 > > > > > > > rcuop/0-15 [003] d..1 1844.659812: rcu_batch_start: rcu_preempt CBs=1851 bl=14 > > > > > > > rcuop/0-15 [003] d..1 1844.667790: rcu_batch_start: rcu_preempt CBs=135 bl=10 > > > > > > Definitely better, but I'd still ask why not just rely on the lazy > > > batching that we now have, since it is a memory pressure related > > > usecase. Or another approach could be, for CONFIG_RCU_LAZY, don't > > > disturb the lazy-RCU batching by queuing these "free memory" CBs; and > > > instead keep your improved kvfree_rcu() batching only for > > > !CONFIG_RCU_LAZY. > > > > Given that making the kvfree_rcu()-level batching conditional on > > CONFIG_RCU_LAZY would complicate the code, what bad thing happens when > > keeping the kvfree_rcu-level batching unconditionally? > > The bad thing happening is power impact. There is a noticeable impact > in our testing, and when we dropped this particular patch, it got much > better results. > > I also run rcutop and I see without the patch that I have several > seconds of laziness at a time, unlike with the patch. Fair point, but is this visible at the power meter? Thanx, Paul > Even in the beginning when I came up with an implementation for > call_rcu_lazy(), I had to mark queue_rcu_work() as lazy as well since > it was quite frequent (on ChromeOS). But when we introduced the > flush() API, I forgot to not use flush() on it. But unfortunately > this patch slipped into my last series when Vlad and I were debugging > the SCSI issue, and did not really help for the SCSI issue itself. > > Thanks, > > - Joel