Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp3595926ybl; Mon, 12 Aug 2019 03:13:28 -0700 (PDT) X-Google-Smtp-Source: APXvYqxIwdWYr2wn8W37kqBFhzIqEJrmuAPyPeg/GAeiWeQjAweVINocHmcy8IWh1DbUXru+pKBq X-Received: by 2002:a17:902:fe14:: with SMTP id g20mr30355372plj.54.1565604808332; Mon, 12 Aug 2019 03:13:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565604808; cv=none; d=google.com; s=arc-20160816; b=SFGot/OLbC/YiV4aCt9hfEnL1I09cpxJwLlmNM5A3npG0NoZHZ9n2M7ZgC3TzA0IYd CybcneClqp1oA2fUYQFpicu3k8+kC5MarKII8WYh90em1XTFdE4+7E+zkb5f8ffWz6cx XmQLUEPwdDXluxEkXXcmhaxMMytSPPf4QQ8pCOYohfuomMjwMQnk9gXHe16l1iuDCvGd Jmgwce7RXxNyygFw9W+doGyxRgNvMKf//AUqM36ChblhelHyMrdHzBoX9XcBAoI3Qj0j Q8KJ0kXZIf8Hbs1k1zkZ+RhU4QKBNs7Xar075k/q411NE3lfPBr1ftiqWqwrIXdU7Sao 6BrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=PAp8dW7biuhLgJ4F9NZncGTC2f5zGKFt/w2Q0zNqVUs=; b=FZaiNt86tvKEH/j3/IPRwCJUvdvktt9E/Yfqx4NPI3jL+F2EAxF+SOxNvnr45fxChL Qy5Ok6lYRWxiykGuPEFoGbfiLZo0qyvcABzCm2k+k7mg6AX6lqqlC0K8gLQhrSZHX0wT OZ+fJhaHsj4ZyGToZ/me360aYJNxUxU71TH48ZGOS1fR9NvmKFeIfQkxRWCRLuAAE6kc AXv7fIXTqA3m5Xj34j7ExCGMKC+PeRilReDbYgf+nphvESbQ4vKL8C5Armoac6sNRaGm pCRxSfULulcJVtUprBeFgKnktuvtJmzFdv3WXRO/VmGSA2IEnEK14W1JwOdKJJIUSEU/ LOPg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e8si13364005plt.288.2019.08.12.03.13.13; Mon, 12 Aug 2019 03:13:28 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727878AbfHLKM1 (ORCPT + 99 others); Mon, 12 Aug 2019 06:12:27 -0400 Received: from lgeamrelo13.lge.com ([156.147.23.53]:45414 "EHLO lgeamrelo11.lge.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727843AbfHLKM0 (ORCPT ); Mon, 12 Aug 2019 06:12:26 -0400 Received: from unknown (HELO lgeamrelo04.lge.com) (156.147.1.127) by 156.147.23.53 with ESMTP; 12 Aug 2019 19:12:22 +0900 X-Original-SENDERIP: 156.147.1.127 X-Original-MAILFROM: byungchul.park@lge.com Received: from unknown (HELO X58A-UD3R) (10.177.222.33) by 156.147.1.127 with ESMTP; 12 Aug 2019 19:12:22 +0900 X-Original-SENDERIP: 10.177.222.33 X-Original-MAILFROM: byungchul.park@lge.com Date: Mon, 12 Aug 2019 19:10:52 +0900 From: Byungchul Park To: "Paul E. McKenney" Cc: Byungchul Park , Joel Fernandes , LKML , Rao Shoaib , kernel-team@android.com, kernel-team , Davidlohr Bueso , Josh Triplett , Lai Jiangshan , Mathieu Desnoyers , rcu , Steven Rostedt Subject: Re: [PATCH RFC v1 1/2] rcu/tree: Add basic support for kfree_rcu batching Message-ID: <20190812101052.GA10478@X58A-UD3R> References: <20190806235631.GU28441@linux.ibm.com> <20190807094504.GB169551@google.com> <20190807175215.GE28441@linux.ibm.com> <20190808095232.GA30401@X58A-UD3R> <20190808125607.GB261256@google.com> <20190808180916.GP28441@linux.ibm.com> <20190811083626.GA9486@X58A-UD3R> <20190811084950.GB9486@X58A-UD3R> <20190811234939.GC28441@linux.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190811234939.GC28441@linux.ibm.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Aug 11, 2019 at 04:49:39PM -0700, Paul E. McKenney wrote: > Maybe. Note well that I said "potential issue". When I checked a few > years ago, none of the uses of rcu_barrier() cared about kfree_rcu(). > They cared instead about call_rcu() callbacks that accessed code or data > that was going to disappear soon, for example, due to module unload or > filesystem unmount. > > So it -might- be that rcu_barrier() can stay as it is, but with changes > as needed to documentation. > > It also -might- be, maybe now or maybe some time in the future, that > there will need to be a kfree_rcu_barrier() or some such. But if so, > let's not create it until it is needed. For one thing, it is reasonably > likely that something other than a kfree_rcu_barrier() would really > be what was needed. After all, the main point would be to make sure > that the old memory really was freed before allocating new memory. Now I fully understand what you meant thanks to you. Thank you for explaining it in detail. > But if the system had ample memory, why wait? In that case you don't > really need to wait for all the old memory to be freed, but rather for > sufficient memory to be available for allocation. Agree. Totally make sense. Thanks, Byungchul > > Thanx, Paul > > > Thanks, > > Byungchul > > > > > But now that we can see letting the list just grow works well, we don't > > > have to consider this one at the moment. Let's consider this method > > > again once we face the problem in the future by any chance. > > > > > > > We should therefore just let the second list grow. If experience shows > > > > a need for callbacks to be sent up more quickly, it should be possible > > > > to provide an additional list, so that two lists on a given CPU can both > > > > be waiting for a grace period at the same time. > > > > > > Or the third and fourth list might be needed in some system. But let's > > > talk about it later too. > > > > > > > > > I also agree. But this _FORCE thing will still not solve the issue Paul is > > > > > > raising which is doing this loop possibly in irq disabled / hardirq context. > > > > > > > > > > I added more explanation above. What I suggested is a way to avoid not > > > > > only heavy > > > > > work within the irq-disabled region of a single kfree_rcu() but also > > > > > too many requests > > > > > to be queued into ->head. > > > > > > > > But let's start simple, please! > > > > > > Yes. The simpler, the better. > > > > > > > > > We can't even cond_resched() here. In fact since _FORCE is larger, it will be > > > > > > even worse. Consider a real-time system with a lot of memory, in this case > > > > > > letting ->head grow large is Ok, but looping for long time in IRQ disabled > > > > > > would not be Ok. > > > > > > > > > > Please check the explanation above. > > > > > > > > > > > But I could make it something like: > > > > > > 1. Letting ->head grow if ->head_free busy > > > > > > 2. If head_free is busy, then just queue/requeue the monitor to try again. > > > > > > > > > > This is exactly what Paul said. The problem with this is ->head can grow too > > > > > much. That's why I suggested the above one. > > > > > > > > It can grow quite large, but how do you know that limiting its size will > > > > really help? Sure, you have limited the size, but does that really do > > > > > > To decide the size, we might have to refer to how much pressure on > > > memory and RCU there are at that moment and adjust it on runtime. > > > > > > > anything for the larger problem of extreme kfree_rcu() rates on the one > > > > hand and a desire for more efficient handling of kfree_rcu() on the other? > > > > > > Assuming current RCU logic handles extremly high rate well which is > > > anyway true, my answer is *yes*, because batching anyway has pros and > > > cons. One of major cons is there must be inevitable kfree_rcu() requests > > > that not even request to RCU. By allowing only the size of batching, the > > > situation can be mitigated. > > > > > > I just answered to you. But again, let's talk about it later once we > > > face the problem as you said. > > > > > > Thanks, > > > Byungchul > > > > > > > Thanx, Paul > > > > > > > > > > This would even improve performance, but will still risk going out of memory. > > > > > > > > > > > > Thoughts? > > > > > > > > > > > > thanks, > > > > > > > > > > > > - Joel > > > > > > > > > > > > > > > > > > > > This way, we can avoid both: > > > > > > > > > > > > > > 1. too many requests being queued and > > > > > > > 2. __call_rcu() bunch of requests within a single kfree_rcu(). > > > > > > > > > > > > > > Thanks, > > > > > > > Byungchul > > > > > > > > > > > > > > > > > > > > > > > But please feel free to come up with a better solution! > > > > > > > > > > > > > > > > [ . . . ] > > > > > > > > > > > > > > > > > > > > -- > > > > > Thanks, > > > > > Byungchul > > > > >