Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp4716431pxk; Wed, 30 Sep 2020 09:49:56 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzp820ZMZCDS73ROBhczFbL0/bH/w93LJZhuNzfDM3EUJypLmDAn+8hPDoUNLxj9p5nGG4g X-Received: by 2002:a17:906:a198:: with SMTP id s24mr3687200ejy.154.1601484596184; Wed, 30 Sep 2020 09:49:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1601484596; cv=none; d=google.com; s=arc-20160816; b=Md2kb2V0CyCRKvQnvRJavKuM6z5Bo+udBK61+3R4OmNlnrUArjwXH/IdwgX3AJchKv oA5Vf0XRKJyze2pZRR7OHwpHATNvk/K3A17rby06dE5gP8e5AIR60Sgy1zNOKg1MHJpx vHE5+Fe3MZL6H9OanTm8SgCaHgOaqBg7whGm8JTGI0Q2VaxJ2lDjHy2lr1BzU2MbhbJ+ eLN0k7R2Bh7N20/WMt+x+DTY2mT7PDPBLJD2bv78I2vbCUsFNxKT2FTq9lGlNPXJksv2 GYmrapiVystFsigG6djG/DtB5hQ48dW3ZKOipAlLIvHUsf6GTJp5lCJPpouditG93+5a NMiA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=ZtN3pR3Z/oium5GfdXD2uz9/2bfNv63HFSibvAivduA=; b=XkQ1ST2U7y0s5VlcgoodduBKHEEWwGW/Nd1qkWsoLcxj29p00H+t0cwuFC8/p3BB97 YrqbZk49pgjVapVH6MlleeSRQunx0qzHUOFuxIRB5S0VwQfPvHvOC043VfoixLl3qZQN rXIY7SP1UfxEkweguh6kGr2aNYN9B135LC8VvW/94zmGmBSeWUOYHGZBt+cUd+vteP/j qB9w4wwqmpxWp/2gKNeFeuCghaUUz4egH+m0qf9F2IbensrkXLQ0VB4u+4zr4IV9+ic+ /AXdvCpiOUNQSbtNmTMWCbTthehHjEWVcsx4U/xrv7ZwbDtkApeFzZ7qkBG94dNIfyvm p+gw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=ieofj3pA; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id d25si1537743eje.181.2020.09.30.09.49.30; Wed, 30 Sep 2020 09:49:56 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=ieofj3pA; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726335AbgI3QsZ (ORCPT + 99 others); Wed, 30 Sep 2020 12:48:25 -0400 Received: from mx2.suse.de ([195.135.220.15]:46924 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725823AbgI3QsZ (ORCPT ); Wed, 30 Sep 2020 12:48:25 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1601484503; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=ZtN3pR3Z/oium5GfdXD2uz9/2bfNv63HFSibvAivduA=; b=ieofj3pAkg5i6Sr6TjB0RUnbe67x+W/JYtJE8sv+gEAJGMujLFV76gqC6+h7tVN7LQ6OoL ClD+6YbWWbEVgIXXTgtUA/jAPy+RbYWK2LjHJIhvdgRWbnBbGxmx2WaR/glH8FtjG5gZGK R4tWOE6K525nzZztXdKKP697OLJn/h4= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 0B0A1AFA9; Wed, 30 Sep 2020 16:48:23 +0000 (UTC) Date: Wed, 30 Sep 2020 18:48:22 +0200 From: Michal Hocko To: Joel Fernandes Cc: Uladzislau Rezki , Mel Gorman , "Paul E. McKenney" , LKML , RCU , linux-mm@kvack.org, Andrew Morton , Peter Zijlstra , Vlastimil Babka , Thomas Gleixner , "Theodore Y . Ts'o" , Sebastian Andrzej Siewior , Oleksiy Avramchenko , Mel Gorman Subject: Re: [RFC-PATCH 2/4] mm: Add __rcu_alloc_page_lockless() func. Message-ID: <20200930164822.GX2277@dhcp22.suse.cz> References: <20200922075002.GU12990@dhcp22.suse.cz> <20200922131257.GA29241@pc636> <20200923103706.GJ3179@techsingularity.net> <20200923154105.GO29330@paulmck-ThinkPad-P72> <20200923232251.GK3179@techsingularity.net> <20200924081614.GA14819@pc636> <20200925080503.GC3389@dhcp22.suse.cz> <20200925153129.GB25350@pc636> <20200925154741.GI3389@dhcp22.suse.cz> <20200930152517.GA1470428@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200930152517.GA1470428@google.com> User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 30-09-20 11:25:17, Joel Fernandes wrote: > On Fri, Sep 25, 2020 at 05:47:41PM +0200, Michal Hocko wrote: > > On Fri 25-09-20 17:31:29, Uladzislau Rezki wrote: > > > > > > > > > > > > > > All good points! > > > > > > > > > > > > > > On the other hand, duplicating a portion of the allocator functionality > > > > > > > within RCU increases the amount of reserved memory, and needlessly most > > > > > > > of the time. > > > > > > > > > > > > > > > > > > > But it's very similar to what mempools are for. > > > > > > > > > > > As for dynamic caching or mempools. It requires extra logic on top of RCU > > > > > to move things forward and it might be not efficient way. As a side > > > > > effect, maintaining of the bulk arrays in the separate worker thread > > > > > will introduce other drawbacks: > > > > > > > > This is true but it is also true that it is RCU to require this special > > > > logic and we can expect that we might need to fine tune this logic > > > > depending on the RCU usage. We definitely do not want to tune the > > > > generic page allocator for a very specific usecase, do we? > > > > > > > I look at it in scope of GFP_ATOMIC/GFP_NOWAIT issues, i.e. inability > > > to provide a memory service for contexts which are not allowed to > > > sleep, RCU is part of them. Both flags used to provide such ability > > > before but not anymore. > > > > > > Do you agree with it? > > > > Yes this sucks. But this is something that we likely really want to live > > with. We have to explicitly _document_ that really atomic contexts in RT > > cannot use the allocator. From the past discussions we've had this is > > likely the most reasonable way forward because we do not really want to > > encourage anybody to do something like that and there should be ways > > around that. The same is btw. true also for !RT. The allocator is not > > NMI safe and while we should be able to make it compatible I am not > > convinced we really want to. > > > > Would something like this be helpful wrt documentation? > > > > diff --git a/include/linux/gfp.h b/include/linux/gfp.h > > index 67a0774e080b..9fcd47606493 100644 > > --- a/include/linux/gfp.h > > +++ b/include/linux/gfp.h > > @@ -238,7 +238,9 @@ struct vm_area_struct; > > * %__GFP_FOO flags as necessary. > > * > > * %GFP_ATOMIC users can not sleep and need the allocation to succeed. A lower > > - * watermark is applied to allow access to "atomic reserves" > > + * watermark is applied to allow access to "atomic reserves". > > + * The current implementation doesn't support NMI and other non-preemptive context > > + * (e.g. raw_spin_lock). > > I think documenting is useful. > > Could it be more explicit in what the issue is? Something like: > > * Even with GFP_ATOMIC, calls to the allocator can sleep on PREEMPT_RT > systems. Therefore, the current low-level allocator implementation does not > support being called from special contexts that are atomic on RT - such as > NMI and raw_spin_lock. Due to these constraints and considering calling code > usually has no control over the PREEMPT_RT configuration, callers of the > allocator should avoid calling the allocator from these cotnexts even in > non-RT systems. I do not mind documenting RT specific behavior but as mentioned in other reply, this should likely go via RT tree for now. There is likely more to clarify about atomicity for PREEMPT_RT. -- Michal Hocko SUSE Labs