Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp553531rwb; Mon, 26 Sep 2022 02:45:29 -0700 (PDT) X-Google-Smtp-Source: AMsMyM6muHlbyogDQoQnGcz0vA5Mr0552eFWzvyo4dui/7bgop0WuSPfV4iTxNUSL+0kKWb0DNUJ X-Received: by 2002:a17:90a:6a83:b0:200:9da5:d0e7 with SMTP id u3-20020a17090a6a8300b002009da5d0e7mr34870771pjj.187.1664185529333; Mon, 26 Sep 2022 02:45:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664185529; cv=none; d=google.com; s=arc-20160816; b=bd3omn0Shrg5SQDxT+uf8tYXM8OYZTzmXk89QBbMsDTrum2cxLXINXOrzVrn5xYugH yPfPPik2DmlfWE1xSXRH9mTJbYRCBsOwlttC7X+sz/4GVceOwHwQwTSOKuk8xF0TIw7/ HmtIqamy1GYMlFluuMVo4F2ggE+K66x84lDGjUtEACSHqS7zPKy0UAUdpRL5o8fCFD0S R2j8mhPj1KI8/qs9FrxFP4XyA79by/RlgvmLAFm3eqi66cELHp0y5pE/fCusaRIt2RAo 97pXBmdHkZqGiNtc4p2jnn65uqpcNu0N1hJK+fP/rhL9hom1aX4nR5tL+G44qPWCrIEH kFMg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=bih9TmuQiIX+2I5xDF+OS0kAuWMZpXGCT3GD+YoKLCM=; b=Q4QEdN9bXRsEDLojzc/OGMWoSg7Rf83+nffn5+3S4Vai31mOy4fFlH5kpgm7VDyURo n/RqiwQKHQlO2tGakgWH9nIVh7AumK/1IVD0lRJs1zEEThYTRBuiiJtSDwidubErBkrE T3VixxBlZrjZCANFXiklY6q4xqktCSogCGUiYBAsO3CG/YtLU1ZegwMEU1C+M2bA0YyH OWhwesbqFiF29rvHngECGj2Bd7TCyUvvh42P2wYI/vJ9217V74rSGCabrSn6JAfvQ8a3 Bra6J/sDMo3glGnppfwhf6PEgBn8dwjQFt7vXsUbGsAIDG5OTJBSMyJTtSogHwIPX93o LSrg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=UFRF8LH7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id f38-20020a631f26000000b004386a1ebe70si17815791pgf.43.2022.09.26.02.45.17; Mon, 26 Sep 2022 02:45:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=UFRF8LH7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234858AbiIZJbl (ORCPT + 99 others); Mon, 26 Sep 2022 05:31:41 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58518 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235000AbiIZJbW (ORCPT ); Mon, 26 Sep 2022 05:31:22 -0400 Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8809931372; Mon, 26 Sep 2022 02:30:08 -0700 (PDT) Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id E4BA922100; Mon, 26 Sep 2022 09:30:06 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1664184606; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=bih9TmuQiIX+2I5xDF+OS0kAuWMZpXGCT3GD+YoKLCM=; b=UFRF8LH7nYOh+cFoa+rPZ4LJVzLAlIMi4omuft9vyKbxhJNE5zH0yffwP/MrJ2NQk858V4 GPXbaFJLat68x/ZhEfKCReJr/gLCecipF+Nl5jADAN3yvxhToNCM14uLMuNHFdw2MEpYaC WTYKdgD2npnLqSUq3t55LFOg4bv76Rg= Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id C2E6D139BD; Mon, 26 Sep 2022 09:30:06 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id JE1nLB5xMWOgdwAAMHmgww (envelope-from ); Mon, 26 Sep 2022 09:30:06 +0000 Date: Mon, 26 Sep 2022 11:30:06 +0200 From: Michal Hocko To: Florian Westphal Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, vbabka@suse.cz, akpm@linux-foundation.org, urezki@gmail.com, netdev@vger.kernel.org, netfilter-devel@vger.kernel.org, Martin Zaharinov Subject: Re: [PATCH mm] mm: fix BUG with kvzalloc+GFP_ATOMIC Message-ID: References: <20220923103858.26729-1-fw@strlen.de> <20220923133512.GE22541@breakpoint.cc> <20220926075639.GA908@breakpoint.cc> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 26-09-22 10:58:00, Michal Hocko wrote: [...] > A better option to me seems to be reworking the rhashtable_insert_rehash > to not rely on an atomic allocation. I am not familiar with that code > but it seems to me that the only reason this allocation mode is used is > due to rcu locking around rhashtable_try_insert. Is there any reason we > cannot drop the rcu lock, allocate with the full GFP_KERNEL allocation > power and retry with the pre allocated object? rhashtable_insert_slow is > already doing that to implement its never fail semantic. So a very blunt and likely not 100% correct take on this side of things. But it should give an idea at least. --- diff --git a/lib/rhashtable.c b/lib/rhashtable.c index e12bbfb240b8..fc547c43b05d 100644 --- a/lib/rhashtable.c +++ b/lib/rhashtable.c @@ -437,31 +437,11 @@ static void rht_deferred_worker(struct work_struct *work) } static int rhashtable_insert_rehash(struct rhashtable *ht, - struct bucket_table *tbl) + struct bucket_table *tbl, + struct bucket_table *new_tbl) { - struct bucket_table *old_tbl; - struct bucket_table *new_tbl; - unsigned int size; int err; - old_tbl = rht_dereference_rcu(ht->tbl, ht); - - size = tbl->size; - - err = -EBUSY; - - if (rht_grow_above_75(ht, tbl)) - size *= 2; - /* Do not schedule more than one rehash */ - else if (old_tbl != tbl) - goto fail; - - err = -ENOMEM; - - new_tbl = bucket_table_alloc(ht, size, GFP_ATOMIC | __GFP_NOWARN); - if (new_tbl == NULL) - goto fail; - err = rhashtable_rehash_attach(ht, tbl, new_tbl); if (err) { bucket_table_free(new_tbl); @@ -471,17 +451,6 @@ static int rhashtable_insert_rehash(struct rhashtable *ht, schedule_work(&ht->run_work); return err; - -fail: - /* Do not fail the insert if someone else did a rehash. */ - if (likely(rcu_access_pointer(tbl->future_tbl))) - return 0; - - /* Schedule async rehash to retry allocation in process context. */ - if (err == -ENOMEM) - schedule_work(&ht->run_work); - - return err; } static void *rhashtable_lookup_one(struct rhashtable *ht, @@ -619,9 +588,33 @@ static void *rhashtable_try_insert(struct rhashtable *ht, const void *key, } } while (!IS_ERR_OR_NULL(new_tbl)); - if (PTR_ERR(data) == -EAGAIN) - data = ERR_PTR(rhashtable_insert_rehash(ht, tbl) ?: + if (PTR_ERR(data) == -EAGAIN) { + struct bucket_table *old_tbl; + unsigned int size; + + old_tbl = rht_dereference_rcu(ht->tbl, ht); + size = tbl->size; + + data = ERR_PTR(-EBUSY); + + if (rht_grow_above_75(ht, tbl)) + size *= 2; + /* Do not schedule more than one rehash */ + else if (old_tbl != tbl) + return data; + + data = ERR_PTR(-ENOMEM); + + rcu_read_unlock(); + new_tbl = bucket_table_alloc(ht, size, GFP_KERNEL); + rcu_read_lock(); + + if (!new_tbl) + return data; + + data = ERR_PTR(rhashtable_insert_rehash(ht, tbl, new_tbl) ?: -EAGAIN); + } return data; } -- Michal Hocko SUSE Labs