Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp4296ybl; Thu, 30 Jan 2020 15:53:47 -0800 (PST) X-Google-Smtp-Source: APXvYqw2EmBO1PELmiXlCmSkSNfOwZ1OPzpSmAtexwyKl9LouUrTEKKZxA659Ql6oWEYqVfH31xr X-Received: by 2002:a9d:4e97:: with SMTP id v23mr5264617otk.201.1580428427390; Thu, 30 Jan 2020 15:53:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1580428427; cv=none; d=google.com; s=arc-20160816; b=Qx8Io9GvN/f5DgFUfWFcnbRX8dJqX3DxL20hh/K59KAQ5EVU3MJyLdX4WYRODoSnMR 0lBCxh/ZsCcpWBXUEVdii22X6P8wSgjHj+x52ELIT0egU4D0iKDYU5zcYgquA42SiDNi +3gwaGf+6CVRYXSvwyXQPVKS4SlUX36S6Nt+C8DocFma2BR3xNCODa2snXD1VugyWtUc YcZ4K0jGKzYXTLx3mecxCYt+xw7vY/wfP3YNbQe1SZ/Goexxs6zuPWLf4S3uiIvWiN7C S/MO21xdaT8hZuzgjqIW1bIrdVtStK08uyUrpknr0TyjWkElVVjze8uvdIzbST9ZU/Q/ Rdsw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=brmCQy+QkB8YBCzYoRpJDzbVFSJQ+oGn6d2Dd2Dta0U=; b=PYF90OeCnupo6Yfn+9zd1OcDtOZT9GhpTPdpN+BgYXSKmBOanz5TCVufB3tx1AYsf1 YMCTYlApPdNfUqqcZ8+YMgxLX46aakl79m0vQgP+f6GX2X3m0YFztOZrqKHq60KQAnAY tIEyoSwpVrd+MS5Eyne0HK+W56QYAJ8AHSHgBNO09XfWzDMvOXzFEyfh8iqZxo82tqNO cHDs/3nlF1K0IekkdsOb+o2OYcfLTJuaqGLwioQZ8a7LjHJKnVnZ6XxtSDTEp0PtW7Nx MYFzSNnWWMNLC3obLtozy4b2EzdEZca9MibzGfwQuBPkKukPTY2SJRZZSqWygBwrzhN9 6HeQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w18si3872180otl.54.2020.01.30.15.53.25; Thu, 30 Jan 2020 15:53:47 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727566AbgA3Xqy (ORCPT + 99 others); Thu, 30 Jan 2020 18:46:54 -0500 Received: from foss.arm.com ([217.140.110.172]:59020 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726026AbgA3Xqy (ORCPT ); Thu, 30 Jan 2020 18:46:54 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 280A11FB; Thu, 30 Jan 2020 15:46:53 -0800 (PST) Received: from [192.168.1.123] (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 794433F67D; Thu, 30 Jan 2020 15:46:51 -0800 (PST) Subject: Re: [PATCH] dma-debug: dynamic allocation of hash table To: Eric Dumazet , Christoph Hellwig , Joerg Roedel Cc: iommu@lists.linux-foundation.org, Eric Dumazet , Geert Uytterhoeven , linux-kernel References: <20200130191049.190569-1-edumazet@google.com> From: Robin Murphy Message-ID: Date: Thu, 30 Jan 2020 23:46:46 +0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; rv:68.0) Gecko/20100101 Thunderbird/68.4.2 MIME-Version: 1.0 In-Reply-To: <20200130191049.190569-1-edumazet@google.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-GB Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Eric, On 2020-01-30 7:10 pm, Eric Dumazet via iommu wrote: > Increasing the size of dma_entry_hash size by 327680 bytes > has reached some bootloaders limitations. [ That might warrant some further explanation - I don't quite follow how this would relate to a bootloader specifically :/ ] > Simply use dynamic allocations instead, and take > this opportunity to increase the hash table to 65536 > buckets. Finally my 40Gbit mlx4 NIC can sustain > line rate with CONFIG_DMA_API_DEBUG=y. That's pretty cool, but I can't help but wonder if making the table bigger caused a problem in the first place, whether making it bigger yet again in the name of a fix is really the wisest move. How might this impact DMA debugging on 32-bit embedded systems with limited vmalloc space and even less RAM, for instance? More to the point, does vmalloc() even work for !CONFIG_MMU builds? Obviously we don't want things to be *needlessly* slow if avoidable, but is there a genuine justification for needing to optimise what is fundamentally an invasive heavyweight correctness check - e.g. has it helped expose race conditions that were otherwise masked? That said, by moving to dynamic allocation maybe there's room to be cleverer and make HASH_SIZE scale with, say, system memory size? (I assume from the context it's not something we can expand on-demand like we did for the dma_debug_entry pool) Robin. > Fixes: 5e76f564572b ("dma-debug: increase HASH_SIZE") > Signed-off-by: Eric Dumazet > Reported-by: Geert Uytterhoeven > Cc: Christoph Hellwig > --- > kernel/dma/debug.c | 10 ++++++++-- > 1 file changed, 8 insertions(+), 2 deletions(-) > > diff --git a/kernel/dma/debug.c b/kernel/dma/debug.c > index 2031ed1ad7fa109bb8a8c290bbbc5f825362baba..a310dbb1515e92c081f8f3f9a7290dd5e53fc889 100644 > --- a/kernel/dma/debug.c > +++ b/kernel/dma/debug.c > @@ -27,7 +27,7 @@ > > #include > > -#define HASH_SIZE 16384ULL > +#define HASH_SIZE 65536ULL > #define HASH_FN_SHIFT 13 > #define HASH_FN_MASK (HASH_SIZE - 1) > > @@ -90,7 +90,8 @@ struct hash_bucket { > }; > > /* Hash list to save the allocated dma addresses */ > -static struct hash_bucket dma_entry_hash[HASH_SIZE]; > +static struct hash_bucket *dma_entry_hash __read_mostly; > + > /* List of pre-allocated dma_debug_entry's */ > static LIST_HEAD(free_entries); > /* Lock for the list above */ > @@ -934,6 +935,10 @@ static int dma_debug_init(void) > if (global_disable) > return 0; > > + dma_entry_hash = vmalloc(HASH_SIZE * sizeof(*dma_entry_hash)); > + if (!dma_entry_hash) > + goto err; > + > for (i = 0; i < HASH_SIZE; ++i) { > INIT_LIST_HEAD(&dma_entry_hash[i].list); > spin_lock_init(&dma_entry_hash[i].lock); > @@ -950,6 +955,7 @@ static int dma_debug_init(void) > pr_warn("%d debug entries requested but only %d allocated\n", > nr_prealloc_entries, nr_total_entries); > } else { > +err: > pr_err("debugging out of memory error - disabled\n"); > global_disable = true; > >