Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp392786iog; Thu, 30 Jun 2022 02:55:05 -0700 (PDT) X-Google-Smtp-Source: AGRyM1sRxU8ehjJ9QnblXgPPQtVSjsC/SAT2GuvgvMIshC9umJ+c6N6yqPbj0731RaY+gQy3+ooJ X-Received: by 2002:a17:90a:c718:b0:1eb:af0e:36fd with SMTP id o24-20020a17090ac71800b001ebaf0e36fdmr11221060pjt.99.1656582904874; Thu, 30 Jun 2022 02:55:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656582904; cv=none; d=google.com; s=arc-20160816; b=I7BTGBkU/PTrIEKkasx4ruW/TUGkpar0VbEn58R+SKTem1QyJNcXxqxsgxaYjJUFAT 97RxldKHtQ5PGGAoG+5gPOFWu7HipQcgPt+6lOgZAQf3VhFDImT2iydUNHBdzam0ni3B Owj2DED2s9cm3Isys56hz8iMFAtl2AdbUw2hKrzuMsZYg4HkAvytdXUBU9q3XQ/xnwfg jSCe3Ht9I5dPvlOTuAgypsK8+B/eymqv+LBxHJfn31UC5jsUDhcDkEJx+W6qxd64Gayv 2RmuI4MEqU2VcSib4cSKiispwqH9XD7RqwA90xVaOsHCd2pRJnVtmIstKnmJ5NYnJz04 LndQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=lWJfGXIJO/6riYZY6WXj48E2VA79XEb1mGLj4eiFtz0=; b=fJPcFX3Nc9Sa1wYemkzr947JYRHC6mhHw1W+aB+Gf5tJtPbEbdDJs/4I5330YlMncq gRcWPCtDGAoCSTUxssXWBrKCKFULzozXYhYtx8C8I+kVEU7pg8HhMHpWz5nmQ5oB7L/e /0DgESwLbSE91qr8UDnwIN1egO289oeutbBZdRfAUD8PwxWu/80NFLXcLfGviiXUQxKX Bkwa2aUUG+EnOjekD0UVSj8hBIPWFxiQrowc4EPIIi0b5Kv7YpoKC8P31D81JzG1MuxB EA5E+VYlWTKq1q3BZregHAGKTYn0kWk5QN4c0BOnsmvJjdyZVAzntAk81ncGJN4jDotI XrRQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id rm1-20020a17090b3ec100b001e34f58374fsi2463950pjb.129.2022.06.30.02.54.53; Thu, 30 Jun 2022 02:55:04 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233849AbiF3JCK (ORCPT + 99 others); Thu, 30 Jun 2022 05:02:10 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59626 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229964AbiF3JCI (ORCPT ); Thu, 30 Jun 2022 05:02:08 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 9F1AAF585 for ; Thu, 30 Jun 2022 02:02:07 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 92B291063; Thu, 30 Jun 2022 02:02:07 -0700 (PDT) Received: from [10.57.85.25] (unknown [10.57.85.25]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id C38B93F5A1; Thu, 30 Jun 2022 02:02:05 -0700 (PDT) Message-ID: <13db50bb-57c7-0d54-3857-84b8a4591d9e@arm.com> Date: Thu, 30 Jun 2022 10:02:00 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; rv:91.0) Gecko/20100101 Thunderbird/91.11.0 Subject: Re: [PATCH] iommu/iova: change IOVA_MAG_SIZE to 127 to save memory Content-Language: en-GB To: Feng Tang , Joerg Roedel , Will Deacon , iommu@lists.linux-foundation.org, iommu@lists.linux.dev, Andrew Morton , Christoph Lameter , Vlastimil Babka Cc: linux-mm@kvack.org, Paul Menzel , linux-kernel@vger.kernel.org References: <20220630073304.26945-1-feng.tang@intel.com> From: Robin Murphy In-Reply-To: <20220630073304.26945-1-feng.tang@intel.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-06-30 08:33, Feng Tang wrote: > kmalloc will round up the request size to power of 2, and current > iova_magazine's size is 1032 (1024+8) bytes, so each instance > allocated will get 2048 bytes from kmalloc, causing around 1KB > waste. > > And in some exstreme case, the memory wasted can trigger OOM as > reported in 2019 on a crash kernel with 256 MB memory [1]. I don't think it really needs pointing out that excessive memory consumption can cause OOM. Especially not in the particularly silly context of a system with only 2MB of RAM per CPU - that's pretty much guaranteed to be doomed one way or another. > [ 4.319253] iommu: Adding device 0000:06:00.2 to group 5 > [ 4.325869] iommu: Adding device 0000:20:01.0 to group 15 > [ 4.332648] iommu: Adding device 0000:20:02.0 to group 16 > [ 4.338946] swapper/0 invoked oom-killer: gfp_mask=0x6040c0(GFP_KERNEL|__GFP_COMP), nodemask=(null), order=0, oom_score_adj=0 > [ 4.350251] swapper/0 cpuset=/ mems_allowed=0 > [ 4.354618] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.19.57.mx64.282 #1 > [ 4.355612] Hardware name: Dell Inc. PowerEdge R7425/08V001, BIOS 1.9.3 06/25/2019 > [ 4.355612] Call Trace: > [ 4.355612] dump_stack+0x46/0x5b > [ 4.355612] dump_header+0x6b/0x289 > [ 4.355612] out_of_memory+0x470/0x4c0 > [ 4.355612] __alloc_pages_nodemask+0x970/0x1030 > [ 4.355612] cache_grow_begin+0x7d/0x520 > [ 4.355612] fallback_alloc+0x148/0x200 > [ 4.355612] kmem_cache_alloc_trace+0xac/0x1f0 > [ 4.355612] init_iova_domain+0x112/0x170 > [ 4.355612] amd_iommu_domain_alloc+0x138/0x1a0 > [ 4.355612] iommu_group_get_for_dev+0xc4/0x1a0 > [ 4.355612] amd_iommu_add_device+0x13a/0x610 > [ 4.355612] add_iommu_group+0x20/0x30 > [ 4.355612] bus_for_each_dev+0x76/0xc0 > [ 4.355612] bus_set_iommu+0xb6/0xf0 > [ 4.355612] amd_iommu_init_api+0x112/0x132 > [ 4.355612] state_next+0xfb1/0x1165 > [ 4.355612] amd_iommu_init+0x1f/0x67 > [ 4.355612] pci_iommu_init+0x16/0x3f > ... > [ 4.670295] Unreclaimable slab info: > ... > [ 4.857565] kmalloc-2048 59164KB 59164KB > > Change IOVA_MAG_SIZE from 128 to 127 to make size of 'iova_magazine' > 1024 bytes so that no memory will be wasted. > > [1]. https://lkml.org/lkml/2019/8/12/266 > > Signed-off-by: Feng Tang > --- > drivers/iommu/iova.c | 7 ++++++- > 1 file changed, 6 insertions(+), 1 deletion(-) > > diff --git a/drivers/iommu/iova.c b/drivers/iommu/iova.c > index db77aa675145b..27634ddd9b904 100644 > --- a/drivers/iommu/iova.c > +++ b/drivers/iommu/iova.c > @@ -614,7 +614,12 @@ EXPORT_SYMBOL_GPL(reserve_iova); > * dynamic size tuning described in the paper. > */ > > -#define IOVA_MAG_SIZE 128 > +/* > + * As kmalloc's buffer size is fixed to power of 2, 127 is chosen to > + * assure size of 'iova_magzine' to be 1024 bytes, so that no memory Typo: iova_magazine > + * will be wasted. > + */ > +#define IOVA_MAG_SIZE 127 The change itself seems perfectly reasonable, though. Acked-by: Robin Murphy > #define MAX_GLOBAL_MAGS 32 /* magazines per bin */ > > struct iova_magazine {