Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp5923074iob; Tue, 10 May 2022 06:48:58 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyIIgn7WvXmDaS4ZT9hkd5RNuLunFoaaASm4EJa3oGbNWNO4k8IxcT0tfqRTXD/G9ne7c7z X-Received: by 2002:a17:907:7810:b0:6e7:ef73:8326 with SMTP id la16-20020a170907781000b006e7ef738326mr19278568ejc.429.1652190538071; Tue, 10 May 2022 06:48:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652190538; cv=none; d=google.com; s=arc-20160816; b=Wolc+htnX4zJxOlgYg5kuqE4GQK9523ur+WEG4MuVKzari8xfelPVUu0xGWia2GJMZ wA/NyAYy1QZpo5d/WxMNOynzdj1VUTaIHHFGCCvSLgq3g3qmn58cyj8Sav9Crxg9efjy DXmE+UCF1+dzZGVKjri4yKlbAat2CwUJv5nFGEIKPZOeSrOV/r6EMcnDvX1tef8aW7AU FjcW1HLAYL1nuY4TyJwi6bfpKfSC+YDQ/0O9XGubSTziF9vqMIOmzlNngpa/oEuvs7FX JO+dWy43UGoV3oOByMajaHedvUTb/C+sAnYMpJGonKyZMa5RE6sBvgP00yFfeg1MT9R0 NP/g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:subject:cc:to:from; bh=v59QTJYWZ/e9u1MAIKyJMEryrIHj/KG/Irh/PBHfUWE=; b=v8Z91af9OvsbbJ47V3SrzCklQxEOTk46KIwuiwUqwX5OTxwCRMXy3kvT3rCzC/8squ rUHwQSIOINVyfIyEvJxRvpU3W1P5CT9kGp6gLwdNa8od5OYVf6PtVaFAgIAH2PCLc71c wF5h38h55VW7Au0lchDj7rTwLCgQ9b0EbhdhikfZLRfoZZVrWF4WtHEMITdsaHvz3i98 bS6zPaXzz3mVFyVFceYDYU3Ci3diDgj6J+HtJhqw7VmhprRh17BIVZdABgP1oBs9+8T6 d5xSzkFT5ZZM6KnNz1xPuV45U887eWGvqpGFpOHU9RH+rWHrtS6N2gH7ajkcpkye7aLw mKgg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j15-20020a170906410f00b006f3c60f13d7si16061472ejk.811.2022.05.10.06.48.31; Tue, 10 May 2022 06:48:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240927AbiEJLYm (ORCPT + 99 others); Tue, 10 May 2022 07:24:42 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45300 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240912AbiEJLY3 (ORCPT ); Tue, 10 May 2022 07:24:29 -0400 Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7FB8A1BDDBF; Tue, 10 May 2022 04:20:25 -0700 (PDT) Received: from fraeml712-chm.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4KyFqc0SLRz6GD99; Tue, 10 May 2022 19:17:00 +0800 (CST) Received: from lhreml724-chm.china.huawei.com (10.201.108.75) by fraeml712-chm.china.huawei.com (10.206.15.61) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Tue, 10 May 2022 13:20:23 +0200 Received: from localhost.localdomain (10.69.192.58) by lhreml724-chm.china.huawei.com (10.201.108.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Tue, 10 May 2022 12:20:20 +0100 From: John Garry To: , CC: , , John Garry Subject: [RFC PATCH 0/2] sbitmap: NUMA node spreading Date: Tue, 10 May 2022 19:14:32 +0800 Message-ID: <1652181274-136198-1-git-send-email-john.garry@huawei.com> X-Mailer: git-send-email 2.8.1 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.69.192.58] X-ClientProxiedBy: dggems703-chm.china.huawei.com (10.3.19.180) To lhreml724-chm.china.huawei.com (10.201.108.75) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jens, guys, I am sending this as an RFC to see if there is any future in it or ideas on how to make better. I also need to improve some items (as mentioned in 2/2 commit message) and test a lot more. The general idea is that we change from allocating a single array of sbitmap words to allocating an sub-array per NUMA node. And then each CPU in that node is hinted to use that sub-array Initial performance looks decent. Some figures: System: 4-nodes (with memory on all nodes), 128 CPUs null blk config block: 20 devs, submit_queues=NR_CPUS, shared_tags, shared_tag_bitmap, hw_queue_depth=256 fio config: bs=4096, iodepth=128, numjobs=10, cpus_allowed_policy=split, rw=read, ioscheduler=none Before: 7130K After: 7630K So a +7% IOPS gain. Any comments welcome, thanks!. Based on v5.18-rc6. John Garry (2): sbitmap: Make sbitmap.map a double pointer sbitmap: Spread sbitmap word allocation over NUMA nodes include/linux/sbitmap.h | 16 +++++--- lib/sbitmap.c | 83 +++++++++++++++++++++++++++++++++-------- 2 files changed, 79 insertions(+), 20 deletions(-) -- 2.26.2