Received: by 2002:a25:d7c1:0:0:0:0:0 with SMTP id o184csp868003ybg; Fri, 18 Oct 2019 08:32:55 -0700 (PDT) X-Google-Smtp-Source: APXvYqxbqxXdrh2v1WVgoybzWpAI7Dbg8AKeNCkSDvlRcSfq8uINjSsUxiPdZYX72aX7smRtXZqF X-Received: by 2002:aa7:cf8e:: with SMTP id z14mr10322674edx.60.1571412775637; Fri, 18 Oct 2019 08:32:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1571412775; cv=none; d=google.com; s=arc-20160816; b=gGSonQEuva5RTut0MAXnVspI3oIXnDM/HD0qHAwLGWxJuMRvjFvvfL7O6qH+9cywub zGRmVx4ZKAdwPlsG2Zu3guhhF/FTUsKWw55oFlOQ5tCdKByFudNK052ica9BmkxdcT+d eZ1rssX7x3p2dUTYwvWCI5B8bj/J1CJ72JBd9bQi2NMXzMq4QjEQ0b4oivUs6nd0KwsF FOL/tR87XYR6uxhZIrvY9kInTEQy1YtaUOQGhEPiAWwGRDoEwNxNBuGGKmHYIASr2vIU aO5H2AQ3wvzldMuQtepso4aE8GbjdzaRK5fPU4d5No/4Pc5BqSe0/SslGyRzM8rbvOdZ XlvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:subject:cc :to:from; bh=SgUc6JKeLoljISLQAgKcLP33MesVLYcd3wg/hB4bflQ=; b=qixGOvFWNo4/kYA3wrRZzPSxbpKTlXBDZxMk4g+Ltz4xxnNh9de9/lf3+zCm1QcwIr 6D65unkmwAXqHAWcG4gIfP96SzJgLOhfiNnL0w6UFylN78Hz4QaDRyF6xnTqRg9lbvSJ zMMD/8FyJUwDxdyIX78ajAX1f6MoWJJR+ZyrdS1R+UDneIYDocBgwdMvjbzOaZaO3pZU O1oOp4mxv7L7bVT9i0i6O/GMCpKrw9wV3+6DFnaeYk6euS3AFrX5ELW9bMfoXDsTUuQD XimsLmss6NsjjK76bbenB2kcXOjiNgYWHlFRkXIU4sPSh/T2QHGkuhFZqMddAy2KGlc8 odPA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u8si3952891edq.84.2019.10.18.08.32.32; Fri, 18 Oct 2019 08:32:55 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2405228AbfJQK0C (ORCPT + 99 others); Thu, 17 Oct 2019 06:26:02 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:39294 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2404947AbfJQK0B (ORCPT ); Thu, 17 Oct 2019 06:26:01 -0400 Received: from DGGEMS406-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id E77D4691DBEF2CE84F46; Thu, 17 Oct 2019 18:25:58 +0800 (CST) Received: from localhost.localdomain (10.67.212.132) by DGGEMS406-HUB.china.huawei.com (10.3.19.206) with Microsoft SMTP Server id 14.3.439.0; Thu, 17 Oct 2019 18:25:52 +0800 From: Shaokun Zhang To: CC: yuqi jin , Andrew Morton , Mike Rapoport , Paul Burton , Michal Hocko , Michael Ellerman , Anshuman Khandual , Shaokun Zhang Subject: [RFC] lib: optimize cpumask_local_spread() Date: Thu, 17 Oct 2019 18:23:08 +0800 Message-ID: <1571307788-43169-1-git-send-email-zhangshaokun@hisilicon.com> X-Mailer: git-send-email 2.7.4 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.67.212.132] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: yuqi jin In the multi-processor and NUMA system, A device may have many numa nodes belonging to multiple cpus. When we get a local numa, it is better to find the node closest to the local numa node to return instead of going to the online cpu immediately. For example, In Huawei Kunpeng 920 system, there are 4 NUMA node(0 -3) in the 2-socket system(0 - 1). If the I/O device is in socket1 and the local NUMA node is 2, we shall choose the non-local node3 in the same socket when cpu core in NUMA node2 is less that I/O requirements. If we directly pick one cpu core from all online ones, it may be in the another socket and it is not friendly for performance. Cc: Andrew Morton Cc: Mike Rapoport Cc: Paul Burton Cc: Michal Hocko Cc: Michael Ellerman Cc: Anshuman Khandual Signed-off-by: yuqi jin Signed-off-by: Shaokun Zhang --- lib/cpumask.c | 78 ++++++++++++++++++++++++++++++++++++++++++++++++++--------- 1 file changed, 67 insertions(+), 11 deletions(-) diff --git a/lib/cpumask.c b/lib/cpumask.c index 0cb672eb107c..8f89c7cebfb0 100644 --- a/lib/cpumask.c +++ b/lib/cpumask.c @@ -192,6 +192,33 @@ void __init free_bootmem_cpumask_var(cpumask_var_t mask) } #endif +static void calc_node_distance(int *node_dist, int node) +{ + int i; + + for (i = 0; i < nr_node_ids; i++) + node_dist[i] = node_distance(node, i); +} + +static int find_nearest_node(int *node_dist, bool *used_flag) +{ + int i, min_dist = node_dist[0], node_id = -1; + + for (i = 0; i < nr_node_ids; i++) + if (used_flag[i] == 0) { + min_dist = node_dist[i]; + node_id = i; + break; + } + for (i = 0; i < nr_node_ids; i++) + if (node_dist[i] < min_dist && used_flag[i] == 0) { + min_dist = node_dist[i]; + node_id = i; + } + + return node_id; +} + /** * cpumask_local_spread - select the i'th cpu with local numa cpu's first * @i: index number @@ -205,7 +232,8 @@ void __init free_bootmem_cpumask_var(cpumask_var_t mask) */ unsigned int cpumask_local_spread(unsigned int i, int node) { - int cpu; + int cpu, j, id, *node_dist; + bool *used_flag; /* Wrap: we always want a cpu. */ i %= num_online_cpus(); @@ -215,19 +243,47 @@ unsigned int cpumask_local_spread(unsigned int i, int node) if (i-- == 0) return cpu; } else { - /* NUMA first. */ - for_each_cpu_and(cpu, cpumask_of_node(node), cpu_online_mask) - if (i-- == 0) - return cpu; + node_dist = kmalloc_array(nr_node_ids, + sizeof(int), GFP_KERNEL); + if (!node_dist) + for_each_cpu(cpu, cpu_online_mask) + if (i-- == 0) + return cpu; - for_each_cpu(cpu, cpu_online_mask) { - /* Skip NUMA nodes, done above. */ - if (cpumask_test_cpu(cpu, cpumask_of_node(node))) - continue; + used_flag = kmalloc_array(nr_node_ids, + sizeof(bool), GFP_KERNEL); + if (!used_flag) + for_each_cpu(cpu, cpu_online_mask) + if (i-- == 0) { + kfree(node_dist); + return cpu; + } + memset(used_flag, 0, nr_node_ids * sizeof(bool)); - if (i-- == 0) - return cpu; + calc_node_distance(node_dist, node); + for (j = 0; j < nr_node_ids; j++) { + id = find_nearest_node(node_dist, used_flag); + if (id < 0) + break; + for_each_cpu_and(cpu, + cpumask_of_node(id), cpu_online_mask) + if (i-- == 0) { + kfree(node_dist); + kfree(used_flag); + return cpu; + } + used_flag[id] = 1; } + + for_each_cpu(cpu, cpu_online_mask) + if (i-- == 0) { + kfree(node_dist); + kfree(used_flag); + return cpu; + } + + kfree(node_dist); + kfree(used_flag); } BUG(); } -- 2.7.4