Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp7110602ybi; Thu, 13 Jun 2019 09:43:46 -0700 (PDT) X-Google-Smtp-Source: APXvYqy/te9RFEtFKGiUO2LGK+q8b1pJ/PzPEq4XkKcDZ/0x6mjvDB9EAHZ1UGzUcSOwqEumH/if X-Received: by 2002:aa7:8248:: with SMTP id e8mr54563229pfn.155.1560444226833; Thu, 13 Jun 2019 09:43:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1560444226; cv=none; d=google.com; s=arc-20160816; b=OtUZaxTslxy0CHkxEQibu9JTXeH8g9+EkKR2Os+3x6e+lkjXGu+YDIJrLS0lgaaJqD tE9JlPoxoELg2dJWyTCmU032117GMCYLtwMRP3jl0iiISrRR/9TfoQculNPl/Kz7eJhS 9XLIE7jXZHtd7VaeI1Wv8rNJo3YLMygWS29H47/3TIQXbQEFg2Ub3Gu09YR2TPeFf0Wq iOTKAqBPU3wZYxeDQsCEda+H5ohQ0w2VrC7Gf+dR67vW9PXHzD0GvBlaW6GKUj6bH5Wp nQcTvkdp4/r4i6hbpgbj80w50g7oX2iR8ARZV8nJFsRQ/O+f6PLOLoHNgCn8YjNQqk+c ih4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=GzhNgQxR09z+pStweMDW5yVMWaOXFVfHF5vEm/TcUJQ=; b=RDWqZzb1u0RD61TZ4qe6oCylNzK0NcWrc/LvjKkZeaX1ZNCHTIQcGT1cLCJFx0dx7i 9rEFJkEN3eMcFAIeGOMZh7NLpdodxQvk1MGmgqm2W8uEOYD/w0JMiRQHpFONeEpJEhOn 0KOysv6y9jIqxlyf0d0JPn0z3fNptsBzDPJvLSQVIVRwgDzj7joaug2kK0ci8ORAN+4y 4/hJpn9s9D2E10CwHnntGBWFDcKRIlJmPUMcEhOAhMrKLq3McKUc7CYi6vGd7414coEi tseCS0RlrBQDIOAaGspehZLFSuR5dYVX7QYb6Ce4P6pANMWO+L4ZURlavlC+7rRLKyR+ KA7w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=NONE dis=NONE) header.from=vmware.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j88si155244pje.35.2019.06.13.09.43.32; Thu, 13 Jun 2019 09:43:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=NONE dis=NONE) header.from=vmware.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392722AbfFMQmO (ORCPT + 99 others); Thu, 13 Jun 2019 12:42:14 -0400 Received: from mail-pg1-f196.google.com ([209.85.215.196]:35290 "EHLO mail-pg1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730515AbfFMGtK (ORCPT ); Thu, 13 Jun 2019 02:49:10 -0400 Received: by mail-pg1-f196.google.com with SMTP id s27so10364392pgl.2 for ; Wed, 12 Jun 2019 23:49:09 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=GzhNgQxR09z+pStweMDW5yVMWaOXFVfHF5vEm/TcUJQ=; b=T+bAgR7oFSZ88LrPG5Dwl9wLOxZEezUb0RQoutukYfEWKL4E/o3OFPa7Q5KWPM/gcp IcXrAFgXbudl0nOhIdIZZ861SnvjHHLk/8IWoMMv2d6+YTooeMei+zN8Z0ArYZTBtpmg Q0K7CXp7gJtD1c+QbYFKTuXBQ9Q2LesSiGDDtvJZJf8dUPs5L8EWRpZllXjG6smOa9Bp kUVrOfmRpU6+oupI8Cej8wqvMHv+q9+X6eXvvk4ZGT6YoT0l307ZExZMbYv3hJcRqY/+ ILYcnIlG/LuwJJHQP9cJUZUGTHd4gfp36+KW9qkKY53vxcxoCNLcnllGAOn5+o4vVRHl P3pA== X-Gm-Message-State: APjAAAUV78jxhHG7XY9uIsWGgV+6OoAVwS8vYl7arbFgr0+SGi/iAE1/ hNqMS9J53U3nk7qT+fqKm7Q= X-Received: by 2002:a63:8f09:: with SMTP id n9mr28590963pgd.249.1560408549306; Wed, 12 Jun 2019 23:49:09 -0700 (PDT) Received: from htb-2n-eng-dhcp405.eng.vmware.com ([66.170.99.1]) by smtp.gmail.com with ESMTPSA id i3sm1559973pfa.175.2019.06.12.23.48.59 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Wed, 12 Jun 2019 23:49:08 -0700 (PDT) From: Nadav Amit To: Peter Zijlstra , Andy Lutomirski Cc: linux-kernel@vger.kernel.org, Ingo Molnar , Borislav Petkov , x86@kernel.org, Thomas Gleixner , Dave Hansen , Nadav Amit , Rik van Riel , Josh Poimboeuf Subject: [PATCH 2/9] smp: Run functions concurrently in smp_call_function_many() Date: Wed, 12 Jun 2019 23:48:06 -0700 Message-Id: <20190613064813.8102-3-namit@vmware.com> X-Mailer: git-send-email 2.20.1 In-Reply-To: <20190613064813.8102-1-namit@vmware.com> References: <20190613064813.8102-1-namit@vmware.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Currently, on_each_cpu() and similar functions do not exploit the potential of concurrency: the function is first executed remotely and only then it is executed locally. Functions such as TLB flush can take considerable time, so this provides an opportunity for performance optimization. To do so, introduce __smp_call_function_many(), which allows the callers to provide local and remote functions that should be executed, and run them concurrently. Keep smp_call_function_many() semantic as it is today for backward compatibility: the called function is not executed in this case locally. __smp_call_function_many() does not use the optimized version for a single remote target that smp_call_function_single() implements. For synchronous function call, smp_call_function_single() keeps a call_single_data (which is used for synchronization) on the stack. Interestingly, it seems that not using this optimization provides greater performance improvements (greater speedup with a single remote target than with multiple ones). Presumably, holding data structures that are intended for synchronization on the stack can introduce overheads due to TLB misses and false-sharing when the stack is used for other purposes. Adding support to run the functions concurrently required to remove a micro-optimization in on_each_cpu() that disabled/enabled IRQs instead of saving/restoring them. The benefit of running the local and remote code concurrently is expected to be greater. Cc: Peter Zijlstra Cc: Dave Hansen Cc: Rik van Riel Cc: Thomas Gleixner Cc: Andy Lutomirski Cc: Josh Poimboeuf Signed-off-by: Nadav Amit --- include/linux/smp.h | 27 ++++++--- kernel/smp.c | 133 +++++++++++++++++++++----------------------- 2 files changed, 83 insertions(+), 77 deletions(-) diff --git a/include/linux/smp.h b/include/linux/smp.h index bb8b451ab01f..b69abc88259d 100644 --- a/include/linux/smp.h +++ b/include/linux/smp.h @@ -32,11 +32,6 @@ extern unsigned int total_cpus; int smp_call_function_single(int cpuid, smp_call_func_t func, void *info, int wait); -/* - * Call a function on all processors - */ -void on_each_cpu(smp_call_func_t func, void *info, int wait); - /* * Call a function on processors specified by mask, which might include * the local one. @@ -44,6 +39,15 @@ void on_each_cpu(smp_call_func_t func, void *info, int wait); void on_each_cpu_mask(const struct cpumask *mask, smp_call_func_t func, void *info, bool wait); +/* + * Call a function on all processors. May be used during early boot while + * early_boot_irqs_disabled is set. + */ +static inline void on_each_cpu(smp_call_func_t func, void *info, int wait) +{ + on_each_cpu_mask(cpu_online_mask, func, info, wait); +} + /* * Call a function on each processor for which the supplied function * cond_func returns a positive value. This may include the local @@ -102,8 +106,17 @@ extern void smp_cpus_done(unsigned int max_cpus); * Call a function on all other processors */ void smp_call_function(smp_call_func_t func, void *info, int wait); -void smp_call_function_many(const struct cpumask *mask, - smp_call_func_t func, void *info, bool wait); + +void __smp_call_function_many(const struct cpumask *mask, + smp_call_func_t remote_func, + smp_call_func_t local_func, + void *info, bool wait); + +static inline void smp_call_function_many(const struct cpumask *mask, + smp_call_func_t func, void *info, bool wait) +{ + __smp_call_function_many(mask, func, NULL, info, wait); +} int smp_call_function_any(const struct cpumask *mask, smp_call_func_t func, void *info, int wait); diff --git a/kernel/smp.c b/kernel/smp.c index 708d2ac1db72..d9189e4d6464 100644 --- a/kernel/smp.c +++ b/kernel/smp.c @@ -388,9 +388,13 @@ int smp_call_function_any(const struct cpumask *mask, EXPORT_SYMBOL_GPL(smp_call_function_any); /** - * smp_call_function_many(): Run a function on a set of other CPUs. + * __smp_call_function_many(): Run a function on a set of CPUs. * @mask: The set of cpus to run on (only runs on online subset). - * @func: The function to run. This must be fast and non-blocking. + * @remote_func: The function to run on remote cores. This must be fast and + * non-blocking. + * @local_func: The function that should be run on this CPU. This must be + * fast and non-blocking. If NULL is provided, no function will + * be executed on this CPU. * @info: An arbitrary pointer to pass to the function. * @wait: If true, wait (atomically) until function has completed * on other CPUs. @@ -401,11 +405,16 @@ EXPORT_SYMBOL_GPL(smp_call_function_any); * hardware interrupt handler or from a bottom half handler. Preemption * must be disabled when calling this function. */ -void smp_call_function_many(const struct cpumask *mask, - smp_call_func_t func, void *info, bool wait) +void __smp_call_function_many(const struct cpumask *mask, + smp_call_func_t remote_func, + smp_call_func_t local_func, + void *info, bool wait) { + int cpu, last_cpu, this_cpu = smp_processor_id(); struct call_function_data *cfd; - int cpu, next_cpu, this_cpu = smp_processor_id(); + bool run_remote = false; + bool run_local = false; + int nr_cpus = 0; /* * Can deadlock when called with interrupts disabled. @@ -413,55 +422,62 @@ void smp_call_function_many(const struct cpumask *mask, * send smp call function interrupt to this cpu and as such deadlocks * can't happen. */ - WARN_ON_ONCE(cpu_online(this_cpu) && irqs_disabled() - && !oops_in_progress && !early_boot_irqs_disabled); + if (cpu_online(this_cpu) && !oops_in_progress && !early_boot_irqs_disabled) + lockdep_assert_irqs_enabled(); + + /* Check if we need local execution. */ + if (local_func && cpumask_test_cpu(this_cpu, mask)) + run_local = true; - /* Try to fastpath. So, what's a CPU they want? Ignoring this one. */ + /* Check if we need remote execution, i.e., any CPU excluding this one. */ cpu = cpumask_first_and(mask, cpu_online_mask); if (cpu == this_cpu) cpu = cpumask_next_and(cpu, mask, cpu_online_mask); + if (cpu < nr_cpu_ids) + run_remote = true; - /* No online cpus? We're done. */ - if (cpu >= nr_cpu_ids) - return; + if (run_remote) { + cfd = this_cpu_ptr(&cfd_data); - /* Do we have another CPU which isn't us? */ - next_cpu = cpumask_next_and(cpu, mask, cpu_online_mask); - if (next_cpu == this_cpu) - next_cpu = cpumask_next_and(next_cpu, mask, cpu_online_mask); - - /* Fastpath: do that cpu by itself. */ - if (next_cpu >= nr_cpu_ids) { - smp_call_function_single(cpu, func, info, wait); - return; - } + cpumask_and(cfd->cpumask, mask, cpu_online_mask); + __cpumask_clear_cpu(this_cpu, cfd->cpumask); - cfd = this_cpu_ptr(&cfd_data); - - cpumask_and(cfd->cpumask, mask, cpu_online_mask); - __cpumask_clear_cpu(this_cpu, cfd->cpumask); + cpumask_clear(cfd->cpumask_ipi); + for_each_cpu(cpu, cfd->cpumask) { + call_single_data_t *csd = per_cpu_ptr(cfd->csd, cpu); + + nr_cpus++; + last_cpu = cpu; + + csd_lock(csd); + if (wait) + csd->flags |= CSD_FLAG_SYNCHRONOUS; + csd->func = remote_func; + csd->info = info; + if (llist_add(&csd->llist, &per_cpu(call_single_queue, cpu))) + __cpumask_set_cpu(cpu, cfd->cpumask_ipi); + } - /* Some callers race with other cpus changing the passed mask */ - if (unlikely(!cpumask_weight(cfd->cpumask))) - return; + /* + * Choose the most efficient way to send an IPI. Note that the + * number of CPUs might be zero due to concurrent changes to the + * provided mask. + */ + if (nr_cpus == 1) + arch_send_call_function_single_ipi(last_cpu); + else if (likely(nr_cpus > 1)) + arch_send_call_function_ipi_mask(cfd->cpumask_ipi); + } - cpumask_clear(cfd->cpumask_ipi); - for_each_cpu(cpu, cfd->cpumask) { - call_single_data_t *csd = per_cpu_ptr(cfd->csd, cpu); + if (run_local) { + unsigned long flags; - csd_lock(csd); - if (wait) - csd->flags |= CSD_FLAG_SYNCHRONOUS; - csd->func = func; - csd->info = info; - if (llist_add(&csd->llist, &per_cpu(call_single_queue, cpu))) - __cpumask_set_cpu(cpu, cfd->cpumask_ipi); + local_irq_save(flags); + local_func(info); + local_irq_restore(flags); } - /* Send a message to all CPUs in the map */ - arch_send_call_function_ipi_mask(cfd->cpumask_ipi); - - if (wait) { + if (run_remote && wait) { for_each_cpu(cpu, cfd->cpumask) { call_single_data_t *csd; @@ -470,7 +486,7 @@ void smp_call_function_many(const struct cpumask *mask, } } } -EXPORT_SYMBOL(smp_call_function_many); +EXPORT_SYMBOL(__smp_call_function_many); /** * smp_call_function(): Run a function on all other CPUs. @@ -587,24 +603,6 @@ void __init smp_init(void) smp_cpus_done(setup_max_cpus); } -/* - * Call a function on all processors. May be used during early boot while - * early_boot_irqs_disabled is set. Use local_irq_save/restore() instead - * of local_irq_disable/enable(). - */ -void on_each_cpu(void (*func) (void *info), void *info, int wait) -{ - unsigned long flags; - - preempt_disable(); - smp_call_function(func, info, wait); - local_irq_save(flags); - func(info); - local_irq_restore(flags); - preempt_enable(); -} -EXPORT_SYMBOL(on_each_cpu); - /** * on_each_cpu_mask(): Run a function on processors specified by * cpumask, which may include the local processor. @@ -624,16 +622,11 @@ EXPORT_SYMBOL(on_each_cpu); void on_each_cpu_mask(const struct cpumask *mask, smp_call_func_t func, void *info, bool wait) { - int cpu = get_cpu(); + preempt_disable(); - smp_call_function_many(mask, func, info, wait); - if (cpumask_test_cpu(cpu, mask)) { - unsigned long flags; - local_irq_save(flags); - func(info); - local_irq_restore(flags); - } - put_cpu(); + __smp_call_function_many(mask, func, func, info, wait); + + preempt_enable(); } EXPORT_SYMBOL(on_each_cpu_mask); -- 2.20.1