Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752388AbcJUCeZ (ORCPT ); Thu, 20 Oct 2016 22:34:25 -0400 Received: from mx2.suse.de ([195.135.220.15]:50166 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750997AbcJUCeY (ORCPT ); Thu, 20 Oct 2016 22:34:24 -0400 Date: Thu, 20 Oct 2016 19:34:14 -0700 From: Davidlohr Bueso To: Sebastian Andrzej Siewior Cc: Arnaldo Carvalho de Melo , Jiri Olsa , Jiri Olsa , Peter Zijlstra , Ingo Molnar , linux-kernel@vger.kernel.org, Davidlohr Bueso Subject: Re: [PATCH 2/2 v2] perf bench futex: add NUMA support Message-ID: <20161021023414.GA8512@linux-80c1.suse> References: <20161016190803.3392-1-bigeasy@linutronix.de> <20161016190803.3392-2-bigeasy@linutronix.de> <20161017143821.GO12815@kernel.org> <20161017150123.GA18595@krava> <20161017150442.GQ12815@kernel.org> <20161017153331.pkzaij5a3ma5c5s5@linutronix.de> <20161019181616.GC30561@linux-80c1.suse> <20161019183717.hsdvutfsizxk3gb6@linutronix.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <20161019183717.hsdvutfsizxk3gb6@linutronix.de> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1377 Lines: 37 On Wed, 19 Oct 2016, Sebastian Andrzej Siewior wrote: >On 2016-10-19 11:16:16 [-0700], Davidlohr Bueso wrote: >> On Mon, 17 Oct 2016, Sebastian Andrzej Siewior wrote: >> >> > By default the application uses malloc() and all available CPUs. This >> > patch introduces NUMA support which means: >> > - memory is allocated node local via numa_alloc_local() >> > - all CPUs of the specified NUMA node are used. This is also true if the >> > number of threads set is greater than the number of CPUs available on >> > this node. >> >> Can't we just use numactl to bind cpus and memory to be node-local? > >something like > numactl --cpunodebind=$NODE --membind=$NODE perf ??? >? Yes. >This should work for memory however since we use > pthread_attr_setaffinity_np(&thread_attr, sizeof(cpu_set_t), &cpu); >we would need to query the affinity mask, and deploy threads based on >that mask. Ah right. I also considered getting rid of the affinity, but that would probably hurt more than help (or at least alter) for non-numa options. >Using NUMA support within this bench-tool has also the side effect that >the output gives all the option used. So if we are going to support the numa option for the benchmark, could you please move the new code into futex.h instead of futex-hash.c? That way we can integrate the support for the other futex programs as well. Thanks, Davidlohr