Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753025Ab0ATGwO (ORCPT ); Wed, 20 Jan 2010 01:52:14 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752008Ab0ATGwM (ORCPT ); Wed, 20 Jan 2010 01:52:12 -0500 Received: from www.tglx.de ([62.245.132.106]:56075 "EHLO www.tglx.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751822Ab0ATGwL (ORCPT ); Wed, 20 Jan 2010 01:52:11 -0500 Date: Wed, 20 Jan 2010 07:51:41 +0100 (CET) From: Thomas Gleixner To: Clark Williams cc: RT , LKML , Carsten Emde , John Kacur Subject: Re: RFC: NUMA modifications to cyclictest In-Reply-To: <20100119171439.24726f2e@torg> Message-ID: References: <20100119171439.24726f2e@torg> User-Agent: Alpine 2.00 (LFD 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1533 Lines: 33 On Tue, 19 Jan 2010, Clark Williams wrote: > RT-ers, > > Lately we've been struggling with some performance issues on high-core > count (>16 cores) NUMA machines with the RT kernel. During the course > of troubleshooting this issue, we tried using the 'numactl' program to > constrain our measurement testing tool (rteval) to a particular memory > node, rather than letting everything float. Doing so showed marked > improvement in both max latency and jitter. While this doesn't solve > our performance problems I thought it might make sense to have a --numa > mode for cylictest that compliments the --smp mode just added. > > The big difference here is that when using --numa, each measurement > thread (one per cpu) has it's stack allocated from the memory node > associated with it's cpu. Also, the major data structures for each > thread (parameter block, statistics block and histogram) are allocated > from the appropriate node. This is done with calls into libnuma, > which means this will add a dependency on libnuma. That might cause some trouble for embedded folks. :( > The intent is to measure latency on a numa system in the same way a > well-written RT application would run on a NUMA machine, that is > minimizing the off-node memory references. Agreed. tglx -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/