Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754842Ab0ASXO7 (ORCPT ); Tue, 19 Jan 2010 18:14:59 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751902Ab0ASXO6 (ORCPT ); Tue, 19 Jan 2010 18:14:58 -0500 Received: from mx1.redhat.com ([209.132.183.28]:50247 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750865Ab0ASXO5 (ORCPT ); Tue, 19 Jan 2010 18:14:57 -0500 Date: Tue, 19 Jan 2010 17:14:39 -0600 From: Clark Williams To: RT Cc: LKML , Carsten Emde , John Kacur , Thomas Gleixner Subject: RFC: NUMA modifications to cyclictest Message-ID: <20100119171439.24726f2e@torg> Organization: Red Hat, Inc Mime-Version: 1.0 Content-Type: multipart/signed; micalg=PGP-SHA1; boundary="Sig_/uBDN3AmdiuNvPDlM=GwJldQ"; protocol="application/pgp-signature" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2130 Lines: 54 --Sig_/uBDN3AmdiuNvPDlM=GwJldQ Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable RT-ers, Lately we've been struggling with some performance issues on high-core count (>16 cores) NUMA machines with the RT kernel. During the course of troubleshooting this issue, we tried using the 'numactl' program to constrain our measurement testing tool (rteval) to a particular memory node, rather than letting everything float. Doing so showed marked improvement in both max latency and jitter. While this doesn't solve our performance problems I thought it might make sense to have a --numa mode for cylictest that compliments the --smp mode just added.=20 The big difference here is that when using --numa, each measurement thread (one per cpu) has it's stack allocated from the memory node associated with it's cpu. Also, the major data structures for each thread (parameter block, statistics block and histogram) are allocated from the appropriate node. This is done with calls into libnuma, which means this will add a dependency on libnuma.=20 The intent is to measure latency on a numa system in the same way a well-written RT application would run on a NUMA machine, that is minimizing the off-node memory references.=20 If you're interested in looking at this, please pull the numa branch from my git repo at: git://git.kernel.org/pub/scm/linux/kernel/git/clrkwllms/rt-tests.git and let me know if you find bugs or disagree with the approach.=20 Thanks, Clark --Sig_/uBDN3AmdiuNvPDlM=GwJldQ Content-Type: application/pgp-signature; name=signature.asc Content-Disposition: attachment; filename=signature.asc -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.13 (GNU/Linux) iEYEARECAAYFAktWPOcACgkQHyuj/+TTEp0AeQCfUialeqzQtuz30tUGZASsQpah +U4AoJpMmEFnjbv3Rn4brrvGi3WcBX5O =0+y3 -----END PGP SIGNATURE----- --Sig_/uBDN3AmdiuNvPDlM=GwJldQ-- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/