Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755887AbZLCRHS (ORCPT ); Thu, 3 Dec 2009 12:07:18 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755258AbZLCRHS (ORCPT ); Thu, 3 Dec 2009 12:07:18 -0500 Received: from mga09.intel.com ([134.134.136.24]:22225 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750790AbZLCRHQ (ORCPT ); Thu, 3 Dec 2009 12:07:16 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.47,335,1257148800"; d="scan'208";a="575378229" Date: Thu, 3 Dec 2009 09:07:21 -0800 (Pacific Standard Time) From: "Waskiewicz Jr, Peter P" To: Dimitri Sivanich cc: "Waskiewicz Jr, Peter P" , Arjan van de Ven , "Eric W. Biederman" , Thomas Gleixner , Peter Zijlstra , Ingo Molnar , "Siddha, Suresh B" , Yinghai Lu , LKML , Jesse Barnes , David Miller , "H. Peter Anvin" Subject: Re: [PATCH v6] x86/apic: limit irq affinity In-Reply-To: <20091203170149.GA15151@sgi.com> Message-ID: References: <20091122011457.GA16910@sgi.com> <1259069986.4531.1453.camel@laptop> <20091124065022.6933be1a@infradead.org> <20091125074033.4c46c1b0@infradead.org> <20091203165004.GA14665@sgi.com> <20091203170149.GA15151@sgi.com> ReplyTo: "Waskiewicz Jr, Peter P" X-X-Sender: amrppwaskie@imapmail.glb.intel.com MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2508 Lines: 51 On Thu, 3 Dec 2009, Dimitri Sivanich wrote: > On Thu, Dec 03, 2009 at 08:53:23AM -0800, Waskiewicz Jr, Peter P wrote: > > On Thu, 3 Dec 2009, Dimitri Sivanich wrote: > > > > > On Wed, Nov 25, 2009 at 07:40:33AM -0800, Arjan van de Ven wrote: > > > > On Tue, 24 Nov 2009 09:41:18 -0800 > > > > ebiederm@xmission.com (Eric W. Biederman) wrote: > > > > > Oii. > > > > > > > > > > I don't think it is bad to export information to applications like > > > > > irqbalance. > > > > > > > > > > I think it pretty horrible that one of the standard ways I have heard > > > > > to improve performance on 10G nics is to kill irqbalance. > > > > > > > > irqbalance does not move networking irqs; if it does there's something > > > > evil going on in the system. But thanks for the bugreport ;) > > > > > > It does move networking irqs. > > > > > > > > > > > we had that; it didn't work. > > > > what I'm asking for is for the kernel to expose the numa information; > > > > right now that is the piece that is missing. > > > > > > > > > > I'm wondering if we should expose that numa information in the form of a node or the set of allowed cpus, or both? > > > > > > I'm guessing 'both' is the correct answer, so that apps like irqbalance can make a qualitative decision based on the node (affinity to cpus on this node is better), but an absolute decision based on allowed cpus (I cannot change affinity to anything but this set of cpus). > > > > That's exactly what my patch in the thread "irq: Add node_affinity CPU > > masks for smarter irqbalance hints" is doing. I've also done the > > irqbalance changes based on that kernel patch, and Arjan currently has > > that patch. > > So if I understand correctly, you're patch takes care of the qualitative portion of it (we prefer to set affinity to these cpus, which may be on more than one node), but not the restrictive part of it (we cannot change affinity to anything but these cpus)? That is correct. The patch provides an interface to both the kernel (functions) and /proc for userspace to set a CPU mask. That is the preferred mask for the interrupt to be balanced on. Then irqbalance will make decisions on how to balance within that provided mask, if it in fact has been provided. Cheers, -PJ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/