Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751086AbWAJOvs (ORCPT ); Tue, 10 Jan 2006 09:51:48 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751090AbWAJOvs (ORCPT ); Tue, 10 Jan 2006 09:51:48 -0500 Received: from hellhawk.shadowen.org ([80.68.90.175]:45066 "EHLO hellhawk.shadowen.org") by vger.kernel.org with ESMTP id S1751086AbWAJOvr (ORCPT ); Tue, 10 Jan 2006 09:51:47 -0500 Message-ID: <43C3CA28.5020502@shadowen.org> Date: Tue, 10 Jan 2006 14:52:24 +0000 From: Andy Whitcroft User-Agent: Debian Thunderbird 1.0.7 (X11/20051017) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Martin Bligh , Andrew Morton CC: linux-kernel@vger.kernel.org, greg@kroah.com Subject: Re: Problems with 2.6.15-mm1 and mm2. References: <43C2A48F.6030407@google.com> <20060109154127.6a7e6972.akpm@osdl.org> <43C2FA2E.2040704@google.com> <20060109164653.23b2676e.akpm@osdl.org> <43C304E7.7000508@google.com> In-Reply-To: <43C304E7.7000508@google.com> Content-Type: multipart/mixed; boundary="------------020102070002080100020203" Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 8271 Lines: 202 This is a multi-part message in MIME format. --------------020102070002080100020203 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit > It fixes the symptom, yes. Greg complained it's papering over the > problem and not a real fix, which is fair enough, but ... > > yes, it did seem to be a newly introduced problem in -mm1 or -mm2. To my > mind, if we don't have a proper fix, we ought to drop the patch that > caused the problem in the first place? Andy .. can you finger which > patch that was, I forget ... The fix I proposed (also attached to this email) fixes a bug introduced in the pci domain patches in greg's tree, specifically this patch: gregkh-pci-x86-pci-domain-support-struct-pci_sysdata.patch I believe that my fix is complete in the context of those changes, in that it returns the original semantics. As greg points out this exposes a couple of places where we do have proper domain and node information where it is not supplied. I have a patch or two waiting to be tested which tries to fill in some of the gaps exposed, but the boxes I was testing on have dropped off the network and am waiting for that to be fixed before sending them in. That said I think that if the pci domain patches are accepted then this minimal fix should be applied with them. -apw --------------020102070002080100020203 Content-Type: text/plain; name="pci-device-ensure-sysdata-initialised" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="pci-device-ensure-sysdata-initialised" pci device ensure sysdata initialised [Ok, here is a patch to ensure sysdata is valid for all busses.] We have been seeing panic's on NUMA systems in pci_call_probe() in 2.6.15-rc5-mm2 and -mm3. It seems that some changes have occured to the meaning of the 'sysdata' for a device such that it is no longer just an integer containing the node, it is now a structure containing the node and other data. However, it seems that we do not always initialise this sysdata before we probe the device. Below are three examples from a boot with this checked for. The attached patch ensures that we supply a valid sysdata for system busses. Currently we take no account of the node for this bus for no ACPI configured systems. This is unchanged from the -mm1 code. Intel(R) PRO/1000 Network Driver - version 6.1.16-k2 Copyright (c) 1999-2005 Intel Corporation. pci_call_probe: starting drv dev id pci_call_probe: dev->bus pci_call_probe: dev->bus->sysdata<00000000> pci_call_probe: node<-1> e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network Connection pci_call_probe: starting drv dev id pci_call_probe: dev->bus pci_call_probe: dev->bus->sysdata<00000000> pci_call_probe: node<-1> Linux Tulip driver version 1.1.13 (December 15, 2004) input: AT Translated Set 2 keyboard as /class/input/input0 tulip0: EEPROM default media type Autosense. tulip0: Index #0 - Media 10baseT (#0) described by a 21140 non-MII (0) block. tulip0: Index #1 - Media 100baseTx (#3) described by a 21140 non-MII (0) block. tulip0: Index #2 - Media 10baseT-FDX (#4) described by a 21140 non-MII (0) block. tulip0: Index #3 - Media 100baseTx-FDX (#5) described by a 21140 non-MII (0) block. eth1: Digital DS21140 Tulip rev 33 at 0001fc00, 00:00:BC:0F:08:96, IRQ 28. pci_call_probe: starting drv dev id pci_call_probe: dev->bus pci_call_probe: dev->bus->sysdata pci_call_probe: node<0> qla1280: QLA1040 found on PCI bus 0, dev 11 Signed-off-by: Andy Whitcroft --- arch/i386/pci/common.c | 2 ++ arch/i386/pci/fixup.c | 8 +++++--- arch/i386/pci/legacy.c | 3 ++- arch/i386/pci/numa.c | 8 +++++--- arch/i386/pci/visws.c | 4 ++-- include/asm-i386/pci.h | 1 + 6 files changed, 17 insertions(+), 9 deletions(-) diff -upN reference/arch/i386/pci/common.c current/arch/i386/pci/common.c --- reference/arch/i386/pci/common.c +++ current/arch/i386/pci/common.c @@ -29,6 +29,8 @@ unsigned long pirq_table_addr; struct pci_bus *pci_root_bus; struct pci_raw_ops *raw_pci_ops; +struct pci_sysdata pci_default_sysdata = { .node = -1 }; + static int pci_read(struct pci_bus *bus, unsigned int devfn, int where, int size, u32 *value) { return raw_pci_ops->read(pci_domain_nr(bus), bus->number, diff -upN reference/arch/i386/pci/fixup.c current/arch/i386/pci/fixup.c --- reference/arch/i386/pci/fixup.c +++ current/arch/i386/pci/fixup.c @@ -25,9 +25,11 @@ static void __devinit pci_fixup_i450nx(s pci_read_config_byte(d, reg++, &subb); DBG("i450NX PXB %d: %02x/%02x/%02x\n", pxb, busno, suba, subb); if (busno) - pci_scan_bus(busno, &pci_root_ops, NULL); /* Bus A */ + pci_scan_bus(busno, &pci_root_ops, + &pci_default_sysdata); /* Bus A */ if (suba < subb) - pci_scan_bus(suba+1, &pci_root_ops, NULL); /* Bus B */ + pci_scan_bus(suba+1, &pci_root_ops, + &pci_default_sysdata); /* Bus B */ } pcibios_last_bus = -1; } @@ -42,7 +44,7 @@ static void __devinit pci_fixup_i450gx(s u8 busno; pci_read_config_byte(d, 0x4a, &busno); printk(KERN_INFO "PCI: i440KX/GX host bridge %s: secondary bus %02x\n", pci_name(d), busno); - pci_scan_bus(busno, &pci_root_ops, NULL); + pci_scan_bus(busno, &pci_root_ops, &pci_default_sysdata); pcibios_last_bus = -1; } DECLARE_PCI_FIXUP_HEADER(PCI_VENDOR_ID_INTEL, PCI_DEVICE_ID_INTEL_82454GX, pci_fixup_i450gx); diff -upN reference/arch/i386/pci/legacy.c current/arch/i386/pci/legacy.c --- reference/arch/i386/pci/legacy.c +++ current/arch/i386/pci/legacy.c @@ -26,7 +26,8 @@ static void __devinit pcibios_fixup_peer l != 0x0000 && l != 0xffff) { DBG("Found device at %02x:%02x [%04x]\n", n, devfn, l); printk(KERN_INFO "PCI: Discovered peer bus %02x\n", n); - pci_scan_bus(n, &pci_root_ops, NULL); + pci_scan_bus(n, &pci_root_ops, + &pci_default_sysdata); break; } } diff -upN reference/arch/i386/pci/numa.c current/arch/i386/pci/numa.c --- reference/arch/i386/pci/numa.c +++ current/arch/i386/pci/numa.c @@ -97,9 +97,11 @@ static void __devinit pci_fixup_i450nx(s pci_read_config_byte(d, reg++, &subb); DBG("i450NX PXB %d: %02x/%02x/%02x\n", pxb, busno, suba, subb); if (busno) - pci_scan_bus(QUADLOCAL2BUS(quad,busno), &pci_root_ops, NULL); /* Bus A */ + pci_scan_bus(QUADLOCAL2BUS(quad,busno), &pci_root_ops, + &pci_default_sysdata); /* Bus A */ if (suba < subb) - pci_scan_bus(QUADLOCAL2BUS(quad,suba+1), &pci_root_ops, NULL); /* Bus B */ + pci_scan_bus(QUADLOCAL2BUS(quad,suba+1), &pci_root_ops, + &pci_default_sysdata); /* Bus B */ } pcibios_last_bus = -1; } @@ -124,7 +126,7 @@ static int __init pci_numa_init(void) printk("Scanning PCI bus %d for quad %d\n", QUADLOCAL2BUS(quad,0), quad); pci_scan_bus(QUADLOCAL2BUS(quad,0), - &pci_root_ops, NULL); + &pci_root_ops, &pci_default_sysdata); } return 0; } diff -upN reference/arch/i386/pci/visws.c current/arch/i386/pci/visws.c --- reference/arch/i386/pci/visws.c +++ current/arch/i386/pci/visws.c @@ -102,8 +102,8 @@ static int __init pcibios_init(void) "bridge B (PIIX4) bus: %u\n", pci_bus1, pci_bus0); raw_pci_ops = &pci_direct_conf1; - pci_scan_bus(pci_bus0, &pci_root_ops, NULL); - pci_scan_bus(pci_bus1, &pci_root_ops, NULL); + pci_scan_bus(pci_bus0, &pci_root_ops, &pci_default_sysdata); + pci_scan_bus(pci_bus1, &pci_root_ops, &pci_default_sysdata); pci_fixup_irqs(visws_swizzle, visws_map_irq); pcibios_resource_survey(); return 0; diff -upN reference/include/asm-i386/pci.h current/include/asm-i386/pci.h --- reference/include/asm-i386/pci.h +++ current/include/asm-i386/pci.h @@ -9,6 +9,7 @@ struct pci_sysdata { int domain; /* PCI domain */ int node; /* NUMA node */ }; +extern struct pci_sysdata pci_default_sysdata; #ifdef CONFIG_PCI_DOMAINS static inline int pci_domain_nr(struct pci_bus *bus) --------------020102070002080100020203-- - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/