Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760345AbXHGJF1 (ORCPT ); Tue, 7 Aug 2007 05:05:27 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755459AbXHGJFI (ORCPT ); Tue, 7 Aug 2007 05:05:08 -0400 Received: from mx10.go2.pl ([193.17.41.74]:58220 "EHLO poczta.o2.pl" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1755239AbXHGJFF (ORCPT ); Tue, 7 Aug 2007 05:05:05 -0400 Date: Tue, 7 Aug 2007 11:05:33 +0200 From: Jarek Poplawski To: Jean-Baptiste Vignaud Cc: cebbert , mingo , "marcin\.slusarz" , tglx , torvalds , linux-kernel , shemminger , linux-net , netdev , akpm , alan Subject: Re: 2.6.20->2.6.21 - networking dies after random time Message-ID: <20070807090533.GC2120@ff.dom.local> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.2i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2659 Lines: 74 On Tue, Aug 07, 2007 at 10:10:34AM +0200, Jean-Baptiste Vignaud wrote: > > > BTW: Jean-Babtiste, could you send or point to you current configs? Oops! I'm very sorry for misspelling! > > I mean at least proc/interrupts, but with dmesg and .config it would > > be even better. (I assume this last report was about the revert patch > > mentioned by Chuck, not the one below your message?) > > Sure. > > Last reports are with the 2.6.22.1-41.fc7 kernel, which has in changelog : > > * Sat Jul 28 2007 Chuck Ebbert > - revert upstream "genirq: do not mask interrupts by default" > > > * interrupts (i use irqbalance, but problem was the same without) I wonder if you tried without SMP too? > > [root@loki ~]# cat /proc/interrupts > CPU0 CPU1 ... > 16: 72625 96 IO-APIC-fasteoi eth1 > 17: 4667 128 IO-APIC-fasteoi eth2 > 20: 4156 39870 IO-APIC-fasteoi sata_nv > 21: 34794 9177 IO-APIC-fasteoi sata_nv > 22: 0 0 IO-APIC-fasteoi ehci_hcd:usb2 > 23: 6005 1565 IO-APIC-fasteoi ohci_hcd:usb1, sata_nv > 2297: 3 492180 PCI-MSI-edge eth0 > NMI: 0 0 > LOC: 4915345 4915282 > ERR: 0 So, here it's not about irq sharing... > > problems are with eth1 and eth2 here. never had any problems with the onboard (eth0). ... > > * .config > > i dont have it, it was the standard fedora one. > > i'm not sure that the problem is related to 3com, because i replaced those cards by old card i had in spare : > > 01:06.0 Ethernet controller: VIA Technologies, Inc. VT6102 [Rhine-II] (rev 42) > 01:07.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8029(AS) > > and i had the exact same problem. > > Those 3com cards were working 24/24 before i went to fedora 7 (and kernel 2.6.21 then). It seems from 2.6.21 the problems are mainly about 'older' network chips on x86_64. This reverted patch should mean only for those using disable_irq, but I see forcedeth could use this too so it's not clear yet, and btw. there where other changes around irqs and pci, so everybody could have something a bit different with similar time outs logs... BTW, Jean-Baptiste and Chuck - it seems, unless you have too much time, there is no use for testing my "genirq: fix simple and fasteoi irq handlers" patch. Thanks, Jarek P. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/