Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933414AbXHHHV2 (ORCPT ); Wed, 8 Aug 2007 03:21:28 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932965AbXHHHUq (ORCPT ); Wed, 8 Aug 2007 03:20:46 -0400 Received: from mx2.go2.pl ([193.17.41.42]:56648 "EHLO poczta.o2.pl" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S932361AbXHHHUn (ORCPT ); Wed, 8 Aug 2007 03:20:43 -0400 Date: Wed, 8 Aug 2007 09:21:14 +0200 From: Jarek Poplawski To: Jean-Baptiste Vignaud Cc: cebbert , mingo , "marcin\.slusarz" , tglx , torvalds , linux-kernel , shemminger , linux-net , netdev , akpm , alan Subject: Re: 2.6.20->2.6.21 - networking dies after random time Message-ID: <20070808072114.GA1731@ff.dom.local> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.2i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3308 Lines: 71 On Tue, Aug 07, 2007 at 07:16:33PM +0200, Jean-Baptiste Vignaud wrote: ... > So this afternoon i compiled 2.6.23-rc2 with same options as 2.6.23-rc1 > and edited grub.conf to add nosmp but after reboot the box did not > responded. Back home, i saw that the kernel failed because it was unable > to find the partitions (mdadm failed, then LVM). After a few tests, > removing nosmp let the kernel boot correctly. It seems that even the > fedora provided kernels have the same behavior > (well at least 2.6.22.1-41.fc7). Sorry: it seems there is some implementation error or some modules don't check CONFIG_SMP enough... Of course testing this with smp should be precious too. Only, after finding some problems, you should consider smp is quite a new and complicated technology, at least regarding such old designs as 3c905. BTW: I didn't notice this yesterday, but your forcedeth uses new type of irq handling (MSI), so it should explain why it's not affected. Jean-Baptiste: I'm not sure how much of this testing you can afford? If you can spare some time for this and your box isn't for 'production' it could be very precious to diagnose such reproducible bug. Then, I'd have a few suggestions (you could choose any of them) like: - trying these last test patches prepared for Marcin, too (but only with kernels 2.6.21 - 2.6.23-rc1), - trying to find the last kernel version, which works for you: Marcin has done this with successfully using the most professional way: git bisect (which btw. I did learn yet), but, IMHO, it could be very usable to try a "poor man's" bisect too older kernels like this: 2.6.18, so to try again this version of previos Fedora, but preferably in "vanilla" version (there could be some problems if something in your configs or hardware has changed); then if OK: 2.6.20; if OK 2.6.21-rc1 or -rc2 (there are usually heavy changes in the beginning of a cycle); ithen try to jump forward or backward around the middle of the range eg. -rc4. You should use each time the same, current config and remember to 'make oldconfig' before make. In my opinion it would be very precious even after some long time, so there is no need to hurry and do this now. The most important: if nothing has changed with your hardware in the meantime, you should find 'the culprit' for sure. But, if there are any problems about such testing, don't bother! It could be really a lot of hard and maybe boring work. If you would like to read something more about testing (then of course my suggestions could occur invalid - I'm a very bad tester myself...) you can try this: http://www.stardust.webpages.pl/files/handbook/ If you would need some additional advice you can mail me privately too (but my response could take a few days). Of course, if your find something interresting I'd be glad to know about this, but let's be honest - I'm not any authority about these drivers, so cc-ing a maintainer should always be more usable. Thanks, Jarek P. PS: it would be nice if you could fix your mail program on line breaking (or try to do this manually). - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/