Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755204AbYGUOCE (ORCPT ); Mon, 21 Jul 2008 10:02:04 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752372AbYGUOBu (ORCPT ); Mon, 21 Jul 2008 10:01:50 -0400 Received: from mx2.mail.elte.hu ([157.181.151.9]:44831 "EHLO mx2.mail.elte.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752334AbYGUOBt (ORCPT ); Mon, 21 Jul 2008 10:01:49 -0400 Date: Mon, 21 Jul 2008 16:01:28 +0200 From: Ingo Molnar To: Evgeniy Polyakov Cc: Pekka Enberg , linux-kernel@vger.kernel.org, netdev@vger.kernel.org, Vegard Nossum , "Rafael J. Wysocki" , cl@linux-foundation.org, davem@davemloft.net Subject: Re: [bug, netconsole, SLUB] BUG skbuff_head_cache: Poison overwritten Message-ID: <20080721140128.GA32245@elte.hu> References: <20080717214222.GA29449@elte.hu> <20080718091146.GQ6875@elte.hu> <20080721094110.GA16029@elte.hu> <84144f020807210252k68d5cf65i8c7ae3c11cecc046@mail.gmail.com> <20080721100627.GA5953@2ka.mipt.ru> <20080721105051.GA5830@elte.hu> <20080721112515.GA4777@2ka.mipt.ru> <20080721115555.GA24176@elte.hu> <20080721125747.GA3056@2ka.mipt.ru> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080721125747.GA3056@2ka.mipt.ru> User-Agent: Mutt/1.5.18 (2008-05-17) X-ELTE-VirusStatus: clean X-ELTE-SpamScore: -1.5 X-ELTE-SpamLevel: X-ELTE-SpamCheck: no X-ELTE-SpamVersion: ELTE 2.0 X-ELTE-SpamCheck-Details: score=-1.5 required=5.9 tests=BAYES_00 autolearn=no SpamAssassin version=3.2.3 -1.5 BAYES_00 BODY: Bayesian spam probability is 0 to 1% [score: 0.0000] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2239 Lines: 52 * Evgeniy Polyakov wrote: > On Mon, Jul 21, 2008 at 01:55:55PM +0200, Ingo Molnar (mingo@elte.hu) wrote: > > > > I could try run tests with netconsole deactivated, if you think > > > > that's a worthwile line of probing this problem. (although that > > > > would make me do blind tests in essence - having kernel log output > > > > is really essential.) > > > > > > Let's try this way first. If system will continue to crash, we will > > > add some debug options in various pathes. Existing reports do not > > > contain enough information unfortunately, so we will not lose too > > > much. > > > > ok. I've turned off netconsole - 8 successful bootups in a row so far. > > The box is a slow booter/builder with an 8 kernels/hour test throughput, > > so if everything goes fine we should have meaningful results in about 10 > > hours. > > > > ( there are other, faster testboxes in -tip testing with 33 kernels/hour > > build+boot throughput where we'd have to wait only 2 hours - but as > > per Murphy's law they dont trigger this bug ;-) > > Since 2.6.25 there was only single change in netpoll.c: > f5184d267c1aedb9b7a8cc44e08ff6b8d382c3b5 > Which looks innocent. > > Is your driver e1000 or e1000e? Can you check different one? i cannot check e1000 anymore due to this upstream commit: | d03157babed7424f5391af43200593768ce69c9a is first bad commit | commit d03157babed7424f5391af43200593768ce69c9a | Author: Auke Kok | Date: Sun Jun 22 15:21:29 2008 -0700 | | e1000: remove PCI Express device IDs | | We do not want to prolong the situation much longer that e1000 | and e1000e support these devices at the same time. As a result, | take out the bandage that was added for the interim period | and remove all the PCI Express device IDs from e1000. but yes, this box was using e1000 for a long time, and recently migrated to e1000e. I'm not sure there's any connection, do you think there is? Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/