Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754190AbZC3SxX (ORCPT ); Mon, 30 Mar 2009 14:53:23 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752969AbZC3SxA (ORCPT ); Mon, 30 Mar 2009 14:53:00 -0400 Received: from courier.cs.helsinki.fi ([128.214.9.1]:35385 "EHLO mail.cs.helsinki.fi" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751293AbZC3Sw7 (ORCPT ); Mon, 30 Mar 2009 14:52:59 -0400 Date: Mon, 30 Mar 2009 21:52:55 +0300 (EEST) From: "=?ISO-8859-1?Q?Ilpo_J=E4rvinen?=" X-X-Sender: ijjarvin@wrl-59.cs.helsinki.fi To: Markus Trippelsdorf cc: Netdev , LKML , corbet@lwn.net Subject: Re: WARNING: at net/ipv4/tcp_input.c:2927 tcp_ack+0xd55/0x1991() In-Reply-To: <20090330164035.GA2652@gentoox2.trippelsdorf.de> Message-ID: References: <20090327211202.GA10014@gentoox2.trippelsdorf.de> <20090328045056.GA2394@gentoox2.trippelsdorf.de> <20090328095514.GA2599@gentoox2.trippelsdorf.de> <20090330164035.GA2652@gentoox2.trippelsdorf.de> MIME-Version: 1.0 Content-Type: MULTIPART/MIXED; BOUNDARY="-696208474-85322892-1238439175=:29272" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3657 Lines: 83 This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. ---696208474-85322892-1238439175=:29272 Content-Type: TEXT/PLAIN; charset=iso-8859-1 Content-Transfer-Encoding: 8BIT On Mon, 30 Mar 2009, Markus Trippelsdorf wrote: > On Mon, Mar 30, 2009 at 07:01:22PM +0300, Ilpo J?rvinen wrote: > > On Sat, 28 Mar 2009, Markus Trippelsdorf wrote: > > > On Sat, Mar 28, 2009 at 10:29:58AM +0200, Ilpo J?rvinen wrote: > > > > On Sat, 28 Mar 2009, Markus Trippelsdorf wrote: > > > > > On Sat, Mar 28, 2009 at 01:05:09AM +0200, Ilpo J?rvinen wrote: > > > > > > On Fri, 27 Mar 2009, Markus Trippelsdorf wrote: > > > > > > > > > > > > > I'm running the latest git kernel (2.6.29-03321-gbe0ea69) and I've got > > > > > > > this warning twice in the last few hours.: > > > > > > > > > The machine hangs afterwards. > > > > > > > > > > > > Is it really related to the warning for sure? I find it hard to > > > > > > believe... > > > > > > > > > > The machine is normally running stable for days. Switching back to 2.6.29 > > > > > solves the problem... > > > > > > > > Sure, but does is hang right after printing that warning or much later on, > > > > e.g., one minute is already a very long time for the crash to be related > > > > to that warning... Even 5 seconds is a long time but I'd immediately say > > > > it's not related then :-). > > > > > > I really can't tell you. In both occurrences of the warning the machine > > > was already unusable when I noticed. I then rebooted and the last entry > > > in the logs was that warning. > > > > ...And, let me guess, you're in X and therefore unable to catch a final > > oops if any would be printed? It would be nice to get around that as well, > > either use serial/netconsole or hang in text mode while waiting for the > > crash (should be too hard if you are able to setup the workload first > > and then switch away from X and if reproducing takes about an hour)... > > OK, I will try this later. Lets hope that gives some clue where it ends up going boom (if it is caused by TCP we certainly should see something more sensible in console than just a hang)... ...I once again read through tcp commits but just cannot find anything that could cause fackets_out miscount, not to speak of crash prone changes so we'll just have to wait and see... > > Arguably the presence of that warning at both times is somewhat alarming. > > Make that four times: > > kernel # for i in log*; cat $i | grep "tcp_input"; > Mar 27 19:57:40 [kernel] WARNING: at net/ipv4/tcp_input.c:2927 tcp_ack+0xd55/0x1991() > Mar 27 21:37:00 [kernel] WARNING: at net/ipv4/tcp_input.c:2927 tcp_ack+0xd55/0x1991() > Mar 28 18:41:03 [kernel] WARNING: at net/ipv4/tcp_input.c:2927 tcp_ack+0xd55/0x1994() > Mar 29 11:32:26 [kernel] WARNING: at net/ipv4/tcp_input.c:2927 tcp_ack+0xd5b/0x19a9() > > The symptoms are always the same... And the first logged entries date to something much earlier? > Another observation is that all four WARNINGs involve only one of my > two networks adapters, namely skge (build into my mobo). The other one > (r8169) never occurs in the call trace. Nothing interesting seemed to happen for skge in v2.6.29.. timeframe... :-( > I will also try to reproduce the WARNING with Jon's patch applied. Thanks. -- i. ---696208474-85322892-1238439175=:29272-- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/