Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S272791AbTG3HNo (ORCPT ); Wed, 30 Jul 2003 03:13:44 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S272792AbTG3HNo (ORCPT ); Wed, 30 Jul 2003 03:13:44 -0400 Received: from twilight.cs.hut.fi ([130.233.40.5]:61175 "EHLO twilight.cs.hut.fi") by vger.kernel.org with ESMTP id S272791AbTG3HNm (ORCPT ); Wed, 30 Jul 2003 03:13:42 -0400 Date: Wed, 30 Jul 2003 10:13:21 +0300 From: Ville Herva To: linux-kernel@vger.kernel.org, gibbs@scsiguy.com Subject: 2.4.22pre8 hangs too (Re: 2.4.21-jam1, aic7xxx-6.2.36: solid hangs) Message-ID: <20030730071321.GV150921@niksula.cs.hut.fi> Mail-Followup-To: Ville Herva , linux-kernel@vger.kernel.org, gibbs@scsiguy.com References: <20030729073948.GD204266@niksula.cs.hut.fi> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20030729073948.GD204266@niksula.cs.hut.fi> User-Agent: Mutt/1.4i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1623 Lines: 40 On Tue, Jul 29, 2003 at 10:39:48AM +0300, you [Ville Herva] wrote: > After about a year of stable operation, a server begun acting up. First it > begun hanging up solid during the nightly oracle backup (that had run > successfully for a year), the I got some aic7xxx-related crashes on boot. > > Initially, the box ran 2.4.20pre7 kernel with aic7xxx version 6.4.8. When > the hangs started happening, I upgraded to 2.4.21-jam1 (basically 2.4.21 > vanilla + -aa patch + some minor stuff) that includes aic7xxx version 6.2.36. > It did not help. > > I enabled kmsgdump and nmi watchdog, but when the box hangs, it hangs solid: > no ctrl-alt-del, no caps lock led, no alt-sysrq-b, no kmsgdump, nmi watchdog > doesn't trigger. Only the cursor on the console blinks, but no messages from > the kernel appear. (Apart from "spurious 8259A interrupt: IRQ7." that > always happens sometime after boot on this box, but way before the hang.) Herbert P?tzl indicted that he'd had similar lockups with fairly similar hw up until 2.4.22pre6. He suggested I should try 2.4.22pre8. 2.4.22pre8 locked up the same way in about 10 hours. > Any ideas on how to to debug this kind of hang? The question still stands; how do I debug this? > Does it sound kernel/driver or hw related? Are the two crashes related to > the hang? Is the hang related to aic7xxx? Any ideas? -- v -- v@iki.fi - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/