Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932393Ab1D1Img (ORCPT ); Thu, 28 Apr 2011 04:42:36 -0400 Received: from gw3.lbox.cz ([62.245.111.133]:39695 "EHLO pcnci.linuxbox.cz" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S932124Ab1D1Ima (ORCPT ); Thu, 28 Apr 2011 04:42:30 -0400 X-Greylist: delayed 720 seconds by postgrey-1.27 at vger.kernel.org; Thu, 28 Apr 2011 04:42:30 EDT Date: Thu, 28 Apr 2011 10:26:25 +0200 From: Nikola Ciprich To: linux-kernel mlist Cc: linux-stable mlist Subject: 2.6.32.21 - uptime related crashes? Message-ID: <20110428082625.GA23293@pcnci.linuxbox.cz> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="YZ5djTAD1cGYuMQK" Content-Disposition: inline User-Agent: Mutt/1.5.19 (2009-01-05) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4485 Lines: 114 --YZ5djTAD1cGYuMQK Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hello everybody, I'm trying to solve strange issue, today, my fourth machine running 2.6.32.= 21 just crashed. What makes the cases similar, apart fromn same kernel vers= ion is that all boxes had very similar uptimes: 214, 216, 216, and 224 days= =2E This might just be a coincidence, but I think this might be important. Unfortunately I only have backtraces of two crashes (and those are trimmed,= sorry), and they do not look as similar as I'd like, but still maybe there= is something in common: [] pollwake+0x57/0x60=20 [] ? default_wake_function+0x0/0x10=20 [] __wake_up_common+0x5a/0x90=20 [] __wake_up+0x43/0x70=20 [] process_masterspan+0x643/0x670 [dahdi]=20 [] coretimer_func+0x135/0x1d0 [dahdi]=20 [] run_timer_softirq+0x15d/0x320=20 [] ? coretimer_func+0x0/0x1d0 [dahdi]=20 [] __do_softirq+0xcc/0x220=20 [] call_softirq+0x1c/0x30=20 [] do_softirq+0x4a/0x80=20 [] irq_exit+0x87/0x90=20 [] do_IRQ+0x77/0xf0=20 [] ret_from_intr+0x0/Oxa=20 [] ? acpi_idle_enter_bm+0x273/0x2a1 [processor]=20 [] ? acpi_idle_enter_bm+0x269/0x2a1 [processor]=20 [] ? cpuidle_idle_call+0xa5/0x150=20 [] ? cpu_idle+0x4f/0x90=20 [] ? rest_init+0x75/0x80=20 [] ? start_kernel+0x2ef/0x390=20 [] ? x86_64_start_reservations+0x81/0xc0=20 [] ? x86_64_start_kernel+0xd6/0x100=20 this box (actually two of the crashed ones) is using dahdi_dummy module to = generate timing for asterisk SW pbx, so maybe it's related to it. [] handle_IRQ_event+0x63/0x1c0 [] handle_edge_irq+0xce/0x160 [] handle_irq+0x1f/0x30 = = =20 [] do_IRQ+0x6e/0xf0 [] ret_from_intr+0x0/Oxa [] ? _spin_un1ock_irq+0xf/0x40 [] ? _spin_un1ock_irq+0x9/0x40 [] ? exit_signals+0x8a/0x130 [] ? do_exit+0x7e/0x7d0 [] ? oops_end+0xa7/0xb0 [] ? die+0x56/0x90 [] ? do_trap+0x130/0x150 [] ? do_divide_error+0x8a/0xa0 [] ? find_busiest_group+0x3d7/0xa00 [] ? cpuacct_charge+0x6b/0x90 [] ? divide_error+0x15/0x20 [] ? find_busiest_group+0x3d7/0xa00 [] ? find_busiest_group+0x1af/0xa00 [] ? thread_return+0x4ce/0x7bb [] ? do_nanosleep+0x75/0x30 [] ? hrtimer_nanosleep+0x9e/0x120 [] ? hrtimer_wakeup+0x0/0x30 [] ? sys_nanosleep+0x6f/0x80 another two don't use it. only similarity I see here is that it seems to be= IRQ handling related, but both issues don't have anything in common. Does anybody have an idea on where should I look? Of course I should update= all those boxes to (at least) latest 2.6.32.x, and I'll do it for sure, bu= t still I'd first like to know where the problem was, and if it has been fi= xed, or how to fix it... I'd be gratefull for any help... BR nik --=20 ------------------------------------- Ing. Nikola CIPRICH LinuxBox.cz, s.r.o. 28. rijna 168, 709 01 Ostrava tel.: +420 596 603 142 fax: +420 596 621 273 mobil: +420 777 093 799 www.linuxbox.cz mobil servis: +420 737 238 656 email servis: servis@linuxbox.cz ------------------------------------- --YZ5djTAD1cGYuMQK Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iEYEARECAAYFAk25JLEACgkQ3xdJJrLygV7gFwCcD6H7MpQF5ZhSQqFhlbGwMY/I sG0AoIwXKtOo5GgX8l8oivkQhts9jUXT =o8H+ -----END PGP SIGNATURE----- --YZ5djTAD1cGYuMQK-- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/