Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756868AbYF0PKF (ORCPT ); Fri, 27 Jun 2008 11:10:05 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755337AbYF0PJv (ORCPT ); Fri, 27 Jun 2008 11:09:51 -0400 Received: from mx1.redhat.com ([66.187.233.31]:39367 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755479AbYF0PJt (ORCPT ); Fri, 27 Jun 2008 11:09:49 -0400 Date: Thu, 26 Jun 2008 15:38:57 -0500 From: David Teigland To: Kamalesh Babulal Cc: Stephen Rothwell , linux-next@vger.kernel.org, LKML , linux-scsi@vger.kernel.org, Eric.Moore@lsi.com, Andy Whitcroft Subject: Re: [BUG] linux-next: Tree for March 25 kernel oops, when loading mpt fusion driver - regression Message-ID: <20080626203857.GB3815@redhat.com> References: <20080325175224.3ee62b09.sfr@canb.auug.org.au> <47E9F0B0.4060006@linux.vnet.ibm.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <47E9F0B0.4060006@linux.vnet.ibm.com> User-Agent: Mutt/1.4.2.2i Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 12463 Lines: 218 On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote: > Hi Stephen, > > Kernel bug is hit while booting up the next-20080325 kernel with MPT > Fusion driver built in.This was reported previously for the > next-20080320 kernel > http://marc.info/?l=linux-next&m=120601013920868&w=2 Hi, did you ever get this fixed? I've been having the same problem, http://marc.info/?l=linux-scsi&m=121061780821823&w=4 still exists on 2.6.26-rc8 for me, Loading scsi_transport_spi.ko module Loading mptscsih.ko module Loading mptspi.ko module Fusion MPT SPI Host driver 3.04.06 ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32 mptbase: ioc0: Initiating bringup ioc0: LSI53C1030 B2: Capabilities={Initiator,Target} mptbase: ioc0: PCI-MSI enabled mptbase: ioc0: Initiating recovery BUG: unable to handle kernel NULL pointer dereference at 0000000000000948 IP: [] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3 PGD 7e981067 PUD 7e982067 PMD 0 Oops: 0000 [1] SMP CPU 1 Modules linked in: mptspi(+) mptscsih scsi_transport_spi mptbase sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd Pid: 16, comm: events/1 Not tainted 2.6.26-rc8 #2 RIP: 0010:[] [] :mptspi:mptspi_dv_renegotia te_work+0x13/0xc3 RSP: 0000:ffff81007f479e50 EFLAGS: 00010286 RAX: ffffffff802429f7 RBX: ffff81007f479e90 RCX: 0000000000000000 RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff81007f424138 RBP: ffff81007f479e80 R08: 0000000000000002 R09: 0000000000000000 R10: ffffffff802429f7 R11: ffff81007ffddde0 R12: ffff81007ffbcd90 R13: 0000000000000948 R14: ffffffffa00e5e15 R15: 0000000000000000 FS: 0000000000680850(0000) GS:ffff81007ff5fbe8(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 0000000000000948 CR3: 000000007e979000 CR4: 00000000000006e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process events/1 (pid: 16, threadinfo ffff81007f478000, task ffff81007f476480) Stack: ffffffffa00e5e15 0000000000000000 ffff81007f479e90 ffff81007ffbcd90 ffff81007f424138 ffffffffa00e5e15 ffff81007f479ed0 ffffffff80242a46 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a Call Trace: [] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3 [] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3 [] run_workqueue+0xee/0x1f6 [] worker_thread+0xdb/0xe8 [] ? autoremove_wake_function+0x0/0x38 [] ? worker_thread+0x0/0xe8 [] kthread+0x49/0x78 [] child_rip+0xa/0x12 [] ? kthreadd+0x1a6/0x1cb [] ? kthread+0x0/0x78 [] ? child_rip+0x0/0x12 Code: 8b bc 24 f8 00 00 00 e8 83 f7 ff ff 5a 5b 41 5c 41 5d 41 5e 41 5f c9 c3 55 48 89 e5 41 56 41 55 41 54 53 48 83 ec 10 4c 8b 6f 40 <4d> 8b 75 00 e8 6a 9e 1a e0 66 41 83 bd fa 02 00 00 00 49 8b be RIP [] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3 RSP CR2: 0000000000000948 ---[ end trace 9714d7078ea4157a ]--- mptbase: ioc0: Initiating recovery mptbase: ioc0: Initiating recovery mptbase: ioc0: Initiating recovery mptbase: ioc0: Initiating recovery mptbase: ioc0: Initiating recovery scsi0 : ioc0: LSI53C1030 B2, FwRev=01032700h, Ports=1, MaxQ=255, IRQ=8412 target0:0:0: mptspi: ioc0: dma_alloc_coherent for parameters failed mptscsih: ioc0: attempting task abort! (sc=ffff81007f450d80) scsi 0:0:0:0: CDB: Inquiry: 12 00 00 00 24 00 mptbase: ioc0: Initiating recovery scsi 0:0:0:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 0, sc=ffff81007f450d80, mf = ffff81007ea42ce0, idx=d mptscsih: ioc0: Issue of TaskMgmt failed! mptscsih: ioc0: task abort: FAILED (sc=ffff81007f450d80) mptscsih: ioc0: attempting target reset! (sc=ffff81007f450d80) ... > Loading mptscsih.ko module > Loading mptspi.ko module > [ 6.591066] Fusion MPT SPI Host driver 3.04.06 > [ 6.592181] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22 > [ 6.593991] mptbase: ioc0: Initiating bringup > [ 6.718342] ioc0: LSI53C1030 B2: Capabilities={Initiator} > [ 6.722484] mptbase: ioc0: PCI-MSI enabled > [ 16.902699] mptbase: ioc0: Initiating recovery > [ 16.903618] mptbase: ioc0: WARNING - IOC is in FAULT state!!! > [ 16.904618] mptbase: ioc0: WARNING - FAULT code = 8112h > [ 21.909082] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009! > [ 39.152711] mptbase: ioc0: Recovered from IOC FAULT > [ 61.630538] BUG: unable to handle kernel NULL pointer dereference at 00000528 > [ 61.632545] IP: [] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab > [ 61.634545] *pde = 00000000 > [ 61.636219] Oops: 0000 [#1] SMP > [ 61.636537] last sysfs file: /sys/block/ram15/dev > [ 61.636537] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd > [ 61.636537] > [ 61.636537] Pid: 17, comm: events/2 Not tainted (2.6.25-rc6-next-20080325-autotest #1) > [ 61.636537] EIP: 0060:[] EFLAGS: 00010282 CPU: 2 > [ 61.636537] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi] > [ 61.636537] EAX: f79e5868 EBX: f79e586c ECX: f78c308c EDX: 00000001 > [ 61.636537] ESI: f7867e38 EDI: 00000528 EBP: f78a2f78 ESP: f78a2f58 > [ 61.636537] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 > [ 61.636537] Process events/2 (pid: 17, ti=f78a2000 task=f78c29a0 task.ti=f78a2000) > [ 61.636537] Stack: 00000000 00000002 00000000 c0430b45 f78a2f90 f79e586c f7867e38 f79e5868 > [ 61.636537] f78a2fac c0430b80 00000000 00000002 c0430b45 f881ccbd f8821588 c08ee870 > [ 61.636537] f881d870 00000002 f7867e38 c043140a f7867e60 f78a2fd0 c04314be 00000000 > [ 61.636537] Call Trace: > [ 61.636537] [] run_workqueue+0x80/0x186 > [ 61.636537] [] run_workqueue+0xbb/0x186 > [ 61.636537] [] run_workqueue+0x80/0x186 > [ 61.636537] [] mptspi_dv_renegotiate_work+0x0/0xab [mptspi] > [ 61.636537] [] worker_thread+0x0/0xbf > [ 61.636537] [] worker_thread+0xb4/0xbf > [ 61.636537] [] autoremove_wake_function+0x0/0x33 > [ 61.636537] [] kthread+0x3b/0x64 > [ 61.636537] [] kthread+0x0/0x64 > [ 61.636537] [] kernel_thread_helper+0x7/0x10 > [ 61.636537] ======================= > [ 61.636537] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00 > [ 61.636537] EIP: [] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a2f58 > [ 61.636550] ---[ end trace c0dc9c06e06bc602 ]--- > [ 47.107291] mptbase: ioc0: Initiating recovery > [ 47.108284] mptbase: ioc0: WARNING - IOC is in FAULT state!!! > [ 47.109284] mptbase: ioc0: WARNING - FAULT code = 8112h > [ 52.122242] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009! > [ 69.374395] mptbase: ioc0: Recovered from IOC FAULT > [ 69.448422] Clocksource tsc unstable (delta = 18746181568 ns) > [ 91.888899] BUG: unable to handle kernel NULL pointer dereference at 00000528 > [ 91.890902] IP: [] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab > [ 91.892902] *pde = 00000000 > [ 91.894904] Oops: 0000 [#2] SMP > [ 91.895898] last sysfs file: /sys/block/ram15/dev > [ 91.895898] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd > [ 91.895898] > [ 91.895898] Pid: 15, comm: events/0 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1) > [ 91.895898] EIP: 0060:[] EFLAGS: 00010282 CPU: 0 > [ 91.895898] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi] > [ 91.895898] EAX: f7a427b8 EBX: f7a427bc ECX: 00000000 EDX: 00000000 > [ 91.895898] ESI: f7867f68 EDI: 00000528 EBP: f7877f78 ESP: f7877f58 > [ 91.895898] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 > [ 91.895898] Process events/0 (pid: 15, ti=f7877000 task=f789e8a0 task.ti=f7877000) > [ 91.895898] Stack: 00000000 00000002 00000000 c0430b45 f7877f90 f7a427bc f7867f68 f7a427b8 > [ 91.895898] f7877fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a > [ 91.895898] 5a5a5a5a 5a5a5a5a f7867f68 c043140a f7867f90 f7877fd0 c04314be 00000000 > [ 91.895898] Call Trace: > [ 91.895898] [] run_workqueue+0x80/0x186 > [ 91.895898] [] run_workqueue+0xbb/0x186 > [ 91.895898] [] run_workqueue+0x80/0x186 > [ 91.895898] [] mptspi_dv_renegotiate_work+0x0/0xab [mptspi] > [ 91.895898] [] worker_thread+0x0/0xbf > [ 91.895898] [] worker_thread+0xb4/0xbf > [ 91.895898] [] autoremove_wake_function+0x0/0x33 > [ 91.895898] [] kthread+0x3b/0x64 > [ 91.895898] [] kthread+0x0/0x64 > [ 91.895898] [] kernel_thread_helper+0x7/0x10 > [ 91.895898] ======================= > [ 91.895898] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00 > [ 91.895898] EIP: [] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f7877f58 > [ 91.895903] ---[ end trace c0dc9c06e06bc602 ]--- > [ 82.434031] mptbase: ioc0: Initiating recovery > [ 82.435028] mptbase: ioc0: WARNING - IOC is in FAULT state!!! > [ 82.436028] mptbase: ioc0: WARNING - FAULT code = 8112h > [ 87.440153] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009! > [ 104.682001] mptbase: ioc0: Recovered from IOC FAULT > [ 127.157135] BUG: unable to handle kernel NULL pointer dereference at 00000528 > [ 127.159138] IP: [] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab > [ 127.161139] *pde = 00000000 > [ 127.163139] Oops: 0000 [#3] SMP > [ 127.164134] last sysfs file: /sys/block/ram15/dev > [ 127.164134] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd > [ 127.164134] > [ 127.164134] Pid: 16, comm: events/1 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1) > [ 127.164134] EIP: 0060:[] EFLAGS: 00010282 CPU: 1 > [ 127.164134] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi] > [ 127.164134] EAX: f7a42fa0 EBX: f7a42fa4 ECX: 00000000 EDX: 00000000 > [ 127.164134] ESI: f7867ed0 EDI: 00000528 EBP: f78a1f78 ESP: f78a1f58 > [ 127.164134] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 > [ 127.164134] Process events/1 (pid: 16, ti=f78a1000 task=f78c0920 task.ti=f78a1000) > [ 127.164134] Stack: 00000000 00000002 00000000 c0430b45 f78a1f90 f7a42fa4 f7867ed0 f7a42fa0 > [ 127.164134] f78a1fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a > [ 127.164134] 5a5a5a5a 5a5a5a5a f7867ed0 c043140a f7867ef8 f78a1fd0 c04314be 00000000 > [ 127.164134] Call Trace: > [ 127.164134] [] run_workqueue+0x80/0x186 > [ 127.164134] [] run_workqueue+0xbb/0x186 > [ 127.164134] [] run_workqueue+0x80/0x186 > [ 127.164134] [] mptspi_dv_renegotiate_work+0x0/0xab [mptspi] > [ 127.164134] [] worker_thread+0x0/0xbf > [ 127.164134] [] worker_thread+0xb4/0xbf > [ 127.164134] [] autoremove_wake_function+0x0/0x33 > [ 127.164134] [] kthread+0x3b/0x64 > [ 127.164134] [] kthread+0x0/0x64 > [ 127.164134] [] kernel_thread_helper+0x7/0x10 > [ 127.164134] ======================= > [ 127.164134] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00 > [ 127.164134] EIP: [] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a1f58 > [ 127.164147] ---[ end trace c0dc9c06e06bc602 ]--- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/