Hi all,
I have created today's linux-next tree at
git://git.kernel.org/pub/scm/linux/kernel/git/sfr/linux-next.git
(tar balls at
http://www.kernel.org/pub/linux/kernel/people/sfr/linux-next/).
You can see which trees have been included by looking in the Next/Trees
file in the source. There are also quilt-import.log and merge.log files
in the Next directory. Between each merge, the tree was built with
a ppc64_defconfig for powerpc and an allmodconfig for x86_64.
There were a few merge conflicts (fairly trivial) and a couple of build
failures that have been notified.
I have added POWERPC-really-export-empty_zero_page.patch and reverted
async-tx commit 9974693221954ef1019f87d32ae26a0757d2ee85 ("dmaengine: ack
to flags: make use of the unused bits in the 'ack' field") as these are
still needed to make powerpc allmodconfig build (in particular ext4 and
drivers/dma/fsldma.c). These have been reported previously.
We are up to 48 trees, more are welcome (even if they are currently
empty). The influx of new trees has slowed completely. Thanks to those
who have contributed, and to those who haven't, please do.
In particular, I have very few of the architecture trees ....
Status of my local build tests is at
http://kisskb.ellerman.id.au/linux-next. If maintainers want to give
advice about cross compilers/configs that work, we are always open to add
more builds.
Thanks to Jan Dittmer for adding the linux-next tree to his build tests
at http://l4x.org/k/.
--
Cheers,
Stephen Rothwell [email protected]
On Tue, 25 Mar 2008 17:52:24 +1100 Stephen Rothwell wrote:
> Hi all,
>
> I have created today's linux-next tree at
> git://git.kernel.org/pub/scm/linux/kernel/git/sfr/linux-next.git
> (tar balls at
> http://www.kernel.org/pub/linux/kernel/people/sfr/linux-next/).
ocfs2 link/build problem with
CONFIG_OCFS2_FS=y
CONFIG_OCFS2_FS_O2CB=m
ERROR: "dlmunlock" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_setup_eviction_cb" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_register_eviction_cb" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_register_domain" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_unregister_domain" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_unregister_eviction_cb" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlmlock" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
ERROR: "dlm_print_one_lock" [fs/ocfs2/ocfs2_stack_o2cb.ko] undefined!
make[2]: *** [__modpost] Error 1
---
~Randy
Hi Stephen,
Kernel bug is hit while booting up the next-20080325 kernel with MPT Fusion driver built in.This was reported previously
for the next-20080320 kernel http://marc.info/?l=linux-next&m=120601013920868&w=2
Loading mptscsih.ko module
Loading mptspi.ko module
[ 6.591066] Fusion MPT SPI Host driver 3.04.06
[ 6.592181] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
[ 6.593991] mptbase: ioc0: Initiating bringup
[ 6.718342] ioc0: LSI53C1030 B2: Capabilities={Initiator}
[ 6.722484] mptbase: ioc0: PCI-MSI enabled
[ 16.902699] mptbase: ioc0: Initiating recovery
[ 16.903618] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
[ 16.904618] mptbase: ioc0: WARNING - FAULT code = 8112h
[ 21.909082] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
[ 39.152711] mptbase: ioc0: Recovered from IOC FAULT
[ 61.630538] BUG: unable to handle kernel NULL pointer dereference at 00000528
[ 61.632545] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
[ 61.634545] *pde = 00000000
[ 61.636219] Oops: 0000 [#1] SMP
[ 61.636537] last sysfs file: /sys/block/ram15/dev
[ 61.636537] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
[ 61.636537]
[ 61.636537] Pid: 17, comm: events/2 Not tainted (2.6.25-rc6-next-20080325-autotest #1)
[ 61.636537] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 2
[ 61.636537] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
[ 61.636537] EAX: f79e5868 EBX: f79e586c ECX: f78c308c EDX: 00000001
[ 61.636537] ESI: f7867e38 EDI: 00000528 EBP: f78a2f78 ESP: f78a2f58
[ 61.636537] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 61.636537] Process events/2 (pid: 17, ti=f78a2000 task=f78c29a0 task.ti=f78a2000)
[ 61.636537] Stack: 00000000 00000002 00000000 c0430b45 f78a2f90 f79e586c f7867e38 f79e5868
[ 61.636537] f78a2fac c0430b80 00000000 00000002 c0430b45 f881ccbd f8821588 c08ee870
[ 61.636537] f881d870 00000002 f7867e38 c043140a f7867e60 f78a2fd0 c04314be 00000000
[ 61.636537] Call Trace:
[ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
[ 61.636537] [<c0430b80>] run_workqueue+0xbb/0x186
[ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
[ 61.636537] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
[ 61.636537] [<c043140a>] worker_thread+0x0/0xbf
[ 61.636537] [<c04314be>] worker_thread+0xb4/0xbf
[ 61.636537] [<c043393d>] autoremove_wake_function+0x0/0x33
[ 61.636537] [<c043387b>] kthread+0x3b/0x64
[ 61.636537] [<c0433840>] kthread+0x0/0x64
[ 61.636537] [<c040468f>] kernel_thread_helper+0x7/0x10
[ 61.636537] =======================
[ 61.636537] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
[ 61.636537] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a2f58
[ 61.636550] ---[ end trace c0dc9c06e06bc602 ]---
[ 47.107291] mptbase: ioc0: Initiating recovery
[ 47.108284] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
[ 47.109284] mptbase: ioc0: WARNING - FAULT code = 8112h
[ 52.122242] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
[ 69.374395] mptbase: ioc0: Recovered from IOC FAULT
[ 69.448422] Clocksource tsc unstable (delta = 18746181568 ns)
[ 91.888899] BUG: unable to handle kernel NULL pointer dereference at 00000528
[ 91.890902] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
[ 91.892902] *pde = 00000000
[ 91.894904] Oops: 0000 [#2] SMP
[ 91.895898] last sysfs file: /sys/block/ram15/dev
[ 91.895898] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
[ 91.895898]
[ 91.895898] Pid: 15, comm: events/0 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
[ 91.895898] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 0
[ 91.895898] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
[ 91.895898] EAX: f7a427b8 EBX: f7a427bc ECX: 00000000 EDX: 00000000
[ 91.895898] ESI: f7867f68 EDI: 00000528 EBP: f7877f78 ESP: f7877f58
[ 91.895898] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 91.895898] Process events/0 (pid: 15, ti=f7877000 task=f789e8a0 task.ti=f7877000)
[ 91.895898] Stack: 00000000 00000002 00000000 c0430b45 f7877f90 f7a427bc f7867f68 f7a427b8
[ 91.895898] f7877fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
[ 91.895898] 5a5a5a5a 5a5a5a5a f7867f68 c043140a f7867f90 f7877fd0 c04314be 00000000
[ 91.895898] Call Trace:
[ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
[ 91.895898] [<c0430b80>] run_workqueue+0xbb/0x186
[ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
[ 91.895898] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
[ 91.895898] [<c043140a>] worker_thread+0x0/0xbf
[ 91.895898] [<c04314be>] worker_thread+0xb4/0xbf
[ 91.895898] [<c043393d>] autoremove_wake_function+0x0/0x33
[ 91.895898] [<c043387b>] kthread+0x3b/0x64
[ 91.895898] [<c0433840>] kthread+0x0/0x64
[ 91.895898] [<c040468f>] kernel_thread_helper+0x7/0x10
[ 91.895898] =======================
[ 91.895898] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
[ 91.895898] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f7877f58
[ 91.895903] ---[ end trace c0dc9c06e06bc602 ]---
[ 82.434031] mptbase: ioc0: Initiating recovery
[ 82.435028] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
[ 82.436028] mptbase: ioc0: WARNING - FAULT code = 8112h
[ 87.440153] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
[ 104.682001] mptbase: ioc0: Recovered from IOC FAULT
[ 127.157135] BUG: unable to handle kernel NULL pointer dereference at 00000528
[ 127.159138] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
[ 127.161139] *pde = 00000000
[ 127.163139] Oops: 0000 [#3] SMP
[ 127.164134] last sysfs file: /sys/block/ram15/dev
[ 127.164134] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
[ 127.164134]
[ 127.164134] Pid: 16, comm: events/1 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
[ 127.164134] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 1
[ 127.164134] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
[ 127.164134] EAX: f7a42fa0 EBX: f7a42fa4 ECX: 00000000 EDX: 00000000
[ 127.164134] ESI: f7867ed0 EDI: 00000528 EBP: f78a1f78 ESP: f78a1f58
[ 127.164134] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 127.164134] Process events/1 (pid: 16, ti=f78a1000 task=f78c0920 task.ti=f78a1000)
[ 127.164134] Stack: 00000000 00000002 00000000 c0430b45 f78a1f90 f7a42fa4 f7867ed0 f7a42fa0
[ 127.164134] f78a1fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
[ 127.164134] 5a5a5a5a 5a5a5a5a f7867ed0 c043140a f7867ef8 f78a1fd0 c04314be 00000000
[ 127.164134] Call Trace:
[ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
[ 127.164134] [<c0430b80>] run_workqueue+0xbb/0x186
[ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
[ 127.164134] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
[ 127.164134] [<c043140a>] worker_thread+0x0/0xbf
[ 127.164134] [<c04314be>] worker_thread+0xb4/0xbf
[ 127.164134] [<c043393d>] autoremove_wake_function+0x0/0x33
[ 127.164134] [<c043387b>] kthread+0x3b/0x64
[ 127.164134] [<c0433840>] kthread+0x0/0x64
[ 127.164134] [<c040468f>] kernel_thread_helper+0x7/0x10
[ 127.164134] =======================
[ 127.164134] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
[ 127.164134] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a1f58
[ 127.164147] ---[ end trace c0dc9c06e06bc602 ]---
--
Thanks & Regards,
Kamalesh Babulal,
Linux Technology Center,
IBM, ISTL.
On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> Hi Stephen,
>
> Kernel bug is hit while booting up the next-20080325 kernel with MPT
> Fusion driver built in.This was reported previously for the
> next-20080320 kernel
> http://marc.info/?l=linux-next&m=120601013920868&w=2
Hi, did you ever get this fixed? I have the same problem every time I
try to boot any 2.6.26-rc kernel.
http://marc.info/?l=linux-scsi&m=121061780821823&w=4
Loading scsi_transport_spi.ko module
Loading mptscsih.ko module
Loading mptspi.ko module
Fusion MPT SPI Host driver 3.04.06
ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
mptbase: ioc0: Initiating bringup
ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
mptbase: ioc0: PCI-MSI enabled
mptbase: ioc0: Initiating recovery
BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
IP: [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
PGD 7e981067 PUD 7e982067 PMD 0
Oops: 0000 [1] SMP
CPU 1
Modules linked in: mptspi(+) mptscsih scsi_transport_spi mptbase sd_mod
scsi_mod
ext3 jbd ehci_hcd ohci_hcd uhci_hcd
Pid: 16, comm: events/1 Not tainted 2.6.26-rc8 #2
RIP: 0010:[<ffffffffa00e5e28>] [<ffffffffa00e5e28>]
:mptspi:mptspi_dv_renegotia
te_work+0x13/0xc3
RSP: 0000:ffff81007f479e50 EFLAGS: 00010286
RAX: ffffffff802429f7 RBX: ffff81007f479e90 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff81007f424138
RBP: ffff81007f479e80 R08: 0000000000000002 R09: 0000000000000000
R10: ffffffff802429f7 R11: ffff81007ffddde0 R12: ffff81007ffbcd90
R13: 0000000000000948 R14: ffffffffa00e5e15 R15: 0000000000000000
FS: 0000000000680850(0000) GS:ffff81007ff5fbe8(0000)
knlGS:0000000000000000
CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000948 CR3: 000000007e979000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process events/1 (pid: 16, threadinfo ffff81007f478000, task
ffff81007f476480)
Stack: ffffffffa00e5e15 0000000000000000 ffff81007f479e90
ffff81007ffbcd90
ffff81007f424138 ffffffffa00e5e15 ffff81007f479ed0 ffffffff80242a46
5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a
Call Trace:
[<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
[<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
[<ffffffff80242a46>] run_workqueue+0xee/0x1f6
[<ffffffff802435d3>] worker_thread+0xdb/0xe8
[<ffffffff80246254>] ? autoremove_wake_function+0x0/0x38
[<ffffffff802434f8>] ? worker_thread+0x0/0xe8
[<ffffffff80246131>] kthread+0x49/0x78
[<ffffffff8020cd98>] child_rip+0xa/0x12
[<ffffffff80245fac>] ? kthreadd+0x1a6/0x1cb
[<ffffffff802460e8>] ? kthread+0x0/0x78
[<ffffffff8020cd8e>] ? child_rip+0x0/0x12
Code: 8b bc 24 f8 00 00 00 e8 83 f7 ff ff 5a 5b 41 5c 41 5d 41 5e 41 5f c9
c3 55
48 89 e5 41 56 41 55 41 54 53 48 83 ec 10 4c 8b 6f 40 <4d> 8b 75 00 e8 6a
9e 1a
e0 66 41 83 bd fa 02 00 00 00 49 8b be
RIP [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
RSP <ffff81007f479e50>
CR2: 0000000000000948
---[ end trace 9714d7078ea4157a ]---
> Loading mptscsih.ko module
> Loading mptspi.ko module
> [ 6.591066] Fusion MPT SPI Host driver 3.04.06
> [ 6.592181] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
> [ 6.593991] mptbase: ioc0: Initiating bringup
> [ 6.718342] ioc0: LSI53C1030 B2: Capabilities={Initiator}
> [ 6.722484] mptbase: ioc0: PCI-MSI enabled
> [ 16.902699] mptbase: ioc0: Initiating recovery
> [ 16.903618] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
> [ 16.904618] mptbase: ioc0: WARNING - FAULT code = 8112h
> [ 21.909082] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
> [ 39.152711] mptbase: ioc0: Recovered from IOC FAULT
> [ 61.630538] BUG: unable to handle kernel NULL pointer dereference at 00000528
> [ 61.632545] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
> [ 61.634545] *pde = 00000000
> [ 61.636219] Oops: 0000 [#1] SMP
> [ 61.636537] last sysfs file: /sys/block/ram15/dev
> [ 61.636537] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> [ 61.636537]
> [ 61.636537] Pid: 17, comm: events/2 Not tainted (2.6.25-rc6-next-20080325-autotest #1)
> [ 61.636537] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 2
> [ 61.636537] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
> [ 61.636537] EAX: f79e5868 EBX: f79e586c ECX: f78c308c EDX: 00000001
> [ 61.636537] ESI: f7867e38 EDI: 00000528 EBP: f78a2f78 ESP: f78a2f58
> [ 61.636537] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
> [ 61.636537] Process events/2 (pid: 17, ti=f78a2000 task=f78c29a0 task.ti=f78a2000)
> [ 61.636537] Stack: 00000000 00000002 00000000 c0430b45 f78a2f90 f79e586c f7867e38 f79e5868
> [ 61.636537] f78a2fac c0430b80 00000000 00000002 c0430b45 f881ccbd f8821588 c08ee870
> [ 61.636537] f881d870 00000002 f7867e38 c043140a f7867e60 f78a2fd0 c04314be 00000000
> [ 61.636537] Call Trace:
> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
> [ 61.636537] [<c0430b80>] run_workqueue+0xbb/0x186
> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
> [ 61.636537] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
> [ 61.636537] [<c043140a>] worker_thread+0x0/0xbf
> [ 61.636537] [<c04314be>] worker_thread+0xb4/0xbf
> [ 61.636537] [<c043393d>] autoremove_wake_function+0x0/0x33
> [ 61.636537] [<c043387b>] kthread+0x3b/0x64
> [ 61.636537] [<c0433840>] kthread+0x0/0x64
> [ 61.636537] [<c040468f>] kernel_thread_helper+0x7/0x10
> [ 61.636537] =======================
> [ 61.636537] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
> [ 61.636537] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a2f58
> [ 61.636550] ---[ end trace c0dc9c06e06bc602 ]---
On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> Hi Stephen,
>
> Kernel bug is hit while booting up the next-20080325 kernel with MPT
> Fusion driver built in.This was reported previously for the
> next-20080320 kernel
> http://marc.info/?l=linux-next&m=120601013920868&w=2
Hi, did you ever get this fixed? I've been having the same problem,
http://marc.info/?l=linux-scsi&m=121061780821823&w=4
still exists on 2.6.26-rc8 for me,
Loading scsi_transport_spi.ko module
Loading mptscsih.ko module
Loading mptspi.ko module
Fusion MPT SPI Host driver 3.04.06
ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
mptbase: ioc0: Initiating bringup
ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
mptbase: ioc0: PCI-MSI enabled
mptbase: ioc0: Initiating recovery
BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
IP: [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
PGD 7e981067 PUD 7e982067 PMD 0
Oops: 0000 [1] SMP
CPU 1
Modules linked in: mptspi(+) mptscsih scsi_transport_spi mptbase sd_mod
scsi_mod
ext3 jbd ehci_hcd ohci_hcd uhci_hcd
Pid: 16, comm: events/1 Not tainted 2.6.26-rc8 #2
RIP: 0010:[<ffffffffa00e5e28>] [<ffffffffa00e5e28>]
:mptspi:mptspi_dv_renegotia
te_work+0x13/0xc3
RSP: 0000:ffff81007f479e50 EFLAGS: 00010286
RAX: ffffffff802429f7 RBX: ffff81007f479e90 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff81007f424138
RBP: ffff81007f479e80 R08: 0000000000000002 R09: 0000000000000000
R10: ffffffff802429f7 R11: ffff81007ffddde0 R12: ffff81007ffbcd90
R13: 0000000000000948 R14: ffffffffa00e5e15 R15: 0000000000000000
FS: 0000000000680850(0000) GS:ffff81007ff5fbe8(0000)
knlGS:0000000000000000
CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000948 CR3: 000000007e979000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process events/1 (pid: 16, threadinfo ffff81007f478000, task
ffff81007f476480)
Stack: ffffffffa00e5e15 0000000000000000 ffff81007f479e90
ffff81007ffbcd90
ffff81007f424138 ffffffffa00e5e15 ffff81007f479ed0 ffffffff80242a46
5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a
Call Trace:
[<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
[<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
[<ffffffff80242a46>] run_workqueue+0xee/0x1f6
[<ffffffff802435d3>] worker_thread+0xdb/0xe8
[<ffffffff80246254>] ? autoremove_wake_function+0x0/0x38
[<ffffffff802434f8>] ? worker_thread+0x0/0xe8
[<ffffffff80246131>] kthread+0x49/0x78
[<ffffffff8020cd98>] child_rip+0xa/0x12
[<ffffffff80245fac>] ? kthreadd+0x1a6/0x1cb
[<ffffffff802460e8>] ? kthread+0x0/0x78
[<ffffffff8020cd8e>] ? child_rip+0x0/0x12
Code: 8b bc 24 f8 00 00 00 e8 83 f7 ff ff 5a 5b 41 5c 41 5d 41 5e 41 5f c9
c3 55
48 89 e5 41 56 41 55 41 54 53 48 83 ec 10 4c 8b 6f 40 <4d> 8b 75 00 e8 6a
9e 1a
e0 66 41 83 bd fa 02 00 00 00 49 8b be
RIP [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
RSP <ffff81007f479e50>
CR2: 0000000000000948
---[ end trace 9714d7078ea4157a ]---
mptbase: ioc0: Initiating recovery
mptbase: ioc0: Initiating recovery
mptbase: ioc0: Initiating recovery
mptbase: ioc0: Initiating recovery
mptbase: ioc0: Initiating recovery
scsi0 : ioc0: LSI53C1030 B2, FwRev=01032700h, Ports=1, MaxQ=255, IRQ=8412
target0:0:0: mptspi: ioc0: dma_alloc_coherent for parameters failed
mptscsih: ioc0: attempting task abort! (sc=ffff81007f450d80)
scsi 0:0:0:0: CDB: Inquiry: 12 00 00 00 24 00
mptbase: ioc0: Initiating recovery
scsi 0:0:0:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 0,
sc=ffff81007f450d80, mf = ffff81007ea42ce0, idx=d
mptscsih: ioc0: Issue of TaskMgmt failed!
mptscsih: ioc0: task abort: FAILED (sc=ffff81007f450d80)
mptscsih: ioc0: attempting target reset! (sc=ffff81007f450d80)
...
> Loading mptscsih.ko module
> Loading mptspi.ko module
> [ 6.591066] Fusion MPT SPI Host driver 3.04.06
> [ 6.592181] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
> [ 6.593991] mptbase: ioc0: Initiating bringup
> [ 6.718342] ioc0: LSI53C1030 B2: Capabilities={Initiator}
> [ 6.722484] mptbase: ioc0: PCI-MSI enabled
> [ 16.902699] mptbase: ioc0: Initiating recovery
> [ 16.903618] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
> [ 16.904618] mptbase: ioc0: WARNING - FAULT code = 8112h
> [ 21.909082] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
> [ 39.152711] mptbase: ioc0: Recovered from IOC FAULT
> [ 61.630538] BUG: unable to handle kernel NULL pointer dereference at 00000528
> [ 61.632545] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
> [ 61.634545] *pde = 00000000
> [ 61.636219] Oops: 0000 [#1] SMP
> [ 61.636537] last sysfs file: /sys/block/ram15/dev
> [ 61.636537] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> [ 61.636537]
> [ 61.636537] Pid: 17, comm: events/2 Not tainted (2.6.25-rc6-next-20080325-autotest #1)
> [ 61.636537] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 2
> [ 61.636537] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
> [ 61.636537] EAX: f79e5868 EBX: f79e586c ECX: f78c308c EDX: 00000001
> [ 61.636537] ESI: f7867e38 EDI: 00000528 EBP: f78a2f78 ESP: f78a2f58
> [ 61.636537] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
> [ 61.636537] Process events/2 (pid: 17, ti=f78a2000 task=f78c29a0 task.ti=f78a2000)
> [ 61.636537] Stack: 00000000 00000002 00000000 c0430b45 f78a2f90 f79e586c f7867e38 f79e5868
> [ 61.636537] f78a2fac c0430b80 00000000 00000002 c0430b45 f881ccbd f8821588 c08ee870
> [ 61.636537] f881d870 00000002 f7867e38 c043140a f7867e60 f78a2fd0 c04314be 00000000
> [ 61.636537] Call Trace:
> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
> [ 61.636537] [<c0430b80>] run_workqueue+0xbb/0x186
> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
> [ 61.636537] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
> [ 61.636537] [<c043140a>] worker_thread+0x0/0xbf
> [ 61.636537] [<c04314be>] worker_thread+0xb4/0xbf
> [ 61.636537] [<c043393d>] autoremove_wake_function+0x0/0x33
> [ 61.636537] [<c043387b>] kthread+0x3b/0x64
> [ 61.636537] [<c0433840>] kthread+0x0/0x64
> [ 61.636537] [<c040468f>] kernel_thread_helper+0x7/0x10
> [ 61.636537] =======================
> [ 61.636537] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
> [ 61.636537] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a2f58
> [ 61.636550] ---[ end trace c0dc9c06e06bc602 ]---
> [ 47.107291] mptbase: ioc0: Initiating recovery
> [ 47.108284] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
> [ 47.109284] mptbase: ioc0: WARNING - FAULT code = 8112h
> [ 52.122242] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
> [ 69.374395] mptbase: ioc0: Recovered from IOC FAULT
> [ 69.448422] Clocksource tsc unstable (delta = 18746181568 ns)
> [ 91.888899] BUG: unable to handle kernel NULL pointer dereference at 00000528
> [ 91.890902] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
> [ 91.892902] *pde = 00000000
> [ 91.894904] Oops: 0000 [#2] SMP
> [ 91.895898] last sysfs file: /sys/block/ram15/dev
> [ 91.895898] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> [ 91.895898]
> [ 91.895898] Pid: 15, comm: events/0 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
> [ 91.895898] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 0
> [ 91.895898] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
> [ 91.895898] EAX: f7a427b8 EBX: f7a427bc ECX: 00000000 EDX: 00000000
> [ 91.895898] ESI: f7867f68 EDI: 00000528 EBP: f7877f78 ESP: f7877f58
> [ 91.895898] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
> [ 91.895898] Process events/0 (pid: 15, ti=f7877000 task=f789e8a0 task.ti=f7877000)
> [ 91.895898] Stack: 00000000 00000002 00000000 c0430b45 f7877f90 f7a427bc f7867f68 f7a427b8
> [ 91.895898] f7877fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
> [ 91.895898] 5a5a5a5a 5a5a5a5a f7867f68 c043140a f7867f90 f7877fd0 c04314be 00000000
> [ 91.895898] Call Trace:
> [ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
> [ 91.895898] [<c0430b80>] run_workqueue+0xbb/0x186
> [ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
> [ 91.895898] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
> [ 91.895898] [<c043140a>] worker_thread+0x0/0xbf
> [ 91.895898] [<c04314be>] worker_thread+0xb4/0xbf
> [ 91.895898] [<c043393d>] autoremove_wake_function+0x0/0x33
> [ 91.895898] [<c043387b>] kthread+0x3b/0x64
> [ 91.895898] [<c0433840>] kthread+0x0/0x64
> [ 91.895898] [<c040468f>] kernel_thread_helper+0x7/0x10
> [ 91.895898] =======================
> [ 91.895898] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
> [ 91.895898] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f7877f58
> [ 91.895903] ---[ end trace c0dc9c06e06bc602 ]---
> [ 82.434031] mptbase: ioc0: Initiating recovery
> [ 82.435028] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
> [ 82.436028] mptbase: ioc0: WARNING - FAULT code = 8112h
> [ 87.440153] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
> [ 104.682001] mptbase: ioc0: Recovered from IOC FAULT
> [ 127.157135] BUG: unable to handle kernel NULL pointer dereference at 00000528
> [ 127.159138] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
> [ 127.161139] *pde = 00000000
> [ 127.163139] Oops: 0000 [#3] SMP
> [ 127.164134] last sysfs file: /sys/block/ram15/dev
> [ 127.164134] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> [ 127.164134]
> [ 127.164134] Pid: 16, comm: events/1 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
> [ 127.164134] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 1
> [ 127.164134] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
> [ 127.164134] EAX: f7a42fa0 EBX: f7a42fa4 ECX: 00000000 EDX: 00000000
> [ 127.164134] ESI: f7867ed0 EDI: 00000528 EBP: f78a1f78 ESP: f78a1f58
> [ 127.164134] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
> [ 127.164134] Process events/1 (pid: 16, ti=f78a1000 task=f78c0920 task.ti=f78a1000)
> [ 127.164134] Stack: 00000000 00000002 00000000 c0430b45 f78a1f90 f7a42fa4 f7867ed0 f7a42fa0
> [ 127.164134] f78a1fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
> [ 127.164134] 5a5a5a5a 5a5a5a5a f7867ed0 c043140a f7867ef8 f78a1fd0 c04314be 00000000
> [ 127.164134] Call Trace:
> [ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
> [ 127.164134] [<c0430b80>] run_workqueue+0xbb/0x186
> [ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
> [ 127.164134] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
> [ 127.164134] [<c043140a>] worker_thread+0x0/0xbf
> [ 127.164134] [<c04314be>] worker_thread+0xb4/0xbf
> [ 127.164134] [<c043393d>] autoremove_wake_function+0x0/0x33
> [ 127.164134] [<c043387b>] kthread+0x3b/0x64
> [ 127.164134] [<c0433840>] kthread+0x0/0x64
> [ 127.164134] [<c040468f>] kernel_thread_helper+0x7/0x10
> [ 127.164134] =======================
> [ 127.164134] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
> [ 127.164134] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a1f58
> [ 127.164147] ---[ end trace c0dc9c06e06bc602 ]---
David Teigland wrote:
> On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
>> Hi Stephen,
>>
>> Kernel bug is hit while booting up the next-20080325 kernel with MPT
>> Fusion driver built in.This was reported previously for the
>> next-20080320 kernel
>> http://marc.info/?l=linux-next&m=120601013920868&w=2
>
> Hi, did you ever get this fixed? I've been having the same problem,
> http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> still exists on 2.6.26-rc8 for me,
>
Hi David,
No,there were no follow ups after that, I did not try any testing on that box
for more than 2 month now. I will try to reproduce the oops by Monday with latest
kernel available.
> Loading scsi_transport_spi.ko module
> Loading mptscsih.ko module
> Loading mptspi.ko module
> Fusion MPT SPI Host driver 3.04.06
> ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> mptbase: ioc0: Initiating bringup
> ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> mptbase: ioc0: PCI-MSI enabled
> mptbase: ioc0: Initiating recovery
> BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
> IP: [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
> PGD 7e981067 PUD 7e982067 PMD 0
> Oops: 0000 [1] SMP
> CPU 1
> Modules linked in: mptspi(+) mptscsih scsi_transport_spi mptbase sd_mod
> scsi_mod
> ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> Pid: 16, comm: events/1 Not tainted 2.6.26-rc8 #2
> RIP: 0010:[<ffffffffa00e5e28>] [<ffffffffa00e5e28>]
> :mptspi:mptspi_dv_renegotia
> te_work+0x13/0xc3
> RSP: 0000:ffff81007f479e50 EFLAGS: 00010286
> RAX: ffffffff802429f7 RBX: ffff81007f479e90 RCX: 0000000000000000
> RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff81007f424138
> RBP: ffff81007f479e80 R08: 0000000000000002 R09: 0000000000000000
> R10: ffffffff802429f7 R11: ffff81007ffddde0 R12: ffff81007ffbcd90
> R13: 0000000000000948 R14: ffffffffa00e5e15 R15: 0000000000000000
> FS: 0000000000680850(0000) GS:ffff81007ff5fbe8(0000)
> knlGS:0000000000000000
> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 0000000000000948 CR3: 000000007e979000 CR4: 00000000000006e0
> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Process events/1 (pid: 16, threadinfo ffff81007f478000, task
> ffff81007f476480)
> Stack: ffffffffa00e5e15 0000000000000000 ffff81007f479e90
> ffff81007ffbcd90
> ffff81007f424138 ffffffffa00e5e15 ffff81007f479ed0 ffffffff80242a46
> 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a 5a5a5a5a5a5a5a5a
> Call Trace:
> [<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
> [<ffffffffa00e5e15>] ? :mptspi:mptspi_dv_renegotiate_work+0x0/0xc3
> [<ffffffff80242a46>] run_workqueue+0xee/0x1f6
> [<ffffffff802435d3>] worker_thread+0xdb/0xe8
> [<ffffffff80246254>] ? autoremove_wake_function+0x0/0x38
> [<ffffffff802434f8>] ? worker_thread+0x0/0xe8
> [<ffffffff80246131>] kthread+0x49/0x78
> [<ffffffff8020cd98>] child_rip+0xa/0x12
> [<ffffffff80245fac>] ? kthreadd+0x1a6/0x1cb
> [<ffffffff802460e8>] ? kthread+0x0/0x78
> [<ffffffff8020cd8e>] ? child_rip+0x0/0x12
>
>
> Code: 8b bc 24 f8 00 00 00 e8 83 f7 ff ff 5a 5b 41 5c 41 5d 41 5e 41 5f c9
> c3 55
> 48 89 e5 41 56 41 55 41 54 53 48 83 ec 10 4c 8b 6f 40 <4d> 8b 75 00 e8 6a
> 9e 1a
> e0 66 41 83 bd fa 02 00 00 00 49 8b be
> RIP [<ffffffffa00e5e28>] :mptspi:mptspi_dv_renegotiate_work+0x13/0xc3
> RSP <ffff81007f479e50>
> CR2: 0000000000000948
> ---[ end trace 9714d7078ea4157a ]---
> mptbase: ioc0: Initiating recovery
> mptbase: ioc0: Initiating recovery
> mptbase: ioc0: Initiating recovery
> mptbase: ioc0: Initiating recovery
> mptbase: ioc0: Initiating recovery
> scsi0 : ioc0: LSI53C1030 B2, FwRev=01032700h, Ports=1, MaxQ=255, IRQ=8412
> target0:0:0: mptspi: ioc0: dma_alloc_coherent for parameters failed
> mptscsih: ioc0: attempting task abort! (sc=ffff81007f450d80)
> scsi 0:0:0:0: CDB: Inquiry: 12 00 00 00 24 00
> mptbase: ioc0: Initiating recovery
> scsi 0:0:0:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 0,
> sc=ffff81007f450d80, mf = ffff81007ea42ce0, idx=d
> mptscsih: ioc0: Issue of TaskMgmt failed!
> mptscsih: ioc0: task abort: FAILED (sc=ffff81007f450d80)
> mptscsih: ioc0: attempting target reset! (sc=ffff81007f450d80)
>
> ...
>
>
>
>> Loading mptscsih.ko module
>> Loading mptspi.ko module
>> [ 6.591066] Fusion MPT SPI Host driver 3.04.06
>> [ 6.592181] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
>> [ 6.593991] mptbase: ioc0: Initiating bringup
>> [ 6.718342] ioc0: LSI53C1030 B2: Capabilities={Initiator}
>> [ 6.722484] mptbase: ioc0: PCI-MSI enabled
>> [ 16.902699] mptbase: ioc0: Initiating recovery
>> [ 16.903618] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
>> [ 16.904618] mptbase: ioc0: WARNING - FAULT code = 8112h
>> [ 21.909082] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
>> [ 39.152711] mptbase: ioc0: Recovered from IOC FAULT
>> [ 61.630538] BUG: unable to handle kernel NULL pointer dereference at 00000528
>> [ 61.632545] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
>> [ 61.634545] *pde = 00000000
>> [ 61.636219] Oops: 0000 [#1] SMP
>> [ 61.636537] last sysfs file: /sys/block/ram15/dev
>> [ 61.636537] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
>> [ 61.636537]
>> [ 61.636537] Pid: 17, comm: events/2 Not tainted (2.6.25-rc6-next-20080325-autotest #1)
>> [ 61.636537] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 2
>> [ 61.636537] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
>> [ 61.636537] EAX: f79e5868 EBX: f79e586c ECX: f78c308c EDX: 00000001
>> [ 61.636537] ESI: f7867e38 EDI: 00000528 EBP: f78a2f78 ESP: f78a2f58
>> [ 61.636537] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
>> [ 61.636537] Process events/2 (pid: 17, ti=f78a2000 task=f78c29a0 task.ti=f78a2000)
>> [ 61.636537] Stack: 00000000 00000002 00000000 c0430b45 f78a2f90 f79e586c f7867e38 f79e5868
>> [ 61.636537] f78a2fac c0430b80 00000000 00000002 c0430b45 f881ccbd f8821588 c08ee870
>> [ 61.636537] f881d870 00000002 f7867e38 c043140a f7867e60 f78a2fd0 c04314be 00000000
>> [ 61.636537] Call Trace:
>> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 61.636537] [<c0430b80>] run_workqueue+0xbb/0x186
>> [ 61.636537] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 61.636537] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
>> [ 61.636537] [<c043140a>] worker_thread+0x0/0xbf
>> [ 61.636537] [<c04314be>] worker_thread+0xb4/0xbf
>> [ 61.636537] [<c043393d>] autoremove_wake_function+0x0/0x33
>> [ 61.636537] [<c043387b>] kthread+0x3b/0x64
>> [ 61.636537] [<c0433840>] kthread+0x0/0x64
>> [ 61.636537] [<c040468f>] kernel_thread_helper+0x7/0x10
>> [ 61.636537] =======================
>> [ 61.636537] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
>> [ 61.636537] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a2f58
>> [ 61.636550] ---[ end trace c0dc9c06e06bc602 ]---
>> [ 47.107291] mptbase: ioc0: Initiating recovery
>> [ 47.108284] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
>> [ 47.109284] mptbase: ioc0: WARNING - FAULT code = 8112h
>> [ 52.122242] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
>> [ 69.374395] mptbase: ioc0: Recovered from IOC FAULT
>> [ 69.448422] Clocksource tsc unstable (delta = 18746181568 ns)
>> [ 91.888899] BUG: unable to handle kernel NULL pointer dereference at 00000528
>> [ 91.890902] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
>> [ 91.892902] *pde = 00000000
>> [ 91.894904] Oops: 0000 [#2] SMP
>> [ 91.895898] last sysfs file: /sys/block/ram15/dev
>> [ 91.895898] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
>> [ 91.895898]
>> [ 91.895898] Pid: 15, comm: events/0 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
>> [ 91.895898] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 0
>> [ 91.895898] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
>> [ 91.895898] EAX: f7a427b8 EBX: f7a427bc ECX: 00000000 EDX: 00000000
>> [ 91.895898] ESI: f7867f68 EDI: 00000528 EBP: f7877f78 ESP: f7877f58
>> [ 91.895898] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
>> [ 91.895898] Process events/0 (pid: 15, ti=f7877000 task=f789e8a0 task.ti=f7877000)
>> [ 91.895898] Stack: 00000000 00000002 00000000 c0430b45 f7877f90 f7a427bc f7867f68 f7a427b8
>> [ 91.895898] f7877fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
>> [ 91.895898] 5a5a5a5a 5a5a5a5a f7867f68 c043140a f7867f90 f7877fd0 c04314be 00000000
>> [ 91.895898] Call Trace:
>> [ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 91.895898] [<c0430b80>] run_workqueue+0xbb/0x186
>> [ 91.895898] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 91.895898] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
>> [ 91.895898] [<c043140a>] worker_thread+0x0/0xbf
>> [ 91.895898] [<c04314be>] worker_thread+0xb4/0xbf
>> [ 91.895898] [<c043393d>] autoremove_wake_function+0x0/0x33
>> [ 91.895898] [<c043387b>] kthread+0x3b/0x64
>> [ 91.895898] [<c0433840>] kthread+0x0/0x64
>> [ 91.895898] [<c040468f>] kernel_thread_helper+0x7/0x10
>> [ 91.895898] =======================
>> [ 91.895898] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
>> [ 91.895898] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f7877f58
>> [ 91.895903] ---[ end trace c0dc9c06e06bc602 ]---
>> [ 82.434031] mptbase: ioc0: Initiating recovery
>> [ 82.435028] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
>> [ 82.436028] mptbase: ioc0: WARNING - FAULT code = 8112h
>> [ 87.440153] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
>> [ 104.682001] mptbase: ioc0: Recovered from IOC FAULT
>> [ 127.157135] BUG: unable to handle kernel NULL pointer dereference at 00000528
>> [ 127.159138] IP: [<f881ccc9>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
>> [ 127.161139] *pde = 00000000
>> [ 127.163139] Oops: 0000 [#3] SMP
>> [ 127.164134] last sysfs file: /sys/block/ram15/dev
>> [ 127.164134] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
>> [ 127.164134]
>> [ 127.164134] Pid: 16, comm: events/1 Tainted: G D (2.6.25-rc6-next-20080325-autotest #1)
>> [ 127.164134] EIP: 0060:[<f881ccc9>] EFLAGS: 00010282 CPU: 1
>> [ 127.164134] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
>> [ 127.164134] EAX: f7a42fa0 EBX: f7a42fa4 ECX: 00000000 EDX: 00000000
>> [ 127.164134] ESI: f7867ed0 EDI: 00000528 EBP: f78a1f78 ESP: f78a1f58
>> [ 127.164134] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
>> [ 127.164134] Process events/1 (pid: 16, ti=f78a1000 task=f78c0920 task.ti=f78a1000)
>> [ 127.164134] Stack: 00000000 00000002 00000000 c0430b45 f78a1f90 f7a42fa4 f7867ed0 f7a42fa0
>> [ 127.164134] f78a1fac c0430b80 00000000 00000002 c0430b45 f881ccbd 5a5a5a5a 5a5a5a5a
>> [ 127.164134] 5a5a5a5a 5a5a5a5a f7867ed0 c043140a f7867ef8 f78a1fd0 c04314be 00000000
>> [ 127.164134] Call Trace:
>> [ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 127.164134] [<c0430b80>] run_workqueue+0xbb/0x186
>> [ 127.164134] [<c0430b45>] run_workqueue+0x80/0x186
>> [ 127.164134] [<f881ccbd>] mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
>> [ 127.164134] [<c043140a>] worker_thread+0x0/0xbf
>> [ 127.164134] [<c04314be>] worker_thread+0xb4/0xbf
>> [ 127.164134] [<c043393d>] autoremove_wake_function+0x0/0x33
>> [ 127.164134] [<c043387b>] kthread+0x3b/0x64
>> [ 127.164134] [<c0433840>] kthread+0x0/0x64
>> [ 127.164134] [<c040468f>] kernel_thread_helper+0x7/0x10
>> [ 127.164134] =======================
>> [ 127.164134] Code: ff 8b 87 8c 00 00 00 e8 b0 6c 03 00 8b 87 8c 00 00 00 e8 6e f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 87 2a c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
>> [ 127.164134] EIP: [<f881ccc9>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a1f58
>> [ 127.164147] ---[ end trace c0dc9c06e06bc602 ]---
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
--
Thanks & Regards,
Kamalesh Babulal,
Linux Technology Center,
IBM, ISTL.
On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
> On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> > Hi Stephen,
> >
> > Kernel bug is hit while booting up the next-20080325 kernel with MPT
> > Fusion driver built in.This was reported previously for the
> > next-20080320 kernel
> > http://marc.info/?l=linux-next&m=120601013920868&w=2
>
> Hi, did you ever get this fixed? I've been having the same problem,
> http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> still exists on 2.6.26-rc8 for me,
>
> Loading scsi_transport_spi.ko module
> Loading mptscsih.ko module
> Loading mptspi.ko module
> Fusion MPT SPI Host driver 3.04.06
> ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> mptbase: ioc0: Initiating bringup
> ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> mptbase: ioc0: PCI-MSI enabled
I speculate that this might be related to the problem. The fusion
driver was recently flipped to default to MSI enable (previously it had
disabled them). Could you boot it with the fusion module parameter
mpt_msi_enable=0
if it actually boots that will be a good confirmation that MSI is to
blame.
> mptbase: ioc0: Initiating recovery
This I suspect is because of a lost interrupt.
> BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
And this one looks to be a second bug: The MSI fault path is actually
causing this oops.
James
On Sun, Jul 06, 2008 at 02:23:35PM -0500, James Bottomley wrote:
> On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
> > On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> > > Hi Stephen,
> > >
> > > Kernel bug is hit while booting up the next-20080325 kernel with MPT
> > > Fusion driver built in.This was reported previously for the
> > > next-20080320 kernel
> > > http://marc.info/?l=linux-next&m=120601013920868&w=2
> >
> > Hi, did you ever get this fixed? I've been having the same problem,
> > http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> > still exists on 2.6.26-rc8 for me,
> >
> > Loading scsi_transport_spi.ko module
> > Loading mptscsih.ko module
> > Loading mptspi.ko module
> > Fusion MPT SPI Host driver 3.04.06
> > ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> > mptbase: ioc0: Initiating bringup
> > ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> > mptbase: ioc0: PCI-MSI enabled
>
> I speculate that this might be related to the problem. The fusion
> driver was recently flipped to default to MSI enable (previously it had
> disabled them). Could you boot it with the fusion module parameter
>
> mpt_msi_enable=0
>
> if it actually boots that will be a good confirmation that MSI is to
> blame.
It all works using that option.
Thanks,
Dave
On Mon, 2008-07-07 at 10:27 -0500, David Teigland wrote:
> On Sun, Jul 06, 2008 at 02:23:35PM -0500, James Bottomley wrote:
> > On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
> > > On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> > > > Hi Stephen,
> > > >
> > > > Kernel bug is hit while booting up the next-20080325 kernel with MPT
> > > > Fusion driver built in.This was reported previously for the
> > > > next-20080320 kernel
> > > > http://marc.info/?l=linux-next&m=120601013920868&w=2
> > >
> > > Hi, did you ever get this fixed? I've been having the same problem,
> > > http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> > > still exists on 2.6.26-rc8 for me,
> > >
> > > Loading scsi_transport_spi.ko module
> > > Loading mptscsih.ko module
> > > Loading mptspi.ko module
> > > Fusion MPT SPI Host driver 3.04.06
> > > ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> > > mptbase: ioc0: Initiating bringup
> > > ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> > > mptbase: ioc0: PCI-MSI enabled
> >
> > I speculate that this might be related to the problem. The fusion
> > driver was recently flipped to default to MSI enable (previously it had
> > disabled them). Could you boot it with the fusion module parameter
> >
> > mpt_msi_enable=0
> >
> > if it actually boots that will be a good confirmation that MSI is to
> > blame.
>
> It all works using that option.
It sounds like your motherboard might need to be added to the
drivers/pci/quirks.c with quirk_disable_all_msi ... can you try a
different card with MSI just to see if it's something specific either to
the fusion or the location in the system?
Thanks,
James
James Bottomley wrote:
> On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
>> On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
>>> Hi Stephen,
>>>
>>> Kernel bug is hit while booting up the next-20080325 kernel with MPT
>>> Fusion driver built in.This was reported previously for the
>>> next-20080320 kernel
>>> http://marc.info/?l=linux-next&m=120601013920868&w=2
>> Hi, did you ever get this fixed? I've been having the same problem,
>> http://marc.info/?l=linux-scsi&m=121061780821823&w=4
>> still exists on 2.6.26-rc8 for me,
>>
>> Loading scsi_transport_spi.ko module
>> Loading mptscsih.ko module
>> Loading mptspi.ko module
>> Fusion MPT SPI Host driver 3.04.06
>> ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
>> mptbase: ioc0: Initiating bringup
>> ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
>> mptbase: ioc0: PCI-MSI enabled
>
> I speculate that this might be related to the problem. The fusion
> driver was recently flipped to default to MSI enable (previously it had
> disabled them). Could you boot it with the fusion module parameter
>
> mpt_msi_enable=0
>
> if it actually boots that will be a good confirmation that MSI is to
> blame.
>
>> mptbase: ioc0: Initiating recovery
>
> This I suspect is because of a lost interrupt.
>
>> BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
>
> And this one looks to be a second bug: The MSI fault path is actually
> causing this oops.
>
> James
>
Hi James,
Kernel Bug is still reproducible with 2.6.26-rc9-git1 kernel, I tried booting with mpt_msi_enable=0
as boot paramenter, but the call trace remains the same
oading mptbase.ko module[ 6.766853] Fusion MPT base driver 3.04.06
[ 6.783482] Copyright (c) 1999-2007 LSI Corporation
Loading mptscsih.ko module
Loading mptspi.ko module
[ 6.813157] Fusion MPT SPI Host driver 3.04.06
[ 6.827369] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
[ 6.850624] mptbase: ioc0: Initiating bringup
[ 6.985601] ioc0: LSI53C1030 B2: Capabilities={Initiator}
[ 7.002711] mptbase: ioc0: PCI-MSI enabled
[ 17.193038] mptbase: ioc0: Initiating recovery
[ 17.206490] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
[ 17.224489] mptbase: ioc0: WARNING - FAULT code = 8112h
[ 22.243634] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
[ 39.507735] mptbase: ioc0: Recovered from IOC FAULT
[ 39.595707] Clocksource tsc unstable (delta = 18747111256 ns)
[ 39.624226] BUG: unable to handle kernel NULL pointer dereference at 00000528
[ 39.650488] IP: [<f881ccc3>] :mptspi:mptspi_dv_renegotiate_work+0xc/0xab
[ 39.676987] *pde = 00000000
[ 39.685169] Oops: 0000 [#1] SMP
[ 39.695127] Modules linked in: mptspi(+) mptscsih mptbase scsi_transport_spi sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
[ 39.730967]
[ 39.730967] Pid: 17, comm: events/2 Not tainted (2.6.26-rc9-git1-autokern1 #1)
[ 39.730967] EIP: 0060:[<f881ccc3>] EFLAGS: 00010282 CPU: 2
[ 39.730967] EIP is at mptspi_dv_renegotiate_work+0xc/0xab [mptspi]
[ 39.730967] EAX: f7a2c188 EBX: f7a2c18c ECX: f78a0500 EDX: 00000001
[ 39.730967] ESI: f7866c70 EDI: 00000528 EBP: f78a1f78 ESP: f78a1f58
[ 39.730967] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 39.730967] Process events/2 (pid: 17, ti=f78a1000 task=f78a0020 task.ti=f78a1000)
[ 39.730967] Stack: 00000000 00000002 00000000 c043193d f78a1f90 f7a2c18c f7866c70 f7a2c188
[ 39.730967] f78a1fac c0431978 00000000 00000002 c043193d f881ccb7 f8820308 c091289c
[ 39.730967] f881d86c 00000002 f7866c70 c04321dd f7866c98 f78a1fd0 c0432291 00000000
[ 39.730967] Call Trace:
[ 39.730967] [<c043193d>] ? run_workqueue+0x80/0x186
[ 39.730967] [<c0431978>] ? run_workqueue+0xbb/0x186
[ 39.730967] [<c043193d>] ? run_workqueue+0x80/0x186
[ 39.730967] [<f881ccb7>] ? mptspi_dv_renegotiate_work+0x0/0xab [mptspi]
[ 39.730967] [<c04321dd>] ? worker_thread+0x0/0xbf
[ 39.730967] [<c0432291>] ? worker_thread+0xb4/0xbf
[ 39.730967] [<c04346f9>] ? autoremove_wake_function+0x0/0x33
[ 39.730967] [<c0434638>] ? kthread+0x3b/0x63
[ 39.730967] [<c04345fd>] ? kthread+0x0/0x63
[ 39.730967] [<c0404613>] ? kernel_thread_helper+0x7/0x10
[ 39.730967] =======================
[ 39.730967] Code: ff 8b 87 8c 00 00 00 e8 bd 27 03 00 8b 87 8c 00 00 00 e8 74 f8 ff ff 8d 65 f4 5b 5e 5f 5d c3 55 89 e5 57 56 53 83 ec 14 8b 78 20 <8b> 17 89 55 e0 e8 89 4c c5 c7 8b 55 e0 66 83 bf b2 02 00 00 00
[ 40.184778] EIP: [<f881ccc3>] mptspi_dv_renegotiate_work+0xc/0xab [mptspi] SS:ESP 0068:f78a1f58
[ 40.649493] ---[ end trace 8850b96835b44d26 ]---
[ 50.659034] mptbase: ioc0: Initiating recovery
[ 50.672478] mptbase: ioc0: WARNING - IOC is in FAULT state!!!
[ 50.690478] mptbase: ioc0: WARNING - FAULT code = 8112h
[ 55.709559] mptbase: ioc0: ERROR - Doorbell ACK timeout (count=4999), IntStatus=80000009!
[ 72.976560] mptbase: ioc0: Recovered from IOC FAULT
0xcc3 is in mptspi_dv_renegotiate_work (drivers/message/fusion/mptspi.c:1228).
1223 struct _MPT_SCSI_HOST *hd = wqw->hd;
1224 struct scsi_device *sdev;
1225 struct scsi_target *starget;
1226 struct _CONFIG_PAGE_SCSI_DEVICE_1 pg1;
1227 u32 nego;
1228 MPT_ADAPTER *ioc = hd->ioc;
1229
1230 kfree(wqw);
1231
1232 if (hd->spi_pending) {
--
Thanks & Regards,
Kamalesh Babulal,
Linux Technology Center,
IBM, ISTL.
On Tue, 2008-07-08 at 01:47 +0530, Kamalesh Babulal wrote:
> James Bottomley wrote:
> > On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
> >> On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> >>> Hi Stephen,
> >>>
> >>> Kernel bug is hit while booting up the next-20080325 kernel with MPT
> >>> Fusion driver built in.This was reported previously for the
> >>> next-20080320 kernel
> >>> http://marc.info/?l=linux-next&m=120601013920868&w=2
> >> Hi, did you ever get this fixed? I've been having the same problem,
> >> http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> >> still exists on 2.6.26-rc8 for me,
> >>
> >> Loading scsi_transport_spi.ko module
> >> Loading mptscsih.ko module
> >> Loading mptspi.ko module
> >> Fusion MPT SPI Host driver 3.04.06
> >> ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> >> mptbase: ioc0: Initiating bringup
> >> ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> >> mptbase: ioc0: PCI-MSI enabled
> >
> > I speculate that this might be related to the problem. The fusion
> > driver was recently flipped to default to MSI enable (previously it had
> > disabled them). Could you boot it with the fusion module parameter
> >
> > mpt_msi_enable=0
> >
> > if it actually boots that will be a good confirmation that MSI is to
> > blame.
> >
> >> mptbase: ioc0: Initiating recovery
> >
> > This I suspect is because of a lost interrupt.
> >
> >> BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
> >
> > And this one looks to be a second bug: The MSI fault path is actually
> > causing this oops.
> >
> > James
> >
> Hi James,
>
> Kernel Bug is still reproducible with 2.6.26-rc9-git1 kernel, I tried booting with mpt_msi_enable=0
> as boot paramenter, but the call trace remains the same
>
> oading mptbase.ko module[ 6.766853] Fusion MPT base driver 3.04.06
>
> [ 6.783482] Copyright (c) 1999-2007 LSI Corporation
> Loading mptscsih.ko module
> Loading mptspi.ko module
> [ 6.813157] Fusion MPT SPI Host driver 3.04.06
> [ 6.827369] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
> [ 6.850624] mptbase: ioc0: Initiating bringup
> [ 6.985601] ioc0: LSI53C1030 B2: Capabilities={Initiator}
> [ 7.002711] mptbase: ioc0: PCI-MSI enabled
^^^^^^^^^^^^^^^
If you had successfully specified mpt_msi_enable=0 this line wouldn't
appear.
My best guess is that you incorrectly specified it as a kernel command
line parameter rather than a module parameter.
James
James Bottomley wrote:
> On Tue, 2008-07-08 at 01:47 +0530, Kamalesh Babulal wrote:
>> James Bottomley wrote:
>>> On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
>>>> On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
>>>>> Hi Stephen,
>>>>>
>>>>> Kernel bug is hit while booting up the next-20080325 kernel with MPT
>>>>> Fusion driver built in.This was reported previously for the
>>>>> next-20080320 kernel
>>>>> http://marc.info/?l=linux-next&m=120601013920868&w=2
>>>> Hi, did you ever get this fixed? I've been having the same problem,
>>>> http://marc.info/?l=linux-scsi&m=121061780821823&w=4
>>>> still exists on 2.6.26-rc8 for me,
>>>>
>>>> Loading scsi_transport_spi.ko module
>>>> Loading mptscsih.ko module
>>>> Loading mptspi.ko module
>>>> Fusion MPT SPI Host driver 3.04.06
>>>> ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
>>>> mptbase: ioc0: Initiating bringup
>>>> ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
>>>> mptbase: ioc0: PCI-MSI enabled
>>> I speculate that this might be related to the problem. The fusion
>>> driver was recently flipped to default to MSI enable (previously it had
>>> disabled them). Could you boot it with the fusion module parameter
>>>
>>> mpt_msi_enable=0
>>>
>>> if it actually boots that will be a good confirmation that MSI is to
>>> blame.
>>>
>>>> mptbase: ioc0: Initiating recovery
>>> This I suspect is because of a lost interrupt.
>>>
>>>> BUG: unable to handle kernel NULL pointer dereference at 0000000000000948
>>> And this one looks to be a second bug: The MSI fault path is actually
>>> causing this oops.
>>>
>>> James
>>>
>> Hi James,
>>
>> Kernel Bug is still reproducible with 2.6.26-rc9-git1 kernel, I tried booting with mpt_msi_enable=0
>> as boot paramenter, but the call trace remains the same
>>
>> oading mptbase.ko module[ 6.766853] Fusion MPT base driver 3.04.06
>>
>> [ 6.783482] Copyright (c) 1999-2007 LSI Corporation
>> Loading mptscsih.ko module
>> Loading mptspi.ko module
>> [ 6.813157] Fusion MPT SPI Host driver 3.04.06
>> [ 6.827369] ACPI: PCI Interrupt 0000:01:01.0[A] -> GSI 22 (level, low) -> IRQ 22
>> [ 6.850624] mptbase: ioc0: Initiating bringup
>> [ 6.985601] ioc0: LSI53C1030 B2: Capabilities={Initiator}
>> [ 7.002711] mptbase: ioc0: PCI-MSI enabled
> ^^^^^^^^^^^^^^^
>
> If you had successfully specified mpt_msi_enable=0 this line wouldn't
> appear.
>
> My best guess is that you incorrectly specified it as a kernel command
> line parameter rather than a module parameter.
>
> James
>
Hi James,
Sorry my fault :(, the kernel bug is not reproducible after disabling it.
--
Thanks & Regards,
Kamalesh Babulal,
Linux Technology Center,
IBM, ISTL.
On Mon, Jul 07, 2008 at 10:43:14AM -0500, James Bottomley wrote:
> On Mon, 2008-07-07 at 10:27 -0500, David Teigland wrote:
> > On Sun, Jul 06, 2008 at 02:23:35PM -0500, James Bottomley wrote:
> > > On Thu, 2008-06-26 at 15:38 -0500, David Teigland wrote:
> > > > On Wed, Mar 26, 2008 at 12:14:00PM +0530, Kamalesh Babulal wrote:
> > > > > Hi Stephen,
> > > > >
> > > > > Kernel bug is hit while booting up the next-20080325 kernel with MPT
> > > > > Fusion driver built in.This was reported previously for the
> > > > > next-20080320 kernel
> > > > > http://marc.info/?l=linux-next&m=120601013920868&w=2
> > > >
> > > > Hi, did you ever get this fixed? I've been having the same problem,
> > > > http://marc.info/?l=linux-scsi&m=121061780821823&w=4
> > > > still exists on 2.6.26-rc8 for me,
> > > >
> > > > Loading scsi_transport_spi.ko module
> > > > Loading mptscsih.ko module
> > > > Loading mptspi.ko module
> > > > Fusion MPT SPI Host driver 3.04.06
> > > > ACPI: PCI Interrupt 0000:86:01.0[A] -> GSI 32 (level, low) -> IRQ 32
> > > > mptbase: ioc0: Initiating bringup
> > > > ioc0: LSI53C1030 B2: Capabilities={Initiator,Target}
> > > > mptbase: ioc0: PCI-MSI enabled
> > >
> > > I speculate that this might be related to the problem. The fusion
> > > driver was recently flipped to default to MSI enable (previously it had
> > > disabled them). Could you boot it with the fusion module parameter
> > >
> > > mpt_msi_enable=0
> > >
> > > if it actually boots that will be a good confirmation that MSI is to
> > > blame.
> >
> > It all works using that option.
>
> It sounds like your motherboard might need to be added to the
> drivers/pci/quirks.c with quirk_disable_all_msi ... can you try a
> different card with MSI just to see if it's something specific either to
> the fusion or the location in the system?
I don't have easy access to the hardware, it's just some lab machine I was
told I could use for unrelated testing. Until this came up I'd never
heard of "MSI", or "Fusion" before, and still don't have much idea about
they are :-)
Dave