2009-07-04 07:16:05

by Mikko C.

[permalink] [raw]
Subject: kernel BUG at fs/notify/notification.c:93!

Hi,
since upgrading to 2.6.31-rc1 I randomly get this:

kernel BUG at
fs/notify/notification.c:93!

invalid opcode: 0000 [#1] PREEMPT
SMP

last sysfs file:
/sys/devices/LNXSYSTM:00/device:00/PNP0C0A:00/power_supply/BAT0/charge_full

CPU
0

Modules linked in: radeon drm snd_seq snd_seq_device vfat fat coretemp
hwmon cpufreq_ondemand fan snd_hda_codec_idt iwl3945 iwlcore mac80211
snd_hda_intel snd_hda_codec led_class usb_storage snd_pcm snd_timer
cfg80211 psmouse dell_laptop snd uhci_hcd evdev rfkill soundcore
ehci_hcd snd_page_alloc ac thermal wmi battery
button
Pid: 5658, comm: krunner Not tainted 2.6.31-rc1 #2
MM061
RIP: 0010:[<ffffffff810db63a>] [<ffffffff810db63a>]
fsnotify_put_event+0x5a/0x60
RSP: 0018:ffff8800658b9e58 EFLAGS:
00010287

RAX: ffff880073fd4c18 RBX: ffff880073fd4bc0 RCX:
0000000000000000
RDX: 0000000000000001 RSI: ffffffff8162bf53 RDI:
ffff880073fd4bc0
RBP: 0000000000000040 R08: 0000000001b47300 R09:
ffff88007efd1f00
R10: 0000000001a92490 R11: 0000000000000246 R12:
ffff880073fd4bc0
R13: 00007fffa83d7e80 R14: 0000000000000003 R15:
ffff88007ef599c0
FS: 00007f824ea26750(0000) GS:ffff880001651000(0000)
knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0:
000000008005003b

CR2: 00007f63b12c5000 CR3: 00000000658c4000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process krunner (pid: 5658, threadinfo ffff8800658b8000, task
ffff88007f9ead20)
Stack:
0000000000000030 ffffffff810ddf35 0000000000000240 00007fffa83d7e80
<0> ffff88007efd1f00 ffff88007ef59a18 ffff88007ef599e8 0000002da83d91ac
<0> 0000000000000000 ffff88007f9ead20 ffffffff81051b90 ffff88007ef59a20
Call Trace:
[<ffffffff810ddf35>] ? inotify_read+0x2e5/0x310
[<ffffffff81051b90>] ? autoremove_wake_function+0x0/0x30
[<ffffffff810ba30b>] ? do_vfs_ioctl+0x3fb/0x580
[<ffffffff810ac028>] ? vfs_read+0xc8/0x1a0
[<ffffffff810ac203>] ? sys_read+0x53/0xa0
[<ffffffff8100b3eb>] ? system_call_fastpath+0x16/0x1b
Code: ff 48 89 de 48 8b 3d 4e 08 55 00 5b e9 a0 a7 fc ff 5b c3 66 0f 1f
44 00 00 48 8d 7f 28 e8 4f 8d fd ff 0f 1f 80 00 00 00 00 eb c3 <0f> 0b
eb fe 66 90 41 57 4c 8d 7f 28 41 56 41 55 41 54 55 48 89
RIP [<ffffffff810db63a>] fsnotify_put_event+0x5a/0x60
RSP <ffff8800658b9e58>
---[ end trace b3964c675905b0bb ]---

Sorry if it's been reported already, I couldn't find it.
I'm not subscribed so please CC me if you answer.
Thanks.

Mikko


2009-07-04 14:41:12

by Eric Paris

[permalink] [raw]
Subject: Re: kernel BUG at fs/notify/notification.c:93!

On Sat, 2009-07-04 at 10:16 +0200, Mikko C. wrote:
> Hi,
> since upgrading to 2.6.31-rc1 I randomly get this:
>
> kernel BUG at
> fs/notify/notification.c:93!

This is the first time I've heard about it. I'm looking for the problem
but I don't see it right offhand. Is there anything you know of that
causes it? I'm looking though....

-Eric

2009-07-04 14:50:18

by Mikko C.

[permalink] [raw]
Subject: Re: kernel BUG at fs/notify/notification.c:93!

Eric Paris wrote:
>
> This is the first time I've heard about it. I'm looking for the problem
> but I don't see it right offhand. Is there anything you know of that
> causes it? I'm looking though....
>

No, sorry - it seems very random to me. Most of the times the related
process is krunner.
But sometimes it can be something else, for example firefox:

Jul 2 23:16:43 gentoo kernel BUG at
fs/notify/notification.c:93!

Jul 2 23:16:43 gentoo invalid opcode: 0000 [#2] PREEMPT
SMP
Jul 2 23:16:43 gentoo last sysfs file:
/sys/devices/LNXSYSTM:00/device:00/PNP0C0A:00/power_supply/BAT0/charge_full

Jul 2 23:16:43 gentoo CPU
0

Jul 2 23:16:43 gentoo Modules linked in: radeon drm snd_seq
snd_seq_device vfat fat coretemp hwmon cpufreq_ondemand fan iwl3945
iwlcore usb_storage snd_hda_codec_idt snd_hda_intel mac80211
snd_hda_codec led_class snd_pcm snd_timer snd cfg80211 soundcore
snd_page_alloc dell_laptop psmouse rfkill ehci_hcd button battery ac
thermal uhci_hcd evdev wmi
Jul 2 23:16:43 gentoo Pid: 5736, comm: firefox Tainted: G D
2.6.31-rc1 #2 MM061
Jul 2 23:16:43 gentoo RIP: 0010:[<ffffffff810db63a>]
[<ffffffff810db63a>] fsnotify_put_event+0x5a/0x60
Jul 2 23:16:43 gentoo RSP: 0018:ffff880063d7de58 EFLAGS:
00010283
Jul 2 23:16:43 gentoo RAX: ffff88007faac258 RBX: ffff88007faac200 RCX:
00007fa00dc5e820
Jul 2 23:16:43 gentoo RDX: 0000000000000001 RSI: ffffffff8162bf50 RDI:
ffff88007faac200
Jul 2 23:16:43 gentoo RBP: 0000000000000020 R08: ffff880063d7c000 R09:
ffff88006e27b828
Jul 2 23:16:43 gentoo R10: 000000000000000a R11: 0000000000000000 R12:
ffff88007faac200
Jul 2 23:16:43 gentoo R13: 00007fa00dc5e800 R14: 0000000000000000 R15:
ffff88007fab9cc0
Jul 2 23:16:43 gentoo FS: 00007fa02cb0f700(0000)
GS:ffff880001651000(0000) knlGS:0000000000000000
Jul 2 23:16:43 gentoo CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Jul 2 23:16:43 gentoo CR2: 00007fce069899c4 CR3: 0000000063d7b000 CR4:
00000000000006f0
Jul 2 23:16:43 gentoo DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
Jul 2 23:16:43 gentoo DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
Jul 2 23:16:43 gentoo Process firefox (pid: 5736, threadinfo
ffff880063d7c000, task ffff88007f98a3c0)
Jul 2 23:16:43 gentoo
Stack:

Jul 2 23:16:43 gentoo 0000000000000010 ffffffff810ddf35
0000000000000400 00007fa00dc5e800
Jul 2 23:16:43 gentoo <0> ffff880057737800 ffff88007fab9d18
ffff88007fab9ce8 0000001080b907cc
Jul 2 23:16:43 gentoo <0> 0000000000000000 ffff88007f98a3c0
ffffffff81051b90 ffff88007fab9d20
Jul 2 23:16:43 gentoo Call
Trace:

Jul 2 23:16:43 gentoo [<ffffffff810ddf35>] ?
inotify_read+0x2e5/0x310

Jul 2 23:16:43 gentoo [<ffffffff81051b90>] ?
autoremove_wake_function+0x0/0x30

Jul 2 23:16:43 gentoo [<ffffffff810ba30b>] ?
do_vfs_ioctl+0x3fb/0x580

Jul 2 23:16:43 gentoo [<ffffffff810ac028>] ?
vfs_read+0xc8/0x1a0

Jul 2 23:16:43 gentoo [<ffffffff810ac203>] ?
sys_read+0x53/0xa0

Jul 2 23:16:43 gentoo [<ffffffff8100b3eb>] ?
system_call_fastpath+0x16/0x1b

Jul 2 23:16:43 gentoo Code: ff 48 89 de 48 8b 3d 4e 08 55 00 5b e9 a0
a7 fc ff 5b c3 66 0f 1f 44 00 00 48 8d 7f 28 e8 4f 8d fd ff 0f 1f 80 00
00 00 00 eb c3 <0f> 0b eb fe 66 90 41 57 4c 8d 7f 28 41 56 41 55 41 54
55 48 89
Jul 2 23:16:43 gentoo RIP [<ffffffff810db63a>]
fsnotify_put_event+0x5a/0x60
Jul 2 23:16:43 gentoo RSP
<ffff880063d7de58>

Jul 2 23:16:43 gentoo ---[ end trace ac2beffdbb4340c9 ]---

Please let me know if you need more info.

2009-07-06 20:30:44

by Eric Paris

[permalink] [raw]
Subject: Re: kernel BUG at fs/notify/notification.c:93!

On Sat, 2009-07-04 at 16:50 +0200, Mikko C. wrote:
> Eric Paris wrote:
> >
> > This is the first time I've heard about it. I'm looking for the problem
> > but I don't see it right offhand. Is there anything you know of that
> > causes it? I'm looking though....
> >
>
> No, sorry - it seems very random to me. Most of the times the related
> process is krunner.
> But sometimes it can be something else, for example firefox:

Mikko sent a third trace offlist, even more interesting since this one
looks like we are hitting things with LIST_POISON1, which looks friggin
impossible to me as I never call list_del() on the anchor of the list
and the only way we run the list is via the list_for_each_entry() macro
starting at the anchor. So how in the hell can LIST_POISON1 ever be
possible? I'm at a loss, but haven't given up yet. All running,
addition, and removal of the list are done holding a spin_lock declared
in the same struct as the list anchor (struct fsnotify_event -> lock).

Anyone have suggestions on common ways someone can end up with
LIST_POISON1 biting them?

BUG: unable to handle kernel paging request at 0000000000100100
IP: [<ffffffff810dba99>] fsnotify_remove_priv_from_event+0x29/0x70
PGD 68f2b067 PUD 68f2c067 PMD 0
Oops: 0000 [#1] PREEMPT SMP
last sysfs file: 00/device:00/PNP0C0A:00/power_supply/BAT0/charge_full
CPU 1
Modules linked in: radeon drm snd_seq snd_seq_device vfat fat coretemp hwmon cpufreq_ondemand fan iwl3945 usb_storage iwlcore mac80211 led_class snd_hda_codec_idt snd_hda_intel cfg80211 snd_hda_codec snd_pcm snd_timer snd psmouse dell_laptop soundcore evdev uhci_hcd snd_page_alloc rfkill ehci_hcd wmi ac button battery thermal
Pid: 5653, comm: dolphin Not tainted 2.6.31-rc2 #1 MM061
RIP: 0010:[<ffffffff810dba99>] [<ffffffff810dba99>] fsnotify_remove_priv_from_event+0x29/0x70
RSP: 0018:ffff880068ddfe60 EFLAGS: 00010206
RAX: ffff880072ef6ab8 RBX: ffff880072e2e5c0 RCX: 0000000000100100
RDX: 00000000001000f8 RSI: ffff880072e2e600 RDI: ffff88007eee00c0
RBP: 0000000000000020 R08: 0000000002244470 R09: ffff88007b348540
R10: 000000000219b080 R11: 0000000000000246 R12: ffff880072e2e5a8
R13: 00007ffff5ac6970 R14: 0000000000000000 R15: ffff88007eee00c0
FS: 00007f1afd69e750(0000) GS:ffff88000166a000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000100100 CR3: 0000000068f2a000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process dolphin (pid: 5653, threadinfo ffff880068dde000, task ffff88007f8f4140)
Stack:

ffffffff810de304 0000000000000020 00007ffff5ac6970 ffff88007b348540
<0> ffff88007eee0118 ffff88007eee00e8 00007ffff5ac7c9c 0000000000000000
<0> ffff88007f8f4140 ffffffff81051c20 ffff88007eee0120 ffff88007eee0120
Call Trace:

[<ffffffff810de304>] ? inotify_read+0x184/0x310
[<ffffffff81051c20>] ? autoremove_wake_function+0x0/0x30
[<ffffffff810ba80b>] ? do_vfs_ioctl+0x3fb/0x580
[<ffffffff810ac528>] ? vfs_read+0xc8/0x1a0
[<ffffffff810ac703>] ? sys_read+0x53/0xa0
[<ffffffff8100b3eb>] ? system_call_fastpath+0x16/0x1b