2004-01-26 23:47:06

by David Martínez Moreno

[permalink] [raw]
Subject: kernel BUG at include/linux/list.h:148!

Hello, I'm using -mm branch since 2.6.0-pre kernels, and now I'm finding
problems (well, *another* type of problems) since 2.6.1-rc1-mm2. Last kernel
without this error was 2.6.1-rc2-mm1.

The error is always the same (at least to me, poor non kernel-hacker):

------------[ cut here ]------------
kernel BUG at include/linux/list.h:148!
invalid operand: 0000 [#1]
CPU: 0
EIP: 0060:[<c012ee1d>] Not tainted VLI
EFLAGS: 00010203
EIP is at __remove_from_page_cache+0x71/0x7b
eax: c13e22b8 ebx: dd2058bc ecx: c13e22c0 edx: c1122c90
esi: c13e22b8 edi: dfdb5e60 ebp: dd2058bc esp: dfdb5d88
ds: 007b es: 007b ss: 0068
Process kswapd0 (pid: 8, threadinfo=dfdb4000 task=dfdbace0)
Stack: dd2058c0 000145cd 00000001 c13e22b8 c0137e0b c13e22b8 c02fce0a d6df1480
00000001 000000b1 00000000 dfdb5db4 dfdb5db4 dfdb5dc0 00000003 c04ade88
00000001 c10502f8 c03d58b4 00000003 c04b0560 00000001 00000001 c13a7c80
Call Trace:
[<c0137e0b>] shrink_list+0x2c0/0x476
[<c02fce0a>] __kfree_skb+0x68/0xd9
[<c013813f>] shrink_cache+0x17e/0x2df
[<c015b74b>] shrink_dcache_memory+0x23/0x25
[<c0137a76>] shrink_slab+0x11b/0x15e
[<c0138b4a>] balance_pgdat+0x18e/0x21e
[<c0138cec>] kswapd+0x112/0x122
[<c011add5>] autoremove_wake_function+0x0/0x4f
[<c011add5>] autoremove_wake_function+0x0/0x4f
[<c0138bda>] kswapd+0x0/0x122
[<c0108d45>] kernel_thread_helper+0x5/0xb

Code: 01 10 00 c7 46 10 00 00 00 00 83 6b 30 01 83 05 80 04 4b c0 ff 8b 74 24 0c 8b 5c 24 08 83 c4 10 c3 0f 0b 95 00 f6 e8 38 c0 eb c5 <0f> 0b 94 00 f6 e8 38 c0 eb b3 8b 54 24 04 8b 02 f7 d0 a8 01 75

This error, for example, is printed with the server's current kernel,
2.6.2-rc1-mm1.

The machine is P4 with XFS over RAID0 by software, Apache2 in thread mode,
and root filesystem on ext3.

Please don't hesitate to request further info.

Kind regards,


Ender.


2004-01-27 00:14:58

by Andrew Morton

[permalink] [raw]
Subject: Re: kernel BUG at include/linux/list.h:148!

David Mart?nez Moreno <[email protected]> wrote:
>
> Hello, I'm using -mm branch since 2.6.0-pre kernels, and now I'm finding
> problems (well, *another* type of problems) since 2.6.1-rc1-mm2. Last kernel
> without this error was 2.6.1-rc2-mm1.
>
> The error is always the same (at least to me, poor non kernel-hacker):
>
> ------------[ cut here ]------------
> kernel BUG at include/linux/list.h:148!
> invalid operand: 0000 [#1]
> CPU: 0
> EIP: 0060:[<c012ee1d>] Not tainted VLI
> EFLAGS: 00010203
> EIP is at __remove_from_page_cache+0x71/0x7b
> eax: c13e22b8 ebx: dd2058bc ecx: c13e22c0 edx: c1122c90
> esi: c13e22b8 edi: dfdb5e60 ebp: dd2058bc esp: dfdb5d88
> ds: 007b es: 007b ss: 0068
> Process kswapd0 (pid: 8, threadinfo=dfdb4000 task=dfdbace0)
> Stack: dd2058c0 000145cd 00000001 c13e22b8 c0137e0b c13e22b8 c02fce0a d6df1480
> 00000001 000000b1 00000000 dfdb5db4 dfdb5db4 dfdb5dc0 00000003 c04ade88
> 00000001 c10502f8 c03d58b4 00000003 c04b0560 00000001 00000001 c13a7c80
> Call Trace:
> [<c0137e0b>] shrink_list+0x2c0/0x476
> [<c02fce0a>] __kfree_skb+0x68/0xd9
> [<c013813f>] shrink_cache+0x17e/0x2df
> [<c015b74b>] shrink_dcache_memory+0x23/0x25
> [<c0137a76>] shrink_slab+0x11b/0x15e

Someone else was seeing something similar. Reverting

ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.2-rc1/2.6.2-rc1-mm1/broken-out/sysfs-pin-kobject.patch

apparently fixed it.

2004-01-27 00:28:21

by David Martínez Moreno

[permalink] [raw]
Subject: Re: kernel BUG at include/linux/list.h:148!

El Martes, 27 de Enero de 2004 01:16, Andrew Morton escribi?:
> Someone else was seeing something similar. Reverting
>
>ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.2-rc1/2.6.2-rc1-mm1/broken-out/sysfs-pin-kobject.patch
>
> apparently fixed it.

Andrew, thank you very much for your prompt response.

I'll try tomorrow, as it seemed to had hung and I need to go to the
university to reboot it.

Have you kicked it out of your next release? I see that is into
2.6.2-rc1-mm3.

Anyway I'll take a look and will report to you.

Thanks again,


Ender.
--
Network engineer
Debian developer

2004-01-27 15:23:51

by David Martínez Moreno

[permalink] [raw]
Subject: Re: kernel BUG at include/linux/list.h:148!

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

El Martes, 27 de Enero de 2004 01:16, Andrew Morton escribi?:

Andrew, sorry, but it seems that sysfs-pin-object is not guilty:

[...]
b44: eth0: Flow control is on for TX and on for RX.
eth0: no IPv6 routers present
- ------------[ cut here ]------------
kernel BUG at include/linux/list.h:148!
invalid operand: 0000 [#1]
CPU: 0
EIP: 0060:[<c012ee1d>] Not tainted VLI
EFLAGS: 00010202
EIP is at __remove_from_page_cache+0x71/0x7b
eax: c13e22b8 ebx: c844b13c ecx: c13e22c0 edx: c12ed480
esi: c13e22b8 edi: dfdb5e60 ebp: c844b13c esp: dfdb5d88
ds: 007b es: 007b ss: 0068
Process kswapd0 (pid: 8, threadinfo=dfdb4000 task=dfdbace0)
Stack: c844b140 00005850 00000001 c13e22b8 c0137e0b c13e22b8 c02fcdbe cc636380
00000001 00000029 00000000 dfdb5db4 dfdb5db4 dfdb5dc0 0000012c 00000001
00000001 c10e73b0 c03d58b4 00000001 c04b05c0 00000009 00000001 c114a078
Call Trace:
[<c0137e0b>] shrink_list+0x2c0/0x476
[<c02fcdbe>] __kfree_skb+0x68/0xd9
[<c013813f>] shrink_cache+0x17e/0x2df
[<c0138774>] shrink_zone+0x77/0x9b
[<c0138b4a>] balance_pgdat+0x18e/0x21e
[<c0138cec>] kswapd+0x112/0x122
[<c011add5>] autoremove_wake_function+0x0/0x4f
[<c011add5>] autoremove_wake_function+0x0/0x4f
[<c0138bda>] kswapd+0x0/0x122
[<c0108d45>] kernel_thread_helper+0x5/0xb

Code: 01 10 00 c7 46 10 00 00 00 00 83 6b 30 01 83 05 80 04 4b c0 ff 8b 74 24 0c 8b 5c 24 08 83 c4 10 c3 0f 0b 95 00 96 e8 38 c0 eb c5 <0f> 0b 94 00 96 e8 38 c0 eb b3 8b 54 24 04 8b 02 f7 d0 a8 01 75

The box keeps running, but it seems that starts to
degrade itself until becomes unresponsive.

Now it's running 2.6.2-rc1-mm1 without sysfs-pin-kobject.patch.

Any other hint?

> David Mart?nez Moreno <[email protected]> wrote:
> > Hello, I'm using -mm branch since 2.6.0-pre kernels, and now I'm finding
> > problems (well, *another* type of problems) since 2.6.1-rc1-mm2. Last
> > kernel without this error was 2.6.1-rc2-mm1.
> >
> > The error is always the same (at least to me, poor non kernel-hacker):
> >
> > ------------[ cut here ]------------
> > kernel BUG at include/linux/list.h:148!
> > invalid operand: 0000 [#1]
> > CPU: 0
> > EIP: 0060:[<c012ee1d>] Not tainted VLI
> > EFLAGS: 00010203
> > EIP is at __remove_from_page_cache+0x71/0x7b
> > eax: c13e22b8 ebx: dd2058bc ecx: c13e22c0 edx: c1122c90
> > esi: c13e22b8 edi: dfdb5e60 ebp: dd2058bc esp: dfdb5d88
> > ds: 007b es: 007b ss: 0068
> > Process kswapd0 (pid: 8, threadinfo=dfdb4000 task=dfdbace0)
> > Stack: dd2058c0 000145cd 00000001 c13e22b8 c0137e0b c13e22b8 c02fce0a
> > d6df1480 00000001 000000b1 00000000 dfdb5db4 dfdb5db4 dfdb5dc0 00000003
> > c04ade88 00000001 c10502f8 c03d58b4 00000003 c04b0560 00000001 00000001
> > c13a7c80 Call Trace:
> > [<c0137e0b>] shrink_list+0x2c0/0x476
> > [<c02fce0a>] __kfree_skb+0x68/0xd9
> > [<c013813f>] shrink_cache+0x17e/0x2df
> > [<c015b74b>] shrink_dcache_memory+0x23/0x25
> > [<c0137a76>] shrink_slab+0x11b/0x15e
>
> Someone else was seeing something similar. Reverting
>
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.2-rc1/2.
>6.2-rc1-mm1/broken-out/sysfs-pin-kobject.patch
>
> apparently fixed it.

- --
Yes I'm old. Old enough to remember when the MCP was just a chess program!
-- Dumont (Tron)
- --
Servicios de red - Network services
RedIRIS - Spanish Academic Network for Research and Development
Madrid (Spain)
Tlf (+34) 91.585.51.50
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)

iD8DBQFAFoJ9Ws/EhA1iABsRAv1RAJ0ZKC74oEWXWpU/JtDdSozixeGgHACglUUe
20+W3HIHAgSZIv4I8hXeLHM=
=s8BQ
-----END PGP SIGNATURE-----