2002-01-17 14:23:52

by Matt Bernstein

[permalink] [raw]
Subject: probably very irrelevant oops

Hi,

I built a fairly pathological kernel based on 2.4.17 with sched-O1-H7,
ext3-0.9.17, XFS, jfs-1.0.12 and Intel's e100. (Things are orthogonal
enough that they patch together easily :)

It boots fine (but not with devfs) and I can use all four journaled
filesystems together happily. So I thought I'd try two very stupid stress
tests.

find /lib/modules/2.4.17-expt/kernel/ -type f|while read i; do insmod $i; done

[Great. A few hundred modules load, no doubt some clashing with others.
However, lsmod seems to suggest we've overrun a buffer as the output
looks truncated (my find produced 597 lines, but lsmod only 300 or so)]

awk '{print $1}' /proc/modules|while read i; do rmmod $i; done

[BOOM. But I was asking for it.. the first three warnings are the modules
in my initrd]

ksymoops 2.4.1 on i686 2.4.17-expt. Options used
-V (default)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.4.17-expt/ (default)
-m /boot/System.map-2.4.17-expt (default)

Warning: You did not tell me where to find symbol information. I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc. ksymoops -h explains the options.

Warning (expand_objects): object /lib/modules/2.4.17-expt/kernel/fs/reiserfs/reiserfs.o for module reiserfs has changed since load
Warning (expand_objects): object /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-disk.o for module ide-disk has changed since load
Warning (expand_objects): object /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o for module ide-mod has changed since load
Warning (compare_ksyms_lsmod): module ip_conntrack is in lsmod but not in ksyms, probably no symbols exported
Warning (compare_ksyms_lsmod): module nls_cp9 is in lsmod but not in ksyms, probably no symbols exported
Error (compare_ksyms_lsmod): module nls_cp950 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nls_big5 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nls_cp1255 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nls_cp1251 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module hfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module fat is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module efs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module bfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module e100 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module xfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module xfs_dmapi is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module xfs_support is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module pagebuf is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nfsd is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module lockd is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module sunrpc is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module ext3 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module jbd is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module nls_iso8859-1 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module aic7xxx is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module sd_mod is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module scsi_mod is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module jfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module autofs4 is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module af_packet is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module mousedev is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module hid is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module usbmouse is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module input is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module usb-uhci is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module usbcore is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module unix is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module reiserfs is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module ide-disk is in ksyms but not in lsmod
Error (compare_ksyms_lsmod): module ide-mod is in ksyms but not in lsmod
Warning (compare_maps): mismatch on symbol irlan_state , irlan says d0ee5900, /lib/modules/2.4.17-expt/kernel/net/irda/irlan.o says d0ee58b8. Ignoring /lib/modules/2.4.17-expt/kernel/net/irda/irlan.o entry
Warning (compare_maps): mismatch on symbol proc_irda , irda says d0ed6084, /lib/modules/2.4.17-expt/kernel/net/irda/irda.o says d0ed56c4. Ignoring /lib/modules/2.4.17-expt/kernel/net/irda/irda.o entry
Warning (compare_maps): mismatch on symbol icmpv6_socket , ipv6 says d0ebeb40, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc7e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol icmpv6_statistics , ipv6 says d0ebea40, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc6e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_dev_count , ipv6 says d0ebe5a0, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc240. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_ifa_count , ipv6 says d0ebe5a4, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc244. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_protos , ipv6 says d0ebe9c0, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc660. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inetsw6 , ipv6 says d0ebe540, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc1e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ip6_ra_chain , ipv6 says d0ebe840, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc4e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ipv6_statistics , ipv6 says d0ebe740, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc3e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol raw_v6_htable , ipv6 says d0ebe940, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc5e0. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol rt6_stats , ipv6 says d0ebe708, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc3a8. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol udp_stats_in6 , ipv6 says d0ebe8c0, /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o says d0ebc560. Ignoring /lib/modules/2.4.17-expt/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol vxfs_inode_cachep , freevxfs says d0e2733c, /lib/modules/2.4.17-expt/kernel/fs/freevxfs/freevxfs.o says d0e26ee0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/freevxfs/freevxfs.o entry
Warning (compare_maps): mismatch on symbol cii_cache , coda says d0dab608, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa7a8. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_cache_inv_stat , coda says d0dab7c8, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa968. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_callstats , coda says d0dab6e0, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa880. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_comms , coda says d0dab540, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa6e0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_debug , coda says d0dab6bc, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa85c. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_fake_statfs , coda says d0dab6c4, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa864. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_hard , coda says d0dab520, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa6c0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_hasmknod , coda says d0dab620, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa7c0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_permission_stat , coda says d0dab7c0, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa960. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_print_entry , coda says d0dab6c0, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa860. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_upcall_stat , coda says d0dab800, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa9a0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol coda_vfs_stat , coda says d0dab780, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa920. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol cred_cache , coda says d0dab60c, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa7ac. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol proc_fs_coda , coda says d0dab9bc, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daab5c. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol upc_cache , coda says d0dab610, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa7b0. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol use_coda_close , coda says d0dab624, /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o says d0daa7c4. Ignoring /lib/modules/2.4.17-expt/kernel/fs/coda/coda.o entry
Warning (compare_maps): mismatch on symbol presto_sym_fops , intermezzo says d0d8f920, /lib/modules/2.4.17-expt/kernel/fs/intermezzo/intermezzo.o says d0d8de20. Ignoring /lib/modules/2.4.17-expt/kernel/fs/intermezzo/intermezzo.o entry
Warning (compare_maps): mismatch on symbol proc_fs_intermezzo , intermezzo says d0d8f900, /lib/modules/2.4.17-expt/kernel/fs/intermezzo/intermezzo.o says d0d8de00. Ignoring /lib/modules/2.4.17-expt/kernel/fs/intermezzo/intermezzo.o entry
Warning (compare_maps): mismatch on symbol nlmsvc_ops , lockd says d0bb39f0, /lib/modules/2.4.17-expt/kernel/fs/lockd/lockd.o says d0bb2e58. Ignoring /lib/modules/2.4.17-expt/kernel/fs/lockd/lockd.o entry
Warning (compare_maps): mismatch on symbol nfs_debug , sunrpc says d0ba6a44, /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o says d0ba6724. Ignoring /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nfsd_debug , sunrpc says d0ba6a48, /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o says d0ba6728. Ignoring /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nlm_debug , sunrpc says d0ba6a4c, /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o says d0ba672c. Ignoring /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol rpc_debug , sunrpc says d0ba6a40, /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o says d0ba6720. Ignoring /lib/modules/2.4.17-expt/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol journal_enable_debug , jbd says d0ae129c, /lib/modules/2.4.17-expt/kernel/fs/jbd/jbd.o says d0ae1288. Ignoring /lib/modules/2.4.17-expt/kernel/fs/jbd/jbd.o entry
Warning (compare_maps): mismatch on symbol aic7xxx_detect_complete , aic7xxx says d0ad41a4, /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o says d0ad1a64. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o entry
Warning (compare_maps): mismatch on symbol aic7xxx_no_probe , aic7xxx says d0ad41a0, /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o says d0ad1a60. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o entry
Warning (compare_maps): mismatch on symbol aic7xxx_periodic_otag , aic7xxx says d0ad41b0, /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o says d0ad1a70. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o entry
Warning (compare_maps): mismatch on symbol aic7xxx_verbose , aic7xxx says d0ad41a8, /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o says d0ad1a68. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/aic7xxx/aic7xxx.o entry
Warning (compare_maps): mismatch on symbol sd , sd_mod says d0a14aec, /lib/modules/2.4.17-expt/kernel/drivers/scsi/sd_mod.o says d0a14980. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/sd_mod.o entry
Warning (compare_maps): mismatch on symbol proc_scsi , scsi_mod says d0ab3b68, /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o says d0ab2310. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o entry
Warning (compare_maps): mismatch on symbol scsi_devicelist , scsi_mod says d0ab3b94, /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o says d0ab233c. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o entry
Warning (compare_maps): mismatch on symbol scsi_hostlist , scsi_mod says d0ab3b90, /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o says d0ab2338. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o entry
Warning (compare_maps): mismatch on symbol scsi_hosts , scsi_mod says d0ab3b98, /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o says d0ab2340. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o entry
Warning (compare_maps): mismatch on symbol scsi_logging_level , scsi_mod says d0ab3b64, /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o says d0ab230c. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/scsi/scsi_mod.o entry
Warning (compare_maps): mismatch on symbol packet_socks_nr , af_packet says d09f6fe8, /lib/modules/2.4.17-expt/kernel/net/packet/af_packet.o says d09f6de4. Ignoring /lib/modules/2.4.17-expt/kernel/net/packet/af_packet.o entry
Warning (compare_maps): mismatch on symbol usb_devfs_handle , usbcore says d091f2f4, /lib/modules/2.4.17-expt/kernel/drivers/usb/usbcore.o says d091edb4. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/usb/usbcore.o entry
Warning (compare_maps): mismatch on symbol unix_socket_table , unix says d08f2520, /lib/modules/2.4.17-expt/kernel/net/unix/unix.o says d08f2140. Ignoring /lib/modules/2.4.17-expt/kernel/net/unix/unix.o entry
Warning (compare_maps): mismatch on symbol ide_devfs_handle , ide-mod says d0812e00, /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o says d0811fec. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o entry
Warning (compare_maps): mismatch on symbol ide_hwifs , ide-mod says d0810f60, /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o says d081014c. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o entry
Warning (compare_maps): mismatch on symbol ide_probe , ide-mod says d0810f40, /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o says d081012c. Ignoring /lib/modules/2.4.17-expt/kernel/drivers/ide/ide-mod.o entry
Reading Oops report from the terminal
Unable to handle kernel paging request at virtual address d11cd2a0
c017184d
*pde = 0563e067
Oops: 0002
CPU: 0
EIP: 0010:[<c017184d>] Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00210246
eax: d11cd2a0 ebx: d1191000 ecx: 00000000 edx: cf005f64
esi: d1197540 edi: 00000000 ebp: 00000003 esp: c3b27f04
ds: 0018 es: 0018 ss: 0018
Process modprobe (pid: 5881, stackpage=c3b27000)
Stack: d1191000 00000003 00000003 d11968ca d1197540 00000000 d1191000 00000003
00000003 d1191000 c0115945 d119763c c0ff4000 00006580 c05da000 00000060
ffffffea 00000006 cb8e1214 00000060 d118d000 d1191060 00006680 00000000
Call Trace: [<d11968ca>] [<d1197540>] [<c0115945>] [<d119763c>] [<d1191060>]
[<c0106ebb>]
Code: 89 30 8b 1d 28 6a 1e c0 81 fb 28 6a 1e c0 74 23 8d 76 00 53

>>EIP; c017184d <pci_register_driver+1d/60> <=====
Trace; d11968ca <[es1371]init_es1371+2a/50>
Trace; d1197540 <[es1371]es1371_driver+0/3f>
Trace; c0115945 <sys_init_module+525/5e0>
Trace; d119763c <[es1371].data.end+bd/dae1>
Trace; d1191060 <[es1371]wait_src_ready+0/3c>
Trace; c0106ebb <system_call+33/38>
Code; c017184d <pci_register_driver+1d/60>
00000000 <_EIP>:
Code; c017184d <pci_register_driver+1d/60> <=====
0: 89 30 mov %esi,(%eax) <=====
Code; c017184f <pci_register_driver+1f/60>
2: 8b 1d 28 6a 1e c0 mov 0xc01e6a28,%ebx
Code; c0171855 <pci_register_driver+25/60>
8: 81 fb 28 6a 1e c0 cmp $0xc01e6a28,%ebx
Code; c017185b <pci_register_driver+2b/60>
e: 74 23 je 33 <_EIP+0x33> c0171880 <pci_register_driver+50/60>
Code; c017185d <pci_register_driver+2d/60>
10: 8d 76 00 lea 0x0(%esi),%esi
Code; c0171860 <pci_register_driver+30/60>
13: 53 push %ebx


60 warnings and 36 errors issued. Results may not be reliable.



2002-01-17 21:01:09

by Andrew Morton

[permalink] [raw]
Subject: Re: probably very irrelevant oops

Matt Bernstein wrote:
>
> Hi,
>
> I built a fairly pathological kernel based on 2.4.17 with sched-O1-H7,
> ext3-0.9.17, XFS, jfs-1.0.12 and Intel's e100. (Things are orthogonal
> enough that they patch together easily :)
>
> It boots fine (but not with devfs) and I can use all four journaled
> filesystems together happily. So I thought I'd try two very stupid stress
> tests.
>
> find /lib/modules/2.4.17-expt/kernel/ -type f|while read i; do insmod $i; done

You're sick. I like you.

> [Great. A few hundred modules load, no doubt some clashing with others.
> However, lsmod seems to suggest we've overrun a buffer as the output
> looks truncated (my find produced 597 lines, but lsmod only 300 or so)]

Well a lot of the modules won't stick, because they won't find the
required hardware. Plus the /proc/modules output gets chopped after
4 kbytes.

> awk '{print $1}' /proc/modules|while read i; do rmmod $i; done
>
> [BOOM. But I was asking for it.. the first three warnings are the modules
> in my initrd]
>
> ...
>
> Unable to handle kernel paging request at virtual address d11cd2a0
> c017184d
> *pde = 0563e067
> Oops: 0002
> CPU: 0
> EIP: 0010:[<c017184d>] Not tainted
> Using defaults from ksymoops -t elf32-i386 -a i386
> EFLAGS: 00210246
> eax: d11cd2a0 ebx: d1191000 ecx: 00000000 edx: cf005f64
> esi: d1197540 edi: 00000000 ebp: 00000003 esp: c3b27f04
> ds: 0018 es: 0018 ss: 0018
> Process modprobe (pid: 5881, stackpage=c3b27000)
> Stack: d1191000 00000003 00000003 d11968ca d1197540 00000000 d1191000 00000003
> 00000003 d1191000 c0115945 d119763c c0ff4000 00006580 c05da000 00000060
> ffffffea 00000006 cb8e1214 00000060 d118d000 d1191060 00006680 00000000
> Call Trace: [<d11968ca>] [<d1197540>] [<c0115945>] [<d119763c>] [<d1191060>]
> [<c0106ebb>]
> Code: 89 30 8b 1d 28 6a 1e c0 81 fb 28 6a 1e c0 74 23 8d 76 00 53
>
> >>EIP; c017184d <pci_register_driver+1d/60> <=====
> Trace; d11968ca <[es1371]init_es1371+2a/50>
> Trace; d1197540 <[es1371]es1371_driver+0/3f>
> Trace; c0115945 <sys_init_module+525/5e0>
> Trace; d119763c <[es1371].data.end+bd/dae1>
> Trace; d1191060 <[es1371]wait_src_ready+0/3c>
> Trace; c0106ebb <system_call+33/38>

It appears that one of the modules failed to unregister itself
from the global driver list. Then when the next module went
walking the list, it tried to refer to the bad module's vmalloc'ed
space.

One strange thing: why was it `modprobe' which crashed? Were you
not purely running `rmmod' at the time?

The bug probably is in the module which immediately preceded
es1371 in your /proc/modules output. Could you please load
them all up again, send me your /proc/modules output?

Also, change your scripts to print out the names of the modules
as they're being loaded and unloaded, run the test again and
see which modules were being loaded/unloaded shortly before the
crash.

Thanks.

2002-01-17 21:34:51

by Richard B. Johnson

[permalink] [raw]
Subject: Re: probably very irrelevant oops

On Thu, 17 Jan 2002, Andrew Morton wrote:

> Matt Bernstein wrote:
> >
> > Hi,
> >
> > I built a fairly pathological kernel based on 2.4.17 with sched-O1-H7,
> > ext3-0.9.17, XFS, jfs-1.0.12 and Intel's e100. (Things are orthogonal
> > enough that they patch together easily :)
> >
> > It boots fine (but not with devfs) and I can use all four journaled
> > filesystems together happily. So I thought I'd try two very stupid stress
> > tests.
> >
> > find /lib/modules/2.4.17-expt/kernel/ -type f|while read i; do insmod $i; done
>
> You're sick. I like you.

[SNIPPED....]

More sickness follows...

The following script. Used to insert modules, many wouldn't install of
course... In fact, if I would run it several times, I could get the
modules that had to be loaded in a specific order to be installed, then
use `modprobe -c` to make a current listing for /etc/modules.conf.


VER=`uname -r`

for x in `find /lib/modules/${VER} -name "*.o"` ; do
echo $x
insmod $x
done


But with Linux-2.4.7, the system Crashes to a hard-stop on:

MIDI Loopback device driver.
XM 3812 and ODL3

Another OOps at
cs98x0.c


That's about all I could see. There were no logs upon boot, in fact,
some SCSI disk driver installed itself instead of my BusLogic so
no disk I/O was possible!!!



Cheers,
Dick Johnson

Penguin : Linux version 2.4.7 on an i686 machine (797.90 BogoMips).

I was going to compile a list of innovations that could be
attributed to Microsoft. Once I realized that Ctrl-Alt-Del
was handled in the BIOS, I found that there aren't any.


2002-01-17 22:53:54

by Matt Bernstein

[permalink] [raw]
Subject: Re: probably very irrelevant oops

At 12:53 -0800 Andrew Morton wrote:

>> find /lib/modules/2.4.17-expt/kernel/ -type f|while read i; do insmod $i; done

[etc..]

>You're sick. I like you.

I'm only using the sledgehammers you guys give me :)

[snip]
>
>It appears that one of the modules failed to unregister itself
>from the global driver list. Then when the next module went
>walking the list, it tried to refer to the bad module's vmalloc'ed
>space.
>
>One strange thing: why was it `modprobe' which crashed? Were you
>not purely running `rmmod' at the time?

I was, but maybe magicdev or something got in the way. I'll try it with no
GUI or any crazy daemons loaded.

>The bug probably is in the module which immediately preceded
>es1371 in your /proc/modules output. Could you please load
>them all up again, send me your /proc/modules output?

I will do, when I'm back at work. I don't fancy remote crashing my
machine!

>Also, change your scripts to print out the names of the modules
>as they're being loaded and unloaded, run the test again and
>see which modules were being loaded/unloaded shortly before the
>crash.

Sure :)

2002-01-18 16:22:21

by Matt Bernstein

[permalink] [raw]
Subject: Re: probably very irrelevant oops

>The bug probably is in the module which immediately preceded
>es1371 in your /proc/modules output. Could you please load
>them all up again, send me your /proc/modules output?

Further exploration revealed an oops on loading the modules. Here's a clip
from find /lib/modules/2.4.17-expt/kernel -type f

/lib/modules/2.4.17-expt/kernel/drivers/net/plip.o
/lib/modules/2.4.17-expt/kernel/drivers/net/8139cp.o
/lib/modules/2.4.17-expt/kernel/drivers/net/slhc.o
/lib/modules/2.4.17-expt/kernel/drivers/net/slip.o
/lib/modules/2.4.17-expt/kernel/drivers/net/wireless/hermes.o
/lib/modules/2.4.17-expt/kernel/drivers/net/wireless/orinoco.o
/lib/modules/2.4.17-expt/kernel/drivers/net/wireless/airo.o
/lib/modules/2.4.17-expt/kernel/drivers/net/wireless/orinoco_plx.o
/lib/modules/2.4.17-expt/kernel/drivers/net/arlan.o
/lib/modules/2.4.17-expt/kernel/drivers/net/8390.o
/lib/modules/2.4.17-expt/kernel/drivers/net/3c509.o
/lib/modules/2.4.17-expt/kernel/drivers/net/3c59x.o
/lib/modules/2.4.17-expt/kernel/drivers/net/shaper.o
/lib/modules/2.4.17-expt/kernel/drivers/net/pppoe.o
/lib/modules/2.4.17-expt/kernel/drivers/net/pppox.o
/lib/modules/2.4.17-expt/kernel/drivers/net/de4x5.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/physmap.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/elan-104nc.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/pnc2000.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/netsc520.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/sbc_gxx.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/maps/sc520cdp.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/nand/nand.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/nand/nand_ecc.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/mtdram.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/slram.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/blkmtd.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/doc1000.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/doc2000.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/doc2001.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/docprobe.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/pmc551.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/devices/docecc.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/gen_probe.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/map_absent.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/jedec_probe.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/chipreg.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/cfi_probe.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/map_ram.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/map_rom.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/cfi_cmdset_0001.o
/lib/modules/2.4.17-expt/kernel/drivers/mtd/chips/cfi_cmdset_0002.o

..and the corresponding oops as logged by sysklogd..
(NB my find -exec insmod "{}" \; was run more than once)

8139cp 10/100 PCI Ethernet driver v0.0.5 (Oct 19, 2001)
arlan: no devices found
physmap flash device: 4000000 at 8000000
Failed to ioremap
ELAN-104NC flash: IO:0x22-0x23 MEM:0xb0000-0xb7fff
CFI: Found no ELAN-104NC flash device at location zero
Trying to free nonexistent resource <00000022-00000023>
Photron PNC-2000 flash mapping: 400000 at bf000000
Unable to handle kernel paging request at virtual address bf000000
printing eip:
d12c4108
*pde = 00000000
Oops: 0002
CPU: 0
EIP: 0010:[reiserfs:__insmod_reiserfs_S.bss_L6624+11033736/350043552] Not tainted
EIP: 0010:[<d12c4108>] Not tainted
EFLAGS: 00010246
eax: 00000000 ebx: d12c4300 ecx: 00000004 edx: 00f000f0
esi: 00f000f0 edi: 00000004 ebp: ca609e94 esp: ca609ca4
ds: 0018 es: 0018 ss: 0018
Process insmod (pid: 3228, stackpage=ca609000)
Stack: d1004105 d12c4300 00f000f0 00000000 00000978 c014a19a 0805e688 0805e688
0805ed60 00000000 ca608000 c9aba914 0805dcc8 00000000 00000000 ca609e94
d12c4300 d1005030 d1005030 d1000377 d12c4300 00000000 00000000 ca609e94
Call Trace: [reiserfs:__insmod_reiserfs_S.bss_L6624+8150149/352927139] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [load_elf_binary+2362/2688] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+8154032/352923256]
Call Trace: [<d1004105>] [<d12c4300>] [<c014a19a>] [<d12c4300>] [<d1005030>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+8154032/352923256] [reiserfs:__insmod_reiserfs_S.bss_L6624+8134391/352942897] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+8133748/352943540] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048]
[<d1005030>] [<d1000377>] [<d12c4300>] [<d12c4300>] [<d10000f4>] [<d12c4300>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+8154032/352923256] [e100:__insmod_e100_S.text_L35088+11122/35424] [e100:__insmod_e100_S.rodata_L96+8216/9184] [e100:__insmod_e100_S.text_L35088+9210/35424] [getblk+24/64] [e100:__insmod_e100_O/lib/modules/2.4.17-expt/kernel/drivers/net/+-1949807/96]
[<d1005030>] [<d0a0cbd2>] [<d0a14ad8>] [<d0a0c45a>] [<c0132e08>] [<d082df91>]
[e100:__insmod_e100_O/lib/modules/2.4.17-expt/kernel/drivers/net/+-1947579/96] [e100:__insmod_e100_O/lib/modules/2.4.17-expt/kernel/drivers/net/+-1930449/96] [e100:__insmod_e100_O/lib/modules/2.4.17-expt/kernel/drivers/net/+-1885122/96] [e100:__insmod_e100_O/lib/modules/2.4.17-expt/kernel/drivers/net/+-1950321/96] [fast_clear_page+10/96] [do_no_page+49/272]
[<d082e845>] [<d0832b2f>] [<d083dc3e>] [<d082dd8f>] [<c01baaca>] [<c0121d11>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+8154040/352923248] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+8133616/352943672] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+8154032/352923256] [reiserfs:__insmod_reiserfs_S.bss_L6624+8154040/352923248]
[<d1005038>] [<d12c4300>] [<d1000070>] [<d12c4300>] [<d1005030>] [<d1005038>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+11034212/350043076] [reiserfs:__insmod_reiserfs_S.bss_L6624+8153534/352923754] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+8154032/352923256] [reiserfs:__insmod_reiserfs_S.bss_L6624+8142022/352935266] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048]
[<d12c42e4>] [<d1004e3e>] [<d12c4300>] [<d1005030>] [<d1002146>] [<d12c4300>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+11033859/350043429] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034212/350043076] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034240/350043048] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034080/350043208] [sys_init_module+1317/1504] [reiserfs:__insmod_reiserfs_S.bss_L6624+11034676/350042612]
[<d12c4183>] [<d12c42e4>] [<d12c4300>] [<d12c4260>] [<c0115945>] [<d12c44b4>]
[reiserfs:__insmod_reiserfs_S.bss_L6624+11033568/350043720] [system_call+51/56]
[<d12c4060>] [<c0106ebb>]

Code: 89 90 00 00 00 bf c3 90 57 56 83 ec 10 8b 44 24 28 8b 7c 24
<5>NetSc520 flash device: 100000 at 200000
Failed to ioremap_nocache
SBC-GXx flash: IO:0x258-0x259 MEM:0xdc000-0xdffff
CFI: Found no SBC-GXx flash device at location zero
Could not find PAR responsible for SC520CDP Flash Bank #0
Trying default address 0x8400000
Could not find PAR responsible for SC520CDP Flash Bank #1
Trying default address 0x8c00000
Could not find PAR responsible for SC520CDP DIL Flash
Trying default address 0x9400000
SC520 CDP flash device: 800000 at 8400000
Failed to ioremap_nocache
mtd: Giving out device 0 to mtdram test device



I don't think that reiserfs (which btw is my root fs) is the culprit,
rather that it's been trampled on by the "Photron PNC-2000 flash mapping".

Ought I to take the next step, I guess to write a program along the lines
of:

for (<all modules>) {
syslog(<module name>);
fsync();
modprobe(<module name>);
fsync();
}

..?

2002-01-18 21:21:31

by Horst von Brand

[permalink] [raw]
Subject: Re: probably very irrelevant oops

Matt Bernstein <[email protected]> said:
> >The bug probably is in the module which immediately preceded
> >es1371 in your /proc/modules output. Could you please load
> >them all up again, send me your /proc/modules output?
>
> Further exploration revealed an oops on loading the modules. Here's a clip
> from find /lib/modules/2.4.17-expt/kernel -type f

Could you try just loading that last module (and anything it might depend
on)? I looks like a problem with the module expecting some parameter it
doesn't get, and then screwing up. [Just a random guess].
--
Horst von Brand http://counter.li.org # 22616