Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S262333AbTINIQy (ORCPT ); Sun, 14 Sep 2003 04:16:54 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S262337AbTINIQy (ORCPT ); Sun, 14 Sep 2003 04:16:54 -0400 Received: from mail.gmx.de ([213.165.64.20]:38347 "HELO mail.gmx.net") by vger.kernel.org with SMTP id S262333AbTINIQe (ORCPT ); Sun, 14 Sep 2003 04:16:34 -0400 From: "Sebastian Piecha" To: linux-kernel@vger.kernel.org Date: Sun, 14 Sep 2003 10:16:38 +0200 MIME-Version: 1.0 Subject: PROBLEM: oops in 2.4.23pre1, Promise-ide, samba Message-ID: <3F644006.22303.31C480A@localhost> X-mailer: Pegasus Mail for Windows (v4.12a) Content-type: text/plain; charset=US-ASCII Content-transfer-encoding: 7BIT Content-description: Mail message body Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 20961 Lines: 564 Hello, several times I got an OOPS. Here's a description of what has happened. Any help would be appreciated. Please CC me on all further mail traffic. ############################################################### 1) one line summary: When moving data (more than 4GB) from a Windows XP Client to a samba share or checking data stored on the samba share (Powerquest DriveImage images, 56 files, each ~700MB of size, checking with the Powerquest Image Explorer) the kernel panics with an OOPS. Linux has to be resetted hard. When accessing the data via NFS no OOPS is occurring. The error occured on kernel 2.4.20 with samba 2.2.7a and 2.2.8a. Now I tried kernel 2.4.23pre1 with samba 2.2.8a and again the error occurred. I already posted the kernel 2.4.20 problem. The subjects were "PROBLEM: Powerquest Drive Image let the kernel panic" and "PROBLEM: kernel panic when accessing data via samba". ############################################################### 2) full description: I'm using Samba to distribute some shares to Windows clients. One of the shares is an Image-directory where I'm storing PQDI Images of Windows clients. One of the created images is about 40GB of size and is split up to 56 files each of same size. When verifying this image from a Win XP client, PQDI stops with an error (error 1811, "Could not read from image file") and the Linux kernel panics. Verifying this image from DOS (with MS network client) is done without any error. Also verifying smaller images is done without any error. Verifying this Image via NFS is also done without an error. Another PQDI version (7.0) also reports an error and the Linux Kernel panics. Copying more than 4 GB to the samba share also lets the kernel panic with an OOPS. Copying data locally from the Linux console is done without an error. In the beginning I thought that the Promise controller is the source of problem, now I'm not sure. Maybe it's samba or the combination of samba and a Promise controller. The share is lying in a directory on a Reiser filesystem: share Images ReiserFS LVM (on /dev/md0 only, 120GB) RAID1 /dev/md0 (120GB) /dev/hda1 + /dev/hde1 (one primary partition of 120GB on each drive) /dev/hda + /dev/hde (each 120GB) IDE UDMA133-controller As IDE-controller I first used a Promise FastTrak TX2000 (which supports "hardware"-RAID). I tried the binary Promise-driver (1.03.0.1) and the source code-driver (1.02.0.25), both without success. All time the OOPS occurred. Then I replaced the controller and both Samsung SP1203N-hard drives (each 120GB) against a Promise UltraTrak 133 TX2 and two Maxtor drives (6Y120P0, each 1 20GB) and installed a Linux native software-RAID without any Promise-driver. But again the OOPS occurred. Of course I updated the Promise-firmware to the latest level. To eliminate the RAID and LVM-drivers as the source of problem I installed just a Reiser FS on one 120GB-primary partition on one of both Maxtor disks (after removing the drive from the RAID). But again the Linux kernel panicked. Trying ext3 instead of reiserfs didn't help. As I do not have enough space on my scsi-disks I can't verify this big image from a scsi-disk. Sometimes the Linux kernel panic occurs immediately some minutes after starting the verify, sometimes it happens after reading half of all image files. Samba doesn't report any error. I also tried a different PCI-slot for the Promise- adapter without any success. Next thing would be to try a different IDE-controller... ############################################################### 3) keywords: Suse Linux 8.20, kernel 2.4.23pre1, Promise Ultra 133 TX2, samba 2.28a ############################################################### 4) /proc/version: Linux version 2.4.23pre1-usbtest (2.4.23pre1@USB-test.suse.de) (gcc version 3.2.2) #1 Wed Aug 27 19:43:12 UTC 2003 ############################################################### 5) OOPS-message: Oops: 0000 CPU: 0 EIP: 0010:[] Not tainted Using defaults from ksymoops -t elf32-i386 -a i386 EFLAGS: 00010206 eax: c40866a0 ebx: 00200000 ecx: c40866a0 edx: 00200000 esi: cec57360 edi: fffffff9 ebp: 00000046 esp: c0303f2c ds: 0018 es: 0018 ss: 0018 Process swapper (pid: 0, stackpage=c0303000) Stack: cec57360 c0219d6e cec57360 cec57360 cec57360 c0219dab cec57360 cec57360 c0219efc cec57360 cf49cb20 c021e173 cec57360 00000003 c032c568 c0120629 c032c568 00000006 0000000e c0303f98 d3e02e40 c010a091 c0106f40 c0302000 Call Trace: [] [] [] [] [] [] [] [] [] [] [] [] Code: 8b 1b 8b 42 74 48 74 0a ff 4a 74 0f 94 c0 84 c0 74 07 52 e8 >>EIP; c0219cd7 <===== >>eax; c40866a0 <_end+3cf81fc/14e64bbc> >>ecx; c40866a0 <_end+3cf81fc/14e64bbc> >>esi; cec57360 <_end+e8c8ebc/14e64bbc> >>esp; c0303f2c Trace; c0219d6e Trace; c0219dab Trace; c0219efc <__kfree_skb+ec/150> Trace; c021e173 Trace; c0120629 Trace; c010a091 Trace; c0106f40 Trace; c010c4e8 Trace; c0106f40 Trace; c0106f64 Trace; c0106fd2 Trace; c0105000 <_stext+0/0> Code; c0219cd7 00000000 <_EIP>: Code; c0219cd7 <===== 0: 8b 1b mov (%ebx),%ebx <===== Code; c0219cd9 2: 8b 42 74 mov 0x74(%edx),%eax Code; c0219cdc 5: 48 dec %eax Code; c0219cdd 6: 74 0a je 12 <_EIP+0x12> Code; c0219cdf 8: ff 4a 74 decl 0x74(%edx) Code; c0219ce2 b: 0f 94 c0 sete %al Code; c0219ce5 e: 84 c0 test %al,%al Code; c0219ce7 10: 74 07 je 19 <_EIP+0x19> Code; c0219ce9 12: 52 push %edx Code; c0219cea 13: e8 00 00 00 00 call 18 <_EIP+0x18> <0>Kernel panic: Aiee, killing interrupt handler! ############################################################### 6) shell script: no way ############################################################### 7) environment: Dell Optiplex GX1 400MTbr+, Intel II 400 MHz, 320 MB RAM Adaptec AHA 2940UW as PCI-adapter with two hard drives (20GB and 4 GB, /boot is on the first scsi-drive) and a Plextor CD-writer onboard LAN (3com) Promise Ultra133 TX2 as PCI-adapter with two Maxtor-drives (each 120GB) DVD-ROM at the onboard-IDE ############################################################### 7.1) ver_linux: Linux server01 2.4.23pre1-usbtest #1 Wed Aug 27 19:43:12 UTC 2003 i686 unknown unknown GNU/Linux Gnu C 3.3.1 Gnu make 3.80 util-linux 2.11z mount 2.11z modutils 2.4.22 e2fsprogs 1.28 jfsutils 1.1.1 Linux C Library x 1 root root 1475331 Mar 27 21:39 /lib/libc.so.6 Dynamic linker (ldd) 2.3.2 Procps 3.1.6 Net-tools 1.60 Kbd 1.06 Sh-utils 4.5.8 Modules Loaded isa-pnp usbserial ipv6 nfsd autofs st sr_mod sg mousedev joydev evdev input usb-uhci usbcore raw1394 ieee1394 3c59x ide-cd cdrom lvm-mod raid1 reiserfs aic7xxx ############################################################### 7.2) cpuinfo: processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 5 model name : Pentium II (Deschutes) stepping : 2 cpu MHz : 398.780 cache size : 512 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr bogomips : 796.26 ############################################################### 7.3) modules: isa-pnp 30180 0 (unused) usbserial 18332 0 (autoclean) (unused) ipv6 161172 -1 (autoclean) nfsd 65808 4 (autoclean) autofs 10100 1 (autoclean) st 27564 0 (autoclean) (unused) sr_mod 14680 0 (autoclean) sg 26540 0 (autoclean) mousedev 4084 0 (unused) joydev 5728 0 (unused) evdev 4384 0 (unused) input 3328 0 [mousedev joydev evdev] usb-uhci 21772 0 (unused) usbcore 57664 1 [usbserial usb-uhci] raw1394 17108 0 (unused) ieee1394 42660 0 [raw1394] 3c59x 26864 1 ide-cd 30112 0 (autoclean) cdrom 26560 0 (autoclean) [sr_mod ide-cd] lvm-mod 56192 6 (autoclean) raid1 12748 1 (autoclean) reiserfs 182992 4 aic7xxx 139948 5 ############################################################### 7.4) ioports, iomem: 0000-001f : dma1 0020-003f : pic1 0040-005f : timer 0060-006f : keyboard 0070-007f : rtc 0080-008f : dma page reg 00a0-00bf : pic2 00c0-00df : dma2 00f0-00ff : fpu 0170-0177 : ide1 02f8-02ff : serial(auto) 0376-0376 : ide1 03c0-03df : vesafb 03f8-03ff : serial(auto) 0800-083f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI 0840-085f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI 0cf8-0cff : PCI conf1 cc00-cc7f : 3Com Corporation 3c905B 100BaseTX [Cyclone] cc00-cc7f : 00:11.0 cca0-ccaf : Promise Technology, Inc. 20269 cca0-cca7 : ide0 cca8-ccaf : ide2 ccb8-ccbb : Promise Technology, Inc. 20269 ccba-ccba : ide2 ccc0-ccc7 : Promise Technology, Inc. 20269 ccc0-ccc7 : ide2 ccd0-ccd3 : Promise Technology, Inc. 20269 ccd2-ccd2 : ide0 ccd8-ccdf : Promise Technology, Inc. 20269 ccd8-ccdf : ide0 cce0-ccff : Intel Corp. 82371AB/EB/MB PIIX4 USB cce0-ccff : usb-uhci d000-dfff : PCI Bus #02 d800-d8ff : Adaptec AHA-2940U/UW / AHA-39xx / AIC-7895 (#2) dc00-dcff : Adaptec AHA-2940U/UW / AHA-39xx / AIC-7895 e000-efff : PCI Bus #01 ec00-ecff : ATI Technologies Inc 3D Rage Pro AGP 1X/2X ffa0-ffaf : Intel Corp. 82371AB/EB/MB PIIX4 IDE ffa8-ffaf : ide1 00000000-0009ffff : System RAM 000a0000-000bffff : Video RAM area 000c0000-000c7fff : Video ROM 000c8000-000cc7ff : Extension ROM 000d0000-000d7fff : Extension ROM 000d8000-000da7ff : Extension ROM 000f0000-000fffff : System ROM 00100000-13ffffff : System RAM 00100000-002700a5 : Kernel code 002700a6-00301703 : Kernel data f0000000-f3ffffff : Intel Corp. 440BX/ZX/DX - 82443BX/ZX/DX Host bridge f5000000-f5ffffff : PCI Bus #02 f6000000-f6ffffff : PCI Bus #01 fa000000-fbffffff : PCI Bus #02 faffe000-faffefff : Adaptec AHA-2940U/UW / AHA-39xx / AIC-7895 (#2) faffe000-faffefff : aic7xxx fafff000-faffffff : Adaptec AHA-2940U/UW / AHA-39xx / AIC-7895 fafff000-faffffff : aic7xxx fc000000-feffffff : PCI Bus #01 fcfff000-fcffffff : ATI Technologies Inc 3D Rage Pro AGP 1X/2X fcfff000-fcffffff : atyfb fd000000-fdffffff : ATI Technologies Inc 3D Rage Pro AGP 1X/2X fd000000-fdffffff : atyfb ff000000-ff003fff : Promise Technology, Inc. 20269 ff004000-ff00407f : 3Com Corporation 3c905B 100BaseTX [Cyclone] ffe00000-ffffffff : reserved ############################################################### 7.5) PCI: 00:00.0 Host bridge: Intel Corp. 440BX/ZX/DX - 82443BX/ZX/DX Host bridge (rev 03) Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- 00:01.0 PCI bridge: Intel Corp. 440BX/ZX/DX - 82443BX/ZX/DX AGP bridge (rev 03) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- Reset- FastB2B+ 00:07.0 ISA bridge: Intel Corp. 82371AB/EB/MB PIIX4 ISA (rev 02) Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR- TAbort- SERR- TAbort- SERR- TAbort- SERR- TAbort- SERR- Reset- FastB2B- Capabilities: [dc] Power Management version 1 Flags: PMEClk- DSI- D1- D2- AuxCurrent=220mA PME(D0-,D1-,D2-,D3hot- ,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Bridge: PM- B3+ 00:11.0 Ethernet controller: 3Com Corporation 3c905B 100BaseTX [Cyclone] (rev 24) Subsystem: Dell Computer Corporation 3C905B Fast Etherlink XL 10/100 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR- [disabled] [size=128K] Capabilities: [50] AGP version 1.0 Status: RQ=256 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 64bit- FW- AGP3- Rate=x1,x2 Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate= 02:0a.0 SCSI storage controller: Adaptec AHA-2940U/UW / AHA-39xx / AIC-7895 (rev 03) Subsystem: Adaptec AHA-2940U/2940UW Dual Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR-