2005-09-30 06:54:31

by Danny ter Haar

[permalink] [raw]
Subject: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

Known story:
last stable kernel for this machine is/was 2.6.12-mm1

Every 2.6.1[34] kernel panics so far.
Sometimes after 40 minutes, sometimes after 4 days.

Config/more info at:
http://newsgate.newsserver.nl/kernel/2.6.14-rc2-git7/

This time it was scsi system again which "blew-up"

Obvious difference i see between 2.6.12 en 2.6.1[34] is that the
scsi/ethernet cards have different IRQ's set.
If i'm correct acpi code changed in 2.6.13, right ?

Since the 2.6.12-mm1 kernel survived 3 weeks+ and gave super performance
i'm convinced it's not a hardware issue.

Just giving feedback.

Will try git8 and mm2 and of course report here when they fail.

Danny


2005-09-30 07:13:36

by Andrew Morton

[permalink] [raw]
Subject: Re: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

[email protected] (Danny ter Haar) wrote:
>
> Known story:
> last stable kernel for this machine is/was 2.6.12-mm1
>
> Every 2.6.1[34] kernel panics so far.
> Sometimes after 40 minutes, sometimes after 4 days.
>
> Config/more info at:
> http://newsgate.newsserver.nl/kernel/2.6.14-rc2-git7/

The aic79xx driver shat itself.

The log buffer starts with

scsi1 (4:0): rejecting I/O to offline device
scsi1 (4:0): rejecting I/O to offline device
scsi1 (4:0): rejecting I/O to offline device

So we've probably lost the info which will tell us how the problems
started. A serial console would be nice.

> This time it was scsi system again which "blew-up"
>
> Obvious difference i see between 2.6.12 en 2.6.1[34] is that the
> scsi/ethernet cards have different IRQ's set.
> If i'm correct acpi code changed in 2.6.13, right ?
>
> Since the 2.6.12-mm1 kernel survived 3 weeks+ and gave super performance
> i'm convinced it's not a hardware issue.
>
> Just giving feedback.
>
> Will try git8 and mm2 and of course report here when they fail.

Are all the failures due to the aic79xx driver failing in this manner? If
not then please report the different failures separately, thanks.

2005-09-30 07:40:13

by Danny ter Haar

[permalink] [raw]
Subject: Re: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

Quoting Andrew Morton ([email protected]):
> So we've probably lost the info which will tell us how the problems
> started. A serial console would be nice.


This is from serial console.
Unfortunatly it's a portmaster2 which has no buffer.
What i do is start a screen, do a telnet to the portmaster
and attach the serial console. I can't go back anyfurther
than this since the output of the new kernel booting replaced it.
Mayby i can tell screen to have more lines of history ?!

> Are all the failures due to the aic79xx driver failing in this manner? If
> not then please report the different failures separately, thanks.

So far, all new kernels indeed barfed with scsi errors.
It somehow seems to miss a scsi event and a chain of events occur
which also sometimes lead to shutdown ethernet interfaces.

This is the difference in IRQ settings i talked about earlier:

Linux 2.6.14-rc2-git7 (root@newsgate) (gcc 4.0.2 ) #1 1CPU
irq 0: 3736032 timer irq 12: 3
irq 1: 8 i8042 irq 16: 4519576 aic79xx
irq 4: 449 serial irq 17: 34901765 aic79xx, eth3
irq 9: 0 acpi irq 18: 89100231 acenic

vs

Linux 2.6.12-mm1 (root@newsgate) (gcc 3.3.6 ) #1 Mon Jun 20 11:13:18 CEST 2005 1CPU [newsgate.(none)]
irq 0: 3822422 timer irq 12: 3
irq 1: 8 i8042 irq 16: 82787217 acenic
irq 4: 353 serial irq 17: 4194166 aic79xx
irq 9: 0 acpi irq 18: 28628408 aic79xx, eth3





--
Police aren't effective because of their uniforms, badges, guns, or
nightsticks, they're effective because of their radios [seen on /.]

2005-09-30 11:33:50

by Danny ter Haar

[permalink] [raw]
Subject: Re: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

Andrew Morton <[email protected]> wrote:
>The log buffer starts with
>scsi1 (4:0): rejecting I/O to offline device
>scsi1 (4:0): rejecting I/O to offline device
>scsi1 (4:0): rejecting I/O to offline device
>So we've probably lost the info which will tell us how the problems
>started. A serial console would be nice.

My colleque found in kern.log the real first announcement:

Sep 30 00:17:17 newsgate kernel: hw tcp v4 csum failed
Sep 30 00:20:48 newsgate kernel: hw tcp v4 csum failed
Sep 30 00:26:54 newsgate kernel: scsi1: Transmission error detected
Sep 30 00:26:54 newsgate kernel: LQISTAT1[0x8]:(LQICRCI_NLQ) LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE)
Sep 30 00:26:54 newsgate kernel: SCSISIGI[0x60]:(P_DATAIN_DT) PERRDIAG[0x4]:(CRCERR)
Sep 30 00:26:54 newsgate kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
Sep 30 00:26:54 newsgate kernel: scsi1: Dumping Card State at program address 0xf Mode 0x33
Sep 30 00:26:54 newsgate kernel: Card was paused
Sep 30 00:26:54 newsgate kernel: HS_MAILBOX[0x0] INTCTL[0xc0]:(SWTMINTEN|SWTMINTMASK)
Sep 30 00:26:54 newsgate kernel: SEQINTSTAT[0x10]:(SEQ_SWTMRTO) SAVED_MODE[0x11]
Sep 30 00:26:54 newsgate kernel: DFFSTAT[0x24]:(CURRFIFO_0|FIFO1FREE) SCSISIGI[0xb6]:(P_MESGOUT|REQI|BSYI|ATNI)
Sep 30 00:26:54 newsgate kernel: SCSIPHASE[0x4]:(MSG_OUT_PHASE) SCSIBUS[0xf7] LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE)
Sep 30 00:26:54 newsgate kernel: SCSISEQ0[0x0] SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI)
Sep 30 00:26:54 newsgate kernel: SEQCTL0[0x0] SEQINTCTL[0x0] SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0]
Sep 30 00:26:54 newsgate kernel: SSTAT0[0x2]:(SPIORDY) SSTAT1[0x19]:(REQINIT|BUSFREE|PHASEMIS)
Sep 30 00:26:54 newsgate kernel: SSTAT2[0x20]:(NONPACKREQ) SSTAT3[0x0] PERRDIAG[0x0]
Sep 30 00:26:54 newsgate kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
Sep 30 00:26:54 newsgate kernel: LQISTAT0[0x0] LQISTAT1[0x8]:(LQICRCI_NLQ) LQISTAT2[0xc0]:(LQIPHASE_OUTPKT|PACKETIZED)
Sep 30 00:26:54 newsgate kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)
Sep 30 00:26:54 newsgate kernel:
Sep 30 00:26:54 newsgate kernel: SCB Count = 128 CMDS_PENDING = 6 LASTSCB 0x6b CURRSCB 0x14 NEXTSCB 0xff80
Sep 30 00:26:54 newsgate kernel: qinstart = 16079 qinfifonext = 16079
Sep 30 00:26:54 newsgate kernel: QINFIFO:
Sep 30 00:26:54 newsgate kernel: WAITING_TID_QUEUES:
Sep 30 00:26:54 newsgate kernel: Pending list:
Sep 30 00:26:54 newsgate kernel: 20 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 107 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 74 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 3 FIFO_USE[0x1] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 28 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xf7]:(TID)
Sep 30 00:26:54 newsgate kernel: 62 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xf7]:(TID)
Sep 30 00:26:54 newsgate kernel: Total 6
Sep 30 00:26:54 newsgate kernel: Kernel Free SCB list: 116 115 105 35 111 108 29 106 13 76 112 34 17 69 84 51 122 23 87 99 43 32 91 88 75 11 92
+40 46 30 5 102 52 70 16 41 15 97 36 0 119 19 100 126 22 54 65 39 93 55 117 50 61 121 2 25 26 60 27 45 8 123 53 57 125 78 47 66 49 4 103 101 77
+24 37 72 90 82 48 79 56 86 118 1 120 59 94 73 21 58 113 64 85 89 81 18 98 96 6 12 31 95 114 7 44 14 127 124 67 33 71 109 63 68 9 104 42 10 80 83+110 38
Sep 30 00:26:54 newsgate kernel: Sequencer Complete DMA-inprog list:
Sep 30 00:26:54 newsgate kernel: Sequencer Complete list:
Sep 30 00:26:54 newsgate kernel: Sequencer DMA-Up and Complete list:
Sep 30 00:26:54 newsgate kernel:
Sep 30 00:26:54 newsgate kernel: scsi1: FIFO0 Active, LONGJMP == 0x233, SCB 0x4a
Sep 30 00:26:54 newsgate kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Sep 30 00:26:54 newsgate kernel: SEQINTSRC[0x0] DFCNTRL[0x8]:(HDMAEN) DFSTATUS[0xc8]:(HDONE|PKT_PRELOAD_AVAIL|PRELOAD_AVAIL)
Sep 30 00:26:54 newsgate kernel: SG_CACHE_SHADOW[0x50] SG_STATE[0x3]:(SEGS_AVAIL|LOADING_NEEDED)
Sep 30 00:26:54 newsgate kernel: DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x46]:(DATAINFIFO|DLZERO|SHCNTNEGATIVE)
Sep 30 00:26:54 newsgate kernel: SHADDR = 0x0d65c9200, SHCNT = 0xfffe00 HADDR = 0x0d65c9000, HCNT = 0x0
Sep 30 00:26:54 newsgate kernel: CCSGCTL[0x10]:(SG_CACHE_AVAIL)
Sep 30 00:26:54 newsgate kernel: scsi1: FIFO1 Free, LONGJMP == 0x823a, SCB 0x6a
Sep 30 00:26:54 newsgate kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Sep 30 00:26:54 newsgate kernel: SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
Sep 30 00:26:54 newsgate kernel: SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
Sep 30 00:26:54 newsgate kernel: SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
Sep 30 00:26:54 newsgate kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0]
Sep 30 00:26:54 newsgate kernel: LQIN: 0x4 0x0 0x0 0x4a 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x28 0x0 0x0 0x0 0x2 0x0
Sep 30 00:26:54 newsgate kernel: scsi1: LQISTATE = 0x2b, LQOSTATE = 0x0, OPTIONMODE = 0x52
Sep 30 00:26:54 newsgate kernel: scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x2
Sep 30 00:26:54 newsgate kernel: SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
Sep 30 00:26:54 newsgate kernel: CCSCBCTL[0x0]
Sep 30 00:26:54 newsgate kernel: scsi1: REG0 == 0x14, SINDEX = 0x150, DINDEX = 0x10a
Sep 30 00:26:54 newsgate kernel: scsi1: SCBPTR == 0x74, SCB_NEXT == 0xff40, SCB_NEXT2 == 0xff74
Sep 30 00:26:54 newsgate kernel: CDB 28 0 7 80 8 a0
Sep 30 00:26:54 newsgate kernel: STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
Sep 30 00:26:54 newsgate kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
Sep 30 00:26:54 newsgate kernel: LQICRC_NLQ
Sep 30 00:26:54 newsgate kernel: scsi1: Unexpected PKT busfree condition
Sep 30 00:26:54 newsgate kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
Sep 30 00:26:54 newsgate kernel: scsi1: Dumping Card State at program address 0xf Mode 0x33
Sep 30 00:26:54 newsgate kernel: Card was paused
Sep 30 00:26:54 newsgate kernel: HS_MAILBOX[0x0] INTCTL[0xc0]:(SWTMINTEN|SWTMINTMASK)
Sep 30 00:26:54 newsgate kernel: SEQINTSTAT[0x10]:(SEQ_SWTMRTO) SAVED_MODE[0x11]
Sep 30 00:26:54 newsgate kernel: DFFSTAT[0x24]:(CURRFIFO_0|FIFO1FREE) SCSISIGI[0xb6]:(P_MESGOUT|REQI|BSYI|ATNI)
Sep 30 00:26:54 newsgate kernel: SCSIPHASE[0x4]:(MSG_OUT_PHASE) SCSIBUS[0xf7] LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE)
Sep 30 00:26:54 newsgate kernel: SCSISEQ0[0x0] SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI)
Sep 30 00:26:54 newsgate kernel: SEQCTL0[0x0] SEQINTCTL[0x0] SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0]
Sep 30 00:26:54 newsgate kernel: SSTAT0[0x2]:(SPIORDY) SSTAT1[0x19]:(REQINIT|BUSFREE|PHASEMIS)
Sep 30 00:26:54 newsgate kernel: SSTAT2[0x20]:(NONPACKREQ) SSTAT3[0x0] PERRDIAG[0x0]
Sep 30 00:26:54 newsgate kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
Sep 30 00:26:54 newsgate kernel: LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0xc0]:(LQIPHASE_OUTPKT|PACKETIZED)
Sep 30 00:26:54 newsgate kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)
Sep 30 00:26:54 newsgate kernel:
Sep 30 00:26:54 newsgate kernel: SCB Count = 128 CMDS_PENDING = 6 LASTSCB 0x6b CURRSCB 0x14 NEXTSCB 0xff80
Sep 30 00:26:54 newsgate kernel: qinstart = 16079 qinfifonext = 16079
Sep 30 00:26:54 newsgate kernel: QINFIFO:
Sep 30 00:26:54 newsgate kernel: WAITING_TID_QUEUES:
Sep 30 00:26:54 newsgate kernel: Pending list:
Sep 30 00:26:54 newsgate kernel: 20 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 107 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 74 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 3 FIFO_USE[0x1] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x47]
Sep 30 00:26:54 newsgate kernel: 28 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xf7]:(TID)
Sep 30 00:26:54 newsgate kernel: 62 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xf7]:(TID)
Sep 30 00:26:54 newsgate kernel: Total 6
Sep 30 00:26:54 newsgate kernel: Kernel Free SCB list: 116 115 105 35 111 108 29 106 13 76 112 34 17 69 84 51 122 23 87 99 43 32 91 88
+75 11 92 40 46 30 5 102 52 70 16 41 15 97 36 0 119 19 100 126 22 54 65 39 93 55 117 50 61 121 2 25 26 60 27 45 8 123 53 57 125 78 47
+66 49 4 103 101 77 24 37 72 90 82 48 79 56 86 118 1 120 59 94 73 21 58 113 64 85 89 81 18 98 96 6 12 31 95 114 7 44 14 127 124 67 33
+71 109 63 68 9 104 42 10 80 83 110 38
Sep 30 00:26:54 newsgate kernel: Sequencer Complete DMA-inprog list:
Sep 30 00:26:54 newsgate kernel: Sequencer Complete list:
Sep 30 00:26:54 newsgate kernel: Sequencer DMA-Up and Complete list:
Sep 30 00:26:54 newsgate kernel:
Sep 30 00:26:54 newsgate kernel: scsi1: FIFO0 Active, LONGJMP == 0x233, SCB 0x4a
Sep 30 00:26:54 newsgate kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Sep 30 00:26:54 newsgate kernel: SEQINTSRC[0x0] DFCNTRL[0x8]:(HDMAEN) DFSTATUS[0xc8]:(HDONE|PKT_PRELOAD_AVAIL|PRELOAD_AVAIL)
Sep 30 00:26:54 newsgate kernel: SG_CACHE_SHADOW[0x50] SG_STATE[0x3]:(SEGS_AVAIL|LOADING_NEEDED)
Sep 30 00:26:54 newsgate kernel: DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x46]:(DATAINFIFO|DLZERO|SHCNTNEGATIVE)
Sep 30 00:26:54 newsgate kernel: SHADDR = 0x0d65c9200, SHCNT = 0xfffe00 HADDR = 0x0d65c9000, HCNT = 0x0
Sep 30 00:26:54 newsgate kernel: CCSGCTL[0x10]:(SG_CACHE_AVAIL)
Sep 30 00:26:54 newsgate kernel: scsi1: FIFO1 Free, LONGJMP == 0x823a, SCB 0x6a
Sep 30 00:26:54 newsgate kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Sep 30 00:26:54 newsgate kernel: SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
Sep 30 00:26:54 newsgate kernel: SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
Sep 30 00:26:54 newsgate kernel: SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
Sep 30 00:26:54 newsgate kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0]
Sep 30 00:26:54 newsgate kernel: LQIN: 0x4 0x0 0x0 0x4a 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x28 0x0 0x0 0x0 0x2 0x0
Sep 30 00:26:54 newsgate kernel: scsi1: LQISTATE = 0x2b, LQOSTATE = 0x0, OPTIONMODE = 0x52
Sep 30 00:26:54 newsgate kernel: scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x2
Sep 30 00:26:54 newsgate kernel: SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
Sep 30 00:26:54 newsgate kernel: CCSCBCTL[0x0]
Sep 30 00:26:54 newsgate kernel: scsi1: REG0 == 0x14, SINDEX = 0x150, DINDEX = 0x10a
Sep 30 00:26:54 newsgate kernel: scsi1: SCBPTR == 0x74, SCB_NEXT == 0xff40, SCB_NEXT2 == 0xff74
Sep 30 00:26:54 newsgate kernel: CDB 28 0 7 80 8 a0
Sep 30 00:26:54 newsgate kernel: STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
Sep 30 00:26:54 newsgate kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
Sep 30 00:27:28 newsgate kernel: scsi1:0:15:0: Attempting to queue an ABORT message:CDB: 0x28 0x0 0x9 0xd4 0xb 0xd7 0x0 0x0 0x80 0x0
Sep 30 00:27:28 newsgate kernel: scsi1: At time of recovery, card was not paused
Sep 30 00:27:28 newsgate kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
Sep 30 00:27:28 newsgate kernel: scsi1: Dumping Card State at program address 0x25 Mode 0x11
Sep 30 00:27:28 newsgate kernel: Card was paused
Sep 30 00:27:28 newsgate kernel: HS_MAILBOX[0x0] INTCTL[0xc0]:(SWTMINTEN|SWTMINTMASK)
Sep 30 00:27:28 newsgate kernel: SEQINTSTAT[0x10]:(SEQ_SWTMRTO) SAVED_MODE[0x33]
Sep 30 00:27:28 newsgate kernel: DFFSTAT[0x34]:(CURRFIFO_0|FIFO0FREE|FIFO1FREE)
Sep 30 00:27:28 newsgate kernel: SCSISIGI[0x66]:(P_DATAIN_DT|REQI|BSYI) SCSIPHASE[0x0]
Sep 30 00:27:28 newsgate kernel: SCSIBUS[0x9b] LASTPHASE[0x60]:(P_DATAIN_DT) SCSISEQ0[0x0]
Sep 30 00:27:28 newsgate kernel: SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0]
Sep 30 00:27:28 newsgate kernel: SEQINTCTL[0x0] SEQ_FLAGS[0x20]:(DPHASE) SEQ_FLAGS2[0x0]
Sep 30 00:27:28 newsgate kernel: SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0]
Sep 30 00:27:28 newsgate kernel: PERRDIAG[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
Sep 30 00:27:28 newsgate kernel: LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0xc0]:(LQIPHASE_OUTPKT|PACKETIZED)
Sep 30 00:27:28 newsgate kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)

etc...

Hope this gives more info on the crash..

Danny

2005-09-30 12:44:52

by Sander

[permalink] [raw]
Subject: Re: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

Danny ter Haar wrote (ao):
> Quoting Andrew Morton ([email protected]):
> > So we've probably lost the info which will tell us how the problems
> > started. A serial console would be nice.
>
> This is from serial console.
> Unfortunatly it's a portmaster2 which has no buffer.
> What i do is start a screen, do a telnet to the portmaster
> and attach the serial console. I can't go back anyfurther
> than this since the output of the new kernel booting replaced it.
> Mayby i can tell screen to have more lines of history ?!

This might help?

>From 'man screen':

C-a H (log) Begins/ends logging of the current window to
the file "screenlog.n".

--
Humilis IT Services and Solutions
http://www.humilis.net

2005-09-30 16:22:31

by Danny ter Haar

[permalink] [raw]
Subject: Re: 2.6.14-rc2-git7 crashed on amd64 (usenet gateway) after 18 hours

Sander <[email protected]> wrote:
>This might help?
>From 'man screen':
> C-a H (log) Begins/ends logging of the current window to
> the file "screenlog.n".

*thanks*

done so.

If it crashes i know now at leas i will have _all_ output!

git8 stil running:
-------------
newsgate:~# procinfo
Linux 2.6.14-rc2-git8 (root@newsgate) (gcc 4.0.2 ) #1 1CPU [newsgate.(none)]

Memory: Total Used Free Shared Buffers
Mem: 4062040 4041212 20828 0 568
Swap: 0 0 0

Bootup: Fri Sep 30 10:21:34 2005 Load average: 4.73 4.60 4.47 5/74 23214

user : 0:57:57.56 12.1% page in : 0
nice : 0:08:26.32 1.8% page out: 0
system: 2:23:25.64 29.9% swap in : 0
idle : 0:01:16.27 0.3% swap out: 0
uptime: 7:59:59.89 context : 77591356

irq 0: 7195064 timer irq 12: 3
irq 1: 8 i8042 irq 16: 7430483 aic79xx
irq 4: 400 serial irq 17: 73394066 aic79xx, eth3
irq 9: 0 acpi irq 18: 138587870 acenic

--------------

Danny