2005-09-02 12:54:41

by Peter Williams

[permalink] [raw]
Subject: 2.6.13-mm1: hangs during boot ...

... at the the point indicated by the following output:

[ 8.197224] Freeing unused kernel memory: 288k freed
[ 8.428217] SCSI subsystem initialized
[ 8.510376] sym0: <810a> rev 0x23 at pci 0000:00:08.0 irq 11
[ 8.587731] sym0: No NVRAM, ID 7, Fast-10, SE, parity checking
[ 8.671531] sym0: SCSI BUS has been reset.
[ 8.725530] scsi0 : sym-2.2.1
[ 17.256480] 0:0:0:0: ABORT operation started.
[ 22.323534] 0:0:0:0: ABORT operation timed-out.
[ 22.384348] 0:0:0:0: DEVICE RESET operation started.
[ 27.458702] 0:0:0:0: DEVICE RESET operation timed-out.
[ 27.527544] 0:0:0:0: BUS RESET operation started.
[ 32.533775] 0:0:0:0: BUS RESET operation timed-out.
[ 32.599173] 0:0:0:0: HOST RESET operation started.
[ 32.669659] sym0: SCSI BUS has been reset.

Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce


2005-09-02 20:13:06

by Andrew Morton

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Peter Williams <[email protected]> wrote:
>
> ... at the the point indicated by the following output:
>
> [ 8.197224] Freeing unused kernel memory: 288k freed
> [ 8.428217] SCSI subsystem initialized
> [ 8.510376] sym0: <810a> rev 0x23 at pci 0000:00:08.0 irq 11
> [ 8.587731] sym0: No NVRAM, ID 7, Fast-10, SE, parity checking
> [ 8.671531] sym0: SCSI BUS has been reset.
> [ 8.725530] scsi0 : sym-2.2.1
> [ 17.256480] 0:0:0:0: ABORT operation started.
> [ 22.323534] 0:0:0:0: ABORT operation timed-out.
> [ 22.384348] 0:0:0:0: DEVICE RESET operation started.
> [ 27.458702] 0:0:0:0: DEVICE RESET operation timed-out.
> [ 27.527544] 0:0:0:0: BUS RESET operation started.
> [ 32.533775] 0:0:0:0: BUS RESET operation timed-out.
> [ 32.599173] 0:0:0:0: HOST RESET operation started.
> [ 32.669659] sym0: SCSI BUS has been reset.
>

Is there no response from sysrq-T?

Maybe adding initcall_debug to the boot command line will show extra info?

The .config would be useful, thanks.

2005-09-03 03:51:39

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

#
# Automatically generated make config: don't edit
# Linux kernel version: 2.6.13-mm1-PS
# Fri Sep 2 21:22:06 2005
#
CONFIG_X86=y
CONFIG_SEMAPHORE_SLEEPERS=y
CONFIG_MMU=y
CONFIG_UID16=y
CONFIG_GENERIC_ISA_DMA=y
CONFIG_GENERIC_IOMAP=y

#
# Code maturity level options
#
CONFIG_EXPERIMENTAL=y
CONFIG_CLEAN_COMPILE=y
CONFIG_LOCK_KERNEL=y
CONFIG_INIT_ENV_ARG_LIMIT=32

#
# General setup
#
CONFIG_LOCALVERSION=""
CONFIG_LOCALVERSION_AUTO=y
CONFIG_SWAP=y
CONFIG_SYSVIPC=y
# CONFIG_POSIX_MQUEUE is not set
CONFIG_BSD_PROCESS_ACCT=y
# CONFIG_BSD_PROCESS_ACCT_V3 is not set
CONFIG_SYSCTL=y
# CONFIG_AUDIT is not set
CONFIG_HOTPLUG=y
CONFIG_KOBJECT_UEVENT=y
# CONFIG_IKCONFIG is not set
CONFIG_CPUSETS=y
CONFIG_INITRAMFS_SOURCE=""
# CONFIG_EMBEDDED is not set
CONFIG_KALLSYMS=y
CONFIG_KALLSYMS_ALL=y
CONFIG_KALLSYMS_EXTRA_PASS=y
CONFIG_PRINTK=y
CONFIG_BUG=y
CONFIG_BASE_FULL=y
CONFIG_FUTEX=y
CONFIG_EPOLL=y
CONFIG_SHMEM=y
CONFIG_CC_ALIGN_FUNCTIONS=0
CONFIG_CC_ALIGN_LABELS=0
CONFIG_CC_ALIGN_LOOPS=0
CONFIG_CC_ALIGN_JUMPS=0
# CONFIG_TINY_SHMEM is not set
CONFIG_BASE_SMALL=0

#
# Loadable module support
#
CONFIG_MODULES=y
CONFIG_MODULE_UNLOAD=y
# CONFIG_MODULE_FORCE_UNLOAD is not set
CONFIG_OBSOLETE_MODPARM=y
# CONFIG_MODVERSIONS is not set
# CONFIG_MODULE_SRCVERSION_ALL is not set
CONFIG_KMOD=y
CONFIG_STOP_MACHINE=y

#
# Processor type and features
#
# CONFIG_X86_PC is not set
# CONFIG_X86_ELAN is not set
# CONFIG_X86_VOYAGER is not set
# CONFIG_X86_NUMAQ is not set
# CONFIG_X86_SUMMIT is not set
# CONFIG_X86_BIGSMP is not set
# CONFIG_X86_VISWS is not set
CONFIG_X86_GENERICARCH=y
# CONFIG_X86_ES7000 is not set
CONFIG_X86_CYCLONE_TIMER=y
# CONFIG_M386 is not set
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
# CONFIG_M586MMX is not set
# CONFIG_M686 is not set
CONFIG_MPENTIUMII=y
# CONFIG_MPENTIUMIII is not set
# CONFIG_MPENTIUMM is not set
# CONFIG_MPENTIUM4 is not set
# CONFIG_MK6 is not set
# CONFIG_MK7 is not set
# CONFIG_MK8 is not set
# CONFIG_MCRUSOE is not set
# CONFIG_MEFFICEON is not set
# CONFIG_MWINCHIPC6 is not set
# CONFIG_MWINCHIP2 is not set
# CONFIG_MWINCHIP3D is not set
# CONFIG_MGEODEGX1 is not set
# CONFIG_MCYRIXIII is not set
# CONFIG_MVIAC3_2 is not set
CONFIG_X86_GENERIC=y
CONFIG_X86_CMPXCHG=y
CONFIG_X86_XADD=y
CONFIG_X86_L1_CACHE_SHIFT=7
CONFIG_RWSEM_XCHGADD_ALGORITHM=y
CONFIG_GENERIC_CALIBRATE_DELAY=y
CONFIG_X86_WP_WORKS_OK=y
CONFIG_X86_INVLPG=y
CONFIG_X86_BSWAP=y
CONFIG_X86_POPAD_OK=y
CONFIG_X86_GOOD_APIC=y
CONFIG_X86_INTEL_USERCOPY=y
CONFIG_X86_USE_PPRO_CHECKSUM=y
CONFIG_HPET_TIMER=y
CONFIG_HPET_EMULATE_RTC=y
CONFIG_SMP=y
CONFIG_NR_CPUS=32
# CONFIG_SCHED_SMT is not set
# CONFIG_PREEMPT_NONE is not set
CONFIG_PREEMPT_VOLUNTARY=y
# CONFIG_PREEMPT is not set
CONFIG_PREEMPT_BKL=y
CONFIG_X86_LOCAL_APIC=y
CONFIG_X86_IO_APIC=y
CONFIG_X86_TSC=y
CONFIG_X86_MCE=y
# CONFIG_X86_MCE_NONFATAL is not set
# CONFIG_X86_MCE_P4THERMAL is not set
# CONFIG_TOSHIBA is not set
# CONFIG_I8K is not set
# CONFIG_X86_REBOOTFIXUPS is not set
# CONFIG_MICROCODE is not set
# CONFIG_X86_MSR is not set
# CONFIG_X86_CPUID is not set

#
# Firmware Drivers
#
# CONFIG_EDD is not set
# CONFIG_DELL_RBU is not set
CONFIG_DCDBAS=m
# CONFIG_NOHIGHMEM is not set
CONFIG_HIGHMEM4G=y
# CONFIG_HIGHMEM64G is not set
CONFIG_HIGHMEM=y
CONFIG_SELECT_MEMORY_MODEL=y
CONFIG_FLATMEM_MANUAL=y
# CONFIG_DISCONTIGMEM_MANUAL is not set
# CONFIG_SPARSEMEM_MANUAL is not set
CONFIG_FLATMEM=y
CONFIG_FLAT_NODE_MEM_MAP=y
# CONFIG_SPARSEMEM_STATIC is not set
CONFIG_HIGHPTE=y
# CONFIG_MATH_EMULATION is not set
CONFIG_MTRR=y
# CONFIG_EFI is not set
# CONFIG_IRQBALANCE is not set
CONFIG_HAVE_DEC_LOCK=y
CONFIG_ATOMIC_TABLE_OPS=y
CONFIG_REGPARM=y
CONFIG_SECCOMP=y
# CONFIG_HZ_100 is not set
CONFIG_HZ_250=y
# CONFIG_HZ_1000 is not set
CONFIG_HZ=250
CONFIG_PHYSICAL_START=0x100000
# CONFIG_KEXEC is not set

#
# Performance-monitoring counters support
#
CONFIG_PERFCTR=y
# CONFIG_PERFCTR_INIT_TESTS is not set
CONFIG_PERFCTR_VIRTUAL=y
CONFIG_PERFCTR_INTERRUPT_SUPPORT=y
CONFIG_PERFCTR_CPUS_FORBIDDEN_MASK=y

#
# Power management options (ACPI, APM)
#
CONFIG_PM=y
# CONFIG_PM_DEBUG is not set
# CONFIG_SOFTWARE_SUSPEND is not set
CONFIG_SUSPEND_SMP=y

#
# ACPI (Advanced Configuration and Power Interface) Support
#
CONFIG_ACPI=y
CONFIG_ACPI_SLEEP=y
CONFIG_ACPI_SLEEP_PROC_FS=y
# CONFIG_ACPI_SLEEP_PROC_SLEEP is not set
# CONFIG_ACPI_AC is not set
# CONFIG_ACPI_BATTERY is not set
CONFIG_ACPI_BUTTON=m
CONFIG_ACPI_VIDEO=m
CONFIG_ACPI_HOTKEY=m
CONFIG_ACPI_FAN=y
CONFIG_ACPI_PROCESSOR=y
CONFIG_ACPI_HOTPLUG_CPU=y
CONFIG_ACPI_THERMAL=y
# CONFIG_ACPI_ASUS is not set
CONFIG_ACPI_IBM=m
# CONFIG_ACPI_TOSHIBA is not set
CONFIG_ACPI_BLACKLIST_YEAR=0
# CONFIG_ACPI_DEBUG is not set
CONFIG_ACPI_EC=y
CONFIG_ACPI_POWER=y
CONFIG_ACPI_SYSTEM=y
# CONFIG_X86_PM_TIMER is not set
CONFIG_ACPI_CONTAINER=y

#
# APM (Advanced Power Management) BIOS Support
#
CONFIG_APM=y
# CONFIG_APM_IGNORE_USER_SUSPEND is not set
# CONFIG_APM_DO_ENABLE is not set
CONFIG_APM_CPU_IDLE=y
# CONFIG_APM_DISPLAY_BLANK is not set
CONFIG_APM_RTC_IS_GMT=y
# CONFIG_APM_ALLOW_INTS is not set
# CONFIG_APM_REAL_MODE_POWER_OFF is not set

#
# CPU Frequency scaling
#
# CONFIG_CPU_FREQ is not set

#
# Bus options (PCI, PCMCIA, EISA, MCA, ISA)
#
CONFIG_PCI=y
# CONFIG_PCI_GOBIOS is not set
# CONFIG_PCI_GOMMCONFIG is not set
# CONFIG_PCI_GODIRECT is not set
CONFIG_PCI_GOANY=y
CONFIG_PCI_BIOS=y
CONFIG_PCI_DIRECT=y
CONFIG_PCI_MMCONFIG=y
# CONFIG_PCIEPORTBUS is not set
# CONFIG_PCI_MSI is not set
CONFIG_PCI_LEGACY_PROC=y
# CONFIG_PCI_DEBUG is not set
CONFIG_ISA_DMA_API=y
CONFIG_ISA=y
CONFIG_EISA=y
CONFIG_EISA_PCI_EISA=y
CONFIG_EISA_NAMES=y
# CONFIG_MCA is not set
# CONFIG_SCx200 is not set
CONFIG_HOTPLUG_CPU=y

#
# PCCARD (PCMCIA/CardBus) support
#
# CONFIG_PCCARD is not set

#
# PCI Hotplug Support
#
# CONFIG_HOTPLUG_PCI is not set

#
# Executable file formats
#
CONFIG_BINFMT_ELF=y
# CONFIG_BINFMT_AOUT is not set
CONFIG_BINFMT_MISC=m

#
# Networking
#
CONFIG_NET=y

#
# Networking options
#
CONFIG_PACKET=m
# CONFIG_PACKET_MMAP is not set
CONFIG_UNIX=y
CONFIG_XFRM=y
# CONFIG_XFRM_USER is not set
# CONFIG_NET_KEY is not set
CONFIG_INET=y
CONFIG_IP_MULTICAST=y
# CONFIG_IP_ADVANCED_ROUTER is not set
CONFIG_IP_FIB_HASH=y
# CONFIG_IP_PNP is not set
# CONFIG_NET_IPIP is not set
# CONFIG_NET_IPGRE is not set
# CONFIG_IP_MROUTE is not set
# CONFIG_ARPD is not set
CONFIG_SYN_COOKIES=y
CONFIG_INET_AH=m
CONFIG_INET_ESP=m
CONFIG_INET_IPCOMP=m
CONFIG_INET_TUNNEL=m
CONFIG_INET_DIAG=y
CONFIG_INET_TCP_DIAG=y
CONFIG_TCP_CONG_ADVANCED=y

#
# TCP congestion control
#
CONFIG_TCP_CONG_BIC=y
CONFIG_TCP_CONG_WESTWOOD=y
CONFIG_TCP_CONG_HTCP=y
# CONFIG_TCP_CONG_HSTCP is not set
# CONFIG_TCP_CONG_HYBLA is not set
# CONFIG_TCP_CONG_VEGAS is not set
# CONFIG_TCP_CONG_SCALABLE is not set
# CONFIG_IPV6 is not set
# CONFIG_NETFILTER is not set

#
# DCCP Configuration (EXPERIMENTAL)
#
# CONFIG_IP_DCCP is not set

#
# SCTP Configuration (EXPERIMENTAL)
#
# CONFIG_IP_SCTP is not set
# CONFIG_ATM is not set
# CONFIG_BRIDGE is not set
# CONFIG_VLAN_8021Q is not set
# CONFIG_DECNET is not set
# CONFIG_LLC2 is not set
# CONFIG_IPX is not set
# CONFIG_ATALK is not set
# CONFIG_X25 is not set
# CONFIG_LAPB is not set
# CONFIG_NET_DIVERT is not set
# CONFIG_ECONET is not set
# CONFIG_WAN_ROUTER is not set
# CONFIG_NET_SCHED is not set
# CONFIG_NET_CLS_ROUTE is not set

#
# Network testing
#
# CONFIG_NET_PKTGEN is not set
# CONFIG_NETFILTER_NETLINK is not set
# CONFIG_HAMRADIO is not set
# CONFIG_IRDA is not set
# CONFIG_BT is not set
# CONFIG_IEEE80211 is not set

#
# Device Drivers
#

#
# Generic Driver Options
#
CONFIG_STANDALONE=y
CONFIG_PREVENT_FIRMWARE_BUILD=y
# CONFIG_FW_LOADER is not set
# CONFIG_DEBUG_DRIVER is not set

#
# Memory Technology Devices (MTD)
#
# CONFIG_MTD is not set

#
# Parallel port support
#
CONFIG_PARPORT=m
CONFIG_PARPORT_PC=m
CONFIG_PARPORT_SERIAL=m
# CONFIG_PARPORT_PC_FIFO is not set
# CONFIG_PARPORT_PC_SUPERIO is not set
# CONFIG_PARPORT_GSC is not set
CONFIG_PARPORT_1284=y

#
# Plug and Play support
#
CONFIG_PNP=y
# CONFIG_PNP_DEBUG is not set

#
# Protocols
#
CONFIG_ISAPNP=y
# CONFIG_PNPBIOS is not set
CONFIG_PNPACPI=y

#
# Block devices
#
CONFIG_BLK_DEV_FD=m
# CONFIG_BLK_DEV_XD is not set
# CONFIG_PARIDE is not set
# CONFIG_BLK_CPQ_DA is not set
# CONFIG_BLK_CPQ_CISS_DA is not set
# CONFIG_BLK_DEV_DAC960 is not set
# CONFIG_BLK_DEV_UMEM is not set
# CONFIG_BLK_DEV_COW_COMMON is not set
CONFIG_BLK_DEV_LOOP=y
CONFIG_BLK_DEV_CRYPTOLOOP=m
# CONFIG_BLK_DEV_NBD is not set
# CONFIG_BLK_DEV_SX8 is not set
CONFIG_BLK_DEV_RAM=y
CONFIG_BLK_DEV_RAM_COUNT=16
CONFIG_BLK_DEV_RAM_SIZE=16384
CONFIG_BLK_DEV_INITRD=y
# CONFIG_LBD is not set
# CONFIG_CDROM_PKTCDVD is not set

#
# IO Schedulers
#
CONFIG_IOSCHED_NOOP=y
CONFIG_IOSCHED_AS=y
CONFIG_IOSCHED_DEADLINE=y
CONFIG_IOSCHED_CFQ=y
# CONFIG_ATA_OVER_ETH is not set

#
# ATA/ATAPI/MFM/RLL support
#
CONFIG_IDE=y
CONFIG_BLK_DEV_IDE=y

#
# Please see Documentation/ide.txt for help/info on IDE drives
#
# CONFIG_BLK_DEV_IDE_SATA is not set
# CONFIG_BLK_DEV_HD_IDE is not set
CONFIG_BLK_DEV_IDEDISK=y
CONFIG_IDEDISK_MULTI_MODE=y
CONFIG_BLK_DEV_IDECD=y
# CONFIG_BLK_DEV_IDETAPE is not set
CONFIG_BLK_DEV_IDEFLOPPY=y
# CONFIG_BLK_DEV_IDESCSI is not set
# CONFIG_IDE_TASK_IOCTL is not set

#
# IDE chipset support/bugfixes
#
CONFIG_IDE_GENERIC=y
# CONFIG_BLK_DEV_CMD640 is not set
CONFIG_BLK_DEV_IDEPNP=y
CONFIG_BLK_DEV_IDEPCI=y
CONFIG_IDEPCI_SHARE_IRQ=y
# CONFIG_BLK_DEV_OFFBOARD is not set
CONFIG_BLK_DEV_GENERIC=y
# CONFIG_BLK_DEV_OPTI621 is not set
CONFIG_BLK_DEV_RZ1000=y
CONFIG_BLK_DEV_IDEDMA_PCI=y
# CONFIG_BLK_DEV_IDEDMA_FORCED is not set
CONFIG_IDEDMA_PCI_AUTO=y
# CONFIG_IDEDMA_ONLYDISK is not set
# CONFIG_BLK_DEV_AEC62XX is not set
# CONFIG_BLK_DEV_ALI15X3 is not set
# CONFIG_BLK_DEV_AMD74XX is not set
# CONFIG_BLK_DEV_ATIIXP is not set
# CONFIG_BLK_DEV_CMD64X is not set
# CONFIG_BLK_DEV_TRIFLEX is not set
# CONFIG_BLK_DEV_CY82C693 is not set
# CONFIG_BLK_DEV_CS5520 is not set
# CONFIG_BLK_DEV_CS5530 is not set
# CONFIG_BLK_DEV_HPT34X is not set
# CONFIG_BLK_DEV_HPT366 is not set
# CONFIG_BLK_DEV_SC1200 is not set
CONFIG_BLK_DEV_PIIX=y
# CONFIG_BLK_DEV_IT821X is not set
# CONFIG_BLK_DEV_NS87415 is not set
# CONFIG_BLK_DEV_PDC202XX_OLD is not set
# CONFIG_BLK_DEV_PDC202XX_NEW is not set
# CONFIG_BLK_DEV_SVWKS is not set
# CONFIG_BLK_DEV_SIIMAGE is not set
# CONFIG_BLK_DEV_SIS5513 is not set
# CONFIG_BLK_DEV_SLC90E66 is not set
# CONFIG_BLK_DEV_TRM290 is not set
# CONFIG_BLK_DEV_VIA82CXXX is not set
# CONFIG_IDE_ARM is not set
# CONFIG_IDE_CHIPSETS is not set
CONFIG_BLK_DEV_IDEDMA=y
# CONFIG_IDEDMA_IVB is not set
CONFIG_IDEDMA_AUTO=y
# CONFIG_BLK_DEV_HD is not set

#
# SCSI device support
#
# CONFIG_RAID_ATTRS is not set
CONFIG_SCSI=m
CONFIG_SCSI_PROC_FS=y

#
# SCSI support type (disk, tape, CD-ROM)
#
CONFIG_BLK_DEV_SD=m
# CONFIG_CHR_DEV_ST is not set
# CONFIG_CHR_DEV_OSST is not set
CONFIG_BLK_DEV_SR=m
# CONFIG_BLK_DEV_SR_VENDOR is not set
# CONFIG_CHR_DEV_SG is not set
CONFIG_CHR_DEV_SCH=m

#
# Some SCSI devices (e.g. CD jukebox) support multiple LUNs
#
# CONFIG_SCSI_MULTI_LUN is not set
# CONFIG_SCSI_CONSTANTS is not set
# CONFIG_SCSI_LOGGING is not set

#
# SCSI Transport Attributes
#
CONFIG_SCSI_SPI_ATTRS=m
CONFIG_SCSI_FC_ATTRS=m
# CONFIG_SCSI_ISCSI_ATTRS is not set

#
# SCSI low-level drivers
#
# CONFIG_BLK_DEV_3W_XXXX_RAID is not set
# CONFIG_SCSI_3W_9XXX is not set
# CONFIG_SCSI_ARCMSR is not set
# CONFIG_SCSI_7000FASST is not set
# CONFIG_SCSI_ACARD is not set
# CONFIG_SCSI_AHA152X is not set
# CONFIG_SCSI_AHA1542 is not set
# CONFIG_SCSI_AHA1740 is not set
# CONFIG_SCSI_AACRAID is not set
# CONFIG_SCSI_AIC7XXX is not set
# CONFIG_SCSI_AIC7XXX_OLD is not set
# CONFIG_SCSI_AIC79XX is not set
# CONFIG_SCSI_DPT_I2O is not set
# CONFIG_SCSI_IN2000 is not set
# CONFIG_MEGARAID_NEWGEN is not set
# CONFIG_MEGARAID_LEGACY is not set
# CONFIG_SCSI_SATA is not set
# CONFIG_SCSI_BUSLOGIC is not set
# CONFIG_SCSI_DMX3191D is not set
# CONFIG_SCSI_DTC3280 is not set
# CONFIG_SCSI_EATA is not set
# CONFIG_SCSI_FUTURE_DOMAIN is not set
# CONFIG_SCSI_GDTH is not set
# CONFIG_SCSI_GENERIC_NCR5380 is not set
# CONFIG_SCSI_GENERIC_NCR5380_MMIO is not set
# CONFIG_SCSI_IPS is not set
# CONFIG_SCSI_INITIO is not set
# CONFIG_SCSI_INIA100 is not set
# CONFIG_SCSI_PPA is not set
# CONFIG_SCSI_IMM is not set
# CONFIG_SCSI_NCR53C406A is not set
CONFIG_SCSI_SYM53C8XX_2=m
CONFIG_SCSI_SYM53C8XX_DMA_ADDRESSING_MODE=1
CONFIG_SCSI_SYM53C8XX_DEFAULT_TAGS=16
CONFIG_SCSI_SYM53C8XX_MAX_TAGS=64
# CONFIG_SCSI_SYM53C8XX_IOMAPPED is not set
# CONFIG_SCSI_IPR is not set
# CONFIG_SCSI_PAS16 is not set
# CONFIG_SCSI_PSI240I is not set
# CONFIG_SCSI_QLOGIC_FAS is not set
# CONFIG_SCSI_QLOGIC_FC is not set
# CONFIG_SCSI_QLOGIC_1280 is not set
CONFIG_SCSI_QLA2XXX=m
# CONFIG_SCSI_QLA21XX is not set
# CONFIG_SCSI_QLA22XX is not set
# CONFIG_SCSI_QLA2300 is not set
# CONFIG_SCSI_QLA2322 is not set
# CONFIG_SCSI_QLA6312 is not set
# CONFIG_SCSI_QLA24XX is not set
# CONFIG_SCSI_LPFC is not set
# CONFIG_SCSI_SIM710 is not set
# CONFIG_SCSI_SYM53C416 is not set
# CONFIG_SCSI_DC395x is not set
# CONFIG_SCSI_DC390T is not set
# CONFIG_SCSI_T128 is not set
# CONFIG_SCSI_U14_34F is not set
# CONFIG_SCSI_ULTRASTOR is not set
# CONFIG_SCSI_NSP32 is not set
# CONFIG_SCSI_DEBUG is not set

#
# Old CD-ROM drivers (not SCSI, not IDE)
#
# CONFIG_CD_NO_IDESCSI is not set

#
# Multi-device support (RAID and LVM)
#
# CONFIG_MD is not set

#
# Fusion MPT device support
#
# CONFIG_FUSION is not set
# CONFIG_FUSION_SPI is not set
# CONFIG_FUSION_FC is not set

#
# IEEE 1394 (FireWire) support
#
# CONFIG_IEEE1394 is not set

#
# I2O device support
#
CONFIG_I2O=m
CONFIG_I2O_EXT_ADAPTEC=y
CONFIG_I2O_CONFIG=m
CONFIG_I2O_CONFIG_OLD_IOCTL=y
# CONFIG_I2O_BUS is not set
CONFIG_I2O_BLOCK=m
CONFIG_I2O_SCSI=m
CONFIG_I2O_PROC=m

#
# Network device support
#
CONFIG_NETDEVICES=y
CONFIG_DUMMY=m
# CONFIG_BONDING is not set
# CONFIG_EQUALIZER is not set
# CONFIG_TUN is not set
# CONFIG_NET_SB1000 is not set

#
# ARCnet devices
#
# CONFIG_ARCNET is not set

#
# PHY device support
#
# CONFIG_PHYLIB is not set

#
# Ethernet (10 or 100Mbit)
#
CONFIG_NET_ETHERNET=y
CONFIG_MII=m
# CONFIG_HAPPYMEAL is not set
# CONFIG_SUNGEM is not set
# CONFIG_NET_VENDOR_3COM is not set
# CONFIG_LANCE is not set
# CONFIG_NET_VENDOR_SMC is not set
# CONFIG_NET_VENDOR_RACAL is not set

#
# Tulip family network device support
#
# CONFIG_NET_TULIP is not set
# CONFIG_AT1700 is not set
# CONFIG_DEPCA is not set
# CONFIG_HP100 is not set
# CONFIG_NET_ISA is not set
CONFIG_NET_PCI=y
# CONFIG_PCNET32 is not set
# CONFIG_AMD8111_ETH is not set
# CONFIG_ADAPTEC_STARFIRE is not set
# CONFIG_AC3200 is not set
# CONFIG_APRICOT is not set
# CONFIG_B44 is not set
# CONFIG_FORCEDETH is not set
# CONFIG_CS89x0 is not set
# CONFIG_DGRS is not set
# CONFIG_EEPRO100 is not set
# CONFIG_E100 is not set
# CONFIG_LNE390 is not set
# CONFIG_FEALNX is not set
CONFIG_NATSEMI=m
# CONFIG_NE2K_PCI is not set
# CONFIG_NE3210 is not set
# CONFIG_ES3210 is not set
# CONFIG_8139CP is not set
# CONFIG_8139TOO is not set
# CONFIG_SIS900 is not set
# CONFIG_EPIC100 is not set
# CONFIG_SUNDANCE is not set
# CONFIG_TLAN is not set
# CONFIG_VIA_RHINE is not set
# CONFIG_NET_POCKET is not set

#
# Ethernet (1000 Mbit)
#
# CONFIG_ACENIC is not set
# CONFIG_DL2K is not set
# CONFIG_E1000 is not set
CONFIG_NS83820=m
# CONFIG_HAMACHI is not set
# CONFIG_YELLOWFIN is not set
# CONFIG_R8169 is not set
# CONFIG_SIS190 is not set
# CONFIG_SKGE is not set
# CONFIG_SKY2 is not set
# CONFIG_SK98LIN is not set
# CONFIG_VIA_VELOCITY is not set
# CONFIG_TIGON3 is not set
# CONFIG_BNX2 is not set

#
# Ethernet (10000 Mbit)
#
# CONFIG_CHELSIO_T1 is not set
# CONFIG_IXGB is not set
# CONFIG_S2IO is not set

#
# Token Ring devices
#
# CONFIG_TR is not set

#
# Wireless LAN (non-hamradio)
#
# CONFIG_NET_RADIO is not set

#
# Wan interfaces
#
# CONFIG_WAN is not set
# CONFIG_FDDI is not set
# CONFIG_HIPPI is not set
# CONFIG_PLIP is not set
# CONFIG_PPP is not set
# CONFIG_SLIP is not set
# CONFIG_NET_FC is not set
# CONFIG_SHAPER is not set
# CONFIG_NETCONSOLE is not set
# CONFIG_KGDBOE is not set
# CONFIG_NETPOLL is not set
# CONFIG_NETPOLL_RX is not set
# CONFIG_NETPOLL_TRAP is not set
# CONFIG_NET_POLL_CONTROLLER is not set

#
# ISDN subsystem
#
# CONFIG_ISDN is not set

#
# Telephony Support
#
# CONFIG_PHONE is not set

#
# Input device support
#
CONFIG_INPUT=y

#
# Userland interfaces
#
CONFIG_INPUT_MOUSEDEV=y
CONFIG_INPUT_MOUSEDEV_PSAUX=y
CONFIG_INPUT_MOUSEDEV_SCREEN_X=1024
CONFIG_INPUT_MOUSEDEV_SCREEN_Y=768
# CONFIG_INPUT_JOYDEV is not set
# CONFIG_INPUT_TSDEV is not set
CONFIG_INPUT_EVDEV=y
# CONFIG_INPUT_EVBUG is not set

#
# Input Device Drivers
#
CONFIG_INPUT_KEYBOARD=y
CONFIG_KEYBOARD_ATKBD=y
# CONFIG_KEYBOARD_SUNKBD is not set
# CONFIG_KEYBOARD_LKKBD is not set
# CONFIG_KEYBOARD_XTKBD is not set
# CONFIG_KEYBOARD_NEWTON is not set
CONFIG_INPUT_MOUSE=y
CONFIG_MOUSE_PS2=y
# CONFIG_MOUSE_SERIAL is not set
# CONFIG_MOUSE_INPORT is not set
# CONFIG_MOUSE_LOGIBM is not set
# CONFIG_MOUSE_PC110PAD is not set
# CONFIG_MOUSE_VSXXXAA is not set
# CONFIG_INPUT_JOYSTICK is not set
# CONFIG_INPUT_TOUCHSCREEN is not set
# CONFIG_INPUT_MISC is not set

#
# Hardware I/O ports
#
CONFIG_SERIO=y
CONFIG_SERIO_I8042=y
# CONFIG_SERIO_SERPORT is not set
# CONFIG_SERIO_CT82C710 is not set
# CONFIG_SERIO_PARKBD is not set
# CONFIG_SERIO_PCIPS2 is not set
CONFIG_SERIO_LIBPS2=y
# CONFIG_SERIO_RAW is not set
# CONFIG_GAMEPORT is not set

#
# Character devices
#
CONFIG_VT=y
CONFIG_VT_CONSOLE=y
CONFIG_HW_CONSOLE=y
# CONFIG_SERIAL_NONSTANDARD is not set

#
# Serial drivers
#
CONFIG_SERIAL_8250=y
CONFIG_SERIAL_8250_CONSOLE=y
# CONFIG_SERIAL_8250_ACPI is not set
CONFIG_SERIAL_8250_NR_UARTS=4
# CONFIG_SERIAL_8250_EXTENDED is not set

#
# Non-8250 serial port support
#
CONFIG_SERIAL_CORE=y
CONFIG_SERIAL_CORE_CONSOLE=y
# CONFIG_SERIAL_JSM is not set
CONFIG_UNIX98_PTYS=y
# CONFIG_LEGACY_PTYS is not set
CONFIG_PRINTER=m
CONFIG_LP_CONSOLE=y
# CONFIG_PPDEV is not set
# CONFIG_TIPAR is not set

#
# IPMI
#
# CONFIG_IPMI_HANDLER is not set

#
# Watchdog Cards
#
# CONFIG_WATCHDOG is not set
# CONFIG_HW_RANDOM is not set
CONFIG_NVRAM=m
CONFIG_RTC=y
# CONFIG_DTLK is not set
# CONFIG_R3964 is not set
# CONFIG_APPLICOM is not set
# CONFIG_SONYPI is not set

#
# Ftape, the floppy tape device driver
#
# CONFIG_AGP is not set
# CONFIG_DRM is not set
# CONFIG_RAW_DRIVER is not set
# CONFIG_HPET is not set
# CONFIG_HANGCHECK_TIMER is not set

#
# TPM devices
#
# CONFIG_TCG_TPM is not set

#
# I2C support
#
# CONFIG_I2C is not set

#
# Dallas's 1-wire bus
#
# CONFIG_W1 is not set

#
# Hardware Monitoring support
#
CONFIG_HWMON=y
# CONFIG_HWMON_VID is not set
# CONFIG_SENSORS_HDAPS is not set
# CONFIG_HWMON_DEBUG_CHIP is not set

#
# Misc devices
#

#
# Multimedia Capabilities Port drivers
#

#
# Multimedia devices
#
# CONFIG_VIDEO_DEV is not set

#
# Digital Video Broadcasting Devices
#
# CONFIG_DVB is not set

#
# Graphics support
#
# CONFIG_FB is not set
CONFIG_VIDEO_SELECT=y

#
# Console display driver support
#
CONFIG_VGA_CONSOLE=y
CONFIG_MDA_CONSOLE=m
CONFIG_DUMMY_CONSOLE=y

#
# Speakup console speech
#
# CONFIG_SPEAKUP is not set

#
# Sound
#
# CONFIG_SOUND is not set

#
# USB support
#
CONFIG_USB_ARCH_HAS_HCD=y
CONFIG_USB_ARCH_HAS_OHCI=y
# CONFIG_USB is not set

#
# USB Gadget Support
#
# CONFIG_USB_GADGET is not set

#
# MMC/SD Card support
#
# CONFIG_MMC is not set

#
# InfiniBand support
#
# CONFIG_INFINIBAND is not set

#
# SN Devices
#

#
# Distributed Lock Manager
#
# CONFIG_DLM is not set

#
# File systems
#
CONFIG_EXT2_FS=y
CONFIG_EXT2_FS_XATTR=y
CONFIG_EXT2_FS_POSIX_ACL=y
CONFIG_EXT2_FS_SECURITY=y
# CONFIG_EXT2_FS_XIP is not set
CONFIG_EXT3_FS=y
CONFIG_EXT3_FS_XATTR=y
CONFIG_EXT3_FS_POSIX_ACL=y
CONFIG_EXT3_FS_SECURITY=y
CONFIG_JBD=y
# CONFIG_JBD_DEBUG is not set
CONFIG_FS_MBCACHE=y
# CONFIG_REISER4_FS is not set
CONFIG_REISERFS_FS=m
# CONFIG_REISERFS_CHECK is not set
CONFIG_REISERFS_PROC_INFO=y
# CONFIG_REISERFS_FS_XATTR is not set
# CONFIG_JFS_FS is not set
CONFIG_FS_POSIX_ACL=y

#
# XFS support
#
# CONFIG_XFS_FS is not set
# CONFIG_OCFS2_FS is not set
# CONFIG_MINIX_FS is not set
# CONFIG_ROMFS_FS is not set
CONFIG_INOTIFY=y
# CONFIG_QUOTA is not set
CONFIG_DNOTIFY=y
CONFIG_AUTOFS_FS=m
CONFIG_AUTOFS4_FS=m
# CONFIG_FUSE_FS is not set

#
# CD-ROM/DVD Filesystems
#
CONFIG_ISO9660_FS=y
CONFIG_JOLIET=y
CONFIG_ZISOFS=y
CONFIG_ZISOFS_FS=y
# CONFIG_UDF_FS is not set

#
# DOS/FAT/NT Filesystems
#
CONFIG_FAT_FS=m
CONFIG_MSDOS_FS=m
CONFIG_VFAT_FS=m
CONFIG_FAT_DEFAULT_CODEPAGE=437
CONFIG_FAT_DEFAULT_IOCHARSET="iso8859-1"
# CONFIG_NTFS_FS is not set

#
# Pseudo filesystems
#
CONFIG_PROC_FS=y
CONFIG_PROC_KCORE=y
CONFIG_SYSFS=y
CONFIG_TMPFS=y
# CONFIG_HUGETLBFS is not set
# CONFIG_HUGETLB_PAGE is not set
CONFIG_RAMFS=y
# CONFIG_CONFIGFS_FS is not set
CONFIG_RELAYFS_FS=y

#
# Miscellaneous filesystems
#
# CONFIG_ADFS_FS is not set
# CONFIG_AFFS_FS is not set
# CONFIG_ASFS_FS is not set
# CONFIG_HFS_FS is not set
# CONFIG_HFSPLUS_FS is not set
# CONFIG_BEFS_FS is not set
# CONFIG_BFS_FS is not set
# CONFIG_EFS_FS is not set
# CONFIG_CRAMFS is not set
# CONFIG_VXFS_FS is not set
# CONFIG_HPFS_FS is not set
# CONFIG_QNX4FS_FS is not set
# CONFIG_SYSV_FS is not set
# CONFIG_UFS_FS is not set

#
# Network File Systems
#
CONFIG_NFS_FS=m
CONFIG_NFS_V3=y
# CONFIG_NFS_V3_ACL is not set
CONFIG_NFS_V4=y
CONFIG_NFS_DIRECTIO=y
CONFIG_NFSD=m
CONFIG_NFSD_V3=y
# CONFIG_NFSD_V3_ACL is not set
# CONFIG_NFSD_V4 is not set
# CONFIG_NFSD_TCP is not set
CONFIG_LOCKD=m
CONFIG_LOCKD_V4=y
CONFIG_EXPORTFS=m
CONFIG_NFS_COMMON=y
CONFIG_SUNRPC=m
CONFIG_SUNRPC_GSS=m
CONFIG_RPCSEC_GSS_KRB5=m
# CONFIG_RPCSEC_GSS_SPKM3 is not set
CONFIG_SMB_FS=m
# CONFIG_SMB_NLS_DEFAULT is not set
# CONFIG_CIFS is not set
# CONFIG_NCP_FS is not set
# CONFIG_CODA_FS is not set
# CONFIG_AFS_FS is not set
# CONFIG_9P_FS is not set

#
# Partition Types
#
CONFIG_PARTITION_ADVANCED=y
# CONFIG_ACORN_PARTITION is not set
# CONFIG_OSF_PARTITION is not set
# CONFIG_AMIGA_PARTITION is not set
# CONFIG_ATARI_PARTITION is not set
# CONFIG_MAC_PARTITION is not set
CONFIG_MSDOS_PARTITION=y
# CONFIG_BSD_DISKLABEL is not set
# CONFIG_MINIX_SUBPARTITION is not set
# CONFIG_SOLARIS_X86_PARTITION is not set
# CONFIG_UNIXWARE_DISKLABEL is not set
# CONFIG_LDM_PARTITION is not set
# CONFIG_SGI_PARTITION is not set
# CONFIG_ULTRIX_PARTITION is not set
# CONFIG_SUN_PARTITION is not set
# CONFIG_EFI_PARTITION is not set

#
# Native Language Support
#
CONFIG_NLS=y
CONFIG_NLS_DEFAULT="utf8"
CONFIG_NLS_CODEPAGE_437=m
# CONFIG_NLS_CODEPAGE_737 is not set
# CONFIG_NLS_CODEPAGE_775 is not set
# CONFIG_NLS_CODEPAGE_850 is not set
# CONFIG_NLS_CODEPAGE_852 is not set
# CONFIG_NLS_CODEPAGE_855 is not set
# CONFIG_NLS_CODEPAGE_857 is not set
# CONFIG_NLS_CODEPAGE_860 is not set
# CONFIG_NLS_CODEPAGE_861 is not set
# CONFIG_NLS_CODEPAGE_862 is not set
# CONFIG_NLS_CODEPAGE_863 is not set
# CONFIG_NLS_CODEPAGE_864 is not set
# CONFIG_NLS_CODEPAGE_865 is not set
# CONFIG_NLS_CODEPAGE_866 is not set
# CONFIG_NLS_CODEPAGE_869 is not set
# CONFIG_NLS_CODEPAGE_936 is not set
# CONFIG_NLS_CODEPAGE_950 is not set
# CONFIG_NLS_CODEPAGE_932 is not set
# CONFIG_NLS_CODEPAGE_949 is not set
# CONFIG_NLS_CODEPAGE_874 is not set
# CONFIG_NLS_ISO8859_8 is not set
# CONFIG_NLS_CODEPAGE_1250 is not set
# CONFIG_NLS_CODEPAGE_1251 is not set
# CONFIG_NLS_ASCII is not set
# CONFIG_NLS_ISO8859_1 is not set
# CONFIG_NLS_ISO8859_2 is not set
# CONFIG_NLS_ISO8859_3 is not set
# CONFIG_NLS_ISO8859_4 is not set
# CONFIG_NLS_ISO8859_5 is not set
# CONFIG_NLS_ISO8859_6 is not set
# CONFIG_NLS_ISO8859_7 is not set
# CONFIG_NLS_ISO8859_9 is not set
# CONFIG_NLS_ISO8859_13 is not set
# CONFIG_NLS_ISO8859_14 is not set
# CONFIG_NLS_ISO8859_15 is not set
# CONFIG_NLS_KOI8_R is not set
# CONFIG_NLS_KOI8_U is not set
# CONFIG_NLS_UTF8 is not set

#
# Profiling support
#
CONFIG_PROFILING=y
# CONFIG_OPROFILE is not set

#
# Kernel hacking
#
CONFIG_PRINTK_TIME=y
CONFIG_DEBUG_KERNEL=y
CONFIG_MAGIC_SYSRQ=y
CONFIG_LOG_BUF_SHIFT=17
CONFIG_DETECT_SOFTLOCKUP=y
CONFIG_SCHEDSTATS=y
# CONFIG_DEBUG_SLAB is not set
# CONFIG_DEBUG_SPINLOCK is not set
CONFIG_DEBUG_SPINLOCK_SLEEP=y
# CONFIG_DEBUG_KOBJECT is not set
CONFIG_DEBUG_HIGHMEM=y
CONFIG_DEBUG_BUGVERBOSE=y
CONFIG_DEBUG_INFO=y
# CONFIG_PAGE_OWNER is not set
# CONFIG_DEBUG_FS is not set
CONFIG_FRAME_POINTER=y
CONFIG_EARLY_PRINTK=y
CONFIG_DEBUG_STACKOVERFLOW=y
# CONFIG_KPROBES is not set
# CONFIG_DEBUG_STACK_USAGE is not set
# CONFIG_DEBUG_PAGEALLOC is not set
# CONFIG_4KSTACKS is not set
CONFIG_X86_FIND_SMP_CONFIG=y
CONFIG_X86_MPPARSE=y
# CONFIG_KGDB is not set

#
# Security options
#
# CONFIG_KEYS is not set
# CONFIG_SECURITY is not set

#
# Cryptographic options
#
CONFIG_CRYPTO=y
CONFIG_CRYPTO_HMAC=y
# CONFIG_CRYPTO_NULL is not set
# CONFIG_CRYPTO_MD4 is not set
CONFIG_CRYPTO_MD5=m
CONFIG_CRYPTO_SHA1=m
# CONFIG_CRYPTO_SHA256 is not set
# CONFIG_CRYPTO_SHA512 is not set
# CONFIG_CRYPTO_WP512 is not set
# CONFIG_CRYPTO_TGR192 is not set
CONFIG_CRYPTO_DES=m
# CONFIG_CRYPTO_BLOWFISH is not set
# CONFIG_CRYPTO_TWOFISH is not set
# CONFIG_CRYPTO_SERPENT is not set
# CONFIG_CRYPTO_AES_586 is not set
# CONFIG_CRYPTO_CAST5 is not set
# CONFIG_CRYPTO_CAST6 is not set
# CONFIG_CRYPTO_TEA is not set
# CONFIG_CRYPTO_ARC4 is not set
# CONFIG_CRYPTO_KHAZAD is not set
# CONFIG_CRYPTO_ANUBIS is not set
CONFIG_CRYPTO_DEFLATE=m
# CONFIG_CRYPTO_MICHAEL_MIC is not set
# CONFIG_CRYPTO_CRC32C is not set
# CONFIG_CRYPTO_TEST is not set

#
# Hardware crypto devices
#
# CONFIG_CRYPTO_DEV_PADLOCK is not set

#
# Library routines
#
# CONFIG_CRC_CCITT is not set
# CONFIG_CRC16 is not set
CONFIG_CRC32=y
# CONFIG_LIBCRC32C is not set
CONFIG_ZLIB_INFLATE=y
CONFIG_ZLIB_DEFLATE=m
CONFIG_GENERIC_HARDIRQS=y
CONFIG_GENERIC_IRQ_PROBE=y
CONFIG_GENERIC_PENDING_IRQ=y
CONFIG_X86_SMP=y
CONFIG_X86_HT=y
CONFIG_X86_BIOS_REBOOT=y
CONFIG_X86_TRAMPOLINE=y
CONFIG_PC=y


Attachments:
hang.output (21.47 kB)
origma.config (25.33 kB)
Download all attachments

2005-09-03 04:17:58

by Andrew Morton

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Peter Williams <[email protected]> wrote:
>
> Andrew Morton wrote:
> > Peter Williams <[email protected]> wrote:
> >
> >>... at the the point indicated by the following output:
> >>
> >>[ 8.197224] Freeing unused kernel memory: 288k freed
> >>[ 8.428217] SCSI subsystem initialized
> >>[ 8.510376] sym0: <810a> rev 0x23 at pci 0000:00:08.0 irq 11
> >>[ 8.587731] sym0: No NVRAM, ID 7, Fast-10, SE, parity checking
> >>[ 8.671531] sym0: SCSI BUS has been reset.
> >>[ 8.725530] scsi0 : sym-2.2.1
> >>[ 17.256480] 0:0:0:0: ABORT operation started.
> >>[ 22.323534] 0:0:0:0: ABORT operation timed-out.
> >>[ 22.384348] 0:0:0:0: DEVICE RESET operation started.
> >>[ 27.458702] 0:0:0:0: DEVICE RESET operation timed-out.
> >>[ 27.527544] 0:0:0:0: BUS RESET operation started.
> >>[ 32.533775] 0:0:0:0: BUS RESET operation timed-out.
> >>[ 32.599173] 0:0:0:0: HOST RESET operation started.
> >>[ 32.669659] sym0: SCSI BUS has been reset.
> >>
> >
> >
> > Is there no response from sysrq-T?
>
> Now that I've tried it there is a response. I've attached the complete
> output from the boot including the sysrq-T output in the hang.output
> attachment to this e-mail.

Thanks.

> ...
> [ 278.990398] Call Trace:
> [ 279.024761] [<c02279ff>] serio_thread+0xbf/0xf0
> [ 279.085573] [<c013b2a6>] kthread+0xa6/0xb0
> [ 279.140552] [<c01034d9>] kernel_thread_helper+0x5/0xc
> [ 279.208130] insmod D C171DCC0 0 227 1 232 70 )[ 279.309031] d7f33b04 d7f33ab8 d8836bb0 c171dcc0 00001055 0fbf64f3 00000000 d
> [ 279.408678] 0000e83b d7f33acc c01da354 d750e6ac d750e570 c130d160 0a9
> [ 279.518639] d7f32000 0a72aa15 00000002 00000246 c172de50 c172de50 d7f
> [ 279.628599] Call Trace:
> [ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
> [ 279.732934] [<c0245c16>] blk_execute_rq+0x66/0xb0
> [ 279.796035] [<d8836eb6>] scsi_execute+0xb6/0xd0 [scsi_mod]
> [ 279.869446] [<d8836f4d>] scsi_execute_req+0x7d/0xb0 [scsi_mod]
> [ 279.947438] [<d88393f6>] scsi_probe_lun+0xb6/0x1d0 [scsi_mod]
> [ 280.024285] [<d883995e>] scsi_probe_and_add_lun+0xde/0x1e0 [scsi_mod]
> [ 280.110295] [<d883a119>] scsi_scan_target+0xc9/0x140 [scsi_mod]
> [ 280.189431] [<d883a208>] scsi_scan_channel+0x78/0x90 [scsi_mod]
> [ 280.268569] [<d883a2e9>] scsi_scan_host_selected+0xc9/0x120 [scsi_mod]
> [ 280.355722] [<d883a362>] scsi_scan_host+0x22/0x30 [scsi_mod]
> [ 280.431425] [<d8864e45>] sym2_probe+0xf5/0x120 [sym53c8xx]
> [ 280.504835] [<c01e6ced>] pci_call_probe+0xd/0x10
> [ 280.566791] [<c01e6d39>] __pci_device_probe+0x49/0x60
> [ 280.634369] [<c01e6d79>] pci_device_probe+0x29/0x50
> [ 280.699657] [<c023e0ee>] driver_probe_device+0x3e/0xc0
> [ 280.768486] [<c023e25f>] __driver_attach+0x5f/0x70
> [ 280.832628] [<c023d7d3>] bus_for_each_dev+0x43/0x70
> [ 280.897916] [<c023e289>] driver_attach+0x19/0x20
> [ 280.959770] [<c023dc5b>] bus_add_driver+0x7b/0xd0
> [ 281.022767] [<c023e692>] driver_register+0x42/0x50
> [ 281.086910] [<c01e6fd0>] pci_register_driver+0x70/0x90
> [ 281.155635] [<d880202b>] sym2_init+0x2b/0x45 [sym53c8xx]
> [ 281.226752] [<c014340c>] sys_init_module+0xec/0x230
> [ 281.292042] [<c0105199>] syscall_call+0x7/0xb
> [ 281.350458] scsi_eh_0 D 00000000 0 232 1 227 )[ 281.451357] d7a51ea0 d7a51e64 1e62bb57 00000000 d7a50000 1e62c494 00000000 d
> [ 281.551005] 00000106 d79b0ab0 c130d160 d79b0bec d79b0ab0 c130d160 9de
> [ 281.660963] d7a50000 9de05c44 00000007 d7a50000 d7a51ef4 d7a51ef0 d7a
> [ 281.770923] Call Trace:
> [ 281.805288] [<c02d5c74>] wait_for_completion+0xa4/0x110
> [ 281.875159] [<d8863490>] sym_eh_handler+0x240/0x290 [sym53c8xx]
> [ 281.954293] [<d88635fd>] sym53c8xx_eh_host_reset_handler+0x2d/0x50 [sym53c8][ 282.050611] [<d8835e9b>] scsi_try_host_reset+0x2b/0xa0 [scsi_mod]
> [ 282.132041] [<d883602c>] scsi_eh_host_reset+0x1c/0xa0 [scsi_mod]
> [ 282.212324] [<d88363f7>] scsi_eh_ready_devs+0x57/0x70 [scsi_mod]
> [ 282.292604] [<d883654f>] scsi_unjam_host+0x9f/0xc0 [scsi_mod]
> [ 282.369451] [<d8836629>] scsi_error_handler+0xb9/0xe0 [scsi_mod]
> [ 282.449734] [<c01034d9>] kernel_thread_helper+0x5/0xc
>

scsi went ga-ga during insertion of the sym2 driver. Usual culprits cc'ed ;)

2005-09-03 04:23:01

by Brown, Len

[permalink] [raw]
Subject: RE: 2.6.13-mm1: hangs during boot ...

> > [ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110

possibly a missing interrupt?

> CONFIG_ACPI=y

any difference if booted with "acpi=off" or "acpi=noirq"?

2005-09-03 04:58:53

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

[ 8.345086] SCSI subsystem initialized
[ 8.427503] sym0: <810a> rev 0x23 at pci 0000:00:08.0 irq 16
[ 8.504636] sym0: No NVRAM, ID 7, Fast-10, SE, parity checking
[ 8.588216] sym0: SCSI BUS has been reset.
[ 8.642194] scsi0 : sym-2.2.1
[ 12.368622] Vendor: PIONEER Model: DVD-ROM DVD-303R Rev: 2.00
[ 12.450118] Type: CD-ROM ANSI SCSI revision:2[ 12.546506] target0:0:2: Beginning Domain Validation
[ 12.613354] target0:0:2: asynchronous.
[ 12.667699] target0:0:2: Domain Validation skipping write tests
[ 12.747629] target0:0:2: FAST-10 SCSI 10.0 MB/s ST (100 ns, offset 8)
[ 12.837395] target0:0:2: Ending Domain Validation
[ 13.256875] Vendor: SONY Model: CD-RW CRX140S Rev: 1.0e
[ 13.338323] Type: CD-ROM ANSI SCSI revision:4[ 13.434891] target0:0:4: Beginning Domain Validation
[ 13.503101] target0:0:4: asynchronous.
[ 13.602931] target0:0:4: Domain Validation skipping write tests
[ 13.683605] target0:0:4: FAST-10 SCSI 10.0 MB/s ST (100 ns, offset 8)
[ 13.777934] target0:0:4: Ending Domain Validation
[ 14.884703] Device not ready.
[ 15.763312] kjournald starting. Commit interval 5 seconds
[ 15.835612] EXT3-fs: mounted filesystem with ordered data mode.


Fedora Core release 4 (Stentz)
Kernel 2.6.13-mm1 on an i686

origma.pw.nest login:

[ 101.886572] DEBUG: Failed to load PF_NETLINK protocol 9
[ 101.963572] DEBUG: Failed to load PF_NETLINK protocol 9


Attachments:
acpi=off.output (1.48 kB)

2005-09-03 05:26:57

by Andrew Morton

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Peter Williams <[email protected]> wrote:
>
> Brown, Len wrote:
> >>>[ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
> >
> >
> > possibly a missing interrupt?
> >
> >
> >>CONFIG_ACPI=y
> >
> >
> > any difference if booted with "acpi=off" or "acpi=noirq"?
>
> Yes. In both cases, the system appears to boot normally

OK, we can pass this ball over to the ACPI team.

> but I'm unable
> to login or connect via ssh. Also there's a "device not ready" message
> after the scsi initialization which I don't normally see. I've attached
> the scsi initialization output. The PF_NETLINK error messages after the
> login prompt in this output are created whenever I try to log in or
> connect via ssh.

Linus hit that too - it's an interaction between PAM and a modified netlink
error code.

Dave, where are we up to with the fix for that?

2005-09-03 05:37:22

by Brown, Len

[permalink] [raw]
Subject: RE: 2.6.13-mm1: hangs during boot ...


>Brown, Len wrote:
>>>>[ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
>>
>>
>> possibly a missing interrupt?
>>
>>
>>>CONFIG_ACPI=y
>>
>>
>> any difference if booted with "acpi=off" or "acpi=noirq"?
>
>Yes. In both cases, the system appears to boot normally but
>I'm unable
>to login or connect via ssh. Also there's a "device not
>ready" message
>after the scsi initialization which I don't normally see.
>I've attached
>the scsi initialization output. The PF_NETLINK error messages
>after the
>login prompt in this output are created whenever I try to log in or
>connect via ssh.

Please confirm that vanilla 2.6.13 has none of these symptoms.
Please apply just the ACPI part of the 2.6.13-mm1 patch to see if
these issues are caused by that or if they are caused by something
else in the mm patch.

http://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.13/2.6.13-mm1/broken-out/git-acpi.patch

thanks,
-Len

2005-09-03 06:18:41

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Brown, Len wrote:
>
>
>>Brown, Len wrote:
>>
>>>>>[ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
>>>
>>>
>>>possibly a missing interrupt?
>>>
>>>
>>>
>>>>CONFIG_ACPI=y
>>>
>>>
>>>any difference if booted with "acpi=off" or "acpi=noirq"?
>>
>>Yes. In both cases, the system appears to boot normally but
>>I'm unable
>>to login or connect via ssh. Also there's a "device not
>>ready" message
>>after the scsi initialization which I don't normally see.
>>I've attached
>>the scsi initialization output. The PF_NETLINK error messages
>>after the
>>login prompt in this output are created whenever I try to log in or
>>connect via ssh.
>
>
> Please confirm that vanilla 2.6.13 has none of these symptoms.

That's correct. 2.6.13 exhibits none of these symptoms.

> Please apply just the ACPI part of the 2.6.13-mm1 patch to see if
> these issues are caused by that or if they are caused by something
> else in the mm patch.
>
> http://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.13/2.6.13-mm1/broken-out/git-acpi.patch

OK. I'll get back to you shortly.

Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2005-09-03 07:18:22

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Peter Williams wrote:
> Brown, Len wrote:
>
>>
>>
>>> Brown, Len wrote:
>>>
>>>>>> [ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
>>>>
>>>>
>>>>
>>>> possibly a missing interrupt?
>>>>
>>>>
>>>>
>>>>> CONFIG_ACPI=y
>>>>
>>>>
>>>>
>>>> any difference if booted with "acpi=off" or "acpi=noirq"?
>>>
>>>
>>> Yes. In both cases, the system appears to boot normally but I'm
>>> unable to login or connect via ssh. Also there's a "device not
>>> ready" message after the scsi initialization which I don't normally
>>> see. I've attached the scsi initialization output. The PF_NETLINK
>>> error messages after the login prompt in this output are created
>>> whenever I try to log in or connect via ssh.
>>
>>
>>
>> Please confirm that vanilla 2.6.13 has none of these symptoms.
>
>
> That's correct. 2.6.13 exhibits none of these symptoms.
>
>> Please apply just the ACPI part of the 2.6.13-mm1 patch to see if
>> these issues are caused by that or if they are caused by something
>> else in the mm patch.
>>
>> http://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.13/2.6.13-mm1/broken-out/git-acpi.patch
>>
>
>
> OK. I'll get back to you shortly.

I am able to confirm that the problem occurs with vanilla 2.5.13 after I
apply the above patch.

Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2005-09-03 08:16:58

by Brown, Len

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

On Sat, 2005-09-03 at 03:18 -0400, Peter Williams wrote:
>
> http://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.13/2.6.13-mm1/broken-out/git-acpi.patch
> >>

> I am able to confirm that the problem occurs with vanilla 2.5.13 after
> I apply the above patch.

Thanks.

Please then try the latest ACPI patch here:
http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/patches/release/2.6.13/acpi-20050902-2.6.13.diff.gz
It should apply to vanilla 2.6.13 with a reject in ia64/Kconfig
that you can ignore.

If this works, then we munged git-acpi.patch in 2.6.13-mm1 somehow.

If this fails, then please confirm it still fails with pnpacpi=off

if it still fails, then please open a bugzilla here:
http://bugzilla.kernel.org/enter_bug.cgi?product=ACPI
component=config-interrupts

build the failing kernel with CONFIG_ACPI_DEBUG=y
boot it with "acpi=noirq" and attach the output from
dmesg -s64000
lspci -vv
cat /proc/interrupts
acpidump, available in the latest pmtools here:
http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/utils/

also include the dmesg -s64000 from the successful
acpi-enabled 2.6.13 boot, along with its /proc/interrupts.

If you have a serial console and can then capture the
failing console log with "debug", that would be ideal.

Where we got from there will depend what we see...

thanks,
-Len


2005-09-03 09:28:17

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Len Brown wrote:
> On Sat, 2005-09-03 at 03:18 -0400, Peter Williams wrote:
>
>>http://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.13/2.6.13-mm1/broken-out/git-acpi.patch
>>
>
>>I am able to confirm that the problem occurs with vanilla 2.5.13 after
>>I apply the above patch.
>
>
> Thanks.
>
> Please then try the latest ACPI patch here:
> http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/patches/release/2.6.13/acpi-20050902-2.6.13.diff.gz
> It should apply to vanilla 2.6.13 with a reject in ia64/Kconfig
> that you can ignore.
>
> If this works, then we munged git-acpi.patch in 2.6.13-mm1 somehow.

There were no problems with this patch applied. So it looks like the
munge theory is correct.

>
> If this fails, then please confirm it still fails with pnpacpi=off
>
> if it still fails, then please open a bugzilla here:
> http://bugzilla.kernel.org/enter_bug.cgi?product=ACPI
> component=config-interrupts
>
> build the failing kernel with CONFIG_ACPI_DEBUG=y
> boot it with "acpi=noirq" and attach the output from
> dmesg -s64000
> lspci -vv
> cat /proc/interrupts
> acpidump, available in the latest pmtools here:
> http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/utils/
>
> also include the dmesg -s64000 from the successful
> acpi-enabled 2.6.13 boot, along with its /proc/interrupts.
>
> If you have a serial console and can then capture the
> failing console log with "debug", that would be ideal.
>
> Where we got from there will depend what we see...
>
> thanks,
> -Len
>

Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2005-09-03 09:50:29

by Andrew Morton

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Peter Williams <[email protected]> wrote:
>
> > Please then try the latest ACPI patch here:
> > http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/patches/release/2.6.13/acpi-20050902-2.6.13.diff.gz
> > It should apply to vanilla 2.6.13 with a reject in ia64/Kconfig
> > that you can ignore.
> >
> > If this works, then we munged git-acpi.patch in 2.6.13-mm1 somehow.
>
> There were no problems with this patch applied. So it looks like the
> munge theory is correct.

That diff is significantly different from the diff I plucked from
master.kernel.org:/pub/scm/linux/kernel/git/lenb/linux-acpi-2.6.git#test
for 2.6.13-mm1.

Doing (patch -R | grep FAILED) on 2.6.13-mm1 says:

Hunk #3 FAILED at 356.
1 out of 3 hunks FAILED -- saving rejects to file arch/ia64/Kconfig.rej
Hunk #6 FAILED at 190.
Hunk #8 FAILED at 221.
Hunk #10 FAILED at 254.
Hunk #11 FAILED at 357.
Hunk #15 FAILED at 474.
Hunk #17 FAILED at 569.
6 out of 17 hunks FAILED -- saving rejects to file drivers/acpi/dispatcher/dsmethod.c.rej
Hunk #19 FAILED at 468.
Hunk #29 FAILED at 701.
2 out of 38 hunks FAILED -- saving rejects to file drivers/acpi/dispatcher/dswload.c.rej
Hunk #14 FAILED at 321.
Hunk #43 FAILED at 1159.
2 out of 44 hunks FAILED -- saving rejects to file drivers/acpi/osl.c.rej
Hunk #17 FAILED at 1134.
1 out of 18 hunks FAILED -- saving rejects to file drivers/acpi/parser/psparse.c.rej
Hunk #3 FAILED at 74.
1 out of 3 hunks FAILED -- saving rejects to file drivers/acpi/parser/psxface.c.rej
Hunk #1 FAILED at 35.
1 out of 15 hunks FAILED -- saving rejects to file drivers/acpi/pci_bind.c.rej
Hunk #5 FAILED at 220.
Hunk #15 FAILED at 412.
Hunk #16 FAILED at 425.
Hunk #17 FAILED at 446.
Hunk #19 FAILED at 484.
5 out of 36 hunks FAILED -- saving rejects to file drivers/acpi/processor_core.c.rej
Hunk #1 FAILED at 41.
Hunk #2 FAILED at 71.
Hunk #4 FAILED at -55.
Hunk #5 FAILED at 30.
Hunk #6 FAILED at 40.
Hunk #7 FAILED at 69.
Hunk #9 FAILED at 317.
Hunk #10 FAILED at 344.
Hunk #12 FAILED at 289.
Hunk #14 FAILED at 523.
Hunk #15 FAILED at 607.
Hunk #16 FAILED at 618.
Hunk #17 FAILED at 645.
Hunk #19 FAILED at 534.
Hunk #20 FAILED at 686.
Hunk #22 FAILED at 916.
Hunk #23 FAILED at 968.
Hunk #25 FAILED at 881.
Hunk #26 FAILED at 891.
Hunk #29 FAILED at 953.
20 out of 29 hunks FAILED -- saving rejects to file drivers/acpi/resources/rsaddr.c.rej
Hunk #11 FAILED at 289.
Hunk #16 FAILED at 407.
Hunk #17 FAILED at 425.
Hunk #18 FAILED at 434.
Hunk #20 FAILED at 470.
Hunk #21 FAILED at 527.
6 out of 21 hunks FAILED -- saving rejects to file drivers/acpi/resources/rsirq.c.rej
Hunk #27 FAILED at 553.
1 out of 61 hunks FAILED -- saving rejects to file drivers/acpi/scan.c.rej
Hunk #1 FAILED at 41.
1 out of 34 hunks FAILED -- saving rejects to file drivers/acpi/utilities/utmisc.c.rej
Hunk #5 FAILED at 291.
1 out of 76 hunks FAILED -- saving rejects to file drivers/acpi/video.c.rej
Hunk #2 FAILED at 64.
1 out of 8 hunks FAILED -- saving rejects to file include/acpi/acconfig.h.rej
Hunk #1 FAILED at 41.
1 out of 1 hunk FAILED -- saving rejects to file include/acpi/acdispat.h.rej
Hunk #29 FAILED at 1078.
Hunk #30 FAILED at 1240.
2 out of 31 hunks FAILED -- saving rejects to file include/acpi/actypes.h.rej

2005-09-03 13:24:36

by Reuben Farrelly

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Hi Peter,

On 3/09/2005 4:59 a.m., Peter Williams wrote:
> Brown, Len wrote:
>>>> [ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
>>
>>
>> possibly a missing interrupt?
>>
>>
>>> CONFIG_ACPI=y
>>
>>
>> any difference if booted with "acpi=off" or "acpi=noirq"?
>
> Yes. In both cases, the system appears to boot normally but I'm unable
> to login or connect via ssh. Also there's a "device not ready" message

Are you seeing this "Device not ready" message appear over and over, or just
the once?

I am seeing it fill up my messages log as it is logging 1 or so messages each
minute. I've emailed the SCSI maintainer James Bottomley twice about it but
had no response either time.

The SCSI device I have is:

Sep 3 22:14:40 tornado kernel: Vendor: SONY Model: CD-RW CRX145S Rev: 1.0b

As for the inability to log in, this bug may be relevant, given I also had
that problem:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=166422

There are fixes in the pipeline for util-linux audit interaction in Fedora as
well. I know because I reported those too ;)

> after the scsi initialization which I don't normally see. I've attached
> the scsi initialization output. The PF_NETLINK error messages after the
> login prompt in this output are created whenever I try to log in or
> connect via ssh.

The workaround by enabling audit support, but obviously a better fix is in the
pipeline..

I'm surprised more people aren't discovering these 'interactions' due to
having audit not turned on. Does everyone build audit into their kernels?

reuben

2005-09-03 13:51:49

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Reuben Farrelly wrote:
> Hi Peter,
>
> On 3/09/2005 4:59 a.m., Peter Williams wrote:
>
>> Brown, Len wrote:
>>
>>>>> [ 279.662960] [<c02d5c74>] wait_for_completion+0xa4/0x110
>>>
>>>
>>>
>>> possibly a missing interrupt?
>>>
>>>
>>>> CONFIG_ACPI=y
>>>
>>>
>>>
>>> any difference if booted with "acpi=off" or "acpi=noirq"?
>>
>>
>> Yes. In both cases, the system appears to boot normally but I'm
>> unable to login or connect via ssh. Also there's a "device not ready"
>> message
>
>
> Are you seeing this "Device not ready" message appear over and over, or
> just the once?

Just the once.

>
> I am seeing it fill up my messages log as it is logging 1 or so messages
> each minute. I've emailed the SCSI maintainer James Bottomley twice
> about it but had no response either time.
>
> The SCSI device I have is:
>
> Sep 3 22:14:40 tornado kernel: Vendor: SONY Model: CD-RW CRX145S
> Rev: 1.0b
>
> As for the inability to log in, this bug may be relevant, given I also
> had that problem:
>
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=166422
>
> There are fixes in the pipeline for util-linux audit interaction in
> Fedora as well. I know because I reported those too ;)
>
>> after the scsi initialization which I don't normally see. I've
>> attached the scsi initialization output. The PF_NETLINK error
>> messages after the login prompt in this output are created whenever I
>> try to log in or connect via ssh.
>
>
> The workaround by enabling audit support, but obviously a better fix is
> in the pipeline..
>
> I'm surprised more people aren't discovering these 'interactions' due to
> having audit not turned on. Does everyone build audit into their kernels?
>
> reuben


--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2005-09-03 15:57:22

by Brown, Len

[permalink] [raw]
Subject: RE: 2.6.13-mm1: hangs during boot ...

>> > Please then try the latest ACPI patch here:
>> >
>http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/patches
>/release/2.6.13/acpi-20050902-2.6.13.diff.gz
>> > It should apply to vanilla 2.6.13 with a reject in ia64/Kconfig
>> > that you can ignore.
>> >
>> > If this works, then we munged git-acpi.patch in
>2.6.13-mm1 somehow.
>>
>> There were no problems with this patch applied. So it
>looks like the
>> munge theory is correct.
>
>That diff is significantly different from the diff I plucked from
>master.kernel.org:/pub/scm/linux/kernel/git/lenb/linux-acpi-2.6
>.git#test
>for 2.6.13-mm1.
>
>Doing (patch -R | grep FAILED) on 2.6.13-mm1 says:

Right.
2.6.13/acpi-20050902-2.6.13.diff.gz
is newers than 2.6.13-rc1's git-acpi.patch

2.6.13/acpi-20050815-2.6.13.diff.gz
is a closer match -- though not exact.

Peter, it might be illustrative if you have a moment
if you can also test 2.6.13/acpi-20050815-2.6.13.diff.gz
all by itself.

If it fails, then I broke -mm1
with acpi-20050815-2.6.13.diff.gz, but fixed
it by acpi-20050902-2.6.13.diff.gz.

If it succeeds, then the issue lies in the relatively small delta
between acpi-20050815-2.6.13.diff.gz 2.6.13-mm1's git-acpi.patch.

thanks,
-Len

2005-09-03 17:08:20

by James Bottomley

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

On Sat, 2005-09-03 at 23:51 +1000, Peter Williams wrote:
> > Are you seeing this "Device not ready" message appear over and over, or
> > just the once?
>
> Just the once.

OK, I finally have a theory about this. It's the everything goes via
bios code. Previously there were several levels at which commands could
exit the SCSI stack; now we make everything go via bios, so they all
come out at the top.

get_capabilities() in sr.c is sending a TEST_UNIT_READY which will get
NOT_READY back. Previously this was completing before it got to
scsi_io_completion(); now it doesn't. There must be quite a few cases
like this. The best fix is probably to use and respect REQ_QUIET for
internally generated commands.

James


2005-09-03 22:48:59

by Peter Williams

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Brown, Len wrote:
>>>>Please then try the latest ACPI patch here:
>>>
>>> >
>>
>>http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/patches
>>/release/2.6.13/acpi-20050902-2.6.13.diff.gz
>>
>>> > It should apply to vanilla 2.6.13 with a reject in ia64/Kconfig
>>> > that you can ignore.
>>> >
>>> > If this works, then we munged git-acpi.patch in
>>
>>2.6.13-mm1 somehow.
>>
>>> There were no problems with this patch applied. So it
>>
>>looks like the
>>
>>> munge theory is correct.
>>
>>That diff is significantly different from the diff I plucked from
>>master.kernel.org:/pub/scm/linux/kernel/git/lenb/linux-acpi-2.6
>>.git#test
>>for 2.6.13-mm1.
>>
>>Doing (patch -R | grep FAILED) on 2.6.13-mm1 says:
>
>
> Right.
> 2.6.13/acpi-20050902-2.6.13.diff.gz
> is newers than 2.6.13-rc1's git-acpi.patch
>
> 2.6.13/acpi-20050815-2.6.13.diff.gz
> is a closer match -- though not exact.
>
> Peter, it might be illustrative if you have a moment
> if you can also test 2.6.13/acpi-20050815-2.6.13.diff.gz
> all by itself.
>
> If it fails,

It does.

> then I broke -mm1
> with acpi-20050815-2.6.13.diff.gz, but fixed
> it by acpi-20050902-2.6.13.diff.gz.

So you did.

>
> If it succeeds, then the issue lies in the relatively small delta
> between acpi-20050815-2.6.13.diff.gz 2.6.13-mm1's git-acpi.patch.
>
> thanks,
> -Len
>

My pleasure
Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2005-09-04 16:33:29

by James Bottomley

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

On Sun, 2005-09-04 at 01:24 +1200, Reuben Farrelly wrote:
> I am seeing it fill up my messages log as it is logging 1 or so messages each
> minute. I've emailed the SCSI maintainer James Bottomley twice about it but
> had no response either time.

OK, can you try this ... it should confirm the theory if the messages go
away.

Thanks,

James

diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
--- a/drivers/scsi/scsi_lib.c
+++ b/drivers/scsi/scsi_lib.c
@@ -315,7 +315,7 @@ int scsi_execute(struct scsi_device *sde
req->sense = sense;
req->sense_len = 0;
req->timeout = timeout;
- req->flags |= flags | REQ_BLOCK_PC | REQ_SPECIAL;
+ req->flags |= flags | REQ_BLOCK_PC | REQ_SPECIAL | REQ_QUIET;

/*
* head injection *required* here otherwise quiesce won't work
@@ -927,17 +927,20 @@ void scsi_io_completion(struct scsi_cmnd
scsi_requeue_command(q, cmd);
return;
}
- printk(KERN_INFO "Device %s not ready.\n",
- req->rq_disk ? req->rq_disk->disk_name : "");
+ if (!(req->flags & REQ_QUIET))
+ dev_printk(KERN_INFO,
+ &cmd->device->sdev_gendev,
+ "Device not ready.\n");
cmd = scsi_end_request(cmd, 0, this_count, 1);
return;
case VOLUME_OVERFLOW:
- printk(KERN_INFO "Volume overflow <%d %d %d %d> CDB: ",
- cmd->device->host->host_no,
- (int)cmd->device->channel,
- (int)cmd->device->id, (int)cmd->device->lun);
- __scsi_print_command(cmd->data_cmnd);
- scsi_print_sense("", cmd);
+ if (!(req->flags & REQ_QUIET)) {
+ dev_printk(KERN_INFO,
+ &cmd->device->sdev_gendev,
+ "Volume overflow, CDB: ");
+ __scsi_print_command(cmd->data_cmnd);
+ scsi_print_sense("", cmd);
+ }
cmd = scsi_end_request(cmd, 0, block_bytes, 1);
return;
default:
@@ -954,15 +957,13 @@ void scsi_io_completion(struct scsi_cmnd
return;
}
if (result) {
- if (!(req->flags & REQ_SPECIAL))
- printk(KERN_INFO "SCSI error : <%d %d %d %d> return code "
- "= 0x%x\n", cmd->device->host->host_no,
- cmd->device->channel,
- cmd->device->id,
- cmd->device->lun, result);
+ if (!(req->flags & REQ_QUIET)) {
+ dev_printk(KERN_INFO, &cmd->device->sdev_gendev,
+ "SCSI error: return code = 0x%x\n", result);

- if (driver_byte(result) & DRIVER_SENSE)
- scsi_print_sense("", cmd);
+ if (driver_byte(result) & DRIVER_SENSE)
+ scsi_print_sense("", cmd);
+ }
/*
* Mark a single buffer as not uptodate. Queue the remainder.
* We sometimes get this cruft in the event that a medium error


2005-09-05 08:37:56

by Reuben Farrelly

[permalink] [raw]
Subject: Re: 2.6.13-mm1: hangs during boot ...

Hi,

On 5/09/2005 4:32 a.m., James Bottomley wrote:
> On Sun, 2005-09-04 at 01:24 +1200, Reuben Farrelly wrote:
>> I am seeing it fill up my messages log as it is logging 1 or so messages each
>> minute. I've emailed the SCSI maintainer James Bottomley twice about it but
>> had no response either time.
>
> OK, can you try this ... it should confirm the theory if the messages go
> away.
>
> Thanks,
>
> James
>
> diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
> --- a/drivers/scsi/scsi_lib.c
> +++ b/drivers/scsi/scsi_lib.c
> @@ -315,7 +315,7 @@ int scsi_execute(struct scsi_device *sde
> req->sense = sense;
> req->sense_len = 0;
> req->timeout = timeout;
> - req->flags |= flags | REQ_BLOCK_PC | REQ_SPECIAL;
> + req->flags |= flags | REQ_BLOCK_PC | REQ_SPECIAL | REQ_QUIET;
>
> /*
> * head injection *required* here otherwise quiesce won't work
> @@ -927,17 +927,20 @@ void scsi_io_completion(struct scsi_cmnd
> scsi_requeue_command(q, cmd);
> return;
> }
> - printk(KERN_INFO "Device %s not ready.\n",
> - req->rq_disk ? req->rq_disk->disk_name : "");
> + if (!(req->flags & REQ_QUIET))
> + dev_printk(KERN_INFO,
> + &cmd->device->sdev_gendev,
> + "Device not ready.\n");
> cmd = scsi_end_request(cmd, 0, this_count, 1);
> return;
> case VOLUME_OVERFLOW:
> - printk(KERN_INFO "Volume overflow <%d %d %d %d> CDB: ",
> - cmd->device->host->host_no,
> - (int)cmd->device->channel,
> - (int)cmd->device->id, (int)cmd->device->lun);
> - __scsi_print_command(cmd->data_cmnd);
> - scsi_print_sense("", cmd);
> + if (!(req->flags & REQ_QUIET)) {
> + dev_printk(KERN_INFO,
> + &cmd->device->sdev_gendev,
> + "Volume overflow, CDB: ");
> + __scsi_print_command(cmd->data_cmnd);
> + scsi_print_sense("", cmd);
> + }
> cmd = scsi_end_request(cmd, 0, block_bytes, 1);
> return;
> default:
> @@ -954,15 +957,13 @@ void scsi_io_completion(struct scsi_cmnd
> return;
> }
> if (result) {
> - if (!(req->flags & REQ_SPECIAL))
> - printk(KERN_INFO "SCSI error : <%d %d %d %d> return code "
> - "= 0x%x\n", cmd->device->host->host_no,
> - cmd->device->channel,
> - cmd->device->id,
> - cmd->device->lun, result);
> + if (!(req->flags & REQ_QUIET)) {
> + dev_printk(KERN_INFO, &cmd->device->sdev_gendev,
> + "SCSI error: return code = 0x%x\n", result);
>
> - if (driver_byte(result) & DRIVER_SENSE)
> - scsi_print_sense("", cmd);
> + if (driver_byte(result) & DRIVER_SENSE)
> + scsi_print_sense("", cmd);
> + }
> /*
> * Mark a single buffer as not uptodate. Queue the remainder.
> * We sometimes get this cruft in the event that a medium error

This patch fixes it, and there was no message during boot about not being
ready, nor after the machine had fully booted. Great ;-)

However, I did get an oops when warm booting the kernel, I suspect this may be
the oops that I get every now and then when warm rebooting, with no real
pattern, and possibly isn't related to the patch. As my serial console wasn't
set up at the time, I took a photo instead, at
http://www.reub.net/kernel/scsi-oops.jpg

Thanks
reuben