2007-06-05 03:50:29

by Linus Torvalds

[permalink] [raw]
Subject: Linux 2.6.22-rc4


So -rc4 is out there now, hopefully shrinking the regression list further.

The diffstat (for those that look at those kinds of things) tells the
story: lots of small stuff to random files. I think the single biggest
file change was the patch-checking script, along with some sparc64 fixes.
But the bulk of it all is just a lot of small random things.

Shortlog appended to give kind of an overview, nothing really stands out
there. Mostly driver fixes, with some architecture updates.

I'd ask that people involved with the known regressions please test
whether they got fixed, and if you wrote a patch and it's still pending,
please make sure to push it upstream..

Linus

---

Akinobu Mita (3):
ieee80211: fix incomplete error message
softmac: alloc_ieee80211() NULL check
[POWERPC] Fix return from pte_alloc_one() in out-of-memory case

Alan Cox (2):
[WATCHDOG] clean-up watchdog documentation
[ARM] enable arbitary speed tty ioctls and split input/output speed

Alan Stern (2):
[SCSI] sd: fix refcounting regression in suspend/resume routines
USB: replace flush_workqueue with cancel_sync_work

Alex Tomas (1):
When ext4_ext_insert_extent() fails to insert new blocks

Alexey Dobriyan (1):
Fix vmi.c compilation

Amit Arora (1):
ext4: Extent overlap bugfix

Andrea Paterniani (1):
SPI: Freescale iMX SPI controller driver fixes

Andrew Morton (6):
sh: support older gcc's
[SPARC32]: Build fix.
ntfs_init_locked_inode(): fix array indexing
afs: needs sched.h
mtrr atomicity fix
vanishing ioctl handler debugging

Andrew Victor (2):
[WATCHDOG] ks8695_wdt.c - new KS8695 watchdog driver
[ARM] 4411/1: KS8695: Another serial driver fix

Andy Gospodarek (1):
PCI: disable MSI by default on systems with Serverworks HT1000 chips

Andy Whitcroft (1):
add a trivial patch style checker

Antonino A. Daplas (1):
neofb: Fix pseudo_palette array overrun in neofb_setcolreg

Arnaldo Carvalho de Melo (1):
[SOCK]: Shrink struct sock by 8 bytes on 64-bit.

Artem Bityutskiy (1):
[JFFS2] Fix buffer length calculations in jffs2_get_inode_nodes()

Auke Kok (1):
e1000: disable polling before registering netdevice

Avuton Olrich (1):
cfag12864bfb: Use sys_ instead of cfb_ framebuffer accessors

Baruch Even (2):
[ALSA] hda-codec - Add quirk for MSI S420
[BRIDGE]: Reduce frequency of forwarding cleanup timer in bridge.

Ben Dooks (6):
[ARM] 4401/1: S3C2443: Add definitions for port GPIOJ
[ARM] 4402/1: S3C2443: Add physical address of HSMMC controller
[ARM] 4412/1: S3C2412: reset errata fix
[ARM] 4414/1: S3C2443: sparse fix for clock.c
[ARM] 4415/1: AML5900: fix sparse warnings from map_io
[ARM] 4416/1: NWFPE: fix undeclared symbols

Ben Gardner (1):
PCI: Fix pci_find_present

Benjamin Herrenschmidt (2):
[POWERPC] Fix ppc32 single-stepping out of syscalls
[POWERPC] Fix possible access to free pages

Bill Nottingham (2):
[SCSI] qla2xxx: fix timeout in qla2x00_down_timeout
[NET]: Fix comparisons of unsigned < 0.

Bj?rn Steinbrink (2):
prism54: fix monitor mode oops
timer statistics: fix race

Bob Moore (1):
ACPICA: Support for external package objects as method arguments

Brandon Craig Rhodes (1):
hostap: Allocate enough tailroom for TKIP

Brian King (1):
[SCSI] ipr: Proper return codes for eh_dev_reset for SATA devices

Brice Goglin (1):
myri10ge: report link up/down in standard ethtool way

Bryan Wu (1):
a bug in ramfs_nommu_resize function, passing old size to vmtruncate

Catalin Marinas (1):
[ARM] 4394/1: ARMv7: Add the TLB range operations

Chris Wright (1):
x86: fix oprofile double free

Christian Rothlaender (1):
[ALSA] hda-codec - Add support for ASUS A8J modem

Christoph Hellwig (6):
[SCSI] NCR53C9x: correct spelling mistake in deprecation notice
[SCSI] pluto: Use wait_for_completion_timeout.
[SPARC64]: Kill unused DIE_PAGE_FAULT enum value.
[S390] dasd_eer: use mutex instead of semaphore
[S390] arch/s390/kernel/debug.c: use mutex instead of semaphore
[S390] raw3270: use mutex instead of semaphore

Christoph Lameter (3):
SLUB: More documentation
SLUB: Fix NUMA / SYSFS bootstrap issue
SLUB: fix locking for hotplug callbacks

Cornelia Huck (1):
[S390] cio: Use device_schedule_callback() for removing disconnected devices.

Dan Williams (1):
msi: fix ARM compile

Daniel Drake (1):
[ALSA] hda-intel: fix ASUS M2V detection

Daniel T Chen (1):
[ALSA] hda-codec - Add quirk for Supermicro PDSBA to alc883_cfg_tbl[]

Darrick J. Wong (1):
[SCSI] aic94xx: asd_clear_nexus should fail if the cleared task does not complete

Dave Airlie (1):
drm/radeon: add more IGP chipset pci ids

Dave Jones (1):
[SCSI] fusion: Fix |/|| confusion

Dave Kleikamp (1):
EXT4: Fix whitespace

David Brownell (1):
SPI dynamic busid generation bugfix

David Chinner (1):
[XFS] Write at EOF may not update filesize correctly.

David Gibson (2):
[POWERPC] Fix compiler/assembler flags for Ebony platform boot files
[POWERPC] Don't use HOSTCFLAGS in BOOTCFLAGS

David Howells (1):
[MTD] generalise the handling of MTD-specific superblocks

David Rientjes (1):
[ARM] use __used attribute

David S. Miller (25):
[SPARC64]: Don't be picky about virtual-dma values on sun4v.
[SPARC64]: PCI device scan is way too verbose by default.
[SCSI] ESP: Kill SCSI_ESP_CORE and link directly just like jazz_esp
[SPARC64]: Fix typo in sun4v_hvapi_register error handling.
[SPARC64]: Report proper system soft state to the hypervisor.
[SPARC64]: Negotiate hypervisor API for PCI services.
[SPARC64]: Use machine description and OBP properly for cpu probing.
[SPARC64]: Eliminate NR_CPUS limitations.
[SPARC64]: Fix _PAGE_EXEC_4U check in sun4u I-TLB miss handler.
[SPARC64]: Fix two bugs wrt. kernel 4MB TSB.
[SPARC64]: Fill holes in hypervisor APIs and fix KTSB registry.
[CASSINI]: Fix printk message typo.
[XFRM]: Allow XFRM_ACQ_EXPIRES to be tunable via sysctl.
[XFRM]: xfrm_larval_drop sysctl should be __read_mostly.
[IPV4]: Kill references to bogus non-existent CONFIG_IP_NOSIOCRT
[AF_PACKET]: Kill bogus CONFIG_PACKET_MULTICAST
[IPV6]: Fix build warning.
[AF_PACKET]: Kill CONFIG_PACKET_SOCKET.
[IPSEC]: Add xfrm_sysctl.txt.
[SPARC64]: Add missing NCS and SVC hypervisor interfaces.
[AF_UNIX]: Make socket locking much less confusing.
[AF_UNIX]: Fix datagram connect race causing an OOPS.
[VIDEO]: XVR500 and XVR2500 require FB=y
[ATA]: Don't allow to enable this for SPARC64 without PCI.
[SCSI] JAZZ ESP and SUN ESP need SPI_ATTRS

David Woodhouse (4):
[JFFS2] Fix BUG() caused by failing to discard xattrs on deleted files.
[JFFS2] Fix potential memory leak of dead xattrs on unmount.
[MTD] Fix error checking after get_mtd_device() in get_sb_mtd functions
[JFFS2] Fix obsoletion of metadata nodes in jffs2_add_tn_to_tree()

Dmitry Torokhov (3):
Input: db9 - do not ignore dev2 module parameter
Input: ucb1x00 - do not access input_dev->private directly
Input: reduce raciness when input handlers disconnect

Domen Puncer (1):
[POWERPC] 52xx: unbreak lite5200 dts (_pic vs. -pic)

Ed Lin (4):
[SCSI] stex: fix id mapping issue
[SCSI] stex: extend hard reset wait time
[SCSI] stex: fix reset recovery for console device
[SCSI] stex: minor cleanup and version update

Enrico Scholz (1):
[ARM] 4403/1: Make the PXA-I2C driver work with lockdep validator

Eric W. Biederman (2):
msi: fix the ordering of msix irqs
msi: mask the msix vector before we unmap it

Evgeniy Polyakov (1):
sh: trivial build cleanups.

FUJITA Tomonori (1):
[SCSI] tgt: fix a rdma indirect transfer error bug

Geert Uytterhoeven (1):
fbdev: Move declaration of fb_class to <linux/fb.h>

Haavard Skinnemoen (1):
atmel_spi dma address bugfix

Heiko Carstens (1):
[S390] Fix section annotations.

Henrique de Moraes Holschuh (1):
ACPI: thinkpad-acpi: do not use named sysfs groups

Herbert Xu (3):
e1000: restore netif_poll_enable call but make sure IRQs are off
[NET] napi: Call __netif_rx_complete in netif_rx_complete
[NET] gso: Fix GSO feature mask in sk_setup_caps

Hong Liu (1):
mac80211: fix memory leak when defrag fragments

Horst H. von Brand (2):
[SPARC64]: arch/sparc64/time.c doesn't compile on Ultra 1 (no PCI)
[SPARC]: Missing #include <linux/mm.h> in drivers/sbus/char/flash.c

Hugh Dickins (1):
[SCSI] fix CONFIG_SCSI_WAIT_SCAN=m

Ilpo J?rvinen (1):
[TCP]: Fix GSO ignorance of pkts_acked arg (cong.cntrl modules)

Ingo Molnar (1):
timer stats: speedups

Ivan Kokshaysky (1):
PCI: i386: fixup for Siemens Nixdorf AG FSC Multiprocessor Interrupt Controllers

James Bottomley (2):
[SCSI] aacraid: fix panic on short Inquiry
[SCSI] aic7xxx: fix aicasm build failure with gcc-3.4.6

Jan Altenberg (1):
Add select PHYLIB to the UCC_GETH Kconfig option

Jan Engelhardt (1):
[SPARC]: Linux always started with 9600 8N1

Jan Kara (1):
Fix possible UDF data corruption

Jaroslav Kysela (1):
[ALSA] version 1.0.14

Jason Gaston (1):
pci_ids: update patch for Intel ICH9M

Jay Cliburn (1):
PCI: quirk disable MSI on via vt3351

Jay Estabrook (3):
ALPHA: support graphics on non-zero PCI domains
ALPHA: correct low-level I/O routines for sable-lynx
ALPHA: misc fixes

Jean Delvare (4):
hwmon/ds1621: Fix swapped temperature limits
hwmon/w83627hf: Be quiet when no chip is found
hwmon-vid: Don't spam the logs when VRM version is missing
hwmon/applesmc: Simplify dependencies

Jeff Garzik (4):
firewire: fix return code
[libata] Add drive to NCQ blacklist
misc/tifm_7xx1: replace deprecated irq flag
[ATM]: Fix warning.

Jerome Borsboom (1):
[NET]: parse ip:port strings correctly in in4_pton

Johannes Berg (1):
[POWERPC] Don't allow PMAC_APM_EMU for 64-bit

John W. Linville (1):
mac80211: avoid null ptr deref in ieee80211_ibss_add_sta

Karsten Keil (1):
Fix broken CLIR in isdn driver

Kazunori MIYAZAWA (2):
[IPSEC]: Fix IPv6 AH calculation in outbound
[IPSEC]: Fix panic when using inter address familiy IPsec on loopback.

Kristian H?gsberg (4):
firewire: Add missing byteswapping for receive DMA programs.
firewire: Implement suspend/resume PCI driver hooks.
firewire: Change struct fw_cdev_iso_packet to not use bitfields.
firewire: Install firewire-constants.h and firewire-cdev.h for userspace.

Krzysztof Helt (4):
[SPARC32]: Removes mismatch section warnigs in sparc time.c file
[SERIAL] sunzilog: section mismatch fix
skeletonfb: fix of xxxfb_setup ifdef
pm3fb: switching between X and fb fix

Kumar Gala (5):
[PPC] Fix COMMON symbol warnings
[PPC] Remove duplicate export of __div64_32.
[POWERPC] Fix modpost warning
[PPC] Fix modpost warning
[POWERPC] Fix Kconfig warning

Kyle McMartin (2):
[SPARC]: Emulate cmpxchg like parisc
fix possible null ptr deref in kallsyms_lookup

Len Brown (4):
ACPICA: allow Load(OEMx) tables
ACPI: extend "acpi_osi=" boot option
ACPI: Make _OSI(Linux) a special case
ACPI: add __init to acpi_initialize_subsystem()

Li Yang (2):
[POWERPC] Fix Section mismatch warnings
NET: add MAINTAINERS entry for ucc_geth driver

Linus Torvalds (2):
smpboot: fix cachesize comparison in smp_tune_scheduling()
Linux 2.6.22-rc4

Luis R. Rodriguez (1):
prism54: MAINTAINERS update

Maciej W. Rozycki (2):
defxx: Fix the handling of ioremap() failures
serial_core.h: include <linux/sysrq.h>

Manuel Lauss (2):
sh: Trivial fix for dma-api compile failure.
sh: Fix vsyscall build failure.

Marc Pignat (1):
at91: fix enable/disable_irq_wake symmetry in pcmcia driver

Mariusz Kozlowski (1):
m68k: parenthesis balance

Mark Glines (1):
[TCP]: Use default 32768-61000 outgoing port range in all cases.

Martin Habets (1):
[SPARC]: Mark as emulating cmpxchg, add appropriate depends for DRM.

Martin Peschke (1):
[SCSI] zfcp: avoid clutter in erp_dbf

Matt Mackall (2):
random: fix error in entropy extraction
random: fix seeding with zero entropy

Matthew Garrett (1):
RTC: use fallback IRQ if PNP tables don't provide one

Michael Chan (1):
[TG3]: Fix link problem on Dell's onboard 5906.

Michael Ellerman (2):
[POWERPC] Update documentation for of_find_node_by_type()
[POWERPC] Compare irq numbers with NO_IRQ not IRQ_NONE

Michael Holzheu (1):
[S390] Add exception handler for diagnose 224

Michael Loehr (1):
[SCSI] zfcp: IO stall after deleting and path checker changes after reenabling zfcp devices

Michael Neuling (1):
[POWERPC] Fix compile warning in pseries xics code

Michael S. Tsirkin (2):
IB/mthca: Fix handling of send CQE with error for QPs connected to SRQ
IPoIB/cm: Fix performance regression on Mellanox

Michael Schmitz (1):
[SCSI] atari_NCR5380: update_timeout removal

Michael Wu (1):
mac80211: always set carrier status on open

Michael-Luke Jones (4):
[ARM] 4404/1: Trivial IXP42x Kconfig cleanup
[ARM] 4405/1: NSLU2, DSM-G600 frequency fixup code
[ARM] 4406/1: Trivial NSLU2 / NAS-100D header & setup code cleanup
[ARM] 4410/1: Remove extern declarations in coyote/ixdpg425-pci.c

Michel D?nzer (1):
drm: make sure the drawable code doesn't call malloc(0).

Mike Frysinger (1):
[MTD] [MAPS] don't force uclinux mtd map to be root dev

Milton Miller (1):
[POWERPC] Fix zImage.coff generation for 32-bit pmac

Mingming Cao (1):
Remove unnecessary exported symbols.

Mithlesh Thukral (1):
NetXen: Removal of extra free_irq call

Nicolas Boichat (1):
hwmon/applesmc: Handle name file creation error and deletion

Oleg Nesterov (1):
tty: fix leakage of -ERESTARTSYS to userland

Ondrej Zajicek (1):
vt8623fb: arkfb: null pointer dereference fix

Patrick McHardy (1):
[ICMP]: Fix icmp_errors_use_inbound_ifaddr sysctl

Paul Mundt (2):
sh: section mismatch fixes for system timer.
smc91x: sh solution engine fixes.

Pavel Emelianov (1):
[TCP]: Consolidate checking for tcp orphan count being too big.

Pekka Enberg (1):
Documentation: How to use GDB to decode OOPSes

Peter Oberparleiter (1):
[S390] cio: deregister ccw device when pgid disband failed

Petr Vandrovec (1):
ieee1394: raw1394: Fix async send

Rafael J. Wysocki (1):
[CRYPTO] cryptd: Fix problem with cryptd and the freezer

Richard Henderson (2):
alpha: cleanup in bitops.h
alpha: support new syscalls

Robert P. J. Day (1):
[ARM] remove unused header file: arch/arm/mach-s3c2410/bast.h

Roland Dreier (1):
IB/mlx4: Fix last allocated object tracking in bitmap allocator

Roman Zippel (2):
m68k: runtime patching infrastructure
m68k: discontinuous memory support

Rudolf Marek (1):
hwmon/coretemp: Add more safety checks

Russell King (3):
[ARM] Fix stacktrace FP range checking
[ARM] oprofile: avoid lockdep warnings on mpcore oprofile init
[ARM] Fix some section mismatch warnings

Salyzyn, Mark (3):
[SCSI] aacraid: Correct sa platform support. (Was: [Bug 8469] Bad EIP value on pentium3 SMP kernel-2.6.21.1)
[SCSI] aacraid: apply commit config for reset_devices flag
[SCSI] aacraid: fix shutdown handler to also disable interrupts.

Sam Ravnborg (6):
sparc64: fix alignment bug in linker definition script
microcode: fix section mismatch warning
isdn: fix section mismatch warnings
acpi: fix section mismatch warning in asus + toshiba
kvm: fix section mismatch warning in kvm-intel.o
net/hp100: fix section mismatch warning

Sangtae Ha (1):
[TCP] tcp_probe: a trivial fix for mismatched number of printl arguments.

Satoru Takeuchi (1):
Better documentation for ERESTARTSYS

Sean Hefty (1):
IB/cm: Fix stale connection detection

Stefan Richter (9):
ieee1394: sbp2: include workqueue.h
ieee1394: eth1394: remove bogus netif_wake_queue
ieee1394: eth1394: handle tlabel exhaustion
ieee1394: eth1394: bring back a parent device
firewire: prefix modules with firewire- instead of fw-
ieee1394: fix calculation of sysfs attribute "address"
ieee1394: sbp2: offer SAM-conforming target port ID in sysfs
firewire: fw-sbp2: implement sysfs ieee1394_id
firewire: add to MAINTAINERS

Stefan Roese (1):
[POWERPC] Fix compile breakage for IBM/AMCC 4xx arch/ppc platforms

Stephen Hemminger (12):
sky2: dont set bogus bit in PHY register
sky2: checksum offload plus vlan bug
sky2: program proper register for fiber PHY
sky2: enable IRQ on duplex renegotiation
[TCP] tcp_probe: use GCC printf attribute
[BRIDGE]: Round off STP perodic timers.
qla3xxx: device doesnt do hardware checksumming.
VLAN: kill_vid is only useful for VLAN filtering devices
sky2: Fix VLAN unregistration
8139cp: fix VLAN unregistration
atl1: eliminate unneeded kill_vid code
network drivers: eliminate unneeded kill_vid code

Stephen Rothwell (3):
[POWERPC] ps3/interrupt.c uses get_hard_smp_processor_id
[POWERPC] pasemi idle uses hard_smp_processor_id
[POWERPC] Create a zImage for legacy iSeries

Steve Longerbeam (1):
[ALSA] hda-codec - Fix pin configs for Gateway MX6453

Sumant Patro (2):
[SCSI] MegaRAID: Update MAINTAINERS email-id
[SCSI] megaraid_sas: intercept cmd timeout and throttle io

Takashi Iwai (3):
[ALSA] ali5451 - Fix possible NULL dereference
[ALSA] hda-codec - Fix input with STAC92xx
[ALSA] hda-codec - Fix STAC922x capture boost level

Takashi YOSHII (1):
sh: Fix pcrel too far for in_nmi label.

Tejun Heo (2):
pci-quirks: fix MSI disabling on RS400-200 and RS480
libata: always use polling SETXFER

Theodore Ts'o (1):
Define/reserve new ext4 superblock fields

Thomas Bogendoerfer (1):
[SCSI] jazz_esp: Converted to use esp_core.

Thomas Gleixner (3):
drm: Spinlock initializer cleanup
NOHZ: prevent multiplication overflow - stop timer for huge timeouts
[ARM] 4417/1: Serial: Fix AMBA drivers locking

Thomas Klein (1):
ehea: Fixed multi queue RX bug

Thomas Renninger (1):
ACPI: thermal: Replace pointer with name in trip_points

Tilman Schmidt (1):
ISDN4Linux: fix maturity label

Tim Gardner (1):
Work around Dell E520 BIOS reboot bug

Timur Tabi (1):
[POWERPC] QE: fix Kconfig 'select' warning with UCC_FAST

Tobin Davis (4):
[ALSA] HDA: Add support for Gateway NX860
[ALSA] HDA: Add more systems to Sigmatel codec
[ALSA] HDA: Fix headphone mute issue on non-eapd Conexant systems
[ALSA] hda-codec - Add support for MSI K9N Ultra

Tony Luck (1):
ACPI: Section mismatch ... acpi_map_pxm_to_node

Trond Myklebust (1):
NFS: Fix a refcount leakage in O_DIRECT

Ulrich Drepper (1):
fix compat futex code for private futexes

Uwe Bugla (1):
Input: logips2pp - fix typo in Kconfig

Valerie Henson (1):
Update tulip maintainer email address

Venkatesh Pallipadi (1):
[NET]: Make net watchdog timers 1 sec jiffy aligned.

Wei Dong (1):
[IPV4]: Fix "ipOutNoRoutes" counter error for TCP and UDP

YOSHIFUJI Hideaki (2):
[IPV6] ROUTE: No longer handle ::/0 specially.
[IPV6] ADDRCONF: Fix conflicts in DEVCONF_xxx constant.

Yasunori Goto (1):
memory hotplug: fix unnecessary calling of init_currenty_empty_zone()

Yoann Padioleau (2):
potential parse error in ifdef
parse errors in ifdefs

Yoshinori Sato (1):
h8300 trival patches

Zhu Yi (1):
mac80211: fail back to use associate from reassociate

Zoltan Devai (1):
[ALSA] Fix ASoC s3c24xx-pcm spinlock bug

Zou Nan hai (1):
x86_64: allocate sparsemem memmap above 4G


2007-06-05 09:29:54

by David Greaves

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

Linus Torvalds wrote:
> So -rc4 is out there now, hopefully shrinking the regression list further.
>
> The diffstat (for those that look at those kinds of things) tells the
> story: lots of small stuff to random files. I think the single biggest
> file change was the patch-checking script, along with some sparc64 fixes.
> But the bulk of it all is just a lot of small random things.
>
> Shortlog appended to give kind of an overview, nothing really stands out
> there. Mostly driver fixes, with some architecture updates.
>
> I'd ask that people involved with the known regressions please test
> whether they got fixed, and if you wrote a patch and it's still pending,
> please make sure to push it upstream..
>
> Linus

Compile warnings and a new regression: hang on boot during sata_promise
detection... :(

I have to go out now, I'll get more details on my return.

make mrproper
cp ../linux-2.6.21.1/.config .
make oldconfig (accept defaults)
make

scripts/kconfig/conf -s arch/i386/Kconfig
drivers/input/keyboard/Kconfig:170:warning: 'select' used by config symbol
'KEYBOARD_ATARI' refers to undefined symbol 'ATARI_KBD_CORE'
drivers/input/mouse/Kconfig:182:warning: 'select' used by config symbol
'MOUSE_ATARI' refers to undefined symbol 'ATARI_KBD_CORE'
CHK include/linux/version.h

[And I gave my Amiga away a year ago :) ]

CC kernel/power/pm.o
kernel/power/pm.c:205: warning: `pm_register' is deprecated (declared at
kernel/power/pm.c:64)
kernel/power/pm.c:205: warning: `pm_register' is deprecated (declared at
kernel/power/pm.c:64)
kernel/power/pm.c:206: warning: `pm_send_all' is deprecated (declared at
kernel/power/pm.c:180)
kernel/power/pm.c:206: warning: `pm_send_all' is deprecated (declared at
kernel/power/pm.c:180)

CC fs/xfs/linux-2.6/xfs_lrw.o
fs/xfs/linux-2.6/xfs_lrw.c: In function `xfs_iozero':
fs/xfs/linux-2.6/xfs_lrw.c:162: warning: `memclear_highpage_flush' is deprecated
(declared at include/linux/highmem.h:115)

CC drivers/base/dd.o
drivers/base/dd.c:211: warning: `device_probe_drivers' defined but not used

CC drivers/pci/search.o
drivers/pci/search.c: In function `pci_find_slot':
drivers/pci/search.c:99: warning: `pci_find_device' is deprecated (declared at
include/linux/pci.h:477)
drivers/pci/search.c: At top level:
drivers/pci/search.c:434: warning: `pci_find_device' is deprecated (declared at
drivers/pci/search.c:241)
drivers/pci/search.c:434: warning: `pci_find_device' is deprecated (declared at
drivers/pci/search.c:241)

LD vmlinux
SYSMAP System.map
SYSMAP .tmp_System.map
MODPOST vmlinux
WARNING: arch/i386/kernel/built-in.o(.text+0x968f): Section mismatch: reference
to .init.text: (between 'mtrr_bp_init' and 'mtrr_ap_init')
WARNING: arch/i386/kernel/built-in.o(.text+0x9781): Section mismatch: reference
to .init.text: (between 'mtrr_bp_init' and 'mtrr_ap_init')
WARNING: arch/i386/kernel/built-in.o(.text+0x9786): Section mismatch: reference
to .init.text: (between 'mtrr_bp_init' and 'mtrr_ap_init')
WARNING: arch/i386/kernel/built-in.o(.text+0xa25c): Section mismatch: reference
to .init.text: (between 'get_mtrr_state' and 'mtrr_wrmsr')
WARNING: arch/i386/kernel/built-in.o(.text+0xa303): Section mismatch: reference
to .init.text: (between 'get_mtrr_state' and 'mtrr_wrmsr')
WARNING: arch/i386/kernel/built-in.o(.text+0xa31b): Section mismatch: reference
to .init.text: (between 'get_mtrr_state' and 'mtrr_wrmsr')
WARNING: arch/i386/kernel/built-in.o(.text+0xa344): Section mismatch: reference
to .init.text: (between 'get_mtrr_state' and 'mtrr_wrmsr')
WARNING: arch/i386/kernel/built-in.o(.exit.text+0x19): Section mismatch:
reference to .init.text: (between 'cache_remove_dev' and 'powernow_k6_exit')
WARNING: arch/i386/kernel/built-in.o(.data+0x2160): Section mismatch: reference
to .init.text: (between 'thermal_throttle_cpu_notifier' and 'mce_work')
WARNING: kernel/built-in.o(.text+0x14482): Section mismatch: reference to
.init.text: (between 'kthreadd' and 'init_waitqueue_head')


David

2007-06-05 10:42:32

by Mikael Pettersson

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Tue, 05 Jun 2007 10:29:43 +0100, David Greaves wrote:
> Linus Torvalds wrote:
> > [Linus' 2.6.22-rc4 announcement]
>
> Compile warnings and a new regression: hang on boot during sata_promise
> detection... :(

Please give us some details about your sata_promise problem:
- describe your hardware (Promise chip version, mainboard, chipset, etc)
- which was the last kernel version prior to 2.6.22-rc4 that worked
- the kernel messages up to the hang, if you can capture them

2007-06-05 15:52:11

by Jeff Garzik

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Tue, Jun 05, 2007 at 12:41:44PM +0200, Mikael Pettersson wrote:
> On Tue, 05 Jun 2007 10:29:43 +0100, David Greaves wrote:
> > Linus Torvalds wrote:
> > > [Linus' 2.6.22-rc4 announcement]
> >
> > Compile warnings and a new regression: hang on boot during sata_promise
> > detection... :(
>
> Please give us some details about your sata_promise problem:
> - describe your hardware (Promise chip version, mainboard, chipset, etc)
> - which was the last kernel version prior to 2.6.22-rc4 that worked
> - the kernel messages up to the hang, if you can capture them

There is a small chance it might be that sata_promise typo fix that went
in, which will definitely change driver behavior for the chips served by
that table entry...

Jeff



2007-06-05 16:15:00

by David Greaves

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4 - sata_promise regression since -rc3

Linus Torvalds wrote:
> So -rc4 is out there now, hopefully shrinking the regression list further.

> I'd ask that people involved with the known regressions please test
> whether they got fixed, and if you wrote a patch and it's still pending,
> please make sure to push it upstream..

[Tejun, Jeff, added you since the bisect points to your patch.]

Sorry, mail glitch means I lost a couple of emails...

I said:
Compile warnings and a new regression: hang on boot during sata_promise
detection...

It turns out that the hang times out; it does boot after a while. It's missing 4
of my SATA disks though.
[in turn this means I can't test the hibernate regression against -rc4. But
testing that regression against a862b5c8cd5d847779a049a5fc8cf5b1e6f5fa07 shows
it is still there. Do I get a bonus for finding 2 regressions?]]

I also bisected and got:
Bisecting: 0 revisions left to test after this
[464cf177df7727efcc5506322fc5d0c8b896f545] libata: always use polling SETXFER

According to marc, Mikail said:
Please give us some details about your sata_promise problem:
- describe your hardware (Promise chip version, mainboard, chipset, etc)
I have a Promise TX-4 and onboard via-sata.
0000:00:00.0 Host bridge: VIA Technologies, Inc. VT8377 [KT400/KT600 AGP] Host
Bridge (rev 80)
0000:00:01.0 PCI bridge: VIA Technologies, Inc. VT8237 PCI Bridge
0000:00:0d.0 Unknown mass storage controller: Promise Technology, Inc. PDC20318
(SATA150 TX4) (rev 02)
0000:00:0f.1 IDE interface: VIA Technologies, Inc.
VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 06)

- which was the last kernel version prior to 2.6.22-rc4 that worked
2.6.22-rc3
- the kernel messages up to the hang, if you can capture them
Easier once I learned patience...

sata_promise 0000:00:0d.0: version 2.07
ACPI: PCI Interrupt 0000:00:0d.0[A] -> GSI 16 (level, low) -> IRQ 17
scsi0 : sata_promise
scsi1 : sata_promise
scsi2 : sata_promise
scsi3 : sata_promise
ata1: SATA max UDMA/133 cmd 0xf880a200 ctl 0xf880a238 bmdma 0x00000000 irq 0
ata2: SATA max UDMA/133 cmd 0xf880a280 ctl 0xf880a2b8 bmdma 0x00000000 irq 0
ata3: SATA max UDMA/133 cmd 0xf880a300 ctl 0xf880a338 bmdma 0x00000000 irq 0
ata4: SATA max UDMA/133 cmd 0xf880a380 ctl 0xf880a3b8 bmdma 0x00000000 irq 0
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata1.00: ATA-7: Maxtor 6B250S0, BANC19J0, max UDMA/133
ata1.00: 490234752 sectors, multi 0: LBA48 NCQ (depth 0/32)
ata1.00: qc timeout (cmd 0xef)
ata1.00: failed to set xfermode (err_mask=0x4)
ata1: failed to recover some devices, retrying in 5 secs
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata1.00: qc timeout (cmd 0xef)
ata1.00: failed to set xfermode (err_mask=0x4)
ata1.00: limiting speed to UDMA/133:PIO3
ata1: failed to recover some devices, retrying in 5 secs
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata1.00: qc timeout (cmd 0xef)
ata1.00: failed to set xfermode (err_mask=0x4)
ata1.00: disabled
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata2.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata2.00: ATA-7: Maxtor 7Y250M0, YAR51EW0, max UDMA/133
ata2.00: 490234752 sectors, multi 0: LBA48
ata2.00: qc timeout (cmd 0xef)
ata2.00: failed to set xfermode (err_mask=0x4)
ata2: failed to recover some devices, retrying in 5 secs
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata2.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata2.00: qc timeout (cmd 0xef)
ata2.00: failed to set xfermode (err_mask=0x4)
ata2.00: limiting speed to UDMA/133:PIO3
ata2: failed to recover some devices, retrying in 5 secs
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata2.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata2.00: qc timeout (cmd 0xef)
ata2.00: failed to set xfermode (err_mask=0x4)
ata2.00: disabled
ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata3.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata3.00: ATA-7: Maxtor 7Y250M0, YAR51EW0, max UDMA/133
ata3.00: 490234752 sectors, multi 0: LBA48
ata3.00: qc timeout (cmd 0xef)
ata3.00: failed to set xfermode (err_mask=0x4)
ata3: failed to recover some devices, retrying in 5 secs
ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata3.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata3.00: qc timeout (cmd 0xef)
ata3.00: failed to set xfermode (err_mask=0x4)
ata3.00: limiting speed to UDMA/133:PIO3
ata3: failed to recover some devices, retrying in 5 secs
ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata3.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata3.00: qc timeout (cmd 0xef)
ata3.00: failed to set xfermode (err_mask=0x4)
ata3.00: disabled
ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata4.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata4.00: ATA-7: Maxtor 6B250S0, BANC1980, max UDMA/133
ata4.00: 490234752 sectors, multi 0: LBA48 NCQ (depth 0/32)
ata4.00: qc timeout (cmd 0xef)
ata4.00: failed to set xfermode (err_mask=0x4)
ata4: failed to recover some devices, retrying in 5 secs
ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata4.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata4.00: qc timeout (cmd 0xef)
ata4.00: failed to set xfermode (err_mask=0x4)
ata4.00: limiting speed to UDMA/133:PIO3
ata4: failed to recover some devices, retrying in 5 secs
ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata4.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata4.00: qc timeout (cmd 0xef)
ata4.00: failed to set xfermode (err_mask=0x4)
ata4.00: disabled
sata_via 0000:00:0f.0: version 2.2
ACPI: PCI Interrupt 0000:00:0f.0[B] -> GSI 20 (level, low) -> IRQ 16
sata_via 0000:00:0f.0: routed to hard irq line 0
scsi4 : sata_via
scsi5 : sata_via
ata5: SATA max UDMA/133 cmd 0x0001b000 ctl 0x0001a802 bmdma 0x00019800 irq 0
ata6: SATA max UDMA/133 cmd 0x0001a400 ctl 0x0001a002 bmdma 0x00019808 irq 0
ata5: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ATA: abnormal status 0x7F on port 0x0001b007
ATA: abnormal status 0x7F on port 0x0001b007
ata5.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata5.00: ATA-7: Maxtor 7B250S0, BANC1980, max UDMA/133
ata5.00: 490234752 sectors, multi 16: LBA48 NCQ (depth 0/32)
ata5.00: ata_hpa_resize 1: sectors = 490234752, hpa_sectors = 490234752
ata5.00: configured for UDMA/133
ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ATA: abnormal status 0x7F on port 0x0001a407
ATA: abnormal status 0x7F on port 0x0001a407
ata6.00: ata_hpa_resize 1: sectors = 781422768, hpa_sectors = 781422768
ata6.00: ATA-7: ST3400620AS, 3.AAK, max UDMA/133
ata6.00: 781422768 sectors, multi 16: LBA48 NCQ (depth 0/32)
ata6.00: ata_hpa_resize 1: sectors = 781422768, hpa_sectors = 781422768
ata6.00: configured for UDMA/133
scsi 4:0:0:0: Direct-Access ATA Maxtor 7B250S0 BANC PQ: 0 ANSI: 5
sd 4:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
sd 4:0:0:0: [sda] Write Protect is off
sd 4:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 4:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO
or FUA
sd 4:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
sd 4:0:0:0: [sda] Write Protect is off
sd 4:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 4:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO
or FUA
sda: sda1
sd 4:0:0:0: [sda] Attached SCSI disk
scsi 5:0:0:0: Direct-Access ATA ST3400620AS 3.AA PQ: 0 ANSI: 5
sd 5:0:0:0: [sdb] 781422768 512-byte hardware sectors (400088 MB)
sd 5:0:0:0: [sdb] Write Protect is off
sd 5:0:0:0: [sdb] Mode Sense: 00 3a 00 00
sd 5:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO
or FUA
sd 5:0:0:0: [sdb] 781422768 512-byte hardware sectors (400088 MB)
sd 5:0:0:0: [sdb] Write Protect is off
sd 5:0:0:0: [sdb] Mode Sense: 00 3a 00 00
sd 5:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO
or FUA
sdb: sdb1 sdb2
sd 5:0:0:0: [sdb] Attached SCSI disk




2007-06-05 18:55:13

by Michal Piotrowski

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

Linus Torvalds pisze:
> So -rc4 is out there now, hopefully shrinking the regression list further.
>

It is too stable for me.

aio_dio_bugs ok
aiostress ok
bash_shared_mapping ok
cpu_hotplug ok
fio ok
fsx ok
interbench ok
iozone ok
isic ok
linus_stress ok
the latest ltp ok


this looks harmless

[ 116.733327] PM: suspend-to-disk mode set to 'shutdown'
[ 116.738849] swsusp: Basic memory bitmaps created
[ 116.745353] Stopping tasks ... WARNING: at /home/devel/linux-git/kernel/lockdep.c:2414 check_flags()
[ 116.754921] [<c0404fbf>] show_trace_log_lvl+0x1a/0x2f
[ 116.754937] [<c0405bd6>] show_trace+0x12/0x14
[ 116.754947] [<c0405c5d>] dump_stack+0x16/0x18
[ 116.754957] [<c043c3e5>] check_flags+0x95/0x143
[ 116.754967] [<c043f158>] lock_acquire+0x29/0x82
[ 116.754977] [<c06313a7>] _spin_lock+0x35/0x42
[ 116.754990] [<c044894a>] refrigerator+0x14/0xc6
[ 116.755002] [<c042d4b3>] get_signal_to_deliver+0x33/0x397
[ 116.755016] [<c0403597>] do_notify_resume+0x94/0x6ed
[ 116.755029] [<c0404099>] work_notifysig+0x13/0x1a
[ 116.755044] =======================
[ 116.755052] irq event stamp: 69
[ 116.755060] hardirqs last enabled at (69): [<c04040f9>] syscall_exit_work+0x11/0x26
[ 116.755084] hardirqs last disabled at (68): [<c0403fdd>] syscall_exit+0x9/0x1a
[ 116.755109] softirqs last enabled at (0): [<c042150c>] copy_process+0x4dd/0x1286
[ 116.755139] softirqs last disabled at (0): [<00000000>] 0x0
[ 116.945776] done.

http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.22-rc4/git-dmesg
http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.22-rc4/git-config

Regards,
Michal

--
"Najbardziej brakowało mi twojego milczenia."
-- Andrzej Sapkowski "Coś więcej"

2007-06-05 19:27:21

by Linus Torvalds

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4



On Tue, 5 Jun 2007, Michal Piotrowski wrote:
>
> It is too stable for me.

I wish all our testers were you ;)

> this looks harmless
>
> [ 116.733327] PM: suspend-to-disk mode set to 'shutdown'
> [ 116.738849] swsusp: Basic memory bitmaps created
> [ 116.745353] Stopping tasks ... WARNING: at /home/devel/linux-git/kernel/lockdep.c:2414 check_flags()
> [ 116.754921] [<c0404fbf>] show_trace_log_lvl+0x1a/0x2f
> [ 116.754937] [<c0405bd6>] show_trace+0x12/0x14
> [ 116.754947] [<c0405c5d>] dump_stack+0x16/0x18
> [ 116.754957] [<c043c3e5>] check_flags+0x95/0x143
> [ 116.754967] [<c043f158>] lock_acquire+0x29/0x82
> [ 116.754977] [<c06313a7>] _spin_lock+0x35/0x42
> [ 116.754990] [<c044894a>] refrigerator+0x14/0xc6
> [ 116.755002] [<c042d4b3>] get_signal_to_deliver+0x33/0x397
> [ 116.755016] [<c0403597>] do_notify_resume+0x94/0x6ed
> [ 116.755029] [<c0404099>] work_notifysig+0x13/0x1a
> [ 116.755044] =======================
> [ 116.755052] irq event stamp: 69
> [ 116.755060] hardirqs last enabled at (69): [<c04040f9>] syscall_exit_work+0x11/0x26
> [ 116.755084] hardirqs last disabled at (68): [<c0403fdd>] syscall_exit+0x9/0x1a
> [ 116.755109] softirqs last enabled at (0): [<c042150c>] copy_process+0x4dd/0x1286
> [ 116.755139] softirqs last disabled at (0): [<00000000>] 0x0
> [ 116.945776] done.
>
> http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.22-rc4/git-dmesg
> http://www.stardust.webpages.pl/files/tbf/bitis-gabonica/2.6.22-rc4/git-config

Well, it's harmless in the sense that "yeah, the system still works", but
it does seem to be a real bug. We have hardware interrupts disabled when
we _think_ we should have them on, so our irq tracking is off.

Ingo, do you see what's up? It looks like we got a signal to a process
that just got created, is the setup stuff for "tsk->hardirqs_enabled"
perhaps off a bit?

Linus

2007-06-05 19:37:55

by Ingo Molnar

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4


* Linus Torvalds <[email protected]> wrote:

> > this looks harmless
> >
> > [ 116.733327] PM: suspend-to-disk mode set to 'shutdown' [
> > 116.738849] swsusp: Basic memory bitmaps created [ 116.745353]
> > Stopping tasks ... WARNING: at
> > /home/devel/linux-git/kernel/lockdep.c:2414 check_flags()

> > [ 116.755052] irq event stamp: 69
> > [ 116.755060] hardirqs last enabled at (69): [<c04040f9>] syscall_exit_work+0x11/0x26
> > [ 116.755084] hardirqs last disabled at (68): [<c0403fdd>] syscall_exit+0x9/0x1a
> > [ 116.755109] softirqs last enabled at (0): [<c042150c>] copy_process+0x4dd/0x1286
> > [ 116.755139] softirqs last disabled at (0): [<00000000>] 0x0
> > [ 116.945776] done.

> Well, it's harmless in the sense that "yeah, the system still works",
> but it does seem to be a real bug. We have hardware interrupts
> disabled when we _think_ we should have them on, so our irq tracking
> is off.
>
> Ingo, do you see what's up? It looks like we got a signal to a process
> that just got created, is the setup stuff for "tsk->hardirqs_enabled"
> perhaps off a bit?

hm. I cannot see the source of the bug at the moment, but here's my
analysis so far:

the last event that irqtrace got was #69, and that was a 'hardirqs on'
in syscall_exit_work. After that we did a 'hardirqs off' without
properly tracking that via irqtrace. Next time we got an irqtrace event
(event 70) the assert caught up with us and turned off lockdep and
backed out of that function. This was in:

> [ 116.754957] [<c043c3e5>] check_flags+0x95/0x143
> [ 116.754967] [<c043f158>] lock_acquire+0x29/0x82
> [ 116.754977] [<c06313a7>] _spin_lock+0x35/0x42
> [ 116.754990] [<c044894a>] refrigerator+0x14/0xc6
> [ 116.755002] [<c042d4b3>] get_signal_to_deliver+0x33/0x397
> [ 116.755016] [<c0403597>] do_notify_resume+0x94/0x6ed
> [ 116.755029] [<c0404099>] work_notifysig+0x13/0x1a

isnt the refrigerator() suspend related? Perhaps suspend disables irqs
somewhere that we forgot to track?

a new thread gets its hardirqs_enabled this way:

#ifdef __ARCH_WANT_INTERRUPTS_ON_CTXSW
p->hardirqs_enabled = 1;
#else
p->hardirqs_enabled = 0;
#endif

on i386 __ARCH_WANT_INTERRUPTS_ON_CTXSW is off so it starts with 0. We
set this up in copy_process() so there's no chance this task can run
without this initialized.

Ingo

2007-06-05 19:50:41

by Linus Torvalds

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4



On Tue, 5 Jun 2007, Ingo Molnar wrote:
>
> isnt the refrigerator() suspend related? Perhaps suspend disables irqs
> somewhere that we forgot to track?

There _is_ something strange there. For that whole sequence to trigger,
the current task has to have the TIF_FREEZE bit set, but I don't see why
it would be during shutdown.

Linus

2007-06-05 20:17:17

by Pavel Machek

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

Hi!

> > isnt the refrigerator() suspend related? Perhaps suspend disables irqs
> > somewhere that we forgot to track?
>
> There _is_ something strange there. For that whole sequence to trigger,
> the current task has to have the TIF_FREEZE bit set, but I don't see why
> it would be during shutdown.

[ 116.733327] PM: suspend-to-disk mode set to 'shutdown'
[ 116.738849] swsusp: Basic memory bitmaps created
[ 116.745353] Stopping tasks ... WARNING: at
/home/devel/linux-git/kernel/lockdep.c:2414 check_flags()
[ 116.754921] [<c0404fbf>] show_trace_log_lvl+0x1a/0x2f
[ 116.754937] [<c0405bd6>] show_trace+0x12/0x14

No, he's not shutting down, he's trying to snapshot his
machine. 'shutdown' will be done at end of hibernation, _way_ after
this.

Pavel

--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2007-06-05 20:20:34

by Pavel Machek

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

Hi!

> > > [ 116.733327] PM: suspend-to-disk mode set to 'shutdown' [
> > > 116.738849] swsusp: Basic memory bitmaps created [ 116.745353]
> > > Stopping tasks ... WARNING: at
> > > /home/devel/linux-git/kernel/lockdep.c:2414 check_flags()
>
> > > [ 116.755052] irq event stamp: 69
> > > [ 116.755060] hardirqs last enabled at (69): [<c04040f9>] syscall_exit_work+0x11/0x26
> > > [ 116.755084] hardirqs last disabled at (68): [<c0403fdd>] syscall_exit+0x9/0x1a
> > > [ 116.755109] softirqs last enabled at (0): [<c042150c>] copy_process+0x4dd/0x1286
> > > [ 116.755139] softirqs last disabled at (0): [<00000000>] 0x0
> > > [ 116.945776] done.
>
> > Well, it's harmless in the sense that "yeah, the system still works",
> > but it does seem to be a real bug. We have hardware interrupts
> > disabled when we _think_ we should have them on, so our irq tracking
> > is off.
> >
> > Ingo, do you see what's up? It looks like we got a signal to a process
> > that just got created, is the setup stuff for "tsk->hardirqs_enabled"
> > perhaps off a bit?
>
> hm. I cannot see the source of the bug at the moment, but here's my
> analysis so far:
>
> the last event that irqtrace got was #69, and that was a 'hardirqs on'
> in syscall_exit_work. After that we did a 'hardirqs off' without
> properly tracking that via irqtrace. Next time we got an irqtrace event
> (event 70) the assert caught up with us and turned off lockdep and
> backed out of that function. This was in:
>
> > [ 116.754957] [<c043c3e5>] check_flags+0x95/0x143
> > [ 116.754967] [<c043f158>] lock_acquire+0x29/0x82
> > [ 116.754977] [<c06313a7>] _spin_lock+0x35/0x42
> > [ 116.754990] [<c044894a>] refrigerator+0x14/0xc6
> > [ 116.755002] [<c042d4b3>] get_signal_to_deliver+0x33/0x397
> > [ 116.755016] [<c0403597>] do_notify_resume+0x94/0x6ed
> > [ 116.755029] [<c0404099>] work_notifysig+0x13/0x1a
>
> isnt the refrigerator() suspend related? Perhaps suspend disables irqs
> somewhere that we forgot to track?

refrigerator is suspend related, but I do not think it does any
interrupt magic. We do magic later in hibernation process.

This is in kernel/power/process.c, we have spinlock_irqsave there, but
that's pretty much it AFAICT.
Pavel
--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2007-06-05 21:11:21

by Badari Pulavarty

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Mon, 2007-06-04 at 20:50 -0700, Linus Torvalds wrote:
> So -rc4 is out there now, hopefully shrinking the regression list further.
>

Nothing serious, compile warnings ..

mm/sparse.c:244: warning: `__kmalloc_section_memmap' defined but not used
mm/sparse.c:274: warning: `__kfree_section_memmap' defined but not used

Here is the patch.

Thanks,
Badari

__kmalloc_section_memmap(), vaddr_in_vmalloc_area() and
__kfree_section_memmap() are used only for MEMORY_HOTPLUG.
Moved them under CONFIG_MEMORY_HOTPLUG.

Signed-off-by: Badari Pulavarty <[email protected]>

Index: linux-2.6.22-rc4/mm/sparse.c
===================================================================
--- linux-2.6.22-rc4.orig/mm/sparse.c 2007-06-04 17:57:25.000000000 -0700
+++ linux-2.6.22-rc4/mm/sparse.c 2007-06-05 13:56:29.000000000 -0700
@@ -240,6 +240,27 @@ static struct page __init *sparse_early_
return NULL;
}

+/*
+ * Allocate the accumulated non-linear sections, allocate a mem_map
+ * for each and record the physical to section mapping.
+ */
+void __init sparse_init(void)
+{
+ unsigned long pnum;
+ struct page *map;
+
+ for (pnum = 0; pnum < NR_MEM_SECTIONS; pnum++) {
+ if (!valid_section_nr(pnum))
+ continue;
+
+ map = sparse_early_mem_map_alloc(pnum);
+ if (!map)
+ continue;
+ sparse_init_one_section(__nr_to_section(pnum), pnum, map);
+ }
+}
+
+#ifdef CONFIG_MEMORY_HOTPLUG
static struct page *__kmalloc_section_memmap(unsigned long nr_pages)
{
struct page *page, *ret;
@@ -280,27 +301,6 @@ static void __kfree_section_memmap(struc
}

/*
- * Allocate the accumulated non-linear sections, allocate a mem_map
- * for each and record the physical to section mapping.
- */
-void __init sparse_init(void)
-{
- unsigned long pnum;
- struct page *map;
-
- for (pnum = 0; pnum < NR_MEM_SECTIONS; pnum++) {
- if (!valid_section_nr(pnum))
- continue;
-
- map = sparse_early_mem_map_alloc(pnum);
- if (!map)
- continue;
- sparse_init_one_section(__nr_to_section(pnum), pnum, map);
- }
-}
-
-#ifdef CONFIG_MEMORY_HOTPLUG
-/*
* returns the number of sections whose mem_maps were properly
* set. If this is <=0, then that means that the passed-in
* map was not consumed and must be freed.




2007-06-05 21:55:46

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Tuesday, 5 June 2007 22:19, Pavel Machek wrote:
> Hi!
>
> > > > [ 116.733327] PM: suspend-to-disk mode set to 'shutdown' [
> > > > 116.738849] swsusp: Basic memory bitmaps created [ 116.745353]
> > > > Stopping tasks ... WARNING: at
> > > > /home/devel/linux-git/kernel/lockdep.c:2414 check_flags()
> >
> > > > [ 116.755052] irq event stamp: 69
> > > > [ 116.755060] hardirqs last enabled at (69): [<c04040f9>] syscall_exit_work+0x11/0x26
> > > > [ 116.755084] hardirqs last disabled at (68): [<c0403fdd>] syscall_exit+0x9/0x1a
> > > > [ 116.755109] softirqs last enabled at (0): [<c042150c>] copy_process+0x4dd/0x1286
> > > > [ 116.755139] softirqs last disabled at (0): [<00000000>] 0x0
> > > > [ 116.945776] done.
> >
> > > Well, it's harmless in the sense that "yeah, the system still works",
> > > but it does seem to be a real bug. We have hardware interrupts
> > > disabled when we _think_ we should have them on, so our irq tracking
> > > is off.
> > >
> > > Ingo, do you see what's up? It looks like we got a signal to a process
> > > that just got created, is the setup stuff for "tsk->hardirqs_enabled"
> > > perhaps off a bit?
> >
> > hm. I cannot see the source of the bug at the moment, but here's my
> > analysis so far:
> >
> > the last event that irqtrace got was #69, and that was a 'hardirqs on'
> > in syscall_exit_work. After that we did a 'hardirqs off' without
> > properly tracking that via irqtrace. Next time we got an irqtrace event
> > (event 70) the assert caught up with us and turned off lockdep and
> > backed out of that function. This was in:
> >
> > > [ 116.754957] [<c043c3e5>] check_flags+0x95/0x143
> > > [ 116.754967] [<c043f158>] lock_acquire+0x29/0x82
> > > [ 116.754977] [<c06313a7>] _spin_lock+0x35/0x42
> > > [ 116.754990] [<c044894a>] refrigerator+0x14/0xc6
> > > [ 116.755002] [<c042d4b3>] get_signal_to_deliver+0x33/0x397
> > > [ 116.755016] [<c0403597>] do_notify_resume+0x94/0x6ed
> > > [ 116.755029] [<c0404099>] work_notifysig+0x13/0x1a
> >
> > isnt the refrigerator() suspend related? Perhaps suspend disables irqs
> > somewhere that we forgot to track?
>
> refrigerator is suspend related, but I do not think it does any
> interrupt magic. We do magic later in hibernation process.
>
> This is in kernel/power/process.c, we have spinlock_irqsave there, but
> that's pretty much it AFAICT.

That's correct. We don't manipulate IRQs directly in the freezer.

Greetings,
Rafael


--
"Premature optimization is the root of all evil." - Donald Knuth

2007-06-07 15:09:05

by Jan Engelhardt

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4


On Jun 5 2007 12:26, Linus Torvalds wrote:
>On Tue, 5 Jun 2007, Michal Piotrowski wrote:
>>
>> It is too stable for me.
>
>I wish all our testers were you ;)

It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
(same config but without fc patches).



Jan
--

2007-06-07 15:24:52

by Gene Heskett

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Thursday 07 June 2007, Jan Engelhardt wrote:
>On Jun 5 2007 12:26, Linus Torvalds wrote:
>>On Tue, 5 Jun 2007, Michal Piotrowski wrote:
>>> It is too stable for me.
>>
>>I wish all our testers were you ;)
>
>It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
>(same config but without fc patches).
>
>
>
> Jan

Oh its stable allright, but why do I have to turn the monitor off by hand when
I leave. None of that is working, not even the screen blanker.

--
Cheers, Gene
"There are four boxes to be used in defense of liberty:
soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
I'm not an Iranian!! I voted for Dianne Feinstein!!

2007-06-07 15:38:43

by Jan Engelhardt

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4


On Jun 7 2007 11:24, Gene Heskett wrote:
>>>> It is too stable for me.
>>>
>>>I wish all our testers were you ;)
>>
>>It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
>>(same config but without fc patches).
>>
>Oh its stable allright, but why do I have to turn the monitor off by hand when
>I leave. None of that is working, not even the screen blanker.

VC or X?


Jan
--

2007-06-07 16:06:25

by Gene Heskett

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Thursday 07 June 2007, Jan Engelhardt wrote:
>On Jun 7 2007 11:24, Gene Heskett wrote:
>>>>> It is too stable for me.
>>>>
>>>>I wish all our testers were you ;)
>>>
>>>It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
>>>(same config but without fc patches).
>>
>>Oh its stable allright, but why do I have to turn the monitor off by hand
>> when I leave. None of that is working, not even the screen blanker.
>
>VC or X?
>
>
> Jan

Regular X, fc6 box, amd xp2800. Using NVIDIA's latest driver, and it works
when booted to a 2.6.21 era kernel.

Just about up to date, seems I have to do the kde stuffs one or two pieces at
a time as yumex isn't handling the deps well otherwise.

--
Cheers, Gene
"There are four boxes to be used in defense of liberty:
soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
The inherent vice of capitalism is the unequal sharing of blessings;
the inherent virtue of socialism is the equal sharing of misery.
-- Churchill

2007-06-07 16:54:46

by Jan Engelhardt

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4


On Jun 7 2007 12:06, Gene Heskett wrote:
>
>On Thursday 07 June 2007, Jan Engelhardt wrote:
>>On Jun 7 2007 11:24, Gene Heskett wrote:
>>>>>> It is too stable for me.
>>>>>
>>>>>I wish all our testers were you ;)
>>>>
>>>>It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
>>>>(same config but without fc patches).
>>>
>>>Oh its stable allright, but why do I have to turn the monitor off by hand
>>> when I leave. None of that is working, not even the screen blanker.
>>
>>VC or X?
>
>Regular X

Most likely an X incompatibility then?

> fc6 box, amd xp2800. Using NVIDIA's latest driver, and it works
>when booted to a 2.6.21 era kernel.

Well, NVIDIA is your first stop then. What about nv?



Jan
--

2007-06-07 20:21:30

by Gene Heskett

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On Thursday 07 June 2007, Jan Engelhardt wrote:
>On Jun 7 2007 12:06, Gene Heskett wrote:
>>On Thursday 07 June 2007, Jan Engelhardt wrote:
>>>On Jun 7 2007 11:24, Gene Heskett wrote:
>>>>>>> It is too stable for me.
>>>>>>
>>>>>>I wish all our testers were you ;)
>>>>>
>>>>>It [2.6.22-rc4] did not die on me so far, unlike that stock FC7 kernel
>>>>>(same config but without fc patches).
>>>>
>>>>Oh its stable allright, but why do I have to turn the monitor off by hand
>>>> when I leave. None of that is working, not even the screen blanker.
>>>
>>>VC or X?
>>
>>Regular X
>
>Most likely an X incompatibility then?
>
>> fc6 box, amd xp2800. Using NVIDIA's latest driver, and it works
>>when booted to a 2.6.21 era kernel.
>
>Well, NVIDIA is your first stop then. What about nv?
>
>
>
> Jan

Don't know Jan. nv, the last time I tried that bucket of molasses in january,
couldn't drive this card at more than 800x600, and I am used to double that
both ways.

The last time I booted to nv, it took me a week to get all the crap fixed that
it overwrote trying to fit the default screens I use while trying to make
them fit in an 800x600 window.

I run 1600x1200, on either an ati card (but the radeon driver quit supporting
the r280 chipset according to boot messages, the main reason I bought this
nvidia card) or on an older nvidia card that nv was quite happy with. That
card went belly up and took the motherboard with it 2 years ago though.

Did you miss the comment that it works with a 2.6.21ish kernel? Everything
else being equal. I will boot to one of them later tonight and test it to
make sure its not some update to kde though, and let the list know.

--
Cheers, Gene
"There are four boxes to be used in defense of liberty:
soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
Vote anarchist.

2007-06-07 21:15:07

by Pavel Machek

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

Hi!

> Don't know Jan. nv, the last time I tried that bucket of molasses in january,
> couldn't drive this card at more than 800x600, and I am used to double that
> both ways.
>
> The last time I booted to nv, it took me a week to get all the crap fixed that
> it overwrote trying to fit the default screens I use while trying to make
> them fit in an 800x600 window.
>
> I run 1600x1200, on either an ati card (but the radeon driver quit supporting
> the r280 chipset according to boot messages, the main reason I bought this
> nvidia card) or on an older nvidia card that nv was quite happy with. That
> card went belly up and took the motherboard with it 2 years ago though.
>
> Did you miss the comment that it works with a 2.6.21ish kernel? Everything
> else being equal. I will boot to one of them later tonight and test it to
> make sure its not some update to kde though, and let the list know.

Well, your kernel is tainted with nvidia crap we are not allowed to
debug. Issue is screen blanking. "It worked before" or not is not
really interesting, sorry.
Pavel
--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2007-06-10 06:15:38

by Avuton Olrich

[permalink] [raw]
Subject: Re: Linux 2.6.22-rc4

On 6/4/07, Linus Torvalds <[email protected]> wrote:
>
> So -rc4 is out there now, hopefully shrinking the regression list further.

(CCd net device MAINTAINERs, I'm not sure, but nv_alloc_rx is forcedeth)

This server has been up for about a day now and I'm starting to get
some bad looking messages when plenty gets transferred over NFS:

(please excuse the links, without them this message would certainly be
too large)
Linux version 2.6.22-rc4 (root@shapeshifter) (gcc version 4.2.0
(Gentoo 4.2.0)) #6 PREEMPT Fri Jun 8 18:54:00 PDT 2007
(it's actually with the latest rsdl patch)

http://avuton.googlepages.com/config
http://avuton.googlepages.com/lspci-vvv
http://avuton.googlepages.com/ioports
http://avuton.googlepages.com/iomem

=======================
Mem-info:
DMA per-cpu:
CPU 0: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0
Normal per-cpu:
CPU 0: Hot: hi: 186, btch: 31 usd: 87 Cold: hi: 62, btch: 15 usd: 57
Active:43224 inactive:168748 dirty:2679 writeback:24182 unstable:0
free:4334 slab:6997 mapped:6927 pagetables:286 bounce:0
DMA free:3520kB min:68kB low:84kB high:100kB active:1064kB
inactive:5492kB present:16256kB pages_sc
anned:0 all_unreclaimable? no
lowmem_reserve[]: 0 873
Normal free:13816kB min:3744kB low:4680kB high:5616kB active:171832kB
inactive:669500kB present:894
080kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0
DMA: 38*4kB 1*8kB 0*16kB 3*32kB 1*64kB 1*128kB 2*256kB 1*512kB
0*1024kB 1*2048kB 0*4096kB = 3520kB
Normal: 3278*4kB 0*8kB 0*16kB 0*32kB 1*64kB 1*128kB 0*256kB 1*512kB
0*1024kB 0*2048kB 0*4096kB = 13
816kB
Swap cache: add 0, delete 0, find 0/0, race 0+0
Free swap = 498004kB
Total swap = 498004kB
Free swap: 498004kB
229376 pages of RAM
0 pages of HIGHMEM
3350 reserved pages
119197 pages shared
0 pages swap cached
2679 pages dirty
24182 pages writeback
6927 pages mapped
6997 pages slab
286 pages pagetables
swapper: page allocation failure. order:1, mode:0x4020
[<c0149b09>] __alloc_pages+0x239/0x2f0
[<c02f6046>] nv_alloc_rx+0xf6/0x1a0
[<c0161622>] __slab_alloc+0x422/0x500
[<c02f6046>] nv_alloc_rx+0xf6/0x1a0
[<c0163055>] __kmalloc_track_caller+0x65/0x70
[<c02f6046>] nv_alloc_rx+0xf6/0x1a0
[<c03d0435>] __alloc_skb+0x55/0x120
[<c02f6046>] nv_alloc_rx+0xf6/0x1a0
[<c0133f1f>] getnstimeofday+0x2f/0xe0
[<c02f7d78>] nv_nic_irq+0x2a8/0x590
[<c02f7b09>] nv_nic_irq+0x39/0x590
[<c0142005>] handle_IRQ_event+0x25/0x50
[<c014386b>] handle_fasteoi_irq+0x5b/0xe0
[<c010685a>] do_IRQ+0x4a/0x80
[<c03d10a2>] __netdev_alloc_skb+0x22/0x50
[<c03d10a2>] __netdev_alloc_skb+0x22/0x50
[<c0104a53>] common_interrupt+0x23/0x28
[<c03d10a2>] __netdev_alloc_skb+0x22/0x50
[<c016302e>] __kmalloc_track_caller+0x3e/0x70
[<c03d10a2>] __netdev_alloc_skb+0x22/0x50
[<c03d0435>] __alloc_skb+0x55/0x120
[<c03d10a2>] __netdev_alloc_skb+0x22/0x50
[<c02f26f4>] skge_poll+0x2c4/0x600
[<c01250f7>] get_next_timer_interrupt+0x1a7/0x230
[<c02f2558>] skge_poll+0x128/0x600
[<c03d6c21>] net_rx_action+0x61/0x170
[<c01211d2>] __do_softirq+0x42/0x90
[<c0121247>] do_softirq+0x27/0x30
[<c01214b5>] irq_exit+0x65/0x70
[<c010685f>] do_IRQ+0x4f/0x80
[<c010685f>] do_IRQ+0x4f/0x80
[<c0104a53>] common_interrupt+0x23/0x28
[<c0102b30>] default_idle+0x0/0x40
[<c0102b5a>] default_idle+0x2a/0x40
[<c01023f0>] cpu_idle+0x50/0x70
[<c0590b35>] start_kernel+0x215/0x260
[<c0590450>] unknown_bootoption+0x0/0x260
=======================
--
avuton
--
Anyone who quotes me in their sig is an idiot. -- Rusty Russell.