Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754338AbZGBJ3U (ORCPT ); Thu, 2 Jul 2009 05:29:20 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752064AbZGBJ3M (ORCPT ); Thu, 2 Jul 2009 05:29:12 -0400 Received: from mail-px0-f190.google.com ([209.85.216.190]:38253 "EHLO mail-px0-f190.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751993AbZGBJ3K (ORCPT ); Thu, 2 Jul 2009 05:29:10 -0400 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=date:from:to:cc:subject:message-id:references:mime-version :content-type:content-disposition:content-transfer-encoding :in-reply-to:user-agent; b=wbcu+RPt0PkSzaJys/x6LVtIA3oiBYOfIgQ5RrXjZCiH9PaTuimIFv+HHCCeF/1r7f Vrh1LOQ6bht5o+P3yk8Yd35XiD7k0E65gBRjN1Xw+UvMZeZVhsnWk48z9KUPL/3VXDoH UdgAIaOUfVtR6WWQ8aFw0WpEnqBptuzb9cb5g= Date: Thu, 2 Jul 2009 17:31:20 +0800 From: Amerigo Wang To: Joao Correia Cc: Amerigo Wang , a.p.zijlstra@chello.nl, linux-kernel@vger.kernel.org Subject: lockdep max numbers (was Re: Crashes during boot on 2.6.30 / 2.6.31-rc, random programs) Message-ID: <20090702093120.GD6372@cr0.nay.redhat.com> References: <2375c9f90906281908k7aa089a3v932dd368d572194b@mail.gmail.com> <20090629091508.GA6308@cr0.nay.redhat.com> <20090630091640.GB5873@cr0.nay.redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 7478 Lines: 151 On Wed, Jul 01, 2009 at 09:30:23PM +0100, Joao Correia wrote: >Hello again > >Looks like this just opened the lid on some other limits. I just hit >another thing, which hadn't shown up before i made the changes, but >looks like its just another limit thats too low. This one is harder to >reproduce tho. > >Jun 30 21:35:03 hightech kernel: BUG: MAX_LOCKDEP_CHAINS too low! >Jun 30 21:35:03 hightech kernel: turning off the locking correctness validator. >Jun 30 21:35:03 hightech kernel: Pid: 9379, comm: qemu-kvm Not tainted Similar problem... Peter? Would like to increase these numbers? >2.6.30-wl #3 >Jun 30 21:35:03 hightech kernel: Call Trace: >Jun 30 21:35:03 hightech kernel: [] ? printk+0x22/0x3b >Jun 30 21:35:03 hightech kernel: [] __lock_acquire+0x583/0xb05 >Jun 30 21:35:03 hightech kernel: [] ? __module_text_address+0x20/0x6d >Jun 30 21:35:03 hightech kernel: [] lock_acquire+0xb7/0xeb >Jun 30 21:35:03 hightech kernel: [] ? get_hash_bucket+0x3a/0x55 >Jun 30 21:35:03 hightech kernel: [] ? get_hash_bucket+0x3a/0x55 >Jun 30 21:35:03 hightech kernel: [] _spin_lock_irqsave+0x45/0x89 >Jun 30 21:35:03 hightech kernel: [] ? get_hash_bucket+0x3a/0x55 >Jun 30 21:35:03 hightech kernel: [] get_hash_bucket+0x3a/0x55 >Jun 30 21:35:03 hightech kernel: [] add_dma_entry+0x1f/0x4f >Jun 30 21:35:03 hightech kernel: [] debug_dma_map_sg+0xe1/0x147 >Jun 30 21:35:03 hightech kernel: [] ata_qc_issue+0x1c0/0x254 >Jun 30 21:35:03 hightech kernel: [] __ata_scsi_queuecmd+0x16a/0x1c8 >Jun 30 21:35:03 hightech kernel: [] ? scsi_done+0x0/0x32 >Jun 30 21:35:03 hightech kernel: [] ? ata_scsi_rw_xlat+0x0/0x207 >Jun 30 21:35:03 hightech kernel: [] ata_scsi_queuecmd+0x55/0x95 >Jun 30 21:35:03 hightech kernel: [] ? scsi_done+0x0/0x32 >Jun 30 21:35:03 hightech kernel: [] scsi_dispatch_cmd+0x187/0x207 >Jun 30 21:35:03 hightech kernel: [] scsi_request_fn+0x33d/0x480 >Jun 30 21:35:03 hightech kernel: [] ? del_timer+0x54/0x6e >Jun 30 21:35:03 hightech kernel: [] __generic_unplug_device+0x3e/0x53 >Jun 30 21:35:03 hightech kernel: [] blk_start_queueing+0x37/0x4a >Jun 30 21:35:03 hightech kernel: [] cfq_insert_request+0x46f/0x498 >Jun 30 21:35:03 hightech kernel: [] ? __rcu_read_unlock+0x0/0x3d >Jun 30 21:35:03 hightech kernel: [] elv_insert+0x12f/0x1d0 >Jun 30 21:35:03 hightech kernel: [] __elv_add_request+0xb5/0xce >Jun 30 21:35:03 hightech kernel: [] ? __rcu_read_unlock+0x2a/0x3d >Jun 30 21:35:03 hightech kernel: [] ? drive_stat_acct+0xbb/0xd3 >Jun 30 21:35:03 hightech kernel: [] __make_request+0x33b/0x3c1 >Jun 30 21:35:03 hightech kernel: [] ? check_valid_pointer+0x2c/0x6c >Jun 30 21:35:03 hightech kernel: [] ? mark_lock+0x29/0x1f3 >Jun 30 21:35:03 hightech kernel: [] ? kmem_cache_alloc+0xc2/0x150 >Jun 30 21:35:03 hightech kernel: [] generic_make_request+0x2fd/0x351 >Jun 30 21:35:03 hightech kernel: [] ? generic_make_request+0x2fd/0x351 >Jun 30 21:35:03 hightech kernel: [] ? mempool_alloc+0x5b/0x105 >Jun 30 21:35:03 hightech kernel: [] ? mark_held_locks+0x52/0x7c >Jun 30 21:35:03 hightech kernel: [] submit_bio+0xc0/0xd9 >Jun 30 21:35:03 hightech kernel: [] ? bio_alloc_bioset+0x35/0xd5 >Jun 30 21:35:03 hightech kernel: [] submit_bh+0xea/0x11b >Jun 30 21:35:03 hightech kernel: [] >__block_write_full_page+0x218/0x2f0 >Jun 30 21:35:03 hightech kernel: [] ? >ext4_normal_get_block_write+0x0/0x76 >Jun 30 21:35:03 hightech kernel: [] ? end_buffer_async_write+0x0/0x140 >Jun 30 21:35:03 hightech kernel: [] ? end_buffer_async_write+0x0/0x140 >Jun 30 21:35:03 hightech kernel: [] >block_write_full_page_endio+0x74/0xda >Jun 30 21:35:03 hightech kernel: [] ? end_buffer_async_write+0x0/0x140 >Jun 30 21:35:03 hightech kernel: [] ? >ext4_normal_get_block_write+0x0/0x76 >Jun 30 21:35:03 hightech kernel: [] block_write_full_page+0x22/0x38 >Jun 30 21:35:03 hightech kernel: [] ? end_buffer_async_write+0x0/0x140 >Jun 30 21:35:03 hightech kernel: [] ext4_da_writepage+0x1aa/0x1da >Jun 30 21:35:03 hightech kernel: [] mpage_da_submit_io+0x9b/0xf0 >Jun 30 21:35:03 hightech kernel: [] ext4_da_writepages+0x2c2/0x445 >Jun 30 21:35:03 hightech kernel: [] do_writepages+0x32/0x58 >Jun 30 21:35:03 hightech kernel: [] >__filemap_fdatawrite_range+0x73/0x8c >Jun 30 21:35:03 hightech kernel: [] filemap_fdatawrite_range+0x2b/0x44 >Jun 30 21:35:03 hightech kernel: [] sync_page_range+0x7f/0xf5 >Jun 30 21:35:03 hightech kernel: [] ? trace_hardirqs_on+0x19/0x2c >Jun 30 21:35:03 hightech kernel: [] generic_file_aio_write+0xbd/0xe1 >Jun 30 21:35:03 hightech kernel: [] ext4_file_write+0xc2/0x149 >Jun 30 21:35:03 hightech kernel: [] do_sync_write+0xbe/0x10a >Jun 30 21:35:03 hightech kernel: [] ? mark_lock+0x29/0x1f3 >Jun 30 21:35:03 hightech kernel: [] ? >autoremove_wake_function+0x0/0x55 >Jun 30 21:35:03 hightech kernel: [] ? __rcu_read_unlock+0x0/0x3d >Jun 30 21:35:03 hightech kernel: [] ? >security_file_permission+0x23/0x36 >Jun 30 21:35:03 hightech kernel: [] ? rw_verify_area+0xa5/0xd8 >Jun 30 21:35:03 hightech kernel: [] ? do_sync_write+0x0/0x10a >Jun 30 21:35:03 hightech kernel: [] vfs_write+0x9f/0x10f >Jun 30 21:35:03 hightech kernel: [] sys_pwrite64+0x6a/0x93 >Jun 30 21:35:03 hightech kernel: [] sysenter_do_call+0x12/0x38 > > >Now the whole call trace is showing up (so the other limits were >raised correctly). > >Im going to go through the source and raise this limit as well, but as >i said above, this one is harder to trigger, so it may take a couple >of days to see it again. > >The first problem was indeed just the tip of the iceberg. Perphaps a >more comprehensive change of all the related limits is in order? (I do >not know if such change has already happened or not, if so, please >point me in the right direction.) > >Again, >Thank you very much for your time, >Joao Correia >Centro de Informatica >Universidade da Beira Interior >Portugal > >On Tue, Jun 30, 2009 at 10:16 AM, Amerigo Wang wrote: >> On Mon, Jun 29, 2009 at 11:30:41AM +0100, Joao Correia wrote: >>>Hello >>> >>>The system seemed to happily ignore all the sysctl.conf changes and >>>all echo VALUE >  /proc/sys/kernel/max_lock_depth >> >> >> Ouch, mistake! max_lock_depth is for rt_mutex... :) >> >>> >>>So i dug a little on the source and changed >>> >>>include/linux/sched.h >>> >>># define MAX_LOCK_DEPTH 48UL >>> >>>to >>> >>># define MAX_LOCK_DEPTH 96UL >>> >>>and im getting no more errors. Of course, now its probably radioactive >>>and about to blow up, but at least it's not complaining anymore :). >> >> Yeah, this is right. >> >> Let's Cc: Peter to see if he would like to change this number... >> >> Peter? >> -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/