Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751311Ab3IIBUW (ORCPT ); Sun, 8 Sep 2013 21:20:22 -0400 Received: from g4t0017.houston.hp.com ([15.201.24.20]:7617 "EHLO g4t0017.houston.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751099Ab3IIBUV (ORCPT ); Sun, 8 Sep 2013 21:20:21 -0400 Message-ID: <1378689618.2103.2.camel@buesod1.americas.hpqcorp.net> Subject: Re: soft lockup in sysvipc code. From: Davidlohr Bueso To: Manfred Spraul Cc: Dave Jones , Linux Kernel , Davidlohr Bueso Date: Sun, 08 Sep 2013 18:20:18 -0700 In-Reply-To: <522AEDBF.2050203@colorfullife.com> References: <20130904215022.GA30606@redhat.com> <522AEDBF.2050203@colorfullife.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.4.4 (3.4.4-2.fc17) Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3570 Lines: 68 On Sat, 2013-09-07 at 11:11 +0200, Manfred Spraul wrote: > Hi Dave, > > On 09/04/2013 11:50 PM, Dave Jones wrote: > > Haven't seen this before. > > Tree based on v3.11-3104-gf357a82 > > > > BUG: soft lockup - CPU#0 stuck for 22s! [trinity-child0:25479] > > Modules linked in: sctp snd_seq_dummy fuse dlci rfcomm tun bnep hidp ipt_ULOG nfnetlink can_raw can_bcm scsi_transport_iscsi nfc caif_socket caif af_802154 phonet af_rxrpc bluetooth rfkill can llc2 pppoe pppox ppp_generic slhc irda crc_ccitt rds af_key rose x25 atm netrom appletalk ipx p8023 psnap p8022 llc ax25 xfs snd_hda_codec_realtek libcrc32c snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_page_alloc snd_timer snd soundcore pcspkr usb_debug e1000e ptp pps_core > > irq event stamp: 1143030 > > hardirqs last enabled at (1143029): [] restore_args+0x0/0x30 > > hardirqs last disabled at (1143030): [] apic_timer_interrupt+0x6a/0x80 > > softirqs last enabled at (1143028): [] __do_softirq+0x198/0x460 > > softirqs last disabled at (1143023): [] irq_exit+0x135/0x150 > > CPU: 0 PID: 25479 Comm: trinity-child0 Not tainted 3.11.0+ #44 > > task: ffff88022c013f90 ti: ffff88022bd8c000 task.ti: ffff88022bd8c000 > > RIP: 0010:[] [] idr_find_slowpath+0x9b/0x150 > > RSP: 0018:ffff88022bd8dc88 EFLAGS: 00000206 > > RAX: 0000000000000006 RBX: ffff000a6c0a0000 RCX: 0000000000000008 > > RDX: 0000000000000008 RSI: ffffffff81c41040 RDI: ffff88022c014668 > > RBP: ffff88022bd8dca0 R08: 0000000000000000 R09: 0000000000000000 > > R10: 0000000000000001 R11: 0000000000000001 R12: ffff88023831a290 > > R13: 0000000000000001 R14: ffff88022bd8dbe8 R15: ffff8802449dffff > > FS: 00007fcfcad2c740(0000) GS:ffff880244800000(0000) knlGS:0000000000000000 > > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > CR2: 00007fcfc84cb968 CR3: 00000001de93f000 CR4: 00000000001407f0 > > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > > DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > > Stack: > > 0000000000000260 0000000000002dba ffffffff81c7e258 ffff88022bd8dcf8 > > ffffffff812b1131 ffff88022c013f90 ffff8801d37174c0 ffff88022bd8dd38 > > ffffffff81c7e2f0 ffff88022bd8dd38 ffff8801e065cec8 ffff880241d86ca8 > > Call Trace: > > [] sysvipc_find_ipc+0x61/0x300 > > [] sysvipc_proc_next+0x46/0xd0 > > [] traverse.isra.7+0xc9/0x260 > > [] ? lock_release_non_nested+0x308/0x350 > > [] seq_read+0x3e1/0x450 > > [] ? proc_reg_write+0x80/0x80 > > [] proc_reg_read+0x3d/0x80 > Do you have any details about load? > > I haven't seen it either - but I don't have a stress test that does > #while true;cat /proc/sysvipc/*>/dev/null;done > in parallel with create/remove/whatever operations. > > Davidlohr: Have you done any stress tests for the /proc interface? > I don't. I tried reproducing the issue by a trivial multithreaded program were each CPU does parallel reads to the different /proc/sysvipc/ files, but no luck triggering any lockups. Dave, are you able to reproduce this on demand? Any chance of bisecting this? Thanks, Davidlohr > -- > Manfred -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/