Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754860AbaJNIBA (ORCPT ); Tue, 14 Oct 2014 04:01:00 -0400 Received: from mx1.redhat.com ([209.132.183.28]:36442 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751160AbaJNIA6 (ORCPT ); Tue, 14 Oct 2014 04:00:58 -0400 Date: Tue, 14 Oct 2014 16:00:50 +0800 From: Dave Young To: Alexander Viro Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org Subject: stuck at mutex_lock_killable in interate_dir Message-ID: <20141014080050.GA9165@dhcp-16-198.nay.redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, [sent an same email on the calprit machine, but I did not receive it, thus resend it, sorry for duplicate if you got the original one] I got a 'D' state mutt process, it's killable though. I think kernel stuck at below code: linux/fs/readdir.c: iterate_dir() -> mutex_lock_killable(&inode->i_mutex); Any idea about this? known issue? BTW, I can keep the laptop on for testing about 1 day. I'm not sure I can reproduce it after rebooting. [20522.102611] ------------[ cut here ]------------ [20522.102636] WARNING: CPU: 3 PID: 4201 at kernel/watchdog.c:267 watchdog_overflow_callback+0xf1/0x109() [20522.102641] Watchdog detected hard LOCKUP on cpu 3 [20522.102645] Modules linked in: macvtap macvlan kvm_intel ext3 jbd pl2303 usbserial iwldvm i915 snd_hda_codec_hdmi mac80211 e1000e snd_hda_codec_conexant snd_hda_codec_generic iwlwifi btusb bluetooth sdhci_pci snd_hda_intel sdhci mmc_core cfg80211 i2c_algo_bit snd_hda_controller ptp snd_hda_codec snd_hwdep snd_pcm drm_kms_helper drm efivars snd_timer lpc_ich pps_core wmi thinkpad_acpi rfkill video loop [20522.102744] CPU: 3 PID: 4201 Comm: mutt Not tainted 3.17.0-rc6+ #24 [20522.102750] Hardware name: LENOVO 4236NUC/4236NUC, BIOS 83ET76WW (1.46 ) 07/05/2013 [20522.102755] 0000000000000000 ffff88021e2c6c28 ffffffff8149af7b ffff88021e2c6c70 [20522.102767] ffff88021e2c6c60 ffffffff810638c9 ffffffff810d8719 0000000000000003 [20522.102777] 0000000000000000 ffff88021e2c6d98 ffff88021e2c6ef8 ffff88021e2c6cc0 [20522.102788] Call Trace: [20522.102793] [] dump_stack+0x4e/0x7a [20522.102817] [] warn_slowpath_common+0x75/0x8e [20522.102826] [] ? watchdog_overflow_callback+0xf1/0x109 [20522.102836] [] warn_slowpath_fmt+0x47/0x49 [20522.102847] [] ? perf_event_task_disable+0x7f/0x7f [20522.102856] [] watchdog_overflow_callback+0xf1/0x109 [20522.102865] [] __perf_event_overflow+0x131/0x22a [20522.102873] [] ? perf_event_task_disable+0x7f/0x7f [20522.102885] [] ? x86_perf_event_set_period+0x101/0x111 [20522.102894] [] perf_event_overflow+0x14/0x16 [20522.102903] [] intel_pmu_handle_irq+0x33f/0x3c8 [20522.102914] [] perf_event_nmi_handler+0x25/0x3e [20522.102925] [] nmi_handle.isra.2+0x7a/0xdb [20522.102933] [] ? local_touch_nmi+0x1f/0x1f [20522.102946] [] ? validate_chain.isra.27+0x4/0x9ae [20522.102954] [] do_nmi+0xd8/0x34c [20522.102964] [] end_repeat_nmi+0x1e/0x2e [20522.102975] [] ? __this_cpu_preempt_check+0x13/0x16 [20522.102985] [] ? validate_chain.isra.27+0x4/0x9ae [20522.102995] [] ? validate_chain.isra.27+0x4/0x9ae [20522.103004] [] ? validate_chain.isra.27+0x4/0x9ae [20522.103008] <> [] __lock_acquire+0x6ed/0xb24 [20522.103031] [] ? sha_transform+0x246/0x1068 [20522.103041] [] lock_acquire+0xa4/0xc4 [20522.103051] [] ? lock_hrtimer_base.isra.26+0x20/0x43 [20522.103062] [] _raw_spin_lock_irqsave+0x4b/0x5c [20522.103070] [] ? lock_hrtimer_base.isra.26+0x20/0x43 [20522.103079] [] lock_hrtimer_base.isra.26+0x20/0x43 [20522.103087] [] hrtimer_try_to_cancel+0x1d/0x8d [20522.103095] [] hrtimer_cancel+0x10/0x1b [20522.103106] [] tick_nohz_restart+0x12/0x7d [20522.103116] [] __tick_nohz_full_check+0xa1/0xa6 [20522.103126] [] nohz_full_kick_work_func+0x9/0xb [20522.103135] [] irq_work_run_list+0x4a/0x68 [20522.103145] [] ? tick_sched_handle+0x47/0x47 [20522.103153] [] irq_work_run+0x2a/0x4a [20522.103162] [] update_process_times+0x4c/0x60 [20522.103171] [] tick_sched_handle+0x42/0x47 [20522.103180] [] tick_sched_timer+0x35/0x53 [20522.103188] [] __run_hrtimer.isra.28+0x57/0xb0 [20522.103197] [] hrtimer_interrupt+0xeb/0x1d2 [20522.103211] [] ? ext4_follow_link+0x1c/0x1c [20522.103220] [] local_apic_timer_interrupt+0x4e/0x51 [20522.103228] [] smp_apic_timer_interrupt+0x3a/0x4b [20522.103236] [] apic_timer_interrupt+0x72/0x80 [20522.103240] [] ? str2hashbuf_signed+0x2f/0x80 [20522.103257] [] ext4fs_dirhash+0x122/0x218 [20522.103266] [] ? __kmalloc+0x82/0xfe [20522.103276] [] htree_dirblock_to_tree+0xd2/0x135 [20522.103285] [] ext4_htree_fill_tree+0x173/0x21d [20522.103294] [] ? debug_smp_processor_id+0x17/0x19 [20522.103306] [] ext4_readdir+0x27e/0x73f [20522.103316] [] ? mutex_lock_killable_nested+0x33d/0x59a [20522.103325] [] ? iterate_dir+0x38/0xff [20522.103332] [] ? iterate_dir+0x38/0xff [20522.103341] [] iterate_dir+0x70/0xff [20522.103349] [] SyS_getdents+0x7a/0xda [20522.103356] [] ? fillonedir+0xcf/0xcf [20522.103364] [] tracesys+0xe1/0xe6 [20522.103371] ---[ end trace 949cb4fa0993f833 ]--- [20553.806845] INFO: rcu_preempt detected stalls on CPUs/tasks: { 3} (detected by 0, t=18002 jiffies, g=181109, c=181108, q=0) [20553.806872] Task dump for CPU 3: [20553.806879] mutt R running task 12768 4201 1825 0x00080008 [20553.806897] ffff8802133ae000 0000000000000006 0000000000000007 ffffea000849f540 [20553.806909] 0000000000000046 ffff8802133ae000 ffffffff81498748 ffff8802150019c0 [20553.806921] ffff880206707ba8 ffffffff8123dab1 ffff880206707bc8 ffffffff81091ce9 [20553.806933] Call Trace: [20553.806953] [] ? __slab_alloc.isra.63.constprop.70+0x3c7/0x3e5 [20553.806964] [] ? __this_cpu_preempt_check+0x13/0x16 [20553.806977] [] ? trace_hardirqs_on_caller+0x1b7/0x1d7 [20553.806986] [] ? trace_hardirqs_on+0xd/0xf [20553.806999] [] ? ext4_htree_store_dirent+0x2d/0xeb [20553.807011] [] ? __getblk+0x1f/0x2a6 [20553.807024] [] ? str2hashbuf_signed+0x2f/0x80 [20553.807032] [] ? ext4fs_dirhash+0x122/0x218 [20553.807041] [] ? __kmalloc+0x82/0xfe [20553.807051] [] ? htree_dirblock_to_tree+0xd2/0x135 [20553.807061] [] ? ext4_htree_fill_tree+0x173/0x21d [20553.807069] [] ? debug_smp_processor_id+0x17/0x19 [20553.807080] [] ? ext4_readdir+0x27e/0x73f [20553.807092] [] ? mutex_lock_killable_nested+0x33d/0x59a [20553.807101] [] ? iterate_dir+0x38/0xff [20553.807108] [] ? iterate_dir+0x38/0xff [20553.807116] [] ? iterate_dir+0x70/0xff [20553.807125] [] ? SyS_getdents+0x7a/0xda [20553.807132] [] ? fillonedir+0xcf/0xcf [20553.807141] [] ? tracesys+0xe1/0xe6 Thanks Dave -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/