Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751403AbdFZGg2 (ORCPT ); Mon, 26 Jun 2017 02:36:28 -0400 Received: from mail1.bemta8.messagelabs.com ([216.82.243.194]:48676 "EHLO mail1.bemta8.messagelabs.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751092AbdFZGgT (ORCPT ); Mon, 26 Jun 2017 02:36:19 -0400 X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFprHKsWRWlGSWpSXmKPExsWS8eIhk27o6oB IgwX9JhZr3txisbi8aw6bxemJPcwW+zoeMFl8WNrLZHFz/VI2BzaPDx/jPFr23WL32LVtJ5PH u3Pn2D0+b5ILYI1izcxLyq9IYM1ounqGuaDrCGPFlk/3mRsYjxxk7GLk4hASeMIocePhGyYIZ yGjxP25F9m7GDk52AR0JKaf2MXWxcjBISKQJLH4vylIDbPAOUaJRasPs4A4wgJtjBIXTy1kA3 FEQJzfy78zQXToSZz7UQEyiEVAVWL33wtsIDavgI9E+/FGJhCbUUBWYtqj+2A2s4C4xNxps1h BbAkBAYkle84zQ9iiEi8f/2MFGSkhIC+xZZYgiMksoCmxfpc+RKeixJTuh+wQ0wUlTs58wjKB UWgWkqGzEDpmIemYhaRjASPLKkaN4tSistQiXSMjvaSizPSMktzEzBxdQwMLvdzU4uLE9NScx KRiveT83E2MwPipZ2Bg3MG4pTnqEKMkB5OSKG+jv3+kEF9SfkplRmJxRnxRaU5q8SFGGQ4OJQ le11UBkUKCRanpqRVpmTnASIZJS3DwKInw2qwASvMWFyTmFmemQ6ROMVpybFi9/gsTx52+DUC yYwKQFGLJy89LlRLn/bQSqEEApCGjNA9uHCzZXGKUlRLmZWRgYBDiKUgtys0sQZV/xSjOwagk zHsN5CqezLwSuK2vgA5iAjqIZR7YQSWJCCmpBkYmq4Wvf/28n6BuJMerl5e+P/Afd4FVceL+t kk3gxOCnpk7NP0OSkkyMi1bdPlaW1nVtgO5U2N8dNnyE1ZMXK/kFyjGZWvw88RC8XMLXx62Cq hv0XZ+02J5M8WynGW+uMOtb59d1Sp+Gdn85Y5uFxBp51m3mMP61ja/i5bLZ3dbfXq/QFN3qRJ LcUaioRZzUXEiAHnhGXUxAwAA X-Env-Sender: liufeng24@lenovo.com X-Msg-Ref: server-6.tower-218.messagelabs.com!1498458965!20002974!1 X-Originating-IP: [104.232.225.2] X-StarScan-Received: X-StarScan-Version: 9.4.19; banners=-,-,- X-VirusChecked: Checked From: Feng Feng24 Liu To: Sebastian Andrzej Siewior , "Mike Galbraith" CC: "linux-kernel@vger.kernel.org" , "linux-rt-users@vger.kernel.org" , "rostedt@goodmis.org" , "tmac@hp.com" Subject: BUG: unable to handle kernel NULL pointer dereference at 0000000000000038 !//RE: kernel BUG at kernel/locking/rtmutex.c:1027 Thread-Topic: unable to handle kernel NULL pointer dereference at 0000000000000038 !//RE: kernel BUG at kernel/locking/rtmutex.c:1027 Thread-Index: AdLuQ56eUD35kh7GScSpVqAQXpco4Q== Date: Mon, 26 Jun 2017 06:33:29 +0000 Message-ID: <2B18E8E1DDAE074A82D1060396451DAE263CBBD6@CNMAILEX04.lenovo.com> Accept-Language: zh-CN, en-US Content-Language: zh-CN X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.96.19.89] Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by mail.home.local id v5Q6aWQd018552 Content-Length: 18382 Lines: 197 Hi, dear RT experts Thanks a lot! I update our kernel to 4.4.70-rt83 as your suggestion. The incorrect deadlock detection problem has been fixed in this version. But I found there is another BUG in 4.4.70-rt83, which can cause the system hang-up The BUG is: "BUG: unable to handle kernel NULL pointer dereference at 0000000000000038" Following is the kernel log ------------------------------------------------------------------------------------------------------------------------------- <4>Jun 23 21:54:53 node-1 kernel: [ 1377.160768] handler405 (21385) used greatest stack depth: 11336 bytes left <4>Jun 23 21:54:53 node-1 kernel: [ 1377.161073] handler403 (21383) used greatest stack depth: 11000 bytes left <1>Jun 24 10:01:19 node-1 kernel: [44959.446196] BUG: unable to handle kernel NULL pointer dereference at 0000000000000038 <1>Jun 24 10:01:19 node-1 kernel: [44959.446203] IP: [] __try_to_take_rt_mutex+0x34/0x160 <4>Jun 24 10:01:19 node-1 kernel: [44959.446205] PGD 1ea8056067 PUD 1e71e4a067 PMD 0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446206] Oops: 0000 [#1] PREEMPT SMP <4>Jun 24 10:01:19 node-1 kernel: [44959.446230] Modules linked in: xt_nat xt_REDIRECT nf_nat_redirect xt_mark ip6table_raw ip6table_mangle ip6table_filter ip6_tables xt_CHECKSUM xt_connmar k iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat veth 8021q garp mrp xt_tcpudp xt_conntrack iptable_raw xt_CT xt_comment iptable_filter xt_multiport ope nvswitch intel_rapl iosf_mbi intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 glue_helper lrw ablk_helper cryptd in put_leds led_class sb_edac edac_core lpc_ich mfd_core mei_me ioatdma mei dca shpchp ipmi_devintf ipmi_si ipmi_msghandler mxm_wmi wmi acpi_pad acpi_power_meter tpm_tis nf_conntrack_ipv6 nf_d efrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 ip_tables x_tables raid1 megaraid_sas <4>Jun 24 10:01:19 node-1 kernel: [44959.446233] CPU: 17 PID: 1738811 Comm: ip Not tainted 4.4.70-thinkcloud-nfv #1 <4>Jun 24 10:01:19 node-1 kernel: [44959.446234] Hardware name: LENOVO System x3650 M5: -[8871AC1]-/01GR174, BIOS -[TCE124M-2.10]- 06/23/2016 <4>Jun 24 10:01:19 node-1 kernel: [44959.446235] task: ffff881cda2c27c0 ti: ffff881ea0538000 task.ti: ffff881ea0538000 <4>Jun 24 10:01:19 node-1 kernel: [44959.446236] RIP: 0010:[] [] __try_to_take_rt_mutex+0x34/0x160 <4>Jun 24 10:01:19 node-1 kernel: [44959.446237] RSP: 0018:ffff881ea053bb50 EFLAGS: 00010082 <4>Jun 24 10:01:19 node-1 kernel: [44959.446238] RAX: 0000000000000000 RBX: ffff881f805416a8 RCX: 0000000000000000 <4>Jun 24 10:01:19 node-1 kernel: [44959.446238] RDX: ffff881ea053bb98 RSI: ffff881cda2c27c0 RDI: ffff881f805416a8 <4>Jun 24 10:01:19 node-1 kernel: [44959.446239] RBP: ffff881ea053bb60 R08: 0000000000000001 R09: 0000000000000002 <4>Jun 24 10:01:19 node-1 kernel: [44959.446239] R10: 0000000000000a01 R11: 0000000000000001 R12: ffff881cda2c27c0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446240] R13: ffff881cda2c27c0 R14: 0000000000000202 R15: ffff881f6b0c27c0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446240] FS: 00007f28be315740(0000) GS:ffff88205f8c0000(0000) knlGS:0000000000000000 <4>Jun 24 10:01:19 node-1 kernel: [44959.446241] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>Jun 24 10:01:19 node-1 kernel: [44959.446241] CR2: 0000000000000038 CR3: 0000001e9e479000 CR4: 00000000003406e0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446242] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 <4>Jun 24 10:01:19 node-1 kernel: [44959.446242] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 <4>Jun 24 10:01:19 node-1 kernel: [44959.446243] Stack: <4>Jun 24 10:01:19 node-1 kernel: [44959.446244] ffff881f805416a8 ffff881ea053bb98 ffff881ea053bc28 ffffffff81a8f03d <4>Jun 24 10:01:19 node-1 kernel: [44959.446245] ffff881ea053c000 01ff881ea053bb90 ffff881cda2c27c0 ffff881f6b0c27c1 <4>Jun 24 10:01:19 node-1 kernel: [44959.446246] ffff881cda2c2eb0 0000000000000001 0000000000000000 0000000000000000 <4>Jun 24 10:01:19 node-1 kernel: [44959.446246] Call Trace: <4>Jun 24 10:01:19 node-1 kernel: [44959.446252] [] rt_spin_lock_slowlock+0x13d/0x390 <4>Jun 24 10:01:19 node-1 kernel: [44959.446255] [] rt_spin_lock+0x1f/0x30 <4>Jun 24 10:01:19 node-1 kernel: [44959.446260] [] lockref_get_not_dead+0xf/0x50 <4>Jun 24 10:01:19 node-1 kernel: [44959.446263] [] ns_get_path+0x61/0x1d0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446268] [] proc_ns_follow_link+0x89/0xa0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446273] [] ? touch_atime+0x23/0xa0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446277] [] trailing_symlink+0x208/0x270 <4>Jun 24 10:01:19 node-1 kernel: [44959.446279] [] path_openat+0x2b7/0x12b0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446286] [] ? mem_cgroup_end_page_stat+0x25/0x50 <4>Jun 24 10:01:19 node-1 kernel: [44959.446287] [] do_filp_open+0x7e/0xd0 <4>Jun 24 10:01:19 node-1 kernel: [44959.446288] [] ? rt_spin_unlock+0x13/0x20 <4>Jun 24 10:01:19 node-1 kernel: [44959.446290] [] ? __alloc_fd+0xc5/0x180 <4>Jun 24 10:01:19 node-1 kernel: [44959.446292] [] do_sys_open+0x128/0x210 <4>Jun 24 10:01:19 node-1 kernel: [44959.446296] [] ? __context_tracking_enter+0x8a/0x160 <4>Jun 24 10:01:19 node-1 kernel: [44959.446297] [] SyS_open+0x1e/0x20 <4>Jun 24 10:01:19 node-1 kernel: [44959.446298] [] entry_SYSCALL_64_fastpath+0x12/0x71 <4>Jun 24 10:01:19 node-1 kernel: [44959.446308] Code: 89 e5 41 54 53 48 83 4f 18 01 48 89 fb 4c 8b 47 18 49 f7 c0 fe ff ff ff 74 05 5b 41 5c 5d c3 48 85 d2 49 89 f4 74 1d 48 8b 4f 10 <48> 3b 79 38 0f 85 0c 01 00 00 48 39 ca 75 e0 48 89 d6 e8 75 fd <1>Jun 24 10:01:19 node-1 kernel: [44959.446309] RIP [] __try_to_take_rt_mutex+0x34/0x160 <4>Jun 24 10:01:19 node-1 kernel: [44959.446310] RSP <4>Jun 24 10:01:19 node-1 kernel: [44959.446310] CR2: 0000000000000038 <4>Jun 24 10:01:19 node-1 kernel: [44963.688055] ---[ end trace 0000000000000002 ]--- <3>Jun 24 10:12:32 node-1 kernel: [45615.758301] INFO: rcu_preempt detected stalls on CPUs/tasks: <3>Jun 24 10:12:32 node-1 kernel: [45615.758308] 4-...: (1 GPs behind) idle=c77/140000000000000/0 softirq=0/0 fqs=200736 <3>Jun 24 10:12:32 node-1 kernel: [45615.758311] (detected by 27, t=651052 jiffies, g=9051323, c=9051322, q=514190) <4>Jun 24 10:12:32 node-1 kernel: [45615.758320] ffff881e9b82bce8 ffff881e9b82bd08 ffffffff810856e3 ffff881e9b078a38 <4>Jun 24 10:12:32 node-1 kernel: [45615.758321] ffff881e9b82bd20 ffffffff810856e3 ffff881f805416a8 ffff881e9b82bd40 <4>Jun 24 10:12:32 node-1 kernel: [45615.758322] ffffffff81a900bb ffff881f805416a8 ffff881e9b82bd78 ffff881e9b82be08 <4>Jun 24 10:12:32 node-1 kernel: [45615.758322] Call Trace: <4>Jun 24 10:12:32 node-1 kernel: [45615.758332] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:12:32 node-1 kernel: [45615.758333] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:12:32 node-1 kernel: [45615.758338] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:12:32 node-1 kernel: [45615.758339] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:12:32 node-1 kernel: [45615.758341] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:12:32 node-1 kernel: [45615.758347] [] ? dput+0xce/0x270 <4>Jun 24 10:12:32 node-1 kernel: [45615.758349] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:12:32 node-1 kernel: [45615.758350] [] ? ____fput+0xe/0x10 <4>Jun 24 10:12:32 node-1 kernel: [45615.758352] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:12:32 node-1 kernel: [45615.758355] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:12:32 node-1 kernel: [45615.758358] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:12:32 node-1 kernel: [45615.758359] [] ? int_ret_from_sys_call+0x25/0x8f <5>Jun 24 10:12:32 node-1 kernel: [45633.758592] megaraid_sas 0000:10:00.0: [ 0]waiting for 2 commands to complete for scsi0 <3>Jun 24 10:12:32 node-1 kernel: [45593.916014] INFO: rcu_sched detected stalls on CPUs/tasks: <3>Jun 24 10:12:32 node-1 kernel: [45593.916018] 4-...: (1 GPs behind) idle=c77/140000000000000/0 softirq=0/0 fqs=181186 <3>Jun 24 10:12:32 node-1 kernel: [45593.916021] (detected by 21, t=588047 jiffies, g=10293, c=10292, q=1) <4>Jun 24 10:12:32 node-1 kernel: [45593.916030] ffff881e9b82bce8 ffff881e9b82bd08 ffffffff810856e3 ffff881e9b078a38 <4>Jun 24 10:12:32 node-1 kernel: [45593.916031] ffff881e9b82bd20 ffffffff810856e3 ffff881f805416a8 ffff881e9b82bd40 <4>Jun 24 10:12:32 node-1 kernel: [45593.916032] ffffffff81a900bb ffff881f805416a8 ffff881e9b82bd78 ffff881e9b82be08 <4>Jun 24 10:12:32 node-1 kernel: [45593.916033] Call Trace: <4>Jun 24 10:12:32 node-1 kernel: [45593.916043] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:12:32 node-1 kernel: [45593.916044] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:12:32 node-1 kernel: [45593.916048] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:12:32 node-1 kernel: [45593.916049] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:12:32 node-1 kernel: [45593.916051] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:12:32 node-1 kernel: [45593.916054] [] ? dput+0xce/0x270 <4>Jun 24 10:12:32 node-1 kernel: [45593.916056] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:12:32 node-1 kernel: [45593.916057] [] ? ____fput+0xe/0x10 <4>Jun 24 10:12:32 node-1 kernel: [45593.916059] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:12:32 node-1 kernel: [45593.916063] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:12:32 node-1 kernel: [45593.916066] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:12:32 node-1 kernel: [45593.916067] [] ? int_ret_from_sys_call+0x25/0x8f <4>Jun 24 10:16:58 node-1 kernel: [45678.764118] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45678.764119] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45678.764123] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:16:58 node-1 kernel: [45678.764124] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:16:58 node-1 kernel: [45678.764125] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:16:58 node-1 kernel: [45678.764128] [] ? dput+0xce/0x270 <4>Jun 24 10:16:58 node-1 kernel: [45678.764130] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:16:58 node-1 kernel: [45678.764131] [] ? ____fput+0xe/0x10 <4>Jun 24 10:16:58 node-1 kernel: [45678.764133] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45678.764137] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:16:58 node-1 kernel: [45678.764139] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45678.764140] [] ? int_ret_from_sys_call+0x25/0x8f <3>Jun 24 10:16:58 node-1 kernel: [45719.927666] INFO: rcu_sched detected stalls on CPUs/tasks: <3>Jun 24 10:16:58 node-1 kernel: [45719.927676] 4-...: (1 GPs behind) idle=c77/140000000000000/0 softirq=0/0 fqs=219912 <3>Jun 24 10:16:58 node-1 kernel: [45719.927678] (detected by 7, t=714057 jiffies, g=10293, c=10292, q=1) <4>Jun 24 10:16:58 node-1 kernel: [45719.927687] ffff881e9b82bce8 ffff881e9b82bd08 ffffffff810856e3 ffff881e9b078a38 <4>Jun 24 10:16:58 node-1 kernel: [45719.927688] ffff881e9b82bd20 ffffffff810856e3 ffff881f805416a8 ffff881e9b82bd40 <4>Jun 24 10:16:58 node-1 kernel: [45719.927689] ffffffff81a900bb ffff881f805416a8 ffff881e9b82bd78 ffff881e9b82be08 <4>Jun 24 10:16:58 node-1 kernel: [45719.927690] Call Trace: <4>Jun 24 10:16:58 node-1 kernel: [45719.927698] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45719.927699] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45719.927703] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:16:58 node-1 kernel: [45719.927704] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:16:58 node-1 kernel: [45719.927706] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:16:58 node-1 kernel: [45719.927708] [] ? dput+0xce/0x270 <4>Jun 24 10:16:58 node-1 kernel: [45719.927710] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:16:58 node-1 kernel: [45719.927711] [] ? ____fput+0xe/0x10 <4>Jun 24 10:16:58 node-1 kernel: [45719.927713] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45719.927716] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:16:58 node-1 kernel: [45719.927719] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45719.927720] [] ? int_ret_from_sys_call+0x25/0x8f <3>Jun 24 10:16:58 node-1 kernel: [45741.769952] INFO: rcu_preempt detected stalls on CPUs/tasks: <3>Jun 24 10:16:58 node-1 kernel: [45741.769956] 4-...: (1 GPs behind) idle=c77/140000000000000/0 softirq=0/0 fqs=239687 <3>Jun 24 10:16:58 node-1 kernel: [45741.769958] (detected by 7, t=777062 jiffies, g=9051323, c=9051322, q=603969) <4>Jun 24 10:16:58 node-1 kernel: [45741.769967] ffff881e9b82bce8 ffff881e9b82bd08 ffffffff810856e3 ffff881e9b078a38 <4>Jun 24 10:16:58 node-1 kernel: [45741.769968] ffff881e9b82bd20 ffffffff810856e3 ffff881f805416a8 ffff881e9b82bd40 <4>Jun 24 10:16:58 node-1 kernel: [45741.769969] ffffffff81a900bb ffff881f805416a8 ffff881e9b82bd78 ffff881e9b82be08 <4>Jun 24 10:16:58 node-1 kernel: [45741.769970] Call Trace: <4>Jun 24 10:16:58 node-1 kernel: [45741.769978] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45741.769979] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45741.769984] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:16:58 node-1 kernel: [45741.769985] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:16:58 node-1 kernel: [45741.769987] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:16:58 node-1 kernel: [45741.769992] [] ? dput+0xce/0x270 <4>Jun 24 10:16:58 node-1 kernel: [45741.769994] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:16:58 node-1 kernel: [45741.769995] [] ? ____fput+0xe/0x10 <4>Jun 24 10:16:58 node-1 kernel: [45741.769997] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45741.770001] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:16:58 node-1 kernel: [45741.770004] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45741.770006] [] ? int_ret_from_sys_call+0x25/0x8f <3>Jun 24 10:16:58 node-1 kernel: [45782.933496] INFO: rcu_sched detected stalls on CPUs/tasks: <3>Jun 24 10:16:58 node-1 kernel: [45782.933499] 4-...: (1 GPs behind) idle=c77/140000000000000/0 softirq=0/0 fqs=239710 <3>Jun 24 10:16:58 node-1 kernel: [45782.933502] (detected by 15, t=777062 jiffies, g=10293, c=10292, q=1) <4>Jun 24 10:16:58 node-1 kernel: [45782.933511] ffff881e9b82bce8 ffff881e9b82bd08 ffffffff810856e3 ffff881e9b078a38 <4>Jun 24 10:16:58 node-1 kernel: [45782.933512] ffff881e9b82bd20 ffffffff810856e3 ffff881f805416a8 ffff881e9b82bd40 <4>Jun 24 10:16:58 node-1 kernel: [45782.933513] ffffffff81a900bb ffff881f805416a8 ffff881e9b82bd78 ffff881e9b82be08 <4>Jun 24 10:16:58 node-1 kernel: [45782.933514] Call Trace: <4>Jun 24 10:16:58 node-1 kernel: [45782.933523] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45782.933524] [] ? preempt_count_add+0xa3/0xc0 <4>Jun 24 10:16:58 node-1 kernel: [45782.933528] [] ? _raw_spin_lock_irqsave+0x4b/0x50 <4>Jun 24 10:16:58 node-1 kernel: [45782.933529] [] ? rt_spin_lock_slowlock+0x5f/0x390 <4>Jun 24 10:16:58 node-1 kernel: [45782.933531] [] ? rt_spin_lock+0x1f/0x30 <4>Jun 24 10:16:58 node-1 kernel: [45782.933535] [] ? dput+0xce/0x270 <4>Jun 24 10:16:58 node-1 kernel: [45782.933536] [] ? __fput+0x16a/0x1e0 <4>Jun 24 10:16:58 node-1 kernel: [45782.933537] [] ? ____fput+0xe/0x10 <4>Jun 24 10:16:58 node-1 kernel: [45782.933539] [] ? task_work_run+0x86/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45782.933543] [] ? exit_to_usermode_loop+0xa2/0xd7 <4>Jun 24 10:16:58 node-1 kernel: [45782.933546] [] ? syscall_return_slowpath+0x8a/0xb0 <4>Jun 24 10:16:58 node-1 kernel: [45782.933547] [] ? int_ret_from_sys_call+0x25/0x8f <3>Jun 24 10:16:58 node-1 kernel: [45804.775777] INFO: rcu_preempt detected stalls on CPUs/tasks: ------------------------------------------------------------------------------------------------------------------------------- Thanks Feng >-----Original Message----- >From: Sebastian Andrzej Siewior [mailto:sebastian.siewior@linutronix.de] >Sent: Thursday, June 08, 2017 4:03 PM >To: Mike Galbraith >Cc: Feng Feng24 Liu; linux-kernel@vger.kernel.org; linux-rt-users@vger.kernel.org; >rostedt@goodmis.org; tmac@hp.com; Tong Tong3 Li >Subject: Re: kernel BUG at kernel/locking/rtmutex.c:1027 > >On 2017-06-08 09:31:39 [+0200], Mike Galbraith wrote: >> On Thu, 2017-06-08 at 07:01 +0000, Feng Feng24 Liu wrote: >> > >> > Our kernel version is: kernel4.4.6-rt14 >> > >> >> Latest 4.4-rt is 4.4.70-rt83... > >Exactly. Please test it with the latest v4.4 RT tree. > https://www.kernel.org/pub/linux/kernel/projects/rt/4.4/ > >Sebastian