From: "=?utf-8?B?546L6b6Z?=" <wanglong@laoqinren.net>
To: "=?utf-8?B?cm9zdGVkdA==?=" <rostedt@goodmis.org>,
        "=?utf-8?B?amtvc2luYQ==?=" <jkosina@suse.cz>,
        "=?utf-8?B?cGF1bG1jaw==?=" <paulmck@linux.vnet.ibm.com>,
        "=?utf-8?B?cG1sYWRlaw==?=" <pmladek@suse.cz>,
        "=?utf-8?B?ZHppY2t1cw==?=" <dzickus@redhat.com>
Cc: "=?utf-8?B?am9oYW5uZXM=?=" <johannes@sipsolutions.net>,
        "=?utf-8?B?a29jdDlp?=" <koct9i@gmail.com>,
        "=?utf-8?B?dGdseA==?=" <tglx@linutronix.de>,
        "=?utf-8?B?bWluZ28=?=" <mingo@redhat.com>,
        "=?utf-8?B?aHBh?=" <hpa@zytor.com>,
        "=?utf-8?B?eDg2?=" <x86@kernel.org>,
        "=?utf-8?B?YXRvbWxpbg==?=" <atomlin@redhat.com>,
        "=?utf-8?B?YWtwbQ==?=" <akpm@linux-foundation.org>,
        "=?utf-8?B?c2FzaGEubGV2aW4=?=" <sasha.levin@oracle.com>,
        "=?utf-8?B?bGludXgta2VybmVs?=" <linux-kernel@vger.kernel.org>,
        "=?utf-8?B?cGVpZmVpeXVl?=" <peifeiyue@huawei.com>,
        "=?utf-8?B?bG9uZy53YW5nbG9uZw==?=" <long.wanglong@huawei.com>,
        "=?utf-8?B?bW9yZ2FuLndhbmc=?=" <morgan.wang@huawei.com>
Subject: [RFC] how to perform a safe NMI stack trace on all CPUs on x86?
Mime-Version: 1.0
Content-Type: text/plain;
	charset="utf-8"
Date: Wed, 13 May 2015 22:14:54 +0800
Message-ID: <tencent_2DEC6ECC6194905D15D8E6D5@qq.com>
Sender: linux-kernel-owner@vger.kernel.org
Content-Transfer-Encoding: 8bit
Content-Length: 991
Lines: 18

Hi all,

In kernel before 3.19, when trigger_all_cpu_backtrace() is called on x86, 
it will trigger an NMI on each CPU and call show_regs(). But this can lead
to a hard lock up if the NMI comes in on another printk().

The commit a9edc88093287183ac934be44f295f183b2c62dd (x86/nmi: Perform a safe 
NMI stack trace on all CPUs) fix this problem on kernel mainline. when the NMI 
triggers, it switches the printk routine for that CPU to call a NMI safe printk 
function that records the printk in a per_cpu seq_buf descriptor. After all 
NMIs have finished recording its data, the seq_bufs are printed in a safe 
context. But how do we fix this problem in older version of kernel(eg, 3.10 stable)? 
The 3.10 stable has no "switch printk routine" and "seq_buf" infrastructures.

Could anyone give me some ideas?

Best Regards
Wang Long????{.n?+???????+%?????ݶ??w??{.n?+????{??G?????{ay?ʇڙ?,j??f???h?????????z_??(?階?ݢj"???m??????G????????????&???~???iO???z??v?^?m????????????I?