Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934179AbcLTJjY (ORCPT ); Tue, 20 Dec 2016 04:39:24 -0500 Received: from mx2.suse.de ([195.135.220.15]:38189 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933498AbcLTJjV (ORCPT ); Tue, 20 Dec 2016 04:39:21 -0500 Date: Tue, 20 Dec 2016 10:39:16 +0100 From: Petr Mladek To: Josh Poimboeuf Cc: Miroslav Benes , Jessica Yu , Jiri Kosina , linux-kernel@vger.kernel.org, live-patching@vger.kernel.org, Michael Ellerman , Heiko Carstens , x86@kernel.org, linuxppc-dev@lists.ozlabs.org, linux-s390@vger.kernel.org, Vojtech Pavlik , Jiri Slaby , Chris J Arges , Andy Lutomirski , Ingo Molnar , Peter Zijlstra Subject: Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces Message-ID: <20161220093916.GA14894@pathway.suse.cz> References: <0315b36c08c104d56a4b43537fb300d200418996.1481220077.git.jpoimboe@redhat.com> <20161219172549.mjm4c2midvkumqxb@treble> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20161219172549.mjm4c2midvkumqxb@treble> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2204 Lines: 52 On Mon 2016-12-19 11:25:49, Josh Poimboeuf wrote: > On Mon, Dec 19, 2016 at 05:25:19PM +0100, Miroslav Benes wrote: > > On Thu, 8 Dec 2016, Josh Poimboeuf wrote: > > > > > diff --git a/arch/x86/Kconfig b/arch/x86/Kconfig > > > index 215612c..b4a6663 100644 > > > --- a/arch/x86/Kconfig > > > +++ b/arch/x86/Kconfig > > > @@ -155,6 +155,7 @@ config X86 > > > select HAVE_PERF_REGS > > > select HAVE_PERF_USER_STACK_DUMP > > > select HAVE_REGS_AND_STACK_ACCESS_API > > > + select HAVE_RELIABLE_STACKTRACE if X86_64 && FRAME_POINTER && STACK_VALIDATION > > > > Tests to measure possible performance penalty of frame pointers were done > > by Mel Gorman. The outcome was quite clear. There IS a measurable > > impact. The percentage depends on the workflow but I think it is safe to > > say that FP usually takes 5-10 percents. > > > > If my understanding is correct there is no single culprit. Register > > pressure is definitely not a problem. We ran simple benchmarks while > > taking a register away from GCC (RBP or a common one). The impact is a > > combination of more cacheline pressure, more memory accesses and the fact > > that the kernel contains a lot of small functions. > > > > Thus, I think that DWARF should be the way to go here. > > > > Other than that the patch looks good to me. > > I agree that DWARF is generally a good idea, and I'm working toward it. > However there's still quite a bit of work to get there. > > For this consistency model to work with DWARF on x86, we would need: > > 1) a reliable x86 DWARF unwinder with Linus's blessing > 2) objtool DWARF support (I'm working on this at the moment) > 3) probably some kind of runtime NMI stack checking feature to > complement objtool, along with a lot of burn time to ensure there are > no issues, particularly in entry code Could you please provide more details about this NMI stack checking? What is it supposed to protect that objtool could not? Will it run regularly or will it be just a random check? The other points are obvious. But I do not know what to think about this NMI thing. And so I am curious :-) > 4) port save_stack_trace_tsk_reliable() to work with DWARF Best Regards, Petr