Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp356176pxb; Thu, 31 Mar 2022 07:04:24 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwCw6ItEBPa6D+x7wZJK2FUhdF2wfVvU8NkFaKKHiPPz/XDB1Or9FVT5vqo4JVM9H3TYgN3 X-Received: by 2002:a17:906:7307:b0:6da:92e1:9c83 with SMTP id di7-20020a170906730700b006da92e19c83mr5101887ejc.459.1648735464112; Thu, 31 Mar 2022 07:04:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1648735464; cv=none; d=google.com; s=arc-20160816; b=C2nxBn0vdCA6qAxymFqWUmt/TNSmuzdf1UDA6jEYkO/hv32KRnsmbIjOoNbxp2W7ZM d73bhQOcbifxiwtCDswHv1tQOxoGVMiHACvctRBeu1EVLLiz6qM73sHj++k+F5g9vED/ 8HoTk7Kot/ncYeWpDAVpwEtjlXa2ghxF1Xu1KfesDyu8RKPJyIngtrPOmRQI3Q5uTSss 2br/fSEuUHruwN/y7Gw8o6+Ii95PZgYkbz7SMF5dqhcvquHrsNljHzZYznlKeeexi2wx EJpOqDsvIwbgNt6vZ6z5d4fSIhY+J65yyZEM501G1ZiTMsacKVibXw9UVwMxAO9rq+z3 K6pw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=GPzritAAJUD0HrITV7wMz8tXKVU1ryWw2kFn2vluens=; b=In70j4qu2wloXBB6mHFy6FIr9KSqnkWf7gqW5mWY+bcw63ks/VfpwvS31AsCYp1I0m Evmdskgn1epUf1EOfPbvgdbkoIalTRU6nVcv+ysxXLkfl64SRZmBfD5U9TxOkTkihJcj 83XUZgrqQ2dY+uNd+sn+w3lltt4rVIcWbpGlSgoZ/MOvEw5Mc3F34CnMuBZa+yCd0whg H4VBdxK3NgtVJ6+GMgY1oCV8CWY8l0h9Y8cQeERGtxhP7k6o8wEAdROWBIiI/2Fe6QVD bCMcRSF34rL0RFaLvsJr5Q4nxM4vvDFVEXqSKhxL0Ar8C7eJ/nNGqrukwF2RjEv6X8TZ yW/g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id h15-20020a170906110f00b006df76385e13si23050850eja.691.2022.03.31.07.03.57; Thu, 31 Mar 2022 07:04:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233742AbiCaJVZ (ORCPT + 99 others); Thu, 31 Mar 2022 05:21:25 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33916 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233723AbiCaJVX (ORCPT ); Thu, 31 Mar 2022 05:21:23 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id E9D1232EC9 for ; Thu, 31 Mar 2022 02:19:35 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 9652723A; Thu, 31 Mar 2022 02:19:35 -0700 (PDT) Received: from FVFF77S0Q05N (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id B74053F718; Thu, 31 Mar 2022 02:19:32 -0700 (PDT) Date: Thu, 31 Mar 2022 10:19:21 +0100 From: Mark Rutland To: andrey.konovalov@linux.dev Cc: Marco Elver , Alexander Potapenko , Catalin Marinas , Will Deacon , Andrew Morton , Andrey Konovalov , Dmitry Vyukov , Andrey Ryabinin , kasan-dev@googlegroups.com, Vincenzo Frascino , Sami Tolvanen , Peter Collingbourne , Evgenii Stepanov , Florian Mayer , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Andrey Konovalov Subject: Re: [PATCH v2 1/4] stacktrace: add interface based on shadow call stack Message-ID: References: <21e3e20ea58e242e3c82c19abbfe65b579e0e4b8.1648049113.git.andreyknvl@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <21e3e20ea58e242e3c82c19abbfe65b579e0e4b8.1648049113.git.andreyknvl@google.com> X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Mar 23, 2022 at 04:32:52PM +0100, andrey.konovalov@linux.dev wrote: > From: Andrey Konovalov > > Add a new interface stack_trace_save_shadow() for collecting stack traces > by copying frames from the Shadow Call Stack. > > Collecting stack traces this way is significantly faster: boot time > of a defconfig build with KASAN enabled gets descreased by ~30%. Hmm... just to check, do ou know if that's just because of hte linear copy, or because we're skipping other work we have to do in the regular stacktrace? > The few patches following this one add an implementation of > stack_trace_save_shadow() for arm64. > > The implementation of the added interface is not meant to use > stack_trace_consume_fn to avoid making a function call for each > collected frame to further improve performance. ... because we could easily provide an inline-optimized stack copy *without* having to write a distinct unwinder, and I'd *really* like to avoid having a bunch of distinct unwinders for arm64, as it really hinders maintenance. We're working on fixing/improving the arm64 unwinder for things like RELIABLE_STACKTRACE, and I know that some of that work is non-trivial to make work with an SCS-based unwind rather than an FP-based unwind, and/or will undermine the saving anyway. > > Signed-off-by: Andrey Konovalov > --- > arch/Kconfig | 6 ++++++ > include/linux/stacktrace.h | 15 +++++++++++++++ > kernel/stacktrace.c | 21 +++++++++++++++++++++ > 3 files changed, 42 insertions(+) > > diff --git a/arch/Kconfig b/arch/Kconfig > index e12a4268c01d..207c1679c53a 100644 > --- a/arch/Kconfig > +++ b/arch/Kconfig > @@ -1041,6 +1041,12 @@ config HAVE_RELIABLE_STACKTRACE > arch_stack_walk_reliable() function which only returns a stack trace > if it can guarantee the trace is reliable. > > +config HAVE_SHADOW_STACKTRACE > + bool > + help > + If this is set, the architecture provides the arch_stack_walk_shadow() > + function, which collects the stack trace from the shadow call stack. > + > config HAVE_ARCH_HASH > bool > default n > diff --git a/include/linux/stacktrace.h b/include/linux/stacktrace.h > index 97455880ac41..b74d1e42e157 100644 > --- a/include/linux/stacktrace.h > +++ b/include/linux/stacktrace.h > @@ -60,6 +60,9 @@ int arch_stack_walk_reliable(stack_trace_consume_fn consume_entry, void *cookie, > > void arch_stack_walk_user(stack_trace_consume_fn consume_entry, void *cookie, > const struct pt_regs *regs); > + > +int arch_stack_walk_shadow(unsigned long *store, unsigned int size, > + unsigned int skipnr); > #endif /* CONFIG_ARCH_STACKWALK */ > > #ifdef CONFIG_STACKTRACE > @@ -108,4 +111,16 @@ static inline int stack_trace_save_tsk_reliable(struct task_struct *tsk, > } > #endif > > +#if defined(CONFIG_STACKTRACE) && defined(CONFIG_HAVE_SHADOW_STACKTRACE) > +int stack_trace_save_shadow(unsigned long *store, unsigned int size, > + unsigned int skipnr); > +#else > +static inline int stack_trace_save_shadow(unsigned long *store, > + unsigned int size, > + unsigned int skipnr) > +{ > + return -ENOSYS; > +} > +#endif > + > #endif /* __LINUX_STACKTRACE_H */ > diff --git a/kernel/stacktrace.c b/kernel/stacktrace.c > index 9ed5ce989415..fe305861fd55 100644 > --- a/kernel/stacktrace.c > +++ b/kernel/stacktrace.c > @@ -237,6 +237,27 @@ unsigned int stack_trace_save_user(unsigned long *store, unsigned int size) > } > #endif > > +#ifdef CONFIG_HAVE_SHADOW_STACKTRACE > +/** > + * stack_trace_save_shadow - Save a stack trace based on shadow call stack > + * @store: Pointer to the storage array > + * @size: Size of the storage array > + * @skipnr: Number of entries to skip at the start of the stack trace > + * > + * Return: Number of trace entries stored. > + */ > +int stack_trace_save_shadow(unsigned long *store, unsigned int size, > + unsigned int skipnr) > +{ > + /* > + * Do not use stack_trace_consume_fn to avoid making a function > + * call for each collected frame to improve performance. > + * Skip + 1 frame to skip stack_trace_save_shadow. > + */ > + return arch_stack_walk_shadow(store, size, skipnr + 1); > +} > +#endif If we really need this, can we make it an __always_inline in a header so that we can avoid the skip? Generally the skipping is problematic due to inlining/outlining and LTO, and I'd like to avoid adding more of it unnecessarily. Thanks, Mark. > + > #else /* CONFIG_ARCH_STACKWALK */ > > /* > -- > 2.25.1 >