Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp1628998pxb; Thu, 4 Mar 2021 16:55:27 -0800 (PST) X-Google-Smtp-Source: ABdhPJzvfSTWPVwZrDI1pfqNmfzPzzA/OmF6oA5dnJZWwb6fUlg5LsYbPik+c27WEvrfY/LzhIh8 X-Received: by 2002:a92:db08:: with SMTP id b8mr6631942iln.172.1614905727249; Thu, 04 Mar 2021 16:55:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614905727; cv=none; d=google.com; s=arc-20160816; b=OSvA5/93N5A1bxCalORRAw/WBg5KbprPiytt4nHbJdz7K/gPSWFmlSu3OVVClDsN4V EpLjLIwqHczS2SYRHpM/gjhFlhzj1tungcDF3I6YuBtkoxvhbcj7APGE6auvD5/RSnoW OLpZW+BqyXeluRvsg3AxM8V8m5D1ln2uWhSV34AzQjzz79Bu1AUu8yp7gbDouTcX3SbW B8vX0rnLsFvnvn5bnwBfyhaAkbgcMrhTfAh3ZNlWH5mQA4VTx+s8viR6pdVsrCbw2LUz GOugxdq0fBweQw94BTBHHz0XbwUGu8OA1XUkcXlsZ7A8MVo4Qc7FBOyu2obL/JhNBvsb kqgw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=0aoBG9bt7K2Nwml5Y4UALv6dR/n8jqT8XlWBeYmu5Iw=; b=Bus3sZgvcsvoXuvaGaOwaxsWVhkIGd6y5pEqj4YXvaBnt+Ui140DRAZMBoDlD/VkzD XKhqk6PFxlQcA+LjmPK+U30mpKAp/00RxNqFvZBmDFzh+I2UOUljwkROEnzlE+TO8xjf hzCIjsw4pcQaVwIEcs+WAkQML3vruf4bb4GI3Z8+IvSepCo4AoNZZdXG/s01vR85SEKf n/oNntE5HYJs7Y4KtBR3KilJgaXa1ZXtyLChOBZbBryzMVuYez62+gQFKvooXzpdmL75 WcG2L44xsT/PXvKOeR69s8Vpnrsuym4ZoL8R2ebI266HVeWld4VWic4A/QzctN6qe2Qz CiMA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=NVSvNizI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y1si819871iow.43.2021.03.04.16.55.14; Thu, 04 Mar 2021 16:55:27 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=NVSvNizI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232549AbhCDTCn (ORCPT + 99 others); Thu, 4 Mar 2021 14:02:43 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36780 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233959AbhCDTCX (ORCPT ); Thu, 4 Mar 2021 14:02:23 -0500 Received: from mail-ot1-x333.google.com (mail-ot1-x333.google.com [IPv6:2607:f8b0:4864:20::333]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B94A4C061756 for ; Thu, 4 Mar 2021 11:01:43 -0800 (PST) Received: by mail-ot1-x333.google.com with SMTP id g8so24667191otk.4 for ; Thu, 04 Mar 2021 11:01:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=0aoBG9bt7K2Nwml5Y4UALv6dR/n8jqT8XlWBeYmu5Iw=; b=NVSvNizI4KZN1UKsLkx5qjZ7v0EVfLOog8osH31hbuPP18B77Ed/U1DkJ+wnX8LNIc 97OleKF0shu4p7MuRnpkbjzD1oxfat5ge/CSKtqW2mE8Awgcd7VEs3CAGVs4G78siGJT VxNaFgsVdSOgBMq6eoBPf5c+taZPck9ZcssIyBepMk7JVIgtw9ZVsaNLM4rwF3/E9/jO 7KnOda3cN/yi9ANa5UMNvvDMJZ+tdOhmzWvhltmFxYatTuNZbktwwnvs7LYwUQ2eUNSz Chta7Uzah3jv3eflKadZzSdrUkCwcyRduia8ZdQmyWjr7NeHBkxUZzNxTSZvxJc+2Wvz 4Wcw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=0aoBG9bt7K2Nwml5Y4UALv6dR/n8jqT8XlWBeYmu5Iw=; b=VUDN+SMYYv09D5dm1Exh1XzT1TMLODEYWLS2OiZfKEg4MGeLxRK3cKx0kcyMBhn1xk tnhgP/NTqTe4nK4AJ+73T6FKecvuc3/vMZXSwbs2jo/feyk3bO0oRRiJoAtlGbX2DkEQ Z+F+KbooxUoNMrzJ3gIDGcSQCzucsBy6egCQ+pyBwYVuGySvgPiOG+Q7u2sFPS9wzALs EMMc5SZjoXdItaSGfIC8zn1ukZTdLUu0AvBqzvS/0r+m8Ini88/NyRtJ6BV4Xpia5yZP 4RLEqulLxWVPVEMJd4HSwNfFPck0H91lYvb6o6as12Ka5CpbpL506NP5qWNH2T/8Sn3u T4yQ== X-Gm-Message-State: AOAM531g/gk1nNnqdqxcTh1bD52uYEm4AJUoPWlSytWaGSZHJVt9MctS 5XNnC+/ozUUjRQhri4JhmPnEME83y3U1VIiFiFyeIA== X-Received: by 2002:a9d:7f11:: with SMTP id j17mr4694384otq.251.1614884502710; Thu, 04 Mar 2021 11:01:42 -0800 (PST) MIME-Version: 1.0 References: <1802be3e-dc1a-52e0-1754-a40f0ea39658@csgroup.eu> <20210304145730.GC54534@C02TD0UTHF1T.local> <20210304165923.GA60457@C02TD0UTHF1T.local> <20210304180154.GD60457@C02TD0UTHF1T.local> <20210304185148.GE60457@C02TD0UTHF1T.local> In-Reply-To: <20210304185148.GE60457@C02TD0UTHF1T.local> From: Marco Elver Date: Thu, 4 Mar 2021 20:01:29 +0100 Message-ID: Subject: Re: [PATCH v1] powerpc: Include running function as first entry in save_stack_trace() and friends To: Mark Rutland Cc: Christophe Leroy , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , LKML , linuxppc-dev@lists.ozlabs.org, kasan-dev , Catalin Marinas , Will Deacon , Linux ARM , broonie@kernel.org, linux-toolchains@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 4 Mar 2021 at 19:51, Mark Rutland wrote: > On Thu, Mar 04, 2021 at 07:22:53PM +0100, Marco Elver wrote: > > On Thu, 4 Mar 2021 at 19:02, Mark Rutland wrote: > > > On Thu, Mar 04, 2021 at 06:25:33PM +0100, Marco Elver wrote: > > > > On Thu, Mar 04, 2021 at 04:59PM +0000, Mark Rutland wrote: > > > > > On Thu, Mar 04, 2021 at 04:30:34PM +0100, Marco Elver wrote: > > > > > > On Thu, 4 Mar 2021 at 15:57, Mark Rutland wrote: > > > > > > > [adding Mark Brown] > > > > > > > > > > > > > > The bigger problem here is that skipping is dodgy to begin with, and > > > > > > > this is still liable to break in some cases. One big concern is that > > > > > > > (especially with LTO) we cannot guarantee the compiler will not inline > > > > > > > or outline functions, causing the skipp value to be too large or too > > > > > > > small. That's liable to happen to callers, and in theory (though > > > > > > > unlikely in practice), portions of arch_stack_walk() or > > > > > > > stack_trace_save() could get outlined too. > > > > > > > > > > > > > > Unless we can get some strong guarantees from compiler folk such that we > > > > > > > can guarantee a specific function acts boundary for unwinding (and > > > > > > > doesn't itself get split, etc), the only reliable way I can think to > > > > > > > solve this requires an assembly trampoline. Whatever we do is liable to > > > > > > > need some invasive rework. > > > > > > > > > > > > Will LTO and friends respect 'noinline'? > > > > > > > > > > I hope so (and suspect we'd have more problems otherwise), but I don't > > > > > know whether they actually so. > > > > > > > > > > I suspect even with 'noinline' the compiler is permitted to outline > > > > > portions of a function if it wanted to (and IIUC it could still make > > > > > specialized copies in the absence of 'noclone'). > > > > > > > > > > > One thing I also noticed is that tail calls would also cause the stack > > > > > > trace to appear somewhat incomplete (for some of my tests I've > > > > > > disabled tail call optimizations). > > > > > > > > > > I assume you mean for a chain A->B->C where B tail-calls C, you get a > > > > > trace A->C? ... or is A going missing too? > > > > > > > > Correct, it's just the A->C outcome. > > > > > > I'd assumed that those cases were benign, e.g. for livepatching what > > > matters is what can be returned to, so B disappearing from the trace > > > isn't a problem there. > > > > > > Is the concern debugability, or is there a functional issue you have in > > > mind? > > > > For me, it's just been debuggability, and reliable test cases. > > > > > > > > Is there a way to also mark a function non-tail-callable? > > > > > > > > > > I think this can be bodged using __attribute__((optimize("$OPTIONS"))) > > > > > on a caller to inhibit TCO (though IIRC GCC doesn't reliably support > > > > > function-local optimization options), but I don't expect there's any way > > > > > to mark a callee as not being tail-callable. > > > > > > > > I don't think this is reliable. It'd be > > > > __attribute__((optimize("-fno-optimize-sibling-calls"))), but doesn't > > > > work if applied to the function we do not want to tail-call-optimize, > > > > but would have to be applied to the function that does the tail-calling. > > > > > > Yup; that's what I meant then I said you could do that on the caller but > > > not the callee. > > > > > > I don't follow why you'd want to put this on the callee, though, so I > > > think I'm missing something. Considering a set of functions in different > > > compilation units: > > > > > > A->B->C->D->E->F->G->H->I->J->K > > > > I was having this problem with KCSAN, where the compiler would > > tail-call-optimize __tsan_X instrumentation. > > Those are compiler-generated calls, right? When those are generated the > compilation unit (and whatever it has included) might not have provided > a prototype anyway, and the compiler has special knowledge of the > functions, so it feels like the compiler would need to inhibit TCO here > for this to be robust. For their intended usage subjecting them to TCO > doesn't seem to make sense AFAICT. > > I suspect that compilers have some way of handling that; otherwise I'd > expect to have heard stories of mcount/fentry calls getting TCO'd and > causing problems. So maybe there's an easy fix there? I agree, the compiler builtins should be handled by the compiler directly, perhaps that was a bad example. But we also have "explicit instrumentation", e.g. everything that's in . > > This would mean that KCSAN runtime functions ended up in the trace, > > but the function where the access happened would not. However, I don't > > care about the runtime functions, and instead want to see the function > > where the access happened. In that case, I'd like to just mark > > __tsan_X and any other kcsan instrumentation functions as > > do-not-tail-call-optimize, which would solve the problem. > > I understand why we don't want to TCO these calls, but given the calls > are implicitly generated, I strongly suspect it's better to fix the > implicit call generation to not be TCO'd to begin with. > > > The solution today is that when you compile a kernel with KCSAN, every > > instrumented TU is compiled with -fno-optimize-sibling-calls. The > > better solution would be to just mark KCSAN runtime functions somehow, > > but permit tail calling other things. Although, I probably still want > > to see the full trace, and would decide that having > > -fno-optimize-sibling-calls is a small price to pay in a > > debug-only-kernel to get complete traces. > > > > > ... if K were marked in this way, and J was compiled with visibility of > > > this, J would stick around, but J's callers might not, and so the a > > > trace might see: > > > > > > A->J->K > > > > > > ... do you just care about the final caller, i.e. you just need > > > certainty that J will be in the trace? > > > > Yes. But maybe it's a special problem that only sanitizers have. > > I reckon for basically any instrumentation we don't want calls to be > TCO'd, though I'm not immediately sure of cases beyond sanitizers and > mcount/fentry. Thinking about this more, I think it's all debugging tools. E.g. lockdep, if you lock/unlock at the end of a function, you might tail call into lockdep. If the compiler applies TCO, and lockdep determines there's a bug and then shows a trace, you'll have no idea where the actual bug is. The kernel has lots of debugging facilities that add instrumentation in this way. So perhaps it's a general debugging-tool problem (rather than just sanitizers). Thanks, -- Marco