Received: by 2002:ac0:a679:0:0:0:0:0 with SMTP id p54csp378383imp; Wed, 20 Feb 2019 01:48:33 -0800 (PST) X-Google-Smtp-Source: AHgI3IaJB3HFrKq0qJnJORMmakQ7nHIjaFMtvhfuEFOT7LVztlP4ZM2KuI4emikrmppD05ExnR1t X-Received: by 2002:a62:931a:: with SMTP id b26mr35238954pfe.65.1550656113357; Wed, 20 Feb 2019 01:48:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550656113; cv=none; d=google.com; s=arc-20160816; b=zBe1Q6N8LPrUTG5vycVkUJ0gbw2cHji2mZPkqttCNegkXiesfzXQ9V2CdBaIRye4ym rtbd0vlxugUlOiNDqTFFbdF5Etk6MWWvdXqhJ6gYrVi4xlASMWf4Y2SPyMoNPpsFna+W dB3n3ALxmmXobp6jkYZ7VRHqxo2qaiJjL0qAZ5HMgrWqClDRERcZ7zz7jpsZIGjhSTvY cP2HcCuDqNHxvKlz7wq1OhO6STWDCBdRuK6dkpEuRoIIJBDTDf2VKrKbMIHWB/te5CJF RUYxM80CQTN61yb3cgNdmgpNdrzWlAbMr93COVnIY1PYjofj2H4OW+B6Y7MJsJGg2AAN WxVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from; bh=B/0wvgxFkM0sQPM7rxcZg0IMvrnksXx+uRb6kbcz7zY=; b=053bz02n+9pwDTEfpOHn04hr00j7cGcftHKNALaRjDk5judQQ0VJ6rZYw3vY7cJkh1 gKFvJpsZPFXra9v6RLbkilawAu+K2+mDD3PQNFRO1b1Ebh5Z3ASmImkwg6IWhjmzh91K xH+2qsEj0kLLqU8jHTjZYiauyxlWxV3D6/oWJEBeaeD7f1ZvNiVoBMny3NOEkI+MdflE Vo0WvaAhb0t36/XhCla0+uOpynaWv62tqdgTzwKq5PYCrnIECfOyu0wxyLStw72Z0npa sI4CZB/jko7Mw0Yf4aXLlmKvYqYlAeb4kSN3Porw6Qg27edkaNhbKrRe/q/ymMWgdleO oksg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w15si20034538plk.357.2019.02.20.01.48.17; Wed, 20 Feb 2019 01:48:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727042AbfBTJre (ORCPT + 99 others); Wed, 20 Feb 2019 04:47:34 -0500 Received: from ozlabs.org ([203.11.71.1]:55371 "EHLO ozlabs.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725816AbfBTJre (ORCPT ); Wed, 20 Feb 2019 04:47:34 -0500 Received: from authenticated.ozlabs.org (localhost [127.0.0.1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPSA id 444CSc24jVz9s70; Wed, 20 Feb 2019 20:47:25 +1100 (AEDT) From: Michael Ellerman To: Andrea Parri , Petr Mladek Cc: linuxppc-dev@ozlabs.org, akpm@linux-foundation.org, tj@kernel.org, linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, dyoung@redhat.com, sergey.senozhatsky@gmail.com, Steven Rostedt Subject: Re: [PATCH v3 1/7] dump_stack: Support adding to the dump stack arch description In-Reply-To: <20190219233925.GA5648@andrea> References: <20190207124635.3885-1-mpe@ellerman.id.au> <20190211125035.GA1562@andrea> <20190211143859.dd2lkccxod3f2fwn@pathway.suse.cz> <20190219233925.GA5648@andrea> Date: Wed, 20 Feb 2019 20:47:25 +1100 Message-ID: <87va1e7pw2.fsf@concordia.ellerman.id.au> MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Andrea Parri writes: > On Mon, Feb 11, 2019 at 03:38:59PM +0100, Petr Mladek wrote: >> On Mon 2019-02-11 13:50:35, Andrea Parri wrote: >> > On Thu, Feb 07, 2019 at 11:46:29PM +1100, Michael Ellerman wrote: >> > > Arch code can set a "dump stack arch description string" which is >> > > displayed with oops output to describe the hardware platform. >> > > >> > > It is useful to initialise this as early as possible, so that an early >> > > oops will have the hardware description. >> > > >> > > However in practice we discover the hardware platform in stages, so it >> > > would be useful to be able to incrementally fill in the hardware >> > > description as we discover it. >> > > >> > > This patch adds that ability, by creating dump_stack_add_arch_desc(). >> > > >> > > If there is no existing string it behaves exactly like >> > > dump_stack_set_arch_desc(). However if there is an existing string it >> > > appends to it, with a leading space. >> > > >> > > This makes it easy to call it multiple times from different parts of the >> > > code and get a reasonable looking result. >> > > >> > > Signed-off-by: Michael Ellerman >> > > --- >> > > include/linux/printk.h | 5 ++++ >> > > lib/dump_stack.c | 58 ++++++++++++++++++++++++++++++++++++++++++ >> > > 2 files changed, 63 insertions(+) >> > > >> > > v3: No change, just widened Cc list. >> > > >> > > v2: Add a smp_wmb() and comment. >> > > >> > > v1 is here for reference https://lore.kernel.org/lkml/1430824337-15339-1-git-send-email-mpe@ellerman.id.au/ >> > > >> > > I'll take this series via the powerpc tree if no one minds? >> > > >> > > >> > > diff --git a/include/linux/printk.h b/include/linux/printk.h >> > > index 77740a506ebb..d5fb4f960271 100644 >> > > --- a/include/linux/printk.h >> > > +++ b/include/linux/printk.h >> > > @@ -198,6 +198,7 @@ u32 log_buf_len_get(void); >> > > void log_buf_vmcoreinfo_setup(void); >> > > void __init setup_log_buf(int early); >> > > __printf(1, 2) void dump_stack_set_arch_desc(const char *fmt, ...); >> > > +__printf(1, 2) void dump_stack_add_arch_desc(const char *fmt, ...); >> > > void dump_stack_print_info(const char *log_lvl); >> > > void show_regs_print_info(const char *log_lvl); >> > > extern asmlinkage void dump_stack(void) __cold; >> > > @@ -256,6 +257,10 @@ static inline __printf(1, 2) void dump_stack_set_arch_desc(const char *fmt, ...) >> > > { >> > > } >> > > >> > > +static inline __printf(1, 2) void dump_stack_add_arch_desc(const char *fmt, ...) >> > > +{ >> > > +} >> > > + >> > > static inline void dump_stack_print_info(const char *log_lvl) >> > > { >> > > } >> > > diff --git a/lib/dump_stack.c b/lib/dump_stack.c >> > > index 5cff72f18c4a..69b710ff92b5 100644 >> > > --- a/lib/dump_stack.c >> > > +++ b/lib/dump_stack.c >> > > @@ -35,6 +35,64 @@ void __init dump_stack_set_arch_desc(const char *fmt, ...) >> > > va_end(args); >> > > } >> > > >> > > +/** >> > > + * dump_stack_add_arch_desc - add arch-specific info to show with task dumps >> > > + * @fmt: printf-style format string >> > > + * @...: arguments for the format string >> > > + * >> > > + * See dump_stack_set_arch_desc() for why you'd want to use this. >> > > + * >> > > + * This version adds to any existing string already created with either >> > > + * dump_stack_set_arch_desc() or dump_stack_add_arch_desc(). If there is an >> > > + * existing string a space will be prepended to the passed string. >> > > + */ >> > > +void __init dump_stack_add_arch_desc(const char *fmt, ...) >> > > +{ >> > > + va_list args; >> > > + int pos, len; >> > > + char *p; >> > > + >> > > + /* >> > > + * If there's an existing string we snprintf() past the end of it, and >> > > + * then turn the terminating NULL of the existing string into a space >> > > + * to create one string separated by a space. >> > > + * >> > > + * If there's no existing string we just snprintf() to the buffer, like >> > > + * dump_stack_set_arch_desc(), but without calling it because we'd need >> > > + * a varargs version. >> > > + */ >> > > + len = strnlen(dump_stack_arch_desc_str, sizeof(dump_stack_arch_desc_str)); >> > > + pos = len; >> > > + >> > > + if (len) >> > > + pos++; >> > > + >> > > + if (pos >= sizeof(dump_stack_arch_desc_str)) >> > > + return; /* Ran out of space */ >> > > + >> > > + p = &dump_stack_arch_desc_str[pos]; >> > > + >> > > + va_start(args, fmt); >> > > + vsnprintf(p, sizeof(dump_stack_arch_desc_str) - pos, fmt, args); >> > > + va_end(args); >> > > + >> > > + if (len) { >> > > + /* >> > > + * Order the stores above in vsnprintf() vs the store of the >> > > + * space below which joins the two strings. Note this doesn't >> > > + * make the code truly race free because there is no barrier on >> > > + * the read side. ie. Another CPU might load the uninitialised >> > > + * tail of the buffer first and then the space below (rather >> > > + * than the NULL that was there previously), and so print the >> > > + * uninitialised tail. But the whole string lives in BSS so in >> > > + * practice it should just see NULLs. >> > >> > The comment doesn't say _why_ we need to order these stores: IOW, what >> > will or can go wrong without this order? This isn't clear to me. >> > >> > Another good practice when adding smp_*-constructs (as discussed, e.g., >> > at KS'18) is to indicate the matching construct/synch. mechanism. >> >> Yes, one barrier without a counter-part is suspicious. > > As is this silence..., > > Michael, what happened to this patch? did you submit a new version? No, I'm just busy, it's the merge window next week :) I thought the comment was pretty clear, if the stores are observed out of order we might print the uninitialised tail. And the barrier on the read side would need to be in printk somewhere, which is obviously unpleasant. >> If the parallel access is really needed then we could define the >> current length as atomic_t and use: >> >> + atomic_cmpxchg() to reserve the space for the string >> + %*s to limit the printed length >> >> In the worst case, we would print an incomplete string. >> See below for a sample code. > > Seems worth exploring, IMO; but I'd like to first hear _clear about > the _intended semantics (before digging into alternatives)... It is not my intention to support concurrent updates of the string. The idea is you setup the string early in boot. The concern with a concurrent reader is simply that the string is dumped in the panic path, and you never really know when you're going to panic. Even if you only write to the string before doing SMP bringup you might still have another CPU go rogue and panic before then. But I probably should have just not added the barrier, it's over paranoid and will almost certainly never matter in practice. cheers