Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp2475756pxa; Mon, 17 Aug 2020 10:29:47 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzuzjpm9X5YEOIuHcprPBMh7BkiKSGilJK60jhIhiEoAG/PEK6eMRlOJgzBsFmH4Dn1XYm6 X-Received: by 2002:a05:6402:1c07:: with SMTP id ck7mr16271239edb.84.1597685387632; Mon, 17 Aug 2020 10:29:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597685387; cv=none; d=google.com; s=arc-20160816; b=hOGE/0PIQpCy/N48KGK1liBvBz5ib6I5y1Ry0D32cDISEPOcYqAxnnyPBAjIOYzhpJ oDBaJMX/egdbsaZBe3eyIWdCHJNrD/yrco75PQ305wIJ1omr8X+TtZb5kqCSalgBWe2E qMc00D6ZQRLcLu8ye7yhROKGFvYbsEP/mAa1bgo4p4qJIBGyVmnLN0SFnmp+P0V1OCKg szjsc29WKSsxtDzF5Eo2SV1UICvmJMsCpepkjAUPLPfJv6ecowPjipYf2v7X/9S2d9tQ 8nVIX5w1JbA3FmRu0/g9ZU06D7X99r16cOa6oZDtzuK3bvPY9z7+WIiKUQvpLhnH3Xxx ngCQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=YX0oVXg///eNfopG9sTf6T12UxVkGpMS/IwW3JYhA54=; b=A6/oFWuw52UI/icnHoxrtWw0Bv7VaM6m4+ZCvdhEXIvYr/vliPd2aN6QeHB9+GLH5d Xm4RuvDBasXc4r0SjTtNfzWOItdicBP3D9hVRnlL7MTHSSDEeKsF/4V0gY6KzKNBtBBm 9zu0XSXabfRHRF5CDQ5v7T7POywwVR1BQ0P8OgyMlyHVOZ+fjMEkHvo3ned4Gr+bUkmv z92laX8oqC6VjnTJKcKtSjo0lJHJ1igQaI7RpKt7X7W5cFXGlWzx+iKBECKg4wYDwYD9 vZapWImLkxr9sT5EhRbHg7uDPMQUFRgesbVWVGKOMq3Dpd1TbBfYk1we7Aidb6GUVMNM tA5A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b="N1d/ga9O"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e10si11807868ejk.32.2020.08.17.10.29.24; Mon, 17 Aug 2020 10:29:47 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b="N1d/ga9O"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729355AbgHQR1g (ORCPT + 99 others); Mon, 17 Aug 2020 13:27:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50856 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731166AbgHQRTX (ORCPT ); Mon, 17 Aug 2020 13:19:23 -0400 Received: from mail-vs1-xe41.google.com (mail-vs1-xe41.google.com [IPv6:2607:f8b0:4864:20::e41]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5F023C061389 for ; Mon, 17 Aug 2020 10:19:23 -0700 (PDT) Received: by mail-vs1-xe41.google.com with SMTP id r7so8668396vsq.5 for ; Mon, 17 Aug 2020 10:19:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=YX0oVXg///eNfopG9sTf6T12UxVkGpMS/IwW3JYhA54=; b=N1d/ga9OT+0QanElTMxw400OpuXd9TpqWN6PJ/UdsHFwIEFvOY7IKKpk4nYCjMszPJ oEJ08DMSjR+H8MFxGxmpacTJaYF+mEz7YmRPiPM+FAKHnUwiapBm2pOIJXxM2xyLMeZh VsGx3MHCDg8neQey2owQc+5WwYzWE8IQo5A2o= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=YX0oVXg///eNfopG9sTf6T12UxVkGpMS/IwW3JYhA54=; b=Od7JD2s7GB2EpKdD4kzwCG4lRtalWwzdBtwi6xdBVPrj4yDiysEVVNZb6MxiYNS9c1 b07lkC74gMTLfwoQxFy1+9YvOMpD0SkDXORPoav3IM7jmjb9OYBcHo6rZst+IBuCuyZR pwTPK8yqjYXmBDyZzvkOrkqjaH23NZG+u0PDibEMTF+7rOzT+GAni7i8qgwRi8bjfnMf Zy+EBJ5E96x6Ip4Wc8RAmzd4Ap3qtfawW4VKaQKGiFrrtTBofgwoWIgIl+vDyIgDDAz9 qLdk7S46uT0PrideAwI873IRnoJFT57jwxaMt/64AceXacN2RWT2tgWwMTlFAels8VF9 SuQA== X-Gm-Message-State: AOAM533v7WlzVogHnrkcSPf5MqL1EsMQ0axuVaYlZ3ocHs/86tMJAlOf t7VW2PH2H/CXkIaqmBE50ShXrlKC7I7CZg== X-Received: by 2002:a05:6102:1ca:: with SMTP id s10mr7893495vsq.14.1597684761983; Mon, 17 Aug 2020 10:19:21 -0700 (PDT) Received: from mail-vs1-f49.google.com (mail-vs1-f49.google.com. [209.85.217.49]) by smtp.gmail.com with ESMTPSA id j195sm3582368vke.18.2020.08.17.10.19.21 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 17 Aug 2020 10:19:21 -0700 (PDT) Received: by mail-vs1-f49.google.com with SMTP id o184so8671953vsc.0 for ; Mon, 17 Aug 2020 10:19:21 -0700 (PDT) X-Received: by 2002:a67:f44f:: with SMTP id r15mr8546380vsn.42.1597684760494; Mon, 17 Aug 2020 10:19:20 -0700 (PDT) MIME-Version: 1.0 References: <1595333413-30052-1-git-send-email-sumit.garg@linaro.org> <1595333413-30052-2-git-send-email-sumit.garg@linaro.org> In-Reply-To: From: Doug Anderson Date: Mon, 17 Aug 2020 10:19:08 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [RFC 1/5] tty/sysrq: Make sysrq handler NMI aware To: Sumit Garg Cc: Peter Zijlstra , Greg Kroah-Hartman , Daniel Thompson , linux-serial@vger.kernel.org, kgdb-bugreport@lists.sourceforge.net, Jiri Slaby , Russell King - ARM Linux , Jason Wessel , LKML , Linux ARM Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On Mon, Aug 17, 2020 at 7:08 AM Sumit Garg wrote: > > On Fri, 14 Aug 2020 at 20:27, Doug Anderson wrote: > > > > Hi, > > > > On Fri, Aug 14, 2020 at 12:24 AM Sumit Garg wrote: > > > > > > + Peter (author of irq_work.c) > > > > > > On Thu, 13 Aug 2020 at 05:30, Doug Anderson wrote: > > > > > > > > Hi, > > > > > > > > On Tue, Jul 21, 2020 at 5:10 AM Sumit Garg wrote: > > > > > > > > > > In a future patch we will add support to the serial core to make it > > > > > possible to trigger a magic sysrq from an NMI context. Prepare for this > > > > > by marking some sysrq actions as NMI safe. Safe actions will be allowed > > > > > to run from NMI context whilst that cannot run from an NMI will be queued > > > > > as irq_work for later processing. > > > > > > > > > > A particular sysrq handler is only marked as NMI safe in case the handler > > > > > isn't contending for any synchronization primitives as in NMI context > > > > > they are expected to cause deadlocks. Note that the debug sysrq do not > > > > > contend for any synchronization primitives. It does call kgdb_breakpoint() > > > > > to provoke a trap but that trap handler should be NMI safe on > > > > > architectures that implement an NMI. > > > > > > > > > > Signed-off-by: Sumit Garg > > > > > --- > > > > > drivers/tty/sysrq.c | 33 ++++++++++++++++++++++++++++++++- > > > > > include/linux/sysrq.h | 1 + > > > > > kernel/debug/debug_core.c | 1 + > > > > > 3 files changed, 34 insertions(+), 1 deletion(-) > > > > > > > > > > diff --git a/drivers/tty/sysrq.c b/drivers/tty/sysrq.c > > > > > index 7c95afa9..8017e33 100644 > > > > > --- a/drivers/tty/sysrq.c > > > > > +++ b/drivers/tty/sysrq.c > > > > > @@ -50,6 +50,8 @@ > > > > > #include > > > > > #include > > > > > #include > > > > > +#include > > > > > +#include > > > > > > > > > > #include > > > > > #include > > > > > @@ -111,6 +113,7 @@ static const struct sysrq_key_op sysrq_loglevel_op = { > > > > > .help_msg = "loglevel(0-9)", > > > > > .action_msg = "Changing Loglevel", > > > > > .enable_mask = SYSRQ_ENABLE_LOG, > > > > > + .nmi_safe = true, > > > > > }; > > > > > > > > > > #ifdef CONFIG_VT > > > > > @@ -157,6 +160,7 @@ static const struct sysrq_key_op sysrq_crash_op = { > > > > > .help_msg = "crash(c)", > > > > > .action_msg = "Trigger a crash", > > > > > .enable_mask = SYSRQ_ENABLE_DUMP, > > > > > + .nmi_safe = true, > > > > > }; > > > > > > > > > > static void sysrq_handle_reboot(int key) > > > > > @@ -170,6 +174,7 @@ static const struct sysrq_key_op sysrq_reboot_op = { > > > > > .help_msg = "reboot(b)", > > > > > .action_msg = "Resetting", > > > > > .enable_mask = SYSRQ_ENABLE_BOOT, > > > > > + .nmi_safe = true, > > > > > }; > > > > > > > > > > const struct sysrq_key_op *__sysrq_reboot_op = &sysrq_reboot_op; > > > > > @@ -217,6 +222,7 @@ static const struct sysrq_key_op sysrq_showlocks_op = { > > > > > .handler = sysrq_handle_showlocks, > > > > > .help_msg = "show-all-locks(d)", > > > > > .action_msg = "Show Locks Held", > > > > > + .nmi_safe = true, > > > > > }; > > > > > #else > > > > > #define sysrq_showlocks_op (*(const struct sysrq_key_op *)NULL) > > > > > @@ -289,6 +295,7 @@ static const struct sysrq_key_op sysrq_showregs_op = { > > > > > .help_msg = "show-registers(p)", > > > > > .action_msg = "Show Regs", > > > > > .enable_mask = SYSRQ_ENABLE_DUMP, > > > > > + .nmi_safe = true, > > > > > }; > > > > > > > > > > static void sysrq_handle_showstate(int key) > > > > > @@ -326,6 +333,7 @@ static const struct sysrq_key_op sysrq_ftrace_dump_op = { > > > > > .help_msg = "dump-ftrace-buffer(z)", > > > > > .action_msg = "Dump ftrace buffer", > > > > > .enable_mask = SYSRQ_ENABLE_DUMP, > > > > > + .nmi_safe = true, > > > > > }; > > > > > #else > > > > > #define sysrq_ftrace_dump_op (*(const struct sysrq_key_op *)NULL) > > > > > @@ -538,6 +546,23 @@ static void __sysrq_put_key_op(int key, const struct sysrq_key_op *op_p) > > > > > sysrq_key_table[i] = op_p; > > > > > } > > > > > > > > > > +#define SYSRQ_NMI_FIFO_SIZE 64 > > > > > +static DEFINE_KFIFO(sysrq_nmi_fifo, int, SYSRQ_NMI_FIFO_SIZE); > > > > > > > > A 64-entry FIFO seems excessive. Quite honestly even a FIFO seems a > > > > bit excessive and it feels like if two sysrqs were received in super > > > > quick succession that it would be OK to just process the first one. I > > > > guess if it simplifies the processing to have a FIFO then it shouldn't > > > > hurt, but no need for 64 entries. > > > > > > > > > > Okay, would a 2-entry FIFO work here? As here we need a FIFO to pass > > > on the key parameter. > > > > ...or even a 1-entry FIFO if that makes sense? > > > > Yes it would make sense but unfortunately not supported by kfifo > (size: power of 2). Typically 1 is considered to be a power of 2 since 2^0 = 1. ...ah, but it appears that size < 2 is not allowed. Oh well. > > > > > +static void sysrq_do_nmi_work(struct irq_work *work) > > > > > +{ > > > > > + const struct sysrq_key_op *op_p; > > > > > + int key; > > > > > + > > > > > + while (kfifo_out(&sysrq_nmi_fifo, &key, 1)) { > > > > > + op_p = __sysrq_get_key_op(key); > > > > > + if (op_p) > > > > > + op_p->handler(key); > > > > > + } > > > > > > > > Do you need to manage "suppress_printk" in this function? Do you need > > > > to call rcu_sysrq_start() and rcu_read_lock()? > > > > > > Ah I missed those. Will add them here instead. > > > > > > > > > > > If so, how do you prevent racing between the mucking we're doing with > > > > these things and the mucking that the NMI does with them? > > > > > > IIUC, here you meant to highlight the race while scheduled sysrq is > > > executing in IRQ context and we receive a new sysrq in NMI context, > > > correct? If yes, this seems to be a trickier situation. I think the > > > appropriate way to handle it would be to deny any further sysrq > > > handling until the prior sysrq handling is complete, your views? > > > > The problem is that in some cases you're running NMIs directly at FIQ > > time and other cases you're running them at IRQ time. So you > > definitely can't just move it to NMI. > > > > Skipping looking for other SYSRQs until the old one is complete sounds > > good to me. Again my ignorance will make me sound like a fool, > > probably, but can you use the kfifo as a form of mutual exclusion? If > > you have a 1-entry kfifo, maybe: > > > > 1. First try to add to the "FIFO". If it fails (out of space) then a > > sysrq is in progress. Ignore this one. > > 2. Decide if you're NMI-safe or not. > > 3. If NMI safe, modify "suppress_printk", call rcu functions, then > > call the handler. Restore suppress_printk and then dequeue from FIFO. > > 4. If not-NMI safe, the irq worker would "peek" into the FIFO, do its > > work (wrapped with "suppress_printk" and the like), and not dequeue > > until it's done. > > > > In the above you'd use the FIFO as a locking mechanism. I don't know > > if that's a valid use of it or if there is a better NMI-safe mechanism > > for this. I think the kfifo docs talk about only one reader and one > > writer and here we have two readers, so maybe it's illegal. It also > > seems weird to have a 1-entry "FIFO" and feels like there's probably a > > better data structure for this. > > Thanks for your suggestions. Have a look at below implementation, I > have used 2-entry fifo but only single entry used for locking > mechanism: > > @@ -538,6 +546,39 @@ static void __sysrq_put_key_op(int key, const > struct sysrq_key_op *op_p) > sysrq_key_table[i] = op_p; > } > > +#define SYSRQ_NMI_FIFO_SIZE 2 > +static DEFINE_KFIFO(sysrq_nmi_fifo, int, SYSRQ_NMI_FIFO_SIZE); > + > +static void sysrq_do_nmi_work(struct irq_work *work) > +{ > + const struct sysrq_key_op *op_p; > + int orig_suppress_printk; > + int key; > + > + orig_suppress_printk = suppress_printk; > + suppress_printk = 0; > + > + rcu_sysrq_start(); > + rcu_read_lock(); > + > + if (kfifo_peek(&sysrq_nmi_fifo, &key)) { > + op_p = __sysrq_get_key_op(key); > + if (op_p) > + op_p->handler(key); > + } > + > + rcu_read_unlock(); > + rcu_sysrq_end(); > + > + suppress_printk = orig_suppress_printk; > + > + /* Pop contents from fifo if any */ > + while (kfifo_get(&sysrq_nmi_fifo, &key)) > + ; I think you can use kfifo_reset_out(). > +} > + > +static DEFINE_IRQ_WORK(sysrq_nmi_work, sysrq_do_nmi_work); > + > void __handle_sysrq(int key, bool check_mask) > { > const struct sysrq_key_op *op_p; > +} > + > +static DEFINE_IRQ_WORK(sysrq_nmi_work, sysrq_do_nmi_work); > + > void __handle_sysrq(int key, bool check_mask) > { > const struct sysrq_key_op *op_p; > @@ -545,6 +586,10 @@ void __handle_sysrq(int key, bool check_mask) > int orig_suppress_printk; > int i; > > + /* Skip sysrq handling if one already in progress */ > + if (!kfifo_is_empty(&sysrq_nmi_fifo)) > + return; This _seems_ OK to me since I'd imagine kfifo_is_empty() is as safe for the writer to do as kfifo_is_full() is and kfifo_is_full() is part of kfifo_put(). I guess there's no better synchronism mechanism that we can use? > + > orig_suppress_printk = suppress_printk; > suppress_printk = 0; > > @@ -568,7 +613,13 @@ void __handle_sysrq(int key, bool check_mask) > if (!check_mask || sysrq_on_mask(op_p->enable_mask)) { > pr_info("%s\n", op_p->action_msg); > console_loglevel = orig_log_level; > - op_p->handler(key); > + > + if (in_nmi() && !op_p->nmi_safe) { > + kfifo_put(&sysrq_nmi_fifo, key); > + irq_work_queue(&sysrq_nmi_work); > + } else { > + op_p->handler(key); > + } > } else { > pr_info("This sysrq operation is disabled.\n"); > console_loglevel = orig_log_level; > > -Sumit