Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp177203pxa; Fri, 14 Aug 2020 00:26:12 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwe5p4qJZEPyM+ZQLRZ02YH3W3eNcLowiS4NLL43PZkbqLTB2vOfVldXijI5IUscM0T6dFJ X-Received: by 2002:a17:906:138b:: with SMTP id f11mr1150312ejc.310.1597389972559; Fri, 14 Aug 2020 00:26:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597389972; cv=none; d=google.com; s=arc-20160816; b=lPbICZ6P/fv3qsSv4IEEbQy+FZlo83oozniTzHRgGu8rA2v6QUUjkCPCx0pm0HPf5J j09rDQ1yweZwpgu3emPI1jC5KRDRtpfH2i2wdNmLD49fknDQwAOGbD+x+rByTTwSG51o tfJHdS6TG+jGX/nVS/GuMjReAuCp4jTELet0Vzci/AxtRdCSUDEGHitzp2XvbqBQszCd /CqarJQi6zjfYKsLB6QM9WSE6VvMSuXNJ/YaSCcm32Q8R9NRKoFXGxs87SM1+/iWgUU5 B0GvhgHw/y1H2lklbGMT562S/EXGMUNcYgbckTze6ru5kU+KbVc5pkcAW8AqQ9Sl/Opw 89tg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=4fPs0JDtfGboYKwrCWztqbdTj5WJ03dHcVbV7Wd/c5s=; b=S0ycqGUub1ized+vJumveiyHNnz8ztjwrBXlTaDoQJzRxxEssWbLTndsoFTTwsio/Z nuNwaUhWLkYMLeXVaZhAKVZnxq+6rKNCOi3SSohd1IrgU86RnglyUX3QYF4vsvys/DuM ZWf+jZFL7gCFrz4PNfWp+uZKH7KRBx5CHRmYIA59JJqrr8SG3rGxicCXiWKt0b5csrkq REBpdfHQRNdx5UMelUjd6/I8jFQcjnFAXOj5m2O2SSW3zHPI0TZTFDwFzEFjFXquho1b VBOvqh9TrJPs2zNRkdAMXeaRD0cnaFMnOUG8C7TuZKyMAzk0vFM/9TQ0oKVGNy7K9+PJ mCaA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=NgjWmY2F; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id g13si4766915edv.263.2020.08.14.00.25.48; Fri, 14 Aug 2020 00:26:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=NgjWmY2F; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726285AbgHNHYv (ORCPT + 99 others); Fri, 14 Aug 2020 03:24:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57062 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726227AbgHNHYv (ORCPT ); Fri, 14 Aug 2020 03:24:51 -0400 Received: from mail-lj1-x241.google.com (mail-lj1-x241.google.com [IPv6:2a00:1450:4864:20::241]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id DDE12C061757 for ; Fri, 14 Aug 2020 00:24:49 -0700 (PDT) Received: by mail-lj1-x241.google.com with SMTP id v4so8958110ljd.0 for ; Fri, 14 Aug 2020 00:24:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=4fPs0JDtfGboYKwrCWztqbdTj5WJ03dHcVbV7Wd/c5s=; b=NgjWmY2FdzolwiHelbqczqVP1Dzk6b2a8gAHTL9y7oqv3vU1781Zy84P473xdCsaKD l8whA5FioTs41uuk/TgAImUqOVUnTVJtYNxF19dRT5rALy3lY+fv0gg8raUnbwXb42Or omtHOg+lbiujjdW5BmRdVE4N7H0NfAdJsrPDGDyCY4Le8CYTe2hauCtOlnvpEF8s6ege VqzDkllVROgr5o574nGBLTmExHXkXybOgEYXlp4Su6S15GCuYZoGXIVTO0K5hRXHppb2 7J0uSad3EZPbkkguB+6Xc7W7KQ5qot7VK53RB9+TD1UloCZOKezI2UpAb90W5y/D8JFT qM1g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=4fPs0JDtfGboYKwrCWztqbdTj5WJ03dHcVbV7Wd/c5s=; b=Hkt1bbKjfp1/G/lVSrYFuFd0Ni1qE0/fvVBSMMfAKnuex+w4oW2MPAzcnZBCGyF+Ls 5q8AfjU7tzxMw7Wo6WKoJi4G9doPCo7C/+beRM94MnbApGdC7QKFRnA1J4dm6zo7eBLK JgbC4d+tyZXIg7AQuZmxIbj53kECcsew6V6Ec42uSO9HpuF7BjoPLoyBjsyEd7yCqPDK RS0ueG3BbSl2/Cx6QdXkRUg4tavWfs1AJ/FksdMt1DQ501yD5CGyZXOEztMSEcFBA9aT D/ocG0i3lxFb6jjQBm319IcPkFUWKyYKmflceF1yqx0XbMaMs1Ic93tYleV7PQZ2F0fX oI1g== X-Gm-Message-State: AOAM5314WPqgPr/9hO1oPzWnz2yiF/KdcNxD7tRn/zrbE7ekdchyKMuR lfBuI0KlfHHq39Z+diD7hQxJbQ4fxVsSp+LNaEVefQ== X-Received: by 2002:a2e:b4f4:: with SMTP id s20mr651759ljm.339.1597389886799; Fri, 14 Aug 2020 00:24:46 -0700 (PDT) MIME-Version: 1.0 References: <1595333413-30052-1-git-send-email-sumit.garg@linaro.org> <1595333413-30052-2-git-send-email-sumit.garg@linaro.org> In-Reply-To: From: Sumit Garg Date: Fri, 14 Aug 2020 12:54:35 +0530 Message-ID: Subject: Re: [RFC 1/5] tty/sysrq: Make sysrq handler NMI aware To: Doug Anderson , Peter Zijlstra Cc: Greg Kroah-Hartman , Daniel Thompson , linux-serial@vger.kernel.org, kgdb-bugreport@lists.sourceforge.net, Jiri Slaby , Russell King - ARM Linux , Jason Wessel , LKML , Linux ARM Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org + Peter (author of irq_work.c) On Thu, 13 Aug 2020 at 05:30, Doug Anderson wrote: > > Hi, > > On Tue, Jul 21, 2020 at 5:10 AM Sumit Garg wrote: > > > > In a future patch we will add support to the serial core to make it > > possible to trigger a magic sysrq from an NMI context. Prepare for this > > by marking some sysrq actions as NMI safe. Safe actions will be allowed > > to run from NMI context whilst that cannot run from an NMI will be queued > > as irq_work for later processing. > > > > A particular sysrq handler is only marked as NMI safe in case the handler > > isn't contending for any synchronization primitives as in NMI context > > they are expected to cause deadlocks. Note that the debug sysrq do not > > contend for any synchronization primitives. It does call kgdb_breakpoint() > > to provoke a trap but that trap handler should be NMI safe on > > architectures that implement an NMI. > > > > Signed-off-by: Sumit Garg > > --- > > drivers/tty/sysrq.c | 33 ++++++++++++++++++++++++++++++++- > > include/linux/sysrq.h | 1 + > > kernel/debug/debug_core.c | 1 + > > 3 files changed, 34 insertions(+), 1 deletion(-) > > > > diff --git a/drivers/tty/sysrq.c b/drivers/tty/sysrq.c > > index 7c95afa9..8017e33 100644 > > --- a/drivers/tty/sysrq.c > > +++ b/drivers/tty/sysrq.c > > @@ -50,6 +50,8 @@ > > #include > > #include > > #include > > +#include > > +#include > > > > #include > > #include > > @@ -111,6 +113,7 @@ static const struct sysrq_key_op sysrq_loglevel_op = { > > .help_msg = "loglevel(0-9)", > > .action_msg = "Changing Loglevel", > > .enable_mask = SYSRQ_ENABLE_LOG, > > + .nmi_safe = true, > > }; > > > > #ifdef CONFIG_VT > > @@ -157,6 +160,7 @@ static const struct sysrq_key_op sysrq_crash_op = { > > .help_msg = "crash(c)", > > .action_msg = "Trigger a crash", > > .enable_mask = SYSRQ_ENABLE_DUMP, > > + .nmi_safe = true, > > }; > > > > static void sysrq_handle_reboot(int key) > > @@ -170,6 +174,7 @@ static const struct sysrq_key_op sysrq_reboot_op = { > > .help_msg = "reboot(b)", > > .action_msg = "Resetting", > > .enable_mask = SYSRQ_ENABLE_BOOT, > > + .nmi_safe = true, > > }; > > > > const struct sysrq_key_op *__sysrq_reboot_op = &sysrq_reboot_op; > > @@ -217,6 +222,7 @@ static const struct sysrq_key_op sysrq_showlocks_op = { > > .handler = sysrq_handle_showlocks, > > .help_msg = "show-all-locks(d)", > > .action_msg = "Show Locks Held", > > + .nmi_safe = true, > > }; > > #else > > #define sysrq_showlocks_op (*(const struct sysrq_key_op *)NULL) > > @@ -289,6 +295,7 @@ static const struct sysrq_key_op sysrq_showregs_op = { > > .help_msg = "show-registers(p)", > > .action_msg = "Show Regs", > > .enable_mask = SYSRQ_ENABLE_DUMP, > > + .nmi_safe = true, > > }; > > > > static void sysrq_handle_showstate(int key) > > @@ -326,6 +333,7 @@ static const struct sysrq_key_op sysrq_ftrace_dump_op = { > > .help_msg = "dump-ftrace-buffer(z)", > > .action_msg = "Dump ftrace buffer", > > .enable_mask = SYSRQ_ENABLE_DUMP, > > + .nmi_safe = true, > > }; > > #else > > #define sysrq_ftrace_dump_op (*(const struct sysrq_key_op *)NULL) > > @@ -538,6 +546,23 @@ static void __sysrq_put_key_op(int key, const struct sysrq_key_op *op_p) > > sysrq_key_table[i] = op_p; > > } > > > > +#define SYSRQ_NMI_FIFO_SIZE 64 > > +static DEFINE_KFIFO(sysrq_nmi_fifo, int, SYSRQ_NMI_FIFO_SIZE); > > A 64-entry FIFO seems excessive. Quite honestly even a FIFO seems a > bit excessive and it feels like if two sysrqs were received in super > quick succession that it would be OK to just process the first one. I > guess if it simplifies the processing to have a FIFO then it shouldn't > hurt, but no need for 64 entries. > Okay, would a 2-entry FIFO work here? As here we need a FIFO to pass on the key parameter. > > > +static void sysrq_do_nmi_work(struct irq_work *work) > > +{ > > + const struct sysrq_key_op *op_p; > > + int key; > > + > > + while (kfifo_out(&sysrq_nmi_fifo, &key, 1)) { > > + op_p = __sysrq_get_key_op(key); > > + if (op_p) > > + op_p->handler(key); > > + } > > Do you need to manage "suppress_printk" in this function? Do you need > to call rcu_sysrq_start() and rcu_read_lock()? Ah I missed those. Will add them here instead. > > If so, how do you prevent racing between the mucking we're doing with > these things and the mucking that the NMI does with them? IIUC, here you meant to highlight the race while scheduled sysrq is executing in IRQ context and we receive a new sysrq in NMI context, correct? If yes, this seems to be a trickier situation. I think the appropriate way to handle it would be to deny any further sysrq handling until the prior sysrq handling is complete, your views? > > > > +} > > + > > +static DEFINE_IRQ_WORK(sysrq_nmi_work, sysrq_do_nmi_work); > > + > > void __handle_sysrq(int key, bool check_mask) > > { > > const struct sysrq_key_op *op_p; > > @@ -568,7 +593,13 @@ void __handle_sysrq(int key, bool check_mask) > > if (!check_mask || sysrq_on_mask(op_p->enable_mask)) { > > pr_info("%s\n", op_p->action_msg); > > console_loglevel = orig_log_level; > > - op_p->handler(key); > > + > > + if (in_nmi() && !op_p->nmi_safe) { > > + kfifo_in(&sysrq_nmi_fifo, &key, 1); > > Rather than kfifo_in() and kfifo_out(), I think you can use > kfifo_put() and kfifo_get(). As I understand it those just get/put > one element which is what you want. Okay, will use kfifo_put() and kfifo_get() here instead. > > > > + irq_work_queue(&sysrq_nmi_work); > > Wishful thinking, but (as far as I can tell) irq_work_queue() only > queues work on the CPU running the NMI. I don't have lots of NMI > experience, but any chance there is a variant that will queue work on > any CPU? Then sysrq handlers that aren't NMI aware will be more > likely to work. > Unfortunately, queuing work on other CPUs isn't safe in NMI context, see this warning [1]. The comment mentions: /* Arch remote IPI send/receive backend aren't NMI safe */ Peter, Can you throw some light here as to why it isn't considered NMI-safe to send remote IPI in NMI context? Is it an arch specific limitation? [1] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/irq_work.c#n103 -Sumit > > > > > + } else { > > + op_p->handler(key); > > + } > > } else { > > pr_info("This sysrq operation is disabled.\n"); > > console_loglevel = orig_log_level; > > diff --git a/include/linux/sysrq.h b/include/linux/sysrq.h > > index 3a582ec..630b5b9 100644 > > --- a/include/linux/sysrq.h > > +++ b/include/linux/sysrq.h > > @@ -34,6 +34,7 @@ struct sysrq_key_op { > > const char * const help_msg; > > const char * const action_msg; > > const int enable_mask; > > + const bool nmi_safe; > > }; > > > > #ifdef CONFIG_MAGIC_SYSRQ > > diff --git a/kernel/debug/debug_core.c b/kernel/debug/debug_core.c > > index 9e59347..2b51173 100644 > > --- a/kernel/debug/debug_core.c > > +++ b/kernel/debug/debug_core.c > > @@ -943,6 +943,7 @@ static const struct sysrq_key_op sysrq_dbg_op = { > > .handler = sysrq_handle_dbg, > > .help_msg = "debug(g)", > > .action_msg = "DEBUG", > > + .nmi_safe = true, > > }; > > #endif > > > > -- > > 2.7.4 > >