Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp4612920imw; Tue, 19 Jul 2022 09:48:07 -0700 (PDT) X-Google-Smtp-Source: AGRyM1twkRqwu0aAWSWqtcy6jWpaUrAplOA6ITLZE5iSYpdN6xZs3+YvzT+DWBiZUie5/6TxHDx0 X-Received: by 2002:a17:907:3e05:b0:71c:2ba5:1ab with SMTP id hp5-20020a1709073e0500b0071c2ba501abmr30586334ejc.93.1658249287662; Tue, 19 Jul 2022 09:48:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658249287; cv=none; d=google.com; s=arc-20160816; b=HsE4PkLc/jGXWt8inpbE4USCyW+RgOUegKiSGAmxzc+Fd/9HrbO66ZA6kh7u2dFmOM rW2WJn35Yx8QQS2TXbx1jkajD+MFqieuUE0Q5Uaby4RXrSVkF0VuDqmTgvqiFeN+KI8w +BdfYAyYLMp1iwPJx5O7nXWj6G4MUhDHcPBp84ESVkGbTXn6zErBItYBDPEL5Jpal+8p TaMqpkhkxIMoolIQ6cijokOv+W7pQxzLKSAs4+K6XQbYDvOrUI16fktfE3BXZanw7ICR OpQDIUX0tSwBPhYFBWp7igrVKWbdkwqLb/3hgAyUb7n/SUP4iyQBfqKW7uZpoMxPC8A7 nT6Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:references:in-reply-to :subject:cc:to:from:message-id:date:dkim-signature; bh=wOcUYZt4SsGk4sWRJGNetOGUl5OpMjY6hibPdGavMrc=; b=v3n5d1fQeicQ1a4D183c+aKQpZEZN9PKUpMOiy/UVim8UCyEH1wpqCin6nwgL6RwQ6 78oemheCuuwd609al4P7tEy0C111TKxqQbzzVs9/MGz4nyDkA9b6rwGNmVhqoOYxhgZa CCtHt0lhmJHPtqd1E7TcNLs79s+cLWrvb4JKgcOTHZ0AoPzUrOJRn5/v3ztP2dZtMgXq bTVAoKxQqyM1e8odSu9clz01OWKY/4olHLCaSvDSpz6LD+4jgb15SzoQtkleVt2Rtp8h N+bsAljMQWs+KgjoBA2hsPiaZzyJ4vw04My3S/N/s503dCTOh48wt34rs7IXYKfoNpzw EFfg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HQyebdNj; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id gb15-20020a170907960f00b00711d9021212si17187044ejc.566.2022.07.19.09.47.41; Tue, 19 Jul 2022 09:48:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HQyebdNj; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229531AbiGSQUR (ORCPT + 99 others); Tue, 19 Jul 2022 12:20:17 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54692 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238271AbiGSQUO (ORCPT ); Tue, 19 Jul 2022 12:20:14 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3291B11C23 for ; Tue, 19 Jul 2022 09:20:09 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 04991B81BD0 for ; Tue, 19 Jul 2022 16:20:08 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 8DAFAC341CA; Tue, 19 Jul 2022 16:20:06 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1658247606; bh=+bR78aF429H1hmw0bA0ANi4dgyVHv+HSsYm48n9qNNg=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=HQyebdNjXTBgd68v4dheHL8TIDaZI1tFIofu9qyrhxGWFMK50FMZWi+uT287i7rC4 GtRwVBy3TumBsLpA8WvXfrha5+0xHn+0GWkLqvZfiZKBMKt6Gs9vlS09hdQr7CWd56 xHrRREEZwoHQr17yG2kBMdMn9FJewaG08BbaHstgwtlM+HtRhPchwbdR80RlgKrytA CZytj/K4A2gLI9fYfB+EKYaGXflMGMBvWsUBnptu9AnqUZN8jcxnyX8PyNTfJOBxe5 jUnNwooxlMXW/HPmjrSSlvbwZLQXIjBuu0yDP6cJ2BRS7QitI1oxXze2L+Zbu+AoCa G7hJKOieiY7qQ== Received: from sofa.misterjones.org ([185.219.108.64] helo=why.misterjones.org) by disco-boy.misterjones.org with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.95) (envelope-from ) id 1oDpx6-008Zbs-6i; Tue, 19 Jul 2022 17:20:04 +0100 Date: Tue, 19 Jul 2022 17:20:03 +0100 Message-ID: <87bktlyszw.wl-maz@kernel.org> From: Marc Zyngier To: Anup Patel Cc: Palmer Dabbelt , Paul Walmsley , Thomas Gleixner , Daniel Lezcano , Atish Patra , Alistair Francis , Anup Patel , linux-riscv@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v6 3/7] genirq: Add mechanism to multiplex a single HW IPI In-Reply-To: <20220418105305.1196665-4-apatel@ventanamicro.com> References: <20220418105305.1196665-1-apatel@ventanamicro.com> <20220418105305.1196665-4-apatel@ventanamicro.com> User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM-LB/1.14.9 (=?UTF-8?B?R29qxY0=?=) APEL-LB/10.8 EasyPG/1.0.0 Emacs/27.1 (x86_64-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO) MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII X-SA-Exim-Connect-IP: 185.219.108.64 X-SA-Exim-Rcpt-To: apatel@ventanamicro.com, palmer@dabbelt.com, paul.walmsley@sifive.com, tglx@linutronix.de, daniel.lezcano@linaro.org, atishp@atishpatra.org, Alistair.Francis@wdc.com, anup@brainfault.org, linux-riscv@lists.infradead.org, linux-kernel@vger.kernel.org X-SA-Exim-Mail-From: maz@kernel.org X-SA-Exim-Scanned: No (on disco-boy.misterjones.org); SAEximRunCond expanded to false X-Spam-Status: No, score=-7.8 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 18 Apr 2022 11:53:01 +0100, Anup Patel wrote: > > All RISC-V platforms have a single HW IPI provided by the INTC local > interrupt controller. The HW method to trigger INTC IPI can be through > external irqchip (e.g. RISC-V AIA), through platform specific device > (e.g. SiFive CLINT timer), or through firmware (e.g. SBI IPI call). > > To support multiple IPIs on RISC-V, we add a generic IPI multiplexing > mechanism which help us create multiple virtual IPIs using a single > HW IPI. This generic IPI multiplexing is shared among various RISC-V > irqchip drivers. > > Signed-off-by: Anup Patel > --- > include/linux/irq.h | 11 +++ > kernel/irq/Kconfig | 4 + > kernel/irq/Makefile | 1 + > kernel/irq/ipi-mux.c | 197 +++++++++++++++++++++++++++++++++++++++++++ > 4 files changed, 213 insertions(+) > create mode 100644 kernel/irq/ipi-mux.c > > diff --git a/include/linux/irq.h b/include/linux/irq.h > index f92788ccdba2..5bb4e2db63d7 100644 > --- a/include/linux/irq.h > +++ b/include/linux/irq.h > @@ -1247,6 +1247,17 @@ int __ipi_send_mask(struct irq_desc *desc, const struct cpumask *dest); > int ipi_send_single(unsigned int virq, unsigned int cpu); > int ipi_send_mask(unsigned int virq, const struct cpumask *dest); > > +#define IPI_MUX_NR_IRQS BITS_PER_LONG > + > +struct ipi_mux_ops { > + void (*ipi_mux_clear)(unsigned int parent_virq); > + void (*ipi_mux_send)(unsigned int parent_virq, > + const struct cpumask *mask); You really cannot just dump this like this. This requires documentation so that architecture maintainers can move over to this. I appreciate that this area is pretty poorly documented, but we need to start somewhere. > +}; > + > +void ipi_mux_process(void); > +int ipi_mux_create(unsigned int parent_virq, const struct ipi_mux_ops *ops); > + > #ifdef CONFIG_GENERIC_IRQ_MULTI_HANDLER > /* > * Registers a generic IRQ handling function as the top-level IRQ handler in > diff --git a/kernel/irq/Kconfig b/kernel/irq/Kconfig > index 10929eda9825..2388e7d40ed3 100644 > --- a/kernel/irq/Kconfig > +++ b/kernel/irq/Kconfig > @@ -84,6 +84,10 @@ config GENERIC_IRQ_IPI > bool > select IRQ_DOMAIN_HIERARCHY > > +# Generic IRQ IPI Mux support > +config GENERIC_IRQ_IPI_MUX > + bool > + > # Generic MSI interrupt support > config GENERIC_MSI_IRQ > bool > diff --git a/kernel/irq/Makefile b/kernel/irq/Makefile > index b4f53717d143..f19d3080bf11 100644 > --- a/kernel/irq/Makefile > +++ b/kernel/irq/Makefile > @@ -15,6 +15,7 @@ obj-$(CONFIG_GENERIC_IRQ_MIGRATION) += cpuhotplug.o > obj-$(CONFIG_PM_SLEEP) += pm.o > obj-$(CONFIG_GENERIC_MSI_IRQ) += msi.o > obj-$(CONFIG_GENERIC_IRQ_IPI) += ipi.o > +obj-$(CONFIG_GENERIC_IRQ_IPI_MUX) += ipi-mux.o > obj-$(CONFIG_SMP) += affinity.o > obj-$(CONFIG_GENERIC_IRQ_DEBUGFS) += debugfs.o > obj-$(CONFIG_GENERIC_IRQ_MATRIX_ALLOCATOR) += matrix.o > diff --git a/kernel/irq/ipi-mux.c b/kernel/irq/ipi-mux.c > new file mode 100644 > index 000000000000..1a1fcfe3ac54 > --- /dev/null > +++ b/kernel/irq/ipi-mux.c > @@ -0,0 +1,197 @@ > +// SPDX-License-Identifier: GPL-2.0-only > +/* > + * Multiplex several virtual IPIs over a single HW IPI. > + * > + * Copyright (c) 2022 Ventana Micro Systems Inc. > + */ > + > +#define pr_fmt(fmt) "ipi-mux: " fmt > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +static unsigned int ipi_mux_parent_virq; > +static struct irq_domain *ipi_mux_domain; > +static const struct ipi_mux_ops *ipi_mux_ops; > +static DEFINE_PER_CPU(unsigned long, ipi_mux_bits); > + > +static void ipi_mux_dummy(struct irq_data *d) > +{ > +} > + > +static void ipi_mux_send_mask(struct irq_data *d, const struct cpumask *mask) > +{ > + int cpu; > + > + /* Barrier before doing atomic bit update to IPI bits */ > + smp_mb__before_atomic(); > + > + for_each_cpu(cpu, mask) > + set_bit(d->hwirq, per_cpu_ptr(&ipi_mux_bits, cpu)); > + > + /* Barrier after doing atomic bit update to IPI bits */ > + smp_mb__after_atomic(); > + > + /* Trigger the parent IPI */ > + ipi_mux_ops->ipi_mux_send(ipi_mux_parent_virq, mask); > +} > + > +static struct irq_chip ipi_mux_chip = { const, please. > + .name = "IPI Mux", > + .irq_mask = ipi_mux_dummy, > + .irq_unmask = ipi_mux_dummy, Maybe we should consider a flow that does not require this dummy callbacks. > + .ipi_send_mask = ipi_mux_send_mask, > +}; > + > +static int ipi_mux_domain_map(struct irq_domain *d, unsigned int irq, > + irq_hw_number_t hwirq) > +{ > + irq_set_percpu_devid(irq); > + irq_domain_set_info(d, irq, hwirq, &ipi_mux_chip, d->host_data, > + handle_percpu_devid_irq, NULL, NULL); > + > + return 0; > +} > + > +static int ipi_mux_domain_alloc(struct irq_domain *d, unsigned int virq, > + unsigned int nr_irqs, void *arg) > +{ > + unsigned int type = IRQ_TYPE_NONE; Really, this should be EDGE. > + struct irq_fwspec *fwspec = arg; > + irq_hw_number_t hwirq; > + int i, ret; > + > + ret = irq_domain_translate_onecell(d, fwspec, &hwirq, &type); > + if (ret) > + return ret; > + > + for (i = 0; i < nr_irqs; i++) { > + ret = ipi_mux_domain_map(d, virq + i, hwirq + i); > + if (ret) > + return ret; > + } > + > + return 0; > +} > + > +static const struct irq_domain_ops ipi_mux_domain_ops = { > + .translate = irq_domain_translate_onecell, What is the purpose of this callback? Firmware shouldn't be involved in IPIs at all, and since you only have one cell, you can use the default path. This is also a dependency on CONFIG_IRQ_DOMAIN_HIERARCHY. > + .alloc = ipi_mux_domain_alloc, > + .free = irq_domain_free_irqs_top, > +}; > + > +/** > + * ipi_mux_process - Process multiplexed virtual IPIs > + */ > +void ipi_mux_process(void) > +{ > + unsigned long irqs, *bits = this_cpu_ptr(&ipi_mux_bits); > + irq_hw_number_t hwirq; > + int err; > + > + /* Clear the parent IPI */ > + if (ipi_mux_ops->ipi_mux_clear) > + ipi_mux_ops->ipi_mux_clear(ipi_mux_parent_virq); > + > + /* > + * Barrier for IPI bits paired with smp_mb__xyz_atomic() xyz??? > + * in ipi_mux_send_mask() > + */ > + smp_mb(); > + > + irqs = xchg(bits, 0); > + if (!irqs) > + return; > + > + for_each_set_bit(hwirq, &irqs, IPI_MUX_NR_IRQS) { > + err = generic_handle_domain_irq(ipi_mux_domain, > + hwirq); > + if (unlikely(err)) > + pr_warn_ratelimited( > + "can't find mapping for hwirq %lu\n", > + hwirq); > + } > +} > + > +static void ipi_mux_handler(struct irq_desc *desc) > +{ > + struct irq_chip *chip = irq_desc_get_chip(desc); > + > + chained_irq_enter(chip, desc); > + ipi_mux_process(); > + chained_irq_exit(chip, desc); > +} > + > +static int ipi_mux_dying_cpu(unsigned int cpu) > +{ > + disable_percpu_irq(ipi_mux_parent_virq); > + return 0; > +} > + > +static int ipi_mux_starting_cpu(unsigned int cpu) > +{ > + enable_percpu_irq(ipi_mux_parent_virq, > + irq_get_trigger_type(ipi_mux_parent_virq)); > + return 0; > +} > + > +/** > + * ipi_mux_create - Create virtual IPIs (total IPI_MUX_NR_IRQS) multiplexed > + * on top of a single parent IPI. > + * @parent_virq: virq of the parent IPI > + * @ops: multiplexing operations for the parent IPI > + * > + * If the parent IPI > 0 then ipi_mux_process() will be automatically > + * called via chained handler. > + * > + * If the parent IPI <= 0 then it is responsiblity of irqchip drivers > + * to explicitly call ipi_mux_process() for processing muxed IPIs. > + * > + * Returns first virq of the newly created virutal IPIs upon success > + * or <=0 upon failure > + */ > +int ipi_mux_create(unsigned int parent_virq, const struct ipi_mux_ops *ops) > +{ Why should be parent_virq be unique? I also see nothing that checks that this is a per-CPU interrupt. If anything, this needs documenting. > + struct irq_domain *domain; > + struct irq_fwspec ipi; > + int virq; > + > + if (ipi_mux_domain || !ops || !ops->ipi_mux_send) > + return 0; > + > + domain = irq_domain_add_linear(NULL, IPI_MUX_NR_IRQS, Urgh. For a start, please use the irq_domain_create_* version, as this shouldn't be DT specific. Then, don't use a NULL fwnode, as this results in a "default domain", which nobody sane should ever use anymore. Also, defaulting to BITS_PER_LONG is a lot of interrupts for not much (most archs use only a handful). You may want to consider this being a parameter, and cap it at BITS_PER_LONG. > + &ipi_mux_domain_ops, NULL); > + if (!domain) { > + pr_err("unable to add IPI Mux domain\n"); > + return 0; > + } > + > + ipi.fwnode = domain->fwnode; Which is NULL (see above). > + ipi.param_count = 1; > + ipi.param[0] = 0; > + virq = __irq_domain_alloc_irqs(domain, -1, IPI_MUX_NR_IRQS, > + NUMA_NO_NODE, &ipi, false, NULL); > + if (virq <= 0) { > + pr_err("unable to alloc IRQs from IPI Mux domain\n"); > + irq_domain_remove(domain); > + return virq; > + } > + > + ipi_mux_domain = domain; > + ipi_mux_parent_virq = parent_virq; > + ipi_mux_ops = ops; > + > + if (parent_virq > 0) { > + irq_set_chained_handler(parent_virq, ipi_mux_handler); > + > + cpuhp_setup_state(CPUHP_AP_ONLINE_DYN, > + "irqchip/ipi-mux:starting", > + ipi_mux_starting_cpu, ipi_mux_dying_cpu); > + } > + > + return virq; > +} Thanks, M. -- Without deviation from the norm, progress is not possible.