Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp2552766iog; Sun, 26 Jun 2022 19:55:01 -0700 (PDT) X-Google-Smtp-Source: AGRyM1vuSQlllWV/AF+BZe5MY7xAk1R9LOVRNWFZZ0IMVgop/ste4iNza5gPExpUOWwIfckkOH6p X-Received: by 2002:a05:6402:3818:b0:436:f2e1:64b3 with SMTP id es24-20020a056402381800b00436f2e164b3mr14108201edb.111.1656298501291; Sun, 26 Jun 2022 19:55:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656298501; cv=none; d=google.com; s=arc-20160816; b=x0VWPf8WEgYESx1RaWwrJzzKL3CzAeyW993jdwdabvBWmsXJ5Tg4yYkQmN6474Y7pp alGq0Tddp7KLouFLoilqMMT3uCdwnqLsaYhci8wb4J8XH4kFUCpk2VW7uELaRgEgQ0VX od+pKhcQJkCL7AoCfikZ/lbjdvT7qXg7d3B5yL7IypYM0J0uRY1AIR1NS+gKfVGsMsOl 1jVF/nM+zjLUIlKAsRGXKBAf6zlQTSwlTZgKDxMKBzYd1l0fh2eXmSHooqa8U0Hfb7BK iMfoPZlBw/j4vbZEqZu4Rny3ja+sOfKlGlB37vozBCkVtrotbQPg1OXqpOERptUOVLvL NXqQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=biN1ia1/BQslLYkymJMJ0pnjqHIpxR4UA9N5HO9QFXs=; b=hkRSFr/gnm6FKiHHebkWRcKRRyNPSnOZ0i6SlWiA23W2yxOfiEqSp22BJdLmsoYf/h 0Vm9FC1sTvAFXVzKGTf777LZ7LZAZuaw8uMFjZMGOcKjLOM13aEU8GnOtUlBGHBrU7qe W7URTMAOP9yEhLHVm1n1rGrj9ySDGyXV3Jc9RSAHiprpLFJn0geQh5ngB6ww1vS31MXp ujcd1tc5tNNkrnX0S54PRu8hdpOLDH3VOvqMUm2ntRCqO4XjQl1jwbanxKZClKBZnQoX qs+Bc8oKhHVlj38KdbK2DIAHV8Q4KX8Z4I625DYJVI7j4OL+VdFSRK8Yplly1ukqKMZ9 hdVA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=dmAfLFQ7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id di4-20020a170906730400b00718bdf7794esi12524394ejc.487.2022.06.26.19.54.37; Sun, 26 Jun 2022 19:55:01 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=dmAfLFQ7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229538AbiF0Cug (ORCPT + 99 others); Sun, 26 Jun 2022 22:50:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58468 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231593AbiF0Cue (ORCPT ); Sun, 26 Jun 2022 22:50:34 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id E2BCD2DCB for ; Sun, 26 Jun 2022 19:50:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1656298232; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=biN1ia1/BQslLYkymJMJ0pnjqHIpxR4UA9N5HO9QFXs=; b=dmAfLFQ7jwfZ1yMDaSzIvJBwCZi/zH3HTz8fOYRBUQFkr5CIQj5lFlrZwYpbZzspp/QXkF tGc6qmgnO8+ephCxEO7+NzTpJZs4u8YDd44wZ+q/NU23bA9w1UnIMVAAabM8RrH1fAkJ07 0gCz9p2JdPnYuL2LyrqEOLydLPdEm4o= Received: from mail-lf1-f69.google.com (mail-lf1-f69.google.com [209.85.167.69]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-308-bCGYn4XFNOC8F6h5OaX1AQ-1; Sun, 26 Jun 2022 22:50:30 -0400 X-MC-Unique: bCGYn4XFNOC8F6h5OaX1AQ-1 Received: by mail-lf1-f69.google.com with SMTP id bq4-20020a056512150400b0047f7f36efc6so3990154lfb.9 for ; Sun, 26 Jun 2022 19:50:30 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=biN1ia1/BQslLYkymJMJ0pnjqHIpxR4UA9N5HO9QFXs=; b=VFTDkpdFkErOz85rzieudtRwmkwdfp23uxs2f3HUXTQSWJU14SHkyOsk1750jCfzwg BXBs7sadesf0wcpkHgUfTV6e0d7ON9od5vlSySplr7nnRO3DU7p/042+WikNHgUQoqf+ PFFgHTRkR2jSODVf7ss+5PClgW2U5IoBOyz9kszXscrBb3I7QD+nwtjTLtmJFmpLMiU4 p5hU0xw5wAxlRWB27E9pqxallnAW0T0zH68Yx/8/zQAov698gP5r6ZpRQUmnkR3jhyQs aMsxvfOcoPg7R1uY9vSZCGRS0nmu56PemyhaGuwMn5oURG3YNfVM2w572PgmlNL7y071 qpig== X-Gm-Message-State: AJIora9n0pL+8+aCkMMjjc1MYLnlxPvoVsBCXDrMSqyw+h24FZGt/fep 3bzeCBC2yzDcponQ7pK1uK+9EG0B0xi0l7XxSzqMfHXKuELNZk7uTF1+zH2dY93kgutQ126Tm0z 2VmFr9DtWd+X0a3qIQBmkTF7rrbtwz/LkiQ4aPxmx X-Received: by 2002:a05:6512:158d:b0:47f:718c:28b5 with SMTP id bp13-20020a056512158d00b0047f718c28b5mr7251006lfb.397.1656298228869; Sun, 26 Jun 2022 19:50:28 -0700 (PDT) X-Received: by 2002:a05:6512:158d:b0:47f:718c:28b5 with SMTP id bp13-20020a056512158d00b0047f718c28b5mr7250996lfb.397.1656298228615; Sun, 26 Jun 2022 19:50:28 -0700 (PDT) MIME-Version: 1.0 References: <20220622012940.21441-1-jasowang@redhat.com> <20220622025047-mutt-send-email-mst@kernel.org> <20220624022622-mutt-send-email-mst@kernel.org> In-Reply-To: <20220624022622-mutt-send-email-mst@kernel.org> From: Jason Wang Date: Mon, 27 Jun 2022 10:50:17 +0800 Message-ID: Subject: Re: [PATCH V3] virtio: disable notification hardening by default To: "Michael S. Tsirkin" Cc: Cornelia Huck , Halil Pasic , Heiko Carstens , Vasily Gorbik , Christian Borntraeger , Alexander Gordeev , linux-s390@vger.kernel.org, virtualization , kvm , linux-kernel , Ben Hutchings , David Hildenbrand Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jun 24, 2022 at 2:31 PM Michael S. Tsirkin wrote: > > On Wed, Jun 22, 2022 at 03:09:31PM +0800, Jason Wang wrote: > > On Wed, Jun 22, 2022 at 3:03 PM Michael S. Tsirkin wrote: > > > > > > On Wed, Jun 22, 2022 at 09:29:40AM +0800, Jason Wang wrote: > > > > We try to harden virtio device notifications in 8b4ec69d7e09 ("virtio: > > > > harden vring IRQ"). It works with the assumption that the driver or > > > > core can properly call virtio_device_ready() at the right > > > > place. Unfortunately, this seems to be not true and uncover various > > > > bugs of the existing drivers, mainly the issue of using > > > > virtio_device_ready() incorrectly. > > > > > > > > So let's having a Kconfig option and disable it by default. It gives > > > > us a breath to fix the drivers and then we can consider to enable it > > > > by default. > > > > > > > > Signed-off-by: Jason Wang > > > > > > > > > OK I will queue, but I think the problem is fundamental. > > > > If I understand correctly, you want some core IRQ work? > > Yes. > > > As discussed > > before, it doesn't solve all the problems, we still need to do per > > driver audit. > > > > Thanks > > Maybe, but we don't need to tie things to device_ready then. > We can do > > - disable irqs > - device ready > - setup everything > - enable irqs > > > and this works for most things, the only issue is > this deadlocks if "setup everything" waits for interrupts. > > > With the current approach there's really no good time: > 1.- setup everything > - device ready > > can cause kicks before device is ready > > 2.- device ready > - setup everything > > can cause callbacks before setup. > > So I prefer the 1. and fix the hardening in the core. So my question is: 1) do similar hardening like config interrupt or 2) per transport notification work (e.g for PCI core IRQ work) 1) seems easier and universal, but we pay little overhead which could be eliminated by the config option. 2) seems require more work in the IRQ core and it can not work for all transports (e.g vDPA would be kind of difficult) Thanks > > > > > > > > > > > > --- > > > > Changes since V2: > > > > - Tweak the Kconfig help > > > > - Add comment for the read_lock() pairing in virtio_ccw > > > > --- > > > > drivers/s390/virtio/virtio_ccw.c | 9 ++++++++- > > > > drivers/virtio/Kconfig | 13 +++++++++++++ > > > > drivers/virtio/virtio.c | 2 ++ > > > > drivers/virtio/virtio_ring.c | 12 ++++++++++++ > > > > include/linux/virtio_config.h | 2 ++ > > > > 5 files changed, 37 insertions(+), 1 deletion(-) > > > > > > > > diff --git a/drivers/s390/virtio/virtio_ccw.c b/drivers/s390/virtio/virtio_ccw.c > > > > index 97e51c34e6cf..1f6a358f65f0 100644 > > > > --- a/drivers/s390/virtio/virtio_ccw.c > > > > +++ b/drivers/s390/virtio/virtio_ccw.c > > > > @@ -1136,8 +1136,13 @@ static void virtio_ccw_int_handler(struct ccw_device *cdev, > > > > vcdev->err = -EIO; > > > > } > > > > virtio_ccw_check_activity(vcdev, activity); > > > > - /* Interrupts are disabled here */ > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > + /* > > > > + * Paried with virtio_ccw_synchronize_cbs() and interrupts are > > > > + * disabled here. > > > > + */ > > > > read_lock(&vcdev->irq_lock); > > > > +#endif > > > > for_each_set_bit(i, indicators(vcdev), > > > > sizeof(*indicators(vcdev)) * BITS_PER_BYTE) { > > > > /* The bit clear must happen before the vring kick. */ > > > > @@ -1146,7 +1151,9 @@ static void virtio_ccw_int_handler(struct ccw_device *cdev, > > > > vq = virtio_ccw_vq_by_ind(vcdev, i); > > > > vring_interrupt(0, vq); > > > > } > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > read_unlock(&vcdev->irq_lock); > > > > +#endif > > > > if (test_bit(0, indicators2(vcdev))) { > > > > virtio_config_changed(&vcdev->vdev); > > > > clear_bit(0, indicators2(vcdev)); > > > > diff --git a/drivers/virtio/Kconfig b/drivers/virtio/Kconfig > > > > index b5adf6abd241..c04f370a1e5c 100644 > > > > --- a/drivers/virtio/Kconfig > > > > +++ b/drivers/virtio/Kconfig > > > > @@ -35,6 +35,19 @@ menuconfig VIRTIO_MENU > > > > > > > > if VIRTIO_MENU > > > > > > > > +config VIRTIO_HARDEN_NOTIFICATION > > > > + bool "Harden virtio notification" > > > > + help > > > > + Enable this to harden the device notifications and suppress > > > > + those that happen at a time where notifications are illegal. > > > > + > > > > + Experimental: Note that several drivers still have bugs that > > > > + may cause crashes or hangs when correct handling of > > > > + notifications is enforced; depending on the subset of > > > > + drivers and devices you use, this may or may not work. > > > > + > > > > + If unsure, say N. > > > > + > > > > config VIRTIO_PCI > > > > tristate "PCI driver for virtio devices" > > > > depends on PCI > > > > diff --git a/drivers/virtio/virtio.c b/drivers/virtio/virtio.c > > > > index ef04a96942bf..21dc08d2f32d 100644 > > > > --- a/drivers/virtio/virtio.c > > > > +++ b/drivers/virtio/virtio.c > > > > @@ -220,6 +220,7 @@ static int virtio_features_ok(struct virtio_device *dev) > > > > * */ > > > > void virtio_reset_device(struct virtio_device *dev) > > > > { > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > /* > > > > * The below virtio_synchronize_cbs() guarantees that any > > > > * interrupt for this line arriving after > > > > @@ -228,6 +229,7 @@ void virtio_reset_device(struct virtio_device *dev) > > > > */ > > > > virtio_break_device(dev); > > > > virtio_synchronize_cbs(dev); > > > > +#endif > > > > > > > > dev->config->reset(dev); > > > > } > > > > diff --git a/drivers/virtio/virtio_ring.c b/drivers/virtio/virtio_ring.c > > > > index 13a7348cedff..d9d3b6e201fb 100644 > > > > --- a/drivers/virtio/virtio_ring.c > > > > +++ b/drivers/virtio/virtio_ring.c > > > > @@ -1688,7 +1688,11 @@ static struct virtqueue *vring_create_virtqueue_packed( > > > > vq->we_own_ring = true; > > > > vq->notify = notify; > > > > vq->weak_barriers = weak_barriers; > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > vq->broken = true; > > > > +#else > > > > + vq->broken = false; > > > > +#endif > > > > vq->last_used_idx = 0; > > > > vq->event_triggered = false; > > > > vq->num_added = 0; > > > > @@ -2135,9 +2139,13 @@ irqreturn_t vring_interrupt(int irq, void *_vq) > > > > } > > > > > > > > if (unlikely(vq->broken)) { > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > dev_warn_once(&vq->vq.vdev->dev, > > > > "virtio vring IRQ raised before DRIVER_OK"); > > > > return IRQ_NONE; > > > > +#else > > > > + return IRQ_HANDLED; > > > > +#endif > > > > } > > > > > > > > /* Just a hint for performance: so it's ok that this can be racy! */ > > > > @@ -2180,7 +2188,11 @@ struct virtqueue *__vring_new_virtqueue(unsigned int index, > > > > vq->we_own_ring = false; > > > > vq->notify = notify; > > > > vq->weak_barriers = weak_barriers; > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > vq->broken = true; > > > > +#else > > > > + vq->broken = false; > > > > +#endif > > > > vq->last_used_idx = 0; > > > > vq->event_triggered = false; > > > > vq->num_added = 0; > > > > diff --git a/include/linux/virtio_config.h b/include/linux/virtio_config.h > > > > index 9a36051ceb76..d15c3cdda2d2 100644 > > > > --- a/include/linux/virtio_config.h > > > > +++ b/include/linux/virtio_config.h > > > > @@ -257,6 +257,7 @@ void virtio_device_ready(struct virtio_device *dev) > > > > > > > > WARN_ON(status & VIRTIO_CONFIG_S_DRIVER_OK); > > > > > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > > /* > > > > * The virtio_synchronize_cbs() makes sure vring_interrupt() > > > > * will see the driver specific setup if it sees vq->broken > > > > @@ -264,6 +265,7 @@ void virtio_device_ready(struct virtio_device *dev) > > > > */ > > > > virtio_synchronize_cbs(dev); > > > > __virtio_unbreak_device(dev); > > > > +#endif > > > > /* > > > > * The transport should ensure the visibility of vq->broken > > > > * before setting DRIVER_OK. See the comments for the transport > > > > -- > > > > 2.25.1 > > > >