Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp123204iog; Fri, 24 Jun 2022 00:10:40 -0700 (PDT) X-Google-Smtp-Source: AGRyM1v9m6mFjpAEZvCyVGb70nMk/xDCxnSSbSwPssglMpi7jcgxPBufpu9yU+v3PNDzE8uJrMOk X-Received: by 2002:a17:907:3f84:b0:724:2ec:b474 with SMTP id hr4-20020a1709073f8400b0072402ecb474mr7542383ejc.644.1656054640177; Fri, 24 Jun 2022 00:10:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656054640; cv=none; d=google.com; s=arc-20160816; b=Ym9O08obBkHKucf6cPNMDVTgcGkKkP/r08yw8QsubpuNfRsHwLyvT+QiEqIGntv0qS 4aK6Bekqny0vBCD3wC20v6Znikq9u4j7Rb9npfMTEJpgu1CjeKKqYdvPd4oyYKO+q+hU h0zfI1XC+Qf7cBnKyEeekxITrHDZU4OjBdYsMvbvk4owrpmun7izycVErscTgjm+jUEO Bv6touRwt63/907d2m68e8XfypLJ7f/MlsCPJGAIbnZDgjaF8SYIbP4k/oaOB7yC8lMV fo+7Z3Ze35KVwps6cpMdNMGEyO0ZVQMHj+MPRB1+ounbdHXWbZr0ucFLQ49dSMT9rt3X ix5w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=fkwre6WoTaZ3HcBGKyAfeqBPtQ+kpW2QzG2weDAdapc=; b=fDBYh4ixqI6LIKtnqvzwKTJICgTnV3tOTeXvpNCnHFNyXUVtl2prGXvNdVi/lv0jJa rpUpoRy6RqA5Kiasqdkl4Eu7/pX60zHi9SoBEyCBrTg/86HUky4nQznDM+z96paMqaTw 1WWe5LJP44lugMP/ls3DIjT25qVYD5ebpJp2fvA5jSt4BGESctRp33Xw626wxiT9R/S2 VuKCutlBr+N7EHCAh2eWp4YXQ6erEKJVkJPhZqjlNRGo9YcFP1gWt4r/TBrwiPGW4He0 q97OR89rW3NdHkGsK0tDrbRJeT4sMPioikKFD4HR7CYtXUIPxkZFZ0/i+F/hkZrphwVA qNUg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TrhPxSnN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id d11-20020a05640208cb00b00435651c4a02si1877490edz.493.2022.06.24.00.10.15; Fri, 24 Jun 2022 00:10:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TrhPxSnN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230111AbiFXGb7 (ORCPT + 99 others); Fri, 24 Jun 2022 02:31:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42228 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229798AbiFXGb5 (ORCPT ); Fri, 24 Jun 2022 02:31:57 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 0F9155DF18 for ; Thu, 23 Jun 2022 23:31:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1656052314; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=fkwre6WoTaZ3HcBGKyAfeqBPtQ+kpW2QzG2weDAdapc=; b=TrhPxSnNbmwPQFVobh95xt8FQnfueDEpAg9bdRWsXqSjmpDGr+R/Cmkg6EMAyG3uCq1lhM Sdli+14Srnb8HC2/8ylfqSDnfJloKl12Oepe7DMUnwD8BlWVPkN683p3B8LxfTvO+RXHd4 daCwYStVBMGbOaT04V9x3HBcwCaXKqw= Received: from mail-wm1-f72.google.com (mail-wm1-f72.google.com [209.85.128.72]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-112-lhCsIXg0NguLQzQMI8xPZw-1; Fri, 24 Jun 2022 02:31:53 -0400 X-MC-Unique: lhCsIXg0NguLQzQMI8xPZw-1 Received: by mail-wm1-f72.google.com with SMTP id c185-20020a1c35c2000000b0039db3e56c39so2586447wma.5 for ; Thu, 23 Jun 2022 23:31:53 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=fkwre6WoTaZ3HcBGKyAfeqBPtQ+kpW2QzG2weDAdapc=; b=zn3h13TJ5HLc0nxlG1mB+XxYvAnwZQOJCzuDsPtVTfX65rOfPKNTH/h3qIuQt98aAK HU7CRiyJuTYZc2LKIpL83cp8EQXzLkIkol+BPSPv6Ds7ol065I3OFIL/AnISeEbd7gGQ wvneYeM+A/PK8dm7SOtFrAXzEXjVbZMFi2m5rlAuz8h1aZYI8Q5VniAZfgOMO7WmmhaF evymUyi9tN/wOMvF1UqHYSBAPqyxSRnvI8Fkhgm9D5QmkjPCUERvDhX48c3RuTK5na5t IHjpq8xO1XGi913G0YtB10aA82Cm+yEaJrYBWItbSHi81G8aQaOt5UtzoTXZejrinq3F DO2g== X-Gm-Message-State: AJIora/ZTv0K48I9+WzPmEmUp4Ptk8CiR24SlI3ZfgSkMMKe+97QQ2UK psPJ0lwCR9MnnV3B/OcHdEBIzA7omRrq76L8PTxpq+IVHtq+FbtG5nsbi5kD6Ae+j+1eHvBiWgg 9qsOsHqCUu39pFAcJY3+oEmJq X-Received: by 2002:a5d:6481:0:b0:219:8930:6e48 with SMTP id o1-20020a5d6481000000b0021989306e48mr11516420wri.574.1656052312301; Thu, 23 Jun 2022 23:31:52 -0700 (PDT) X-Received: by 2002:a5d:6481:0:b0:219:8930:6e48 with SMTP id o1-20020a5d6481000000b0021989306e48mr11516397wri.574.1656052311996; Thu, 23 Jun 2022 23:31:51 -0700 (PDT) Received: from redhat.com ([2.55.188.216]) by smtp.gmail.com with ESMTPSA id r21-20020a05600c35d500b003a02f957245sm5754978wmq.26.2022.06.23.23.31.49 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 23 Jun 2022 23:31:51 -0700 (PDT) Date: Fri, 24 Jun 2022 02:31:46 -0400 From: "Michael S. Tsirkin" To: Jason Wang Cc: Cornelia Huck , Halil Pasic , Heiko Carstens , Vasily Gorbik , Christian Borntraeger , Alexander Gordeev , linux-s390@vger.kernel.org, virtualization , kvm , linux-kernel , Ben Hutchings , David Hildenbrand Subject: Re: [PATCH V3] virtio: disable notification hardening by default Message-ID: <20220624022622-mutt-send-email-mst@kernel.org> References: <20220622012940.21441-1-jasowang@redhat.com> <20220622025047-mutt-send-email-mst@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jun 22, 2022 at 03:09:31PM +0800, Jason Wang wrote: > On Wed, Jun 22, 2022 at 3:03 PM Michael S. Tsirkin wrote: > > > > On Wed, Jun 22, 2022 at 09:29:40AM +0800, Jason Wang wrote: > > > We try to harden virtio device notifications in 8b4ec69d7e09 ("virtio: > > > harden vring IRQ"). It works with the assumption that the driver or > > > core can properly call virtio_device_ready() at the right > > > place. Unfortunately, this seems to be not true and uncover various > > > bugs of the existing drivers, mainly the issue of using > > > virtio_device_ready() incorrectly. > > > > > > So let's having a Kconfig option and disable it by default. It gives > > > us a breath to fix the drivers and then we can consider to enable it > > > by default. > > > > > > Signed-off-by: Jason Wang > > > > > > OK I will queue, but I think the problem is fundamental. > > If I understand correctly, you want some core IRQ work? Yes. > As discussed > before, it doesn't solve all the problems, we still need to do per > driver audit. > > Thanks Maybe, but we don't need to tie things to device_ready then. We can do - disable irqs - device ready - setup everything - enable irqs and this works for most things, the only issue is this deadlocks if "setup everything" waits for interrupts. With the current approach there's really no good time: 1.- setup everything - device ready can cause kicks before device is ready 2.- device ready - setup everything can cause callbacks before setup. So I prefer the 1. and fix the hardening in the core. > > > > > > > --- > > > Changes since V2: > > > - Tweak the Kconfig help > > > - Add comment for the read_lock() pairing in virtio_ccw > > > --- > > > drivers/s390/virtio/virtio_ccw.c | 9 ++++++++- > > > drivers/virtio/Kconfig | 13 +++++++++++++ > > > drivers/virtio/virtio.c | 2 ++ > > > drivers/virtio/virtio_ring.c | 12 ++++++++++++ > > > include/linux/virtio_config.h | 2 ++ > > > 5 files changed, 37 insertions(+), 1 deletion(-) > > > > > > diff --git a/drivers/s390/virtio/virtio_ccw.c b/drivers/s390/virtio/virtio_ccw.c > > > index 97e51c34e6cf..1f6a358f65f0 100644 > > > --- a/drivers/s390/virtio/virtio_ccw.c > > > +++ b/drivers/s390/virtio/virtio_ccw.c > > > @@ -1136,8 +1136,13 @@ static void virtio_ccw_int_handler(struct ccw_device *cdev, > > > vcdev->err = -EIO; > > > } > > > virtio_ccw_check_activity(vcdev, activity); > > > - /* Interrupts are disabled here */ > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > + /* > > > + * Paried with virtio_ccw_synchronize_cbs() and interrupts are > > > + * disabled here. > > > + */ > > > read_lock(&vcdev->irq_lock); > > > +#endif > > > for_each_set_bit(i, indicators(vcdev), > > > sizeof(*indicators(vcdev)) * BITS_PER_BYTE) { > > > /* The bit clear must happen before the vring kick. */ > > > @@ -1146,7 +1151,9 @@ static void virtio_ccw_int_handler(struct ccw_device *cdev, > > > vq = virtio_ccw_vq_by_ind(vcdev, i); > > > vring_interrupt(0, vq); > > > } > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > read_unlock(&vcdev->irq_lock); > > > +#endif > > > if (test_bit(0, indicators2(vcdev))) { > > > virtio_config_changed(&vcdev->vdev); > > > clear_bit(0, indicators2(vcdev)); > > > diff --git a/drivers/virtio/Kconfig b/drivers/virtio/Kconfig > > > index b5adf6abd241..c04f370a1e5c 100644 > > > --- a/drivers/virtio/Kconfig > > > +++ b/drivers/virtio/Kconfig > > > @@ -35,6 +35,19 @@ menuconfig VIRTIO_MENU > > > > > > if VIRTIO_MENU > > > > > > +config VIRTIO_HARDEN_NOTIFICATION > > > + bool "Harden virtio notification" > > > + help > > > + Enable this to harden the device notifications and suppress > > > + those that happen at a time where notifications are illegal. > > > + > > > + Experimental: Note that several drivers still have bugs that > > > + may cause crashes or hangs when correct handling of > > > + notifications is enforced; depending on the subset of > > > + drivers and devices you use, this may or may not work. > > > + > > > + If unsure, say N. > > > + > > > config VIRTIO_PCI > > > tristate "PCI driver for virtio devices" > > > depends on PCI > > > diff --git a/drivers/virtio/virtio.c b/drivers/virtio/virtio.c > > > index ef04a96942bf..21dc08d2f32d 100644 > > > --- a/drivers/virtio/virtio.c > > > +++ b/drivers/virtio/virtio.c > > > @@ -220,6 +220,7 @@ static int virtio_features_ok(struct virtio_device *dev) > > > * */ > > > void virtio_reset_device(struct virtio_device *dev) > > > { > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > /* > > > * The below virtio_synchronize_cbs() guarantees that any > > > * interrupt for this line arriving after > > > @@ -228,6 +229,7 @@ void virtio_reset_device(struct virtio_device *dev) > > > */ > > > virtio_break_device(dev); > > > virtio_synchronize_cbs(dev); > > > +#endif > > > > > > dev->config->reset(dev); > > > } > > > diff --git a/drivers/virtio/virtio_ring.c b/drivers/virtio/virtio_ring.c > > > index 13a7348cedff..d9d3b6e201fb 100644 > > > --- a/drivers/virtio/virtio_ring.c > > > +++ b/drivers/virtio/virtio_ring.c > > > @@ -1688,7 +1688,11 @@ static struct virtqueue *vring_create_virtqueue_packed( > > > vq->we_own_ring = true; > > > vq->notify = notify; > > > vq->weak_barriers = weak_barriers; > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > vq->broken = true; > > > +#else > > > + vq->broken = false; > > > +#endif > > > vq->last_used_idx = 0; > > > vq->event_triggered = false; > > > vq->num_added = 0; > > > @@ -2135,9 +2139,13 @@ irqreturn_t vring_interrupt(int irq, void *_vq) > > > } > > > > > > if (unlikely(vq->broken)) { > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > dev_warn_once(&vq->vq.vdev->dev, > > > "virtio vring IRQ raised before DRIVER_OK"); > > > return IRQ_NONE; > > > +#else > > > + return IRQ_HANDLED; > > > +#endif > > > } > > > > > > /* Just a hint for performance: so it's ok that this can be racy! */ > > > @@ -2180,7 +2188,11 @@ struct virtqueue *__vring_new_virtqueue(unsigned int index, > > > vq->we_own_ring = false; > > > vq->notify = notify; > > > vq->weak_barriers = weak_barriers; > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > vq->broken = true; > > > +#else > > > + vq->broken = false; > > > +#endif > > > vq->last_used_idx = 0; > > > vq->event_triggered = false; > > > vq->num_added = 0; > > > diff --git a/include/linux/virtio_config.h b/include/linux/virtio_config.h > > > index 9a36051ceb76..d15c3cdda2d2 100644 > > > --- a/include/linux/virtio_config.h > > > +++ b/include/linux/virtio_config.h > > > @@ -257,6 +257,7 @@ void virtio_device_ready(struct virtio_device *dev) > > > > > > WARN_ON(status & VIRTIO_CONFIG_S_DRIVER_OK); > > > > > > +#ifdef CONFIG_VIRTIO_HARDEN_NOTIFICATION > > > /* > > > * The virtio_synchronize_cbs() makes sure vring_interrupt() > > > * will see the driver specific setup if it sees vq->broken > > > @@ -264,6 +265,7 @@ void virtio_device_ready(struct virtio_device *dev) > > > */ > > > virtio_synchronize_cbs(dev); > > > __virtio_unbreak_device(dev); > > > +#endif > > > /* > > > * The transport should ensure the visibility of vq->broken > > > * before setting DRIVER_OK. See the comments for the transport > > > -- > > > 2.25.1 > >