Received: by 2002:a05:6358:4e97:b0:b3:742d:4702 with SMTP id ce23csp2729248rwb; Mon, 15 Aug 2022 10:19:21 -0700 (PDT) X-Google-Smtp-Source: AA6agR6EaYfGeaW4HvSqYGUEHQaNO/CzeV2pTZiKgRU2vFrKgAIJPwl8gdWjeuiX6A+5z3Fh8fBZ X-Received: by 2002:a05:6a00:804:b0:52f:43f9:b634 with SMTP id m4-20020a056a00080400b0052f43f9b634mr17481971pfk.62.1660583961686; Mon, 15 Aug 2022 10:19:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1660583961; cv=none; d=google.com; s=arc-20160816; b=n/X//s5BMAPeHY5f05521Bm79+mZI4XE6U0BMaujdDFa2dGsfbjee9/2l8/dYY4OE9 zwxZ+g9NOWVCdnEqYmvv/Y1eh+GEU+LRlUzhqGCFrxEL+vDyOMErijNbwBT86fvefek9 LhHKx8loPTUJjhlA8RIJBTgr5LCI1onBZxOQf7AAkANAXrmZz3IPjFfITXDjGd7t7DMm u0S3DcSyRYZKydwgr6xDClIxQ1zFj0uvE/39OMIr+1TH08KWOakY+NHHeYNuVt0rCAaR /JlHE5uF8bvyIdCOOQtMx9MSCE2qRlzn3REhfuOJ/hr4BTkt4XQuEU19iHm5m3vr1nOG MHog== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=e1xpd0eYnShseb7U86x6m8V2giOiRt5hzyF/+OIPBxs=; b=DC9HNNyt8iLETCT7oHO0Q0LkPJ8lBIBWOmMhcJXsYJ6R3aLmPNrag7NmwT6I/Y+Ki9 SnYTiG7F3qJ3jeF5VGbVP29/7GoS0b1zAMzGaf8Hz6PIyJRHwZ6gtbLANnBXNCuQavjW hp/RMyF44Sl8aA8F6mg1L3UyUmROiar9T+fjfD6yiSN1PNZh75V0k21sGqSNx0dPTVo7 TnMEdYFo7jUU3t5YRm9UilxzKElnwt67SZz4es/4VJrJ+RUIsv6dov8F+jKp8FziknVN v6WMlyi25lpvROaVMbWK1FID3CNo6ksQI/YC6SrkW/jwwHZ4oVGWCim0wQ0pDHh9uoHQ U0uQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=SXUSaVex; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l8-20020a17090aaa8800b001f571f6b3d7si14976544pjq.181.2022.08.15.10.19.09; Mon, 15 Aug 2022 10:19:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=SXUSaVex; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230509AbiHOQvL (ORCPT + 99 others); Mon, 15 Aug 2022 12:51:11 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39736 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230503AbiHOQvG (ORCPT ); Mon, 15 Aug 2022 12:51:06 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 5B95C17E36 for ; Mon, 15 Aug 2022 09:51:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1660582264; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=e1xpd0eYnShseb7U86x6m8V2giOiRt5hzyF/+OIPBxs=; b=SXUSaVexpvhiD5ftUZEv0/PFfMrFNa4+m+vn1Emm8QWB+yx8ITytjMYFT3XGTnxCDHOWKU eEIoekQ4IiJWpPMIL8lzEggFWhWGq4M9HaO/SZmPfLABvv+BVtTgN5ciqy4M6ic1Nh3q+H jlTdPwy9nZY4oBmcEzFJ2vm2w8c6+YA= Received: from mail-ed1-f70.google.com (mail-ed1-f70.google.com [209.85.208.70]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id us-mta-458-u1ziETkJOnmNijfEqIvRBg-1; Mon, 15 Aug 2022 12:51:00 -0400 X-MC-Unique: u1ziETkJOnmNijfEqIvRBg-1 Received: by mail-ed1-f70.google.com with SMTP id m22-20020a056402431600b0043d6a88130aso5013377edc.18 for ; Mon, 15 Aug 2022 09:51:00 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc; bh=e1xpd0eYnShseb7U86x6m8V2giOiRt5hzyF/+OIPBxs=; b=kIB2MMtNwFNZO83n/A9Sbblzg+ztHLJNH7jv9IYX1v7VAOyJoUclXwoJ48bKsnw/++ kYtbilqkcne5LQiPOxfkyb5ByAzCbcQjupkQCXCB4Z9BYblXwBK/Nvnn+cUKs3Zb+IxG LvX5muimHhiy7xQ1SmrIS4yr8Xa34U9QET9uVPw1IPVtkfEblnPl1fmQSUiuHKJ52atv EIW/Gge6aN0gOya/IeNBHFFSyKpniNPTVhK1ZFuBZ9pyp+RlRCRtQqX4lfFtQ6PcKYa9 iZKM/yM6uxWmfdax+OuDWp3MJzGPFQ4ZT2L2uZX3mA0Xr1u+i2w1Azr2MFG70vcays/l 0A9A== X-Gm-Message-State: ACgBeo1mCNeLhunVK+I5/6cKqp4i8gkhlq528k6XUYdIuozc2toS83Xf SXwfqoSkgn3D8oUxFzgls2WPPpv9roVKlr5tJNWCPXxDPmVqW0sFZkhW/cPY42GfhW8SSmQ/VzL MifGRTZg/BHq9A8GbnkDm2bG5 X-Received: by 2002:a17:906:4787:b0:737:2a03:6dd6 with SMTP id cw7-20020a170906478700b007372a036dd6mr7144184ejc.476.1660582259259; Mon, 15 Aug 2022 09:50:59 -0700 (PDT) X-Received: by 2002:a17:906:4787:b0:737:2a03:6dd6 with SMTP id cw7-20020a170906478700b007372a036dd6mr7144170ejc.476.1660582258959; Mon, 15 Aug 2022 09:50:58 -0700 (PDT) Received: from redhat.com ([2.54.169.49]) by smtp.gmail.com with ESMTPSA id fu17-20020a170907b01100b0073095265adesm4199677ejc.165.2022.08.15.09.50.55 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 15 Aug 2022 09:50:58 -0700 (PDT) Date: Mon, 15 Aug 2022 12:50:52 -0400 From: "Michael S. Tsirkin" To: Andres Freund Cc: Xuan Zhuo , Jason Wang , "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, Linus Torvalds , Jens Axboe , James Bottomley , "Martin K. Petersen" , Guenter Roeck , linux-kernel@vger.kernel.org, Greg KH , c@redhat.com Subject: Re: upstream kernel crashes Message-ID: <20220815124748-mutt-send-email-mst@kernel.org> References: <1c057afa-92df-ee3c-5978-3731d3db9345@kernel.dk> <20220815013651.mrm7qgklk6sgpkbb@awork3.anarazel.de> <20220815071143.n2t5xsmifnigttq2@awork3.anarazel.de> <20220815034532-mutt-send-email-mst@kernel.org> <20220815081527.soikyi365azh5qpu@awork3.anarazel.de> <20220815042623-mutt-send-email-mst@kernel.org> <20220815113729-mutt-send-email-mst@kernel.org> <20220815164503.jsoezxcm6q4u2b6j@awork3.anarazel.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220815164503.jsoezxcm6q4u2b6j@awork3.anarazel.de> X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 15, 2022 at 09:45:03AM -0700, Andres Freund wrote: > Hi, > > On 2022-08-15 11:40:59 -0400, Michael S. Tsirkin wrote: > > OK so this gives us a quick revert as a solution for now. > > Next, I would appreciate it if you just try this simple hack. > > If it crashes we either have a long standing problem in virtio > > code or more likely a gcp bug where it can't handle smaller > > rings than what device requestes. > > Thanks! > > I applied the below and the problem persists. > > > diff --git a/drivers/virtio/virtio_pci_modern.c b/drivers/virtio/virtio_pci_modern.c > > index f7965c5dd36b..bdd5f481570b 100644 > > --- a/drivers/virtio/virtio_pci_modern.c > > +++ b/drivers/virtio/virtio_pci_modern.c > > @@ -314,6 +314,9 @@ static struct virtqueue *setup_vq(struct virtio_pci_device *vp_dev, > > if (!size || size > num) > > size = num; > > > > + if (size > 1024) > > + size = 1024; > > + > > if (size & (size - 1)) { > > dev_warn(&vp_dev->pci_dev->dev, "bad queue size %u", size); > > return ERR_PTR(-EINVAL); > > > > > > [ 1.165162] virtio_net virtio1 enp0s4: renamed from eth0 > [ 1.177815] general protection fault, probably for non-canonical address 0xffff000000000400: 0000 [#1] PREEMPT SMP PTI > [ 1.179565] CPU: 1 PID: 125 Comm: systemd-udevd Not tainted 6.0.0-rc1-bisect14-dirty #14 > [ 1.180785] Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/29/2022 > [ 1.182475] RIP: 0010:__kmalloc_node_track_caller+0x19e/0x380 > [ 1.183365] Code: 2b 04 25 28 00 00 00 0f 85 f8 01 00 00 48 83 c4 18 48 89 e8 5b 5d 41 5c 41 5d 41 5e 41 5f c3 cc cc cc cc 8b 4d 28 48 8b 7d 00 <48> 8b 1c 08 48 8d 4a 40 65 48 0f c7 0f 0f 94 c0 84 c0 0f 84 0b ff > [ 1.186208] RSP: 0018:ffff9c470021b860 EFLAGS: 00010246 > [ 1.187194] RAX: ffff000000000000 RBX: 00000000000928c0 RCX: 0000000000000400 > [ 1.188634] RDX: 0000000000005781 RSI: 00000000000928c0 RDI: 000000000002e0f0 > [ 1.190177] RBP: ffff908380042c00 R08: 0000000000000600 R09: ffff908380b665e4 > [ 1.191256] R10: 0000000000000003 R11: 0000000000000002 R12: 00000000000928c0 > [ 1.192269] R13: 0000000000000740 R14: 00000000ffffffff R15: 0000000000000000 > [ 1.193368] FS: 00007f746702a8c0(0000) GS:ffff9084b7d00000(0000) knlGS:0000000000000000 > [ 1.194846] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 1.195661] CR2: 00007ffc010df980 CR3: 0000000103826005 CR4: 00000000003706e0 > [ 1.196912] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > [ 1.198216] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > [ 1.199367] Call Trace: > [ 1.199815] > [ 1.200138] ? netlink_trim+0x85/0xb0 > [ 1.200754] pskb_expand_head+0x92/0x340 > [ 1.202512] netlink_trim+0x85/0xb0 > [ 1.203069] netlink_unicast+0x54/0x390 > [ 1.203630] rtnl_getlink+0x366/0x410 > [ 1.204155] ? __d_alloc+0x24/0x1d0 > [ 1.204668] rtnetlink_rcv_msg+0x146/0x3b0 > [ 1.205256] ? _raw_spin_unlock+0xd/0x30 > [ 1.205867] ? __d_add+0xf2/0x1b0 > [ 1.206600] ? rtnl_calcit.isra.0+0x130/0x130 > [ 1.207221] netlink_rcv_skb+0x49/0xf0 > [ 1.207904] netlink_unicast+0x23a/0x390 > [ 1.208585] netlink_sendmsg+0x23b/0x4b0 > [ 1.209203] sock_sendmsg+0x57/0x60 > [ 1.210118] __sys_sendto+0x117/0x170 > [ 1.210694] ? __wake_up_common_lock+0x83/0xc0 > [ 1.211420] __x64_sys_sendto+0x1b/0x30 > [ 1.211992] do_syscall_64+0x37/0x90 > [ 1.212497] entry_SYSCALL_64_after_hwframe+0x63/0xcd > [ 1.213407] RIP: 0033:0x7f74677404e6 > [ 1.213973] Code: 69 0e 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b8 0f 1f 00 41 89 ca 64 8b 04 25 18 00 00 00 85 c0 75 11 b8 2c 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 72 c3 90 41 54 48 83 ec 30 44 89 4c 24 2c 4c > [ 1.217098] RSP: 002b:00007ffc010daa78 EFLAGS: 00000246 ORIG_RAX: 000000000000002c > [ 1.219539] RAX: ffffffffffffffda RBX: 000000000011bc98 RCX: 00007f74677404e6 > [ 1.220552] RDX: 0000000000000020 RSI: 0000563160679570 RDI: 0000000000000005 > [ 1.222378] RBP: 00005631606796b0 R08: 00007ffc010daaf0 R09: 0000000000000080 > [ 1.223692] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000001 > [ 1.224793] R13: 0000000000000000 R14: 0000000000000000 R15: 00005631606794b0 > [ 1.226228] > [ 1.226775] Modules linked in: > [ 1.227414] ---[ end trace 0000000000000000 ]--- > > Greetings, > > Andres Freund Okay! And just to be 100% sure, can you try the following on top of 5.19: diff --git a/drivers/virtio/virtio_pci_modern.c b/drivers/virtio/virtio_pci_modern.c index 623906b4996c..6f4e54a618bc 100644 --- a/drivers/virtio/virtio_pci_modern.c +++ b/drivers/virtio/virtio_pci_modern.c @@ -208,6 +208,9 @@ static struct virtqueue *setup_vq(struct virtio_pci_device *vp_dev, return ERR_PTR(-EINVAL); } + if (num > 1024) + num = 1024; + info->msix_vector = msix_vec; /* create the vring */ -- MST