Received: by 2002:a05:6a10:9e8c:0:0:0:0 with SMTP id y12csp264397pxx; Thu, 29 Oct 2020 01:56:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxqyt4E6Z0sKRMZgxAThxQs+6HkQG3PtSYA5LlBxnJy5eWHxKYNJ5mXNydnx5URi6aOgzZj X-Received: by 2002:a05:6402:1586:: with SMTP id c6mr2947794edv.84.1603961814975; Thu, 29 Oct 2020 01:56:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1603961814; cv=none; d=google.com; s=arc-20160816; b=LGfnSQ8AfF7J5AqHSLk3WbGIHwoMJoSvAdF7Uf2Gs5WpmDOjEBbN+bjW3tWYt8SiA0 g5lvR9XS4xj/RCheQlNKPMymdZIpOI39FZ7CziX601aPiucKNyuElKcszuOsCeCSW5z/ wutrrW6v3EfC54hlXpEq9gIC7lJY6UrKVRqKuWlRUtaaSiWZAMLT6y8Ce3CnNZyaWLL6 Oks3tscGa9rWD5l1/pry+fS4d+xdpKnZ/4+mxp1C/mLI4M2Px7TW6xVLMSfirY3VM81t 4pu0s+ou+JA05IjZLWgPK48q5g129roSknUzTHlPcfhRR8tx/iwzkm1Wg3iQxFh7xUXN hwDA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=ZTM2nTPuGzvOMC1iwum7REnZdF3Ayl6FuTLJ0c/smHY=; b=d/XePPySYTkUtBE4W2mtEQhmCHbPAbNfB6AT+RXu9RkpdyZmQo7Gq4n/t7fHRQkEgy 1S9TvtOTsHFQrNt/zasEco+ex7iDjvD6CLJpLySONi9tFWRDzoy0iDmmjyOyvXyw/WwM mKt4hJ3NAl+okwaJT6p5MNCtERvntzuBLXJ7sOA+k2UgIIhKZ6TC2o5NBlF4LrjR+4ZV FFCYUwwxWKK30zPWqyHP7iB8O8nXjReUmD9f46HgEJM518b4a/Bd1dkPeARU8QGpuDoE FKbojxz2S2xcuwDQQt5Vmy95SxkcjmUqkkKKjdJXGSiCkcu4QF9laDUDUHZdH8J+8Byk 0Q7A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r2si1062649ejr.265.2020.10.29.01.56.33; Thu, 29 Oct 2020 01:56:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404448AbgJ2CiC (ORCPT + 99 others); Wed, 28 Oct 2020 22:38:02 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:6923 "EHLO szxga07-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404449AbgJ2Chj (ORCPT ); Wed, 28 Oct 2020 22:37:39 -0400 Received: from DGGEMS410-HUB.china.huawei.com (unknown [172.30.72.60]) by szxga07-in.huawei.com (SkyGuard) with ESMTP id 4CM8jv1XMSz6tv0; Thu, 29 Oct 2020 10:37:39 +0800 (CST) Received: from [10.74.191.121] (10.74.191.121) by DGGEMS410-HUB.china.huawei.com (10.3.19.210) with Microsoft SMTP Server id 14.3.487.0; Thu, 29 Oct 2020 10:37:26 +0800 Subject: Re: [PATCH v2 net] net: sch_generic: aviod concurrent reset and enqueue op for lockless qdisc To: Vishwanath Pai , Cong Wang CC: "Hunt, Joshua" , Jamal Hadi Salim , Jiri Pirko , David Miller , "Jakub Kicinski" , Linux Kernel Network Developers , LKML , "linuxarm@huawei.com" , John Fastabend , Eric Dumazet References: <1599562954-87257-1-git-send-email-linyunsheng@huawei.com> <830f85b5-ef29-c68e-c982-de20ac880bd9@huawei.com> <19f888c2-8bc1-ea56-6e19-4cb4841c4da0@akamai.com> From: Yunsheng Lin Message-ID: <93ab7f0f-7b5a-74c3-398d-a572274a4790@huawei.com> Date: Thu, 29 Oct 2020 10:37:26 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.2.0 MIME-Version: 1.0 In-Reply-To: <19f888c2-8bc1-ea56-6e19-4cb4841c4da0@akamai.com> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.74.191.121] X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/10/29 4:04, Vishwanath Pai wrote: > On 10/28/20 1:47 PM, Cong Wang wrote: >> On Wed, Oct 28, 2020 at 8:37 AM Pai, Vishwanath wrote: >>> Hi, >>> >>> We noticed some problems when testing the latest 5.4 LTS kernel and traced it >>> back to this commit using git bisect. When running our tests the machine stops >>> responding to all traffic and the only way to recover is a reboot. I do not see >>> a stack trace on the console. >> >> Do you mean the machine is still running fine just the network is down? >> >> If so, can you dump your tc config with stats when the problem is happening? >> (You can use `tc -s -d qd show ...`.) >> >>> >>> This can be reproduced using the packetdrill test below, it should be run a >>> few times or in a loop. You should hit this issue within a few tries but >>> sometimes might take up to 15-20 tries. >> ... >>> I can reproduce the issue easily on v5.4.68, and after reverting this commit it >>> does not happen anymore. >> >> This is odd. The patch in this thread touches netdev reset path, if packetdrill >> is the only thing you use to trigger the bug (that is netdev is always active), >> I can not connect them. >> >> Thanks. > > Hi Cong, > >> Do you mean the machine is still running fine just the network is down? > > I was able to access the machine via serial console, it looks like it is > up and running, just that networking is down. > >> If so, can you dump your tc config with stats when the problem is happening? >> (You can use `tc -s -d qd show ...`.) > > If I try running tc when the machine is in this state the command never > returns. It doesn't print anything but doesn't exit either. > >> This is odd. The patch in this thread touches netdev reset path, if packetdrill >> is the only thing you use to trigger the bug (that is netdev is always active), >> I can not connect them. > > I think packetdrill creates a tun0 interface when it starts the > test and tears it down at the end, so it might be hitting this code path > during teardown. Hi, Is there any preparation setup before running the above packetdrill test case, I run the above test case in 5.9-rc4 with this patch applied without any preparation setup, did not reproduce it. By the way, I am newbie to packetdrill:), it would be good to provide the detail setup to reproduce it,thanks. > > P.S: My mail server is having connectivity issues with vger.kernel.org > so messages aren't getting delivered to netdev. It'll hopefully get > resolved soon. > > Thanks, > Vishwanath > > > . >