Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp4154726pxf; Tue, 6 Apr 2021 09:09:07 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxoyQChQgRR989QTA/2YPFrWPciF9HILOd/r3zUDztrETZ9THwiTdFvmYz4Gj4NMB1wNbpG X-Received: by 2002:a05:6000:1045:: with SMTP id c5mr5270431wrx.245.1617725347367; Tue, 06 Apr 2021 09:09:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617725347; cv=none; d=google.com; s=arc-20160816; b=F2AjYFoe08iJYX5DNnXGLKaUX3j5gfE13GLbWcI8yzXzpCWDYo/VILl2n8QNMFpP4w ttOWnE0/DCdSIm/gcrztylNyHduODm1K2NDhYP3exZ6l/mMq8hkHEQvMhzYZhzgiKX/3 UfFlb0L/2Iw7q2wIzqYc8BWMylCZAFcDE0ojUWL/wJlqsM8xcLTaswnMC1Q3dYfW0qhL U7CCtrE8/FYfBV3l3MX8vGI1bd7bUmmaQVFj44zzQMs2L71YhZmD21QE83S5HnBgo02g sr5f6gndvsvzP4pQFTSz9/tkUr3Ct8S5+yCAWoips+jZk5GSmDbcXDKEPb2Jb/rb2WHk c5fg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=RHYpJ9VGT17MkKS5EdZ6n9RX16Pjj9NUFF7c08NF7rY=; b=pxRAI1+NCkIGAwm3tGNuLNOOJ4VBgPyduI74VztV0AhJUpMyk7+e0W2OorFqGQY/od F67I+mlt4T7caPT5z3VwxAZK80KMB/Xijh5sIz8D23ijAI07DcrA/1K3ZNIXV4S6nAb8 9hLJAi2Q07aN2VW7oKi2VU/eeAUEvKkiX0K+jIO4S9GhqnlEdd/EWUh4q/h5sFYmNkjK DTUSQT3IDH19f5ZdE4PpZ8yMuyvWmyCjaFnaTePfOjRlGfeeBDncOZomIt2YODKmt3pZ 7GL5ZTrYs4c7rWUF7V5jc7uBbhh3QP8nHLgLVRdWmHu/UXHfG6tvDLgVK7HspnknXFUb Oyxg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id n18si16030161ejg.224.2021.04.06.09.08.43; Tue, 06 Apr 2021 09:09:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S244142AbhDFHHK (ORCPT + 99 others); Tue, 6 Apr 2021 03:07:10 -0400 Received: from mx2.suse.de ([195.135.220.15]:60650 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244136AbhDFHHI (ORCPT ); Tue, 6 Apr 2021 03:07:08 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id B3F3FAFAA; Tue, 6 Apr 2021 07:06:59 +0000 (UTC) Received: by lion.mk-sys.cz (Postfix, from userid 1000) id 1809D60441; Tue, 6 Apr 2021 09:06:59 +0200 (CEST) Date: Tue, 6 Apr 2021 09:06:59 +0200 From: Michal Kubecek To: Yunsheng Lin Cc: Jiri Kosina , Hillf Danton , John Fastabend , Cong Wang , Paolo Abeni , Kehuan Feng , Jike Song , Jonas Bonn , Michael Zhivich , David Miller , LKML , Netdev , Josh Hunt Subject: Re: Packet gets stuck in NOLOCK pfifo_fast qdisc Message-ID: <20210406070659.t6csfgkskbmmxtx5@lion.mk-sys.cz> References: <20200827125747.5816-1-hdanton@sina.com> <5f51cbad3cc2_3eceb208fc@john-XPS-13-9370.notmuch> <20210403003537.2032-1-hdanton@sina.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 06, 2021 at 08:55:41AM +0800, Yunsheng Lin wrote: > > Hi, Jiri > Do you have a reproducer that can be shared here? > With reproducer, I can debug and test it myself too. I'm afraid we are not aware of a simple reproducer. As mentioned in the original discussion, the race window is extremely small and the other thread has to do quite a lot in the meantime which is probably why, as far as I know, this was never observed on real hardware, only in virtualization environments. NFS may also be important as, IIUC, it can often issue an RPC request from a different CPU right after a data transfer. Perhaps you could cheat a bit and insert a random delay between the empty queue check and releasing q->seqlock to make it more likely to happen. Other than that, it's rather just "run this complex software in a xen VM and wait". Michal