Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp4040219pxk; Tue, 22 Sep 2020 08:55:01 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzmYg5SZ7af2JiS2YyaUSJJ24tJeLie/tS0hC3204Mr4ij5XBSD0HajhuPHZgk8DEeaf17t X-Received: by 2002:a50:bf47:: with SMTP id g7mr4665112edk.26.1600790101248; Tue, 22 Sep 2020 08:55:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1600790101; cv=none; d=google.com; s=arc-20160816; b=OOsB/WQ5T0kn7MsUQCDcppgFzWHUKpYq4y1SWZKKoVdfkVLZRmissz1v+9ngNRJeOW 1krwisd/f78C8KzW2CZKUhbHjtN/MFZHm1k5vJEzwMzg8Cv0HafmiKnVg3DJxNV6jqCf WSUU8+bJBNmAveGENq5SCKVhVa4fo/0NSVe7sAhRlIZrcSES4ihL0l7QX2/DZzyBHjXm sDnWm7GmiQOrOv1Vrp6NWQzq0bTzoFICj84x/Mdfmhd23drOsSscSZdJUynv77CZOjDe CgIv8g86Nobe3sBAsNm2ht7GQDspOI84TsXOPQDuRAkCg+7rbx3067XS6Tl9wFaK6Z7o qzKA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version; bh=pIoceIKQPNtSSW0ClpWL4s0yxOcwiu4CqYlHp9kO+wI=; b=a9nZtg0dZW7l9UyUgeljlO+dEGcZFrgJFmzsBYdnIaviq+jXdBap80s4WSbzOmwzgD zgxcTw+moU/Tk8q0/z1xYdeZYhk5XKye6Mg+e7hFpjxIzZhe7ndvciK+TRGyk9k90cLm EsoZ5USahY2m1e0sqT5yKckuY2ygoBgNiyC6kMlyV1Je2SA9MKH5XVccKlCPrWdQmeDo Wqs2aAXNjVEywl0cnbX8rd5MwOLULbzchxsIRkxpcKI+KGYUCG+S1E99s6IxT+n7lWu+ zPFkZCPBQH7EgQWPG9tYXrYc3bvvEjExv3BZYwmUdbaPGpaSqKlokmt+19t6oyOC2OrF V17A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l10si10540579edr.220.2020.09.22.08.54.35; Tue, 22 Sep 2020 08:55:01 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726631AbgIVPyF (ORCPT + 99 others); Tue, 22 Sep 2020 11:54:05 -0400 Received: from mail-ej1-f66.google.com ([209.85.218.66]:35223 "EHLO mail-ej1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726614AbgIVPyE (ORCPT ); Tue, 22 Sep 2020 11:54:04 -0400 Received: by mail-ej1-f66.google.com with SMTP id u21so23661995eja.2 for ; Tue, 22 Sep 2020 08:54:03 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=pIoceIKQPNtSSW0ClpWL4s0yxOcwiu4CqYlHp9kO+wI=; b=aEa1jULjDZ12q/K0gSsNYClsrMgLQnT+bY2nKzgF+i1afhOWNGeK0aUhPawdedcNkr hYX3q+yU9jxpzHCqjJcwEhdXCsZD6lrucskV+0XEE0g+/LtZXH1ADNCxQXySZVJFKqBi zYYdyOtahs0CZ7hjuGgCQ1gMcQPj5fezsvyG1BLJHrcomKyu5ezH5cjSBfoPeLbNANi5 EOtHp923xKZUehlm3HKlQuVh98rpOanEUEcth7xnMfslzCvSyQaESR0cjGEsydlYc0w6 Bh3pE2on9yicZR62hTS3u2mW03z9Vv9oUGGH59aD3QN1x4UgcIYXP/NdxaqIvc/HBhxe BqIg== X-Gm-Message-State: AOAM530du5oiSthNrofuqdUrSf8K85n/0+Ne0gcYOEgEeqr7bzrqSxoP VKoa7dBNBEsv7WMACkJ1hebLxYc5eH3KLzcvr6M= X-Received: by 2002:a17:906:3552:: with SMTP id s18mr5643260eja.23.1600790042721; Tue, 22 Sep 2020 08:54:02 -0700 (PDT) MIME-Version: 1.0 References: <5a7f6bbf4cf2038634a572f42ad80e95a8d0ae9c.1600686204.git.bcodding@redhat.com> <8DB79D4D-6986-4114-B031-43157089C2B5@redhat.com> In-Reply-To: From: Anna Schumaker Date: Tue, 22 Sep 2020 11:53:46 -0400 Message-ID: Subject: Re: [PATCH 1/3] NFSv4: Fix a livelock when CLOSE pre-emptively bumps state sequence To: Benjamin Coddington Cc: Trond Myklebust , Linux NFS Mailing List Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Tue, Sep 22, 2020 at 11:49 AM Benjamin Coddington wrote: > > On 22 Sep 2020, at 10:43, Anna Schumaker wrote: > > > On Tue, Sep 22, 2020 at 10:31 AM Anna Schumaker > > wrote: > >> > >> On Tue, Sep 22, 2020 at 10:22 AM Benjamin Coddington > >> wrote: > >>> > >>> On 22 Sep 2020, at 10:03, Anna Schumaker wrote: > >>>> Hi Ben, > >>>> > >>>> Once I apply this patch I have trouble with generic/478 doing lock > >>>> reclaim: > >>>> > >>>> [ 937.460505] run fstests generic/478 at 2020-09-22 09:59:14 > >>>> [ 937.607990] NFS: __nfs4_reclaim_open_state: Lock reclaim failed! > >>>> > >>>> And the test just hangs until I kill it. > >>>> > >>>> Just thought you should know! > >>> > >>> Yes, thanks! I'm not seeing that.. I've tested these based on > >>> v5.8.4, I'll > >>> rebase and check again. I see a wirecap of generic/478 is only 515K > >>> on my > >>> system, would you be willing to share a capture of your test > >>> failing? > >> > >> I have it based on v5.9-rc6 (plus the patches I have queued up for > >> v5.10), so there definitely could be a difference there! I'm using a > >> stock kernel on my server, though :) > >> > >> I can definitely get you a packet trace once I re-apply the patch and > >> rerun the test. > > > > Here's the packet trace, I reran the test with just this patch applied > > on top of v5.9-rc6 so it's not interacting with something else in my > > tree. Looks like it's ending up in an NFS4ERR_OLD_STATEID loop. > > Thanks very much! > > Did you see this failure with all three patches applied, or just with > the > first patch? I saw it with the first patch applied, and with the first and third applied. I initially hit it as I was wrapping up for the day yesterday, but I left out #2 since I saw your retraction > > I see the client get two OPEN responses, but then is sending > TEST_STATEID > with the first seqid. Seems like seqid 2 is getting lost. I wonder if > we're making a bad assumption that NFS_OPEN_STATE can only be toggled > under > the so_lock. > > Ben >