Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp1273876pxb; Fri, 20 Aug 2021 01:50:20 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx5B0ac4krFiQhkZASoyr4ttD0REvpF5OVGMypyP2lXF2vIYzGvaoA/oLg9HblpwDCEUV2p X-Received: by 2002:a5e:df0d:: with SMTP id f13mr15042801ioq.108.1629449420428; Fri, 20 Aug 2021 01:50:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629449420; cv=none; d=google.com; s=arc-20160816; b=NNCt9CPu98BSNz117+kdJH+4vR2YC3ItgTdbSA4HMdJ1ocMEqZKoh+Y7Ezmw+RIk5V iTIvUsGarD8yXsKqhAaD6o6RrwSWEBlQJ3d5Yh8QfWRLWzaKiQdDTMBPGp1t1AfaUBJc ybGhDDM2yhSOJK93/5mUgXX/MeSG3mtFvuagC33lhae2ygYSdLokbbn9EaokCf9ppSEW DsS3g3yLYzQe6Rvu+sdF9GouxlFRyE+FnF2x3zLoUNQgXuywdm+4X/hQjBHcxDTPizJR GHk7DU9PwjAGGOE56zDlfbi1XHnviSxFPCGgKHS+Ehgwx8mH7Qu2qT3M2KG+9j3i5u/f KLig== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature :dkim-signature; bh=VCaN0M5amMyZRXL8tzm/AX/hJ8arauZM7tvxoIPOVXY=; b=KI3ycFq0CMPzFR8aFOHkWNdBHYDgl+h36D2PYK5dGTTP6Ov5GzO77MvlcE1Am35XKG wcyr5MfdYMjN/wz/L5laZ40LggjWrrtlM/WhZtCvBmi8kpCHMBEurxI3fG8PaHuLKwDJ 0e8FehFiocZfLLvKMfkg56+d1FuDhFV24zYbNpii2OI1jh32ObWN8jx9LhPEbESBDZHz AV3K18kGOplEwgIrmIemCrWil9Djf6gCeEH1AhFV7l045ZLHAnobYbTF0bUaftfOSOR7 nJeJxKHUu10k6v8En23gvszj8hkc3GEhhIqEHKQzqV8+9kQT4+N0LQTu/Ya36LNSBhCy MQig== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=J6uwkYNy; dkim=neutral (no key) header.i=@suse.de header.s=susede2_ed25519 header.b="3RlKmB/K"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l13si5408240jaq.11.2021.08.20.01.50.09; Fri, 20 Aug 2021 01:50:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=J6uwkYNy; dkim=neutral (no key) header.i=@suse.de header.s=susede2_ed25519 header.b="3RlKmB/K"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231992AbhHTItL (ORCPT + 99 others); Fri, 20 Aug 2021 04:49:11 -0400 Received: from smtp-out1.suse.de ([195.135.220.28]:41396 "EHLO smtp-out1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230450AbhHTItL (ORCPT ); Fri, 20 Aug 2021 04:49:11 -0400 Received: from imap1.suse-dmz.suse.de (imap1.suse-dmz.suse.de [192.168.254.73]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id BDA2F22137; Fri, 20 Aug 2021 08:48:32 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1629449312; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=VCaN0M5amMyZRXL8tzm/AX/hJ8arauZM7tvxoIPOVXY=; b=J6uwkYNyjhSkdRI9DQp/ktI23jMY43QGD4lspGzN7ETfV32kT89QCV/g6oDNNPY9BoI3ZI ewlvw+eL6vtEEqLBBCTfiEZj8vEqyDWw8bRKnxjQ45Lk6Ynt+mvkL5NGMET9PigBi4QDQ3 w3cNKXgQ2FEkKo468F9hAyAXjQvqD7s= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1629449312; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=VCaN0M5amMyZRXL8tzm/AX/hJ8arauZM7tvxoIPOVXY=; b=3RlKmB/KCREuzvOCADE6fhAT5sFxtWEWxg4Mo7SDzTGdpLPioigWW9+mzLBwAvA4pQR/fa 063pT9bYeXA3A1BQ== Received: from imap1.suse-dmz.suse.de (imap1.suse-dmz.suse.de [192.168.254.73]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap1.suse-dmz.suse.de (Postfix) with ESMTPS id AD6881333E; Fri, 20 Aug 2021 08:48:32 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap1.suse-dmz.suse.de with ESMTPSA id kc8wKmBsH2FwTwAAGKfGzw (envelope-from ); Fri, 20 Aug 2021 08:48:32 +0000 Date: Fri, 20 Aug 2021 10:48:32 +0200 From: Daniel Wagner To: linux-nvme@lists.infradead.org Cc: linux-kernel@vger.kernel.org, James Smart , Keith Busch , Ming Lei , Sagi Grimberg , Hannes Reinecke , Wen Xiong , Himanshu Madhani Subject: Re: [PATCH v5 0/3] Handle update hardware queues and queue freeze more carefully Message-ID: <20210820084832.nlsbiztn26fv3b73@carbon.lan> References: <20210818120530.130501-1-dwagner@suse.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210818120530.130501-1-dwagner@suse.de> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Aug 18, 2021 at 02:05:27PM +0200, Daniel Wagner wrote: > I've dropped all non FC patches as they were bogus. I've retested this > version with all combinations and all looks good now. Also I gave > nvme-tcp a spin and again all is good. I forgot to mention I also dropped the first three patches from v4. Which seems to break her testing again. Wendy reported all her tests pass with Ming's V7 of 'blk-mq: fix blk_mq_alloc_request_hctx' and this series *only* if 'nvme-fc: Update hardware queues before using them' from previous version is also used. After starring at it once more, I think I finally understood the problem. So when we do ret = nvme_fc_create_hw_io_queues(ctrl, ctrl->ctrl.sqsize + 1); if (ret) goto out_free_io_queues; ret = nvme_fc_connect_io_queues(ctrl, ctrl->ctrl.sqsize + 1); if (ret) goto out_delete_hw_queues; and the number of queues has changed, the connect call will fail: nvme2: NVME-FC{2}: create association : host wwpn 0x100000109b5a4dfa rport wwpn 0x50050768101935e5: NQN "nqn.1986-03.com.ibm:nvme:2145.0000020420006CEA" nvme2: Connect command failed, error wo/DNR bit: -16389 and we stop the current reconnect attempt and reschedule a new reconnect attempt: nvme2: NVME-FC{2}: reset: Reconnect attempt failed (-5) nvme2: NVME-FC{2}: Reconnect attempt in 2 seconds Then we try to do the same thing again which fails, thus we never make progress. So clearly we need to update number of queues at one point. What would be the right thing to do here? As I understood we need to be careful with frozen requests. Can we abort them (is this even possible in this state?) and requeue them before we update the queue numbers? Daniel