Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp983271imu; Fri, 7 Dec 2018 12:06:32 -0800 (PST) X-Google-Smtp-Source: AFSGD/VQl9rS/O+BNwrNKk9ccHkaFh0g22a6jgWAuokJh/N5kkVEG0lOJ3wSkWZU5P15yMNEhmDO X-Received: by 2002:a63:91c1:: with SMTP id l184mr3149637pge.29.1544213192635; Fri, 07 Dec 2018 12:06:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544213192; cv=none; d=google.com; s=arc-20160816; b=WPxnZsK1Xgzo6I5iVq0MQFfONlHie+MUAPOTnnw+XmUygorqv3chML1sRls/zVtKXU cqb3VJgwtLe5oO9cZKLpqwgrchwTTQs9kUy7CZJUo/GNm/28my2B80HsaOSR/9TukQGw 2A+FwPYEAXDDXiBK4PS4ctZNeOCTdJdHDLL1EIYF7jfT6LSJwO4yfOwgSPdiISZ9qoU8 8EVRsbsnJHC2XBzlw8AaJIZGCOOdyiFjKdQSdqoij2GDqlBAo6VjmGsBoYMxUDDAmKqK eQCn7FieN+1WqkDPPNqPlG+jdCvFySWUdrTKMhYrU+F/LUZNjYXAvDWyUffzPRle0+Bi JSSQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=SrD5V7i3XzpVTB6vFwreQ6ci8wRoV2qPngTUFYcqUCM=; b=PA4YpEu8TNwSGmtI++9cdZEuwkmKUeM13TMJCvn4M94uUotEnaIHnDBl113a2aZzl9 JbupEfzo4pQPLueklWfCDtCIzI9jDkTaJiG0qCn8EwZUCjSQDRFBINfVmzvN5V0q4Hoc LPGztqfo8Mt5VdL+eOagD5VwK0O71bYcJ2gmf5X+QA32OLX0obvWY+yz5m2eaRfvxet/ 8qAstpFq4gQ+Q/jcDZznKlLLtHtHBq4mibffyxRSkhFLW0f1RFFhboXzFlnc8Hiis2Xm 8L19MlCwGu5+HQ/YglUUsMxVmspaj+5BqBHcovPaR7yb8p1qnhMjR0ypmXxawu116lAB JFCg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 123si3987561pfx.109.2018.12.07.12.06.17; Fri, 07 Dec 2018 12:06:32 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726114AbeLGUFm (ORCPT + 99 others); Fri, 7 Dec 2018 15:05:42 -0500 Received: from mail-oi1-f195.google.com ([209.85.167.195]:44534 "EHLO mail-oi1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726041AbeLGUFl (ORCPT ); Fri, 7 Dec 2018 15:05:41 -0500 Received: by mail-oi1-f195.google.com with SMTP id m6so4329730oig.11 for ; Fri, 07 Dec 2018 12:05:41 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=SrD5V7i3XzpVTB6vFwreQ6ci8wRoV2qPngTUFYcqUCM=; b=g2puUEPR8Z3myC1eSqGJUqzzfhK4DxWSTGNfYtDWEpZVomLoXnD9dJCk9XWbikqLzP v76yCbn5T3/xu6H9N2ZQfdsymtpdauzliLFIm6umG+nI5U0iBv0krTuPJDGpSE0rdyHq 0/TzcAABB1UYG7glhRLHAEgGsiKqFAtem58wkiHVxHxFhhh2WkKdf367W7ha1taQ1LWd p+fERR0VP6inH8jn9LHrmR7PyQ7ONkFvZTU1Hc6AEkmwqnJhAxNeQJaqG2C2flPKd0va JciITRjU3dk3am68zcSD75AajQyeDdb6AvW9NmGlG7KkZqOsgy0jE8cwi8oXsi7a1KNS txXA== X-Gm-Message-State: AA+aEWaKKgVGiKV871NNOalBFWYSZjia+ad+MU7FPZy6cZx1LYrtNI5b 000eEpamtqEW3UXdxtBOHGk= X-Received: by 2002:aca:53cd:: with SMTP id h196mr2129759oib.355.1544213140831; Fri, 07 Dec 2018 12:05:40 -0800 (PST) Received: from ?IPv6:2600:1700:65a0:78e0:514:7862:1503:8e4d? ([2600:1700:65a0:78e0:514:7862:1503:8e4d]) by smtp.gmail.com with ESMTPSA id d10sm1831665otl.62.2018.12.07.12.05.38 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 07 Dec 2018 12:05:40 -0800 (PST) Subject: Re: [PATCH] nvme-rdma: complete requests from ->timeout To: Jaesoo Lee Cc: keith.busch@intel.com, axboe@fb.com, hch@lst.de, linux-nvme@lists.infradead.org, linux-kernel@vger.kernel.org, Prabhath Sajeepa , Roland Dreier , Ashish Karkare References: <1543535954-28073-1-git-send-email-jalee@purestorage.com> From: Sagi Grimberg Message-ID: <2055d5b5-2c27-b5a2-e3a0-75146c7bd227@grimberg.me> Date: Fri, 7 Dec 2018 12:05:37 -0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.2.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > Could you please take a look at this bug and code review? > > We are seeing more instances of this bug and found that reconnect_work > could hang as well, as can be seen from below stacktrace. > > Workqueue: nvme-wq nvme_rdma_reconnect_ctrl_work [nvme_rdma] > Call Trace: > __schedule+0x2ab/0x880 > schedule+0x36/0x80 > schedule_timeout+0x161/0x300 > ? __next_timer_interrupt+0xe0/0xe0 > io_schedule_timeout+0x1e/0x50 > wait_for_completion_io_timeout+0x130/0x1a0 > ? wake_up_q+0x80/0x80 > blk_execute_rq+0x6e/0xa0 > __nvme_submit_sync_cmd+0x6e/0xe0 > nvmf_connect_admin_queue+0x128/0x190 [nvme_fabrics] > ? wait_for_completion_interruptible_timeout+0x157/0x1b0 > nvme_rdma_start_queue+0x5e/0x90 [nvme_rdma] > nvme_rdma_setup_ctrl+0x1b4/0x730 [nvme_rdma] > nvme_rdma_reconnect_ctrl_work+0x27/0x70 [nvme_rdma] > process_one_work+0x179/0x390 > worker_thread+0x4f/0x3e0 > kthread+0x105/0x140 > ? max_active_store+0x80/0x80 > ? kthread_bind+0x20/0x20 > > This bug is produced by setting MTU of RoCE interface to '568' for > test while running I/O traffics. I think that with the latest changes from Keith we can no longer rely on blk-mq to barrier racing completions. We will probably need to barrier ourselves in nvme-rdma...