Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp721621ybl; Fri, 24 Jan 2020 08:22:01 -0800 (PST) X-Google-Smtp-Source: APXvYqwRNmg8ZEq+MkphOpiJuDGQlIJvirZL0CvfcbDj26piRLHdr2Du+tvK81MWdzNDSloMqMl4 X-Received: by 2002:aca:c786:: with SMTP id x128mr24490oif.2.1579882921491; Fri, 24 Jan 2020 08:22:01 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579882921; cv=none; d=google.com; s=arc-20160816; b=oellTfkVjqeOolo5D3vTrxBeK4/GGIZPj1otZ0lkB2nkRmvUzmG+pj2a4Ln1QQdFPw s7x70734GuIicXxVBR4GMtKcRDV4M20vol4ArHDU7oHS/PVKQWFKyijDqsAcb1JsjPeJ IJpAYyn073gOhxUg+i5qcjBJc9XiG6RJXj/RYIZoFpNLnAsA75bbKChspfa2rbhBU7TF AR+xL49RlPUXkc+ohGueibyT6e5A7bnr7Qu5dQaB2vJyHB8g+xB20+f/6+bEhkpyPNPo 3VWCKynOmcj9NFATHXQ4xKHoUtpEJ5ReeAQs6gSpNvB2Q4ZVHVJfeqyFj1gY2j3Qh85+ iHIQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=cX0vr5FzySq8J7szX10WRFwjagyP60N5WTjlK9oyDvk=; b=nP6s+DQrgOttfxUwgSaeEP3hP7/PeKe5K+wbH97IzrnFdED+r1a84+qHU8vnHWSRW1 4I1+J0bBCpKjVoVjhUrOpzurt5/UQ+q1YCzv1RinFgv29GmMIZRDcHZAm5rRdkhe4e/d fHaqt4c+pSZ2amCRZHDIrwsYUwEWa5rPpmdj3AcUbwf8ESSQyWZJTfsF3xhvqZiFfPLl r7EbblaiuzXxCyj4YozL0Fnb56vLHGjKO92KHKImUX6eZGbBN09FBlEyODvEI/CToCFT +kLbeiix5cX6z965PW2BjSonY97+NojOCUMw8DAGoCgpbyKhjBlGSSQCrgZYzdMh5QQo hyZQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=0gdYHNxV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g17si32203otr.261.2020.01.24.08.21.49; Fri, 24 Jan 2020 08:22:01 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=0gdYHNxV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2389392AbgAXLHD (ORCPT + 99 others); Fri, 24 Jan 2020 06:07:03 -0500 Received: from mail.kernel.org ([198.145.29.99]:42010 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2389024AbgAXLHA (ORCPT ); Fri, 24 Jan 2020 06:07:00 -0500 Received: from localhost (ip-213-127-102-57.ip.prioritytelecom.net [213.127.102.57]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 599202077C; Fri, 24 Jan 2020 11:06:59 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1579864019; bh=VJaTf0bIBHAPHo5d81+P3os0Sa1rObqPVzeEvZ77Iho=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=0gdYHNxV6bcksWEA99sIUQ98Scs0HU2y4eeLsQMVGvaGtJ8ygnBA1cynIzfdNuWRi 3zJVkz3NHUJgrLtxEqjLuL1LEDxduz6pYBeQLKjdaje/O7hGcfGj8809xXTGR4Jlkw hb9BrkBr5Xd7Y8AJVFUjM1LV3XHb+VH+To2sPMDM= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Moni Shoua , Leon Romanovsky , Saeed Mahameed , Sasha Levin Subject: [PATCH 4.19 149/639] net/mlx5: Take lock with IRQs disabled to avoid deadlock Date: Fri, 24 Jan 2020 10:25:19 +0100 Message-Id: <20200124093105.887220381@linuxfoundation.org> X-Mailer: git-send-email 2.25.0 In-Reply-To: <20200124093047.008739095@linuxfoundation.org> References: <20200124093047.008739095@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Moni Shoua [ Upstream commit 33814e5d127e21f53b52e17b0722c1b57d4f4d29 ] The lock in qp_table might be taken from process context or from interrupt context. This may lead to a deadlock unless it is taken with IRQs disabled. Discovered by lockdep ================================ WARNING: inconsistent lock state 4.20.0-rc6 -------------------------------- inconsistent {HARDIRQ-ON-W} -> {IN-HARDIRQ-W} python/12572 [HC1[1]:SC0[0]:HE0:SE1] takes: 00000000052a4df4 (&(&table->lock)->rlock#2){?.+.}, /0x50 [mlx5_core] {HARDIRQ-ON-W} state was registered at: _raw_spin_lock+0x33/0x70 mlx5_get_rsc+0x1a/0x50 [mlx5_core] mlx5_ib_eqe_pf_action+0x493/0x1be0 [mlx5_ib] process_one_work+0x90c/0x1820 worker_thread+0x87/0xbb0 kthread+0x320/0x3e0 ret_from_fork+0x24/0x30 irq event stamp: 103928 hardirqs last enabled at (103927): [] nk+0x1a/0x1c hardirqs last disabled at (103928): [] unk+0x1a/0x1c softirqs last enabled at (103924): [] tcp_sendmsg+0x31/0x40 softirqs last disabled at (103922): [] 80 other info that might help us debug this: Possible unsafe locking scenario: CPU0 ---- lock(&(&table->lock)->rlock#2); lock(&(&table->lock)->rlock#2); *** DEADLOCK *** Fixes: 032080ab43ac ("IB/mlx5: Lock QP during page fault handling") Signed-off-by: Moni Shoua Reviewed-by: Leon Romanovsky Signed-off-by: Saeed Mahameed Signed-off-by: Sasha Levin --- drivers/net/ethernet/mellanox/mlx5/core/qp.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/drivers/net/ethernet/mellanox/mlx5/core/qp.c b/drivers/net/ethernet/mellanox/mlx5/core/qp.c index f33707ce8b6b0..479ac21cdbc69 100644 --- a/drivers/net/ethernet/mellanox/mlx5/core/qp.c +++ b/drivers/net/ethernet/mellanox/mlx5/core/qp.c @@ -44,14 +44,15 @@ static struct mlx5_core_rsc_common *mlx5_get_rsc(struct mlx5_core_dev *dev, { struct mlx5_qp_table *table = &dev->priv.qp_table; struct mlx5_core_rsc_common *common; + unsigned long flags; - spin_lock(&table->lock); + spin_lock_irqsave(&table->lock, flags); common = radix_tree_lookup(&table->tree, rsn); if (common) atomic_inc(&common->refcount); - spin_unlock(&table->lock); + spin_unlock_irqrestore(&table->lock, flags); if (!common) { mlx5_core_warn(dev, "Async event for bogus resource 0x%x\n", -- 2.20.1