Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6207430ybv; Tue, 18 Feb 2020 12:02:54 -0800 (PST) X-Google-Smtp-Source: APXvYqzRlGroQa/ScAhKm1/OiDMLki1a5Kd2BDkgg5WmYVemBGIAgeIZTvZRPVRcpTSXLRocLb+T X-Received: by 2002:aca:2118:: with SMTP id 24mr2392384oiz.28.1582056174174; Tue, 18 Feb 2020 12:02:54 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582056174; cv=none; d=google.com; s=arc-20160816; b=GjzWVmaB7RAvttteREM4/z4qUwuK2fFJIs20KE25VcsTj38D6dvMjRjOxJRTExA3H6 b5rnee7r5mNXxKHCmnPCVFjMwsXrmuDRkzGlKWwqiusKOcj0bO1po+HY+XP3qQMOi3Zw xvvxrqaIQYzTb4ZtacAdIK2MnBig5IKUCB81DcC796PYLSmRaQsjthnlVKn2VTZODIrQ 0UvLwOEKbuLzhS73Ugi88FRUIBWfUKE1A4S3AD2r0FKMZDBcLtIxbamH34iFZIDHdmi9 eD8wXw3fvR7KHJqa5JGO5QFarrpZ4s9j4qWEfHK+haNezncnT+gUG3bq7m4BCYoWRNqY Hbzw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=VNgpwvi2R4GHX8bSqIZ7ndWyEvf3rKM55Me+AIUhaN0=; b=arS6gPjak2YTC4RqFXWEiDJp3qOBLLy42yZuSGTAdyqXOmGatMkScoVqQ/eqVtdV4n yJ8HQ6tG1nSC/BYrNSH1NgIt3N/0zoa4sHx0wjoozukr+O0gQb/r0LLGtJfGvlnhpJYo AQCirlw5A2e7tHy5wMPpQA4GEVA1wO0+vIWQHOofoEY4t76e+b+DW1yjjIHuxxAYeJsy lDwrZmDe6Z1FOaxsCJESJN8jYsI4yzuj8v/Mh04yIC/kacbQoE62cQVa+/si8e1vYNip ALXUC9Ziz/f1lMv5vMkN73YqbiIPS/TLQYkJ2p3iIhXgI3QzsOb1yVIsU96uYgNtHV6F wDsw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JBPj6XnX; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k9si8606806oiw.262.2020.02.18.12.02.42; Tue, 18 Feb 2020 12:02:54 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JBPj6XnX; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728979AbgBRUCh (ORCPT + 99 others); Tue, 18 Feb 2020 15:02:37 -0500 Received: from mail.kernel.org ([198.145.29.99]:43122 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728970AbgBRUCg (ORCPT ); Tue, 18 Feb 2020 15:02:36 -0500 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id B9B2E21D56; Tue, 18 Feb 2020 20:02:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1582056156; bh=r3Wt9Fqia8ELqoZcBvGRbikshi1Mn5+KDL7u1kFTmjA=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=JBPj6XnXlmUHvT/28JZ3EOJ4YPCTZPWo5+zTbTHYod89lMH902tRVwdDsUqFPdtX+ teI2YF1MtXxac96W+C86o22BDHCM16R3RGUb5Ce1DURMpEgkE5KmscUjbm0yHy/Ivr JNvsELKM7kpp8ssZfVqkfJ8MyGSouIvsJQC/asto= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Zhu Yanjun , Leon Romanovsky , Jason Gunthorpe Subject: [PATCH 5.5 56/80] RDMA/rxe: Fix soft lockup problem due to using tasklets in softirq Date: Tue, 18 Feb 2020 20:55:17 +0100 Message-Id: <20200218190437.492485814@linuxfoundation.org> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20200218190432.043414522@linuxfoundation.org> References: <20200218190432.043414522@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Zhu Yanjun commit 8ac0e6641c7ca14833a2a8c6f13d8e0a435e535c upstream. When run stress tests with RXE, the following Call Traces often occur watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [swapper/2:0] ... Call Trace: create_object+0x3f/0x3b0 kmem_cache_alloc_node_trace+0x129/0x2d0 __kmalloc_reserve.isra.52+0x2e/0x80 __alloc_skb+0x83/0x270 rxe_init_packet+0x99/0x150 [rdma_rxe] rxe_requester+0x34e/0x11a0 [rdma_rxe] rxe_do_task+0x85/0xf0 [rdma_rxe] tasklet_action_common.isra.21+0xeb/0x100 __do_softirq+0xd0/0x298 irq_exit+0xc5/0xd0 smp_apic_timer_interrupt+0x68/0x120 apic_timer_interrupt+0xf/0x20 ... The root cause is that tasklet is actually a softirq. In a tasklet handler, another softirq handler is triggered. Usually these softirq handlers run on the same cpu core. So this will cause "soft lockup Bug". Fixes: 8700e3e7c485 ("Soft RoCE driver") Link: https://lore.kernel.org/r/20200212072635.682689-8-leon@kernel.org Signed-off-by: Zhu Yanjun Signed-off-by: Leon Romanovsky Signed-off-by: Jason Gunthorpe Signed-off-by: Greg Kroah-Hartman --- drivers/infiniband/sw/rxe/rxe_comp.c | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) --- a/drivers/infiniband/sw/rxe/rxe_comp.c +++ b/drivers/infiniband/sw/rxe/rxe_comp.c @@ -329,7 +329,7 @@ static inline enum comp_state check_ack( qp->comp.psn = pkt->psn; if (qp->req.wait_psn) { qp->req.wait_psn = 0; - rxe_run_task(&qp->req.task, 1); + rxe_run_task(&qp->req.task, 0); } } return COMPST_ERROR_RETRY; @@ -463,7 +463,7 @@ static void do_complete(struct rxe_qp *q */ if (qp->req.wait_fence) { qp->req.wait_fence = 0; - rxe_run_task(&qp->req.task, 1); + rxe_run_task(&qp->req.task, 0); } } @@ -479,7 +479,7 @@ static inline enum comp_state complete_a if (qp->req.need_rd_atomic) { qp->comp.timeout_retry = 0; qp->req.need_rd_atomic = 0; - rxe_run_task(&qp->req.task, 1); + rxe_run_task(&qp->req.task, 0); } } @@ -725,7 +725,7 @@ int rxe_completer(void *arg) RXE_CNT_COMP_RETRY); qp->req.need_retry = 1; qp->comp.started_retry = 1; - rxe_run_task(&qp->req.task, 1); + rxe_run_task(&qp->req.task, 0); } if (pkt) {