Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp1553639pxb; Fri, 20 Aug 2021 08:12:37 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzEbyYzHlJvq0uV2r9QVj2MIZkAKfiEfgeR+t57lDMRbmVEdvliJ9Mbrp0lOFG2PZHBJfn3 X-Received: by 2002:a2e:a90b:: with SMTP id j11mr16329151ljq.475.1629472357468; Fri, 20 Aug 2021 08:12:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629472357; cv=none; d=google.com; s=arc-20160816; b=RGviHknTxbSZyZGysRtmngKOakCHiYkIOCxDeGOIeNjwViuYV12ifL5q1w39vAOYDP jQTmdkYVO5yGE4bjgHgBBF0dVCFo0Fyjnemr5Lgb6NW8mq3PUHmPie9xQ/p3W7O0ZBUq qWTZ53ghhvDT6CJpbPrSfCgZ536wVwo/lDFWN0NKL942O5W9utsQ/nQWvbAvRFxr5Fbv 3cf2rPEx4TOWVUf7wEd5IhZoE1j2sccX+8x3U9P/Vzh8w7eYyKIAyaHYR/Td1ac1lO7v XlvfXqOrByMJcBEhdPnmXLuDg9EkjqRpcleN89fKzCVYOe3zEhMJCdHkvyLTJZFumTae QVjg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:references:in-reply-to:message-id:date:subject :to:from:dkim-signature; bh=8oKKyxGUMHtBxQa8BQEOvJV2xTYMQ1bHs44R25ou5rE=; b=g0jWsQ+GEcQutO1fZzAXfX8lv+hNnF2p5TrY5RaKLZZ1YQxTM2CAzbddAb3nGgvHBv pHdp8V0cCymGhMrQb/7JelSvwCetrD7+SlhVEscOhX4zsTLawnjVyB7zGiXC8lcsp9Td quRHYP+DdM4pHLRyh1tGhPucekhFkiUDGp4d3NxRpWgM6IfGL9pZU22c7iC7gp9Hbw5E d0GadcDJ7SY0WdeLMUD2zpD8pM6iD84PDrGIaiuwqkJVPxWJlmo0L+CTLDG4/WHCjW9/ OkNpYr8HniUSDGowYSNHBfCQJ39yrSdsDrmptMZcWdRFtQutbm0nf0T9yPeD7/7TDifX x+wQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=OBjnP7Lu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m12si6839047ejg.274.2021.08.20.08.12.05; Fri, 20 Aug 2021 08:12:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=OBjnP7Lu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240967AbhHTPIH (ORCPT + 99 others); Fri, 20 Aug 2021 11:08:07 -0400 Received: from mail.kernel.org ([198.145.29.99]:50360 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240956AbhHTPIG (ORCPT ); Fri, 20 Aug 2021 11:08:06 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 4D18D60FDA for ; Fri, 20 Aug 2021 15:07:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1629472048; bh=tWufcxFkKtxB7AT4OiAkVQ5Q/bP625jdgo4IPuIhtJA=; h=From:To:Subject:Date:In-Reply-To:References:From; b=OBjnP7LuhmV64IZcPMGG04g7VUPDl1VbuV7uz+aedQv3j8x7qIXPSOtRxAz1i1Zd5 0lCWMozyWnIF2/FoygycH6z0gZ0GmMFpnfCOGPIanJoYARu6OVsUtA/QSY1TvH05PB GgTvNFLz45n6cCWHwyd62F9MGjIjVt+iZw1AojZx/OAxdu6VpRNCO08WevD3tvIfBM hUWFiXRLzu+KfctrktC2coKh00652kvPqRHsgBe7qSeNTLv5fD6nhRXKQ+a2PjZKOO h4l8xDh0NktYy1DXwNAyfyJq9QP6CWWvZzAVPP5c3kqp7xws0HlBZP+GONRexX9ZiX X7sFalt4+ZN5A== From: Oded Gabbay To: linux-kernel@vger.kernel.org Subject: [PATCH 2/4] habanalabs: disable IRQ in user interrupts spinlock Date: Fri, 20 Aug 2021 18:07:17 +0300 Message-Id: <20210820150719.67934-2-ogabbay@kernel.org> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20210820150719.67934-1-ogabbay@kernel.org> References: <20210820150719.67934-1-ogabbay@kernel.org> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Because this spinlock is taken in an interrupt handler, we must use the spin_lock_irqsave/irqrestore version to disable the interrupts on the local CPU. Otherwise, we can have a potential deadlock (if the interrupt handler is scheduled to run on the same cpu that the code who took the lock was running on). Signed-off-by: Oded Gabbay --- .../habanalabs/common/command_submission.c | 25 ++++++++++--------- 1 file changed, 13 insertions(+), 12 deletions(-) diff --git a/drivers/misc/habanalabs/common/command_submission.c b/drivers/misc/habanalabs/common/command_submission.c index 3a67265312ee..8a2f75de6df8 100644 --- a/drivers/misc/habanalabs/common/command_submission.c +++ b/drivers/misc/habanalabs/common/command_submission.c @@ -923,13 +923,14 @@ static void wake_pending_user_interrupt_threads(struct hl_user_interrupt *interrupt) { struct hl_user_pending_interrupt *pend; + unsigned long flags; - spin_lock(&interrupt->wait_list_lock); + spin_lock_irqsave(&interrupt->wait_list_lock, flags); list_for_each_entry(pend, &interrupt->wait_list_head, wait_list_node) { pend->fence.error = -EIO; complete_all(&pend->fence.completion); } - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); } void hl_release_pending_user_interrupts(struct hl_device *hdev) @@ -2714,9 +2715,9 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, { struct hl_user_pending_interrupt *pend; struct hl_user_interrupt *interrupt; - unsigned long timeout; - long completion_rc; + unsigned long timeout, flags; u32 completion_value; + long completion_rc; int rc = 0; if (timeout_us == U32_MAX) @@ -2739,7 +2740,7 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, else interrupt = &hdev->user_interrupt[interrupt_offset]; - spin_lock(&interrupt->wait_list_lock); + spin_lock_irqsave(&interrupt->wait_list_lock, flags); if (!hl_device_operational(hdev, NULL)) { rc = -EPERM; goto unlock_and_free_fence; @@ -2765,7 +2766,7 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, * handler to monitor */ list_add_tail(&pend->wait_list_node, &interrupt->wait_list_head); - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); wait_again: /* Wait for interrupt handler to signal completion */ @@ -2777,12 +2778,12 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, * If comparison fails, keep waiting until timeout expires */ if (completion_rc > 0) { - spin_lock(&interrupt->wait_list_lock); + spin_lock_irqsave(&interrupt->wait_list_lock, flags); if (copy_from_user(&completion_value, u64_to_user_ptr(user_address), 4)) { - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); dev_err(hdev->dev, "Failed to copy completion value from user\n"); @@ -2792,13 +2793,13 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, } if (completion_value >= target_value) { - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); *status = CS_WAIT_STATUS_COMPLETED; } else { reinit_completion(&pend->fence.completion); timeout = completion_rc; - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); goto wait_again; } } else if (completion_rc == -ERESTARTSYS) { @@ -2812,11 +2813,11 @@ static int _hl_interrupt_wait_ioctl(struct hl_device *hdev, struct hl_ctx *ctx, } remove_pending_user_interrupt: - spin_lock(&interrupt->wait_list_lock); + spin_lock_irqsave(&interrupt->wait_list_lock, flags); list_del(&pend->wait_list_node); unlock_and_free_fence: - spin_unlock(&interrupt->wait_list_lock); + spin_unlock_irqrestore(&interrupt->wait_list_lock, flags); kfree(pend); hl_ctx_put(ctx); -- 2.17.1