Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp1776827ybl; Tue, 3 Dec 2019 12:28:59 -0800 (PST) X-Google-Smtp-Source: APXvYqzbphfShC8Y/2TPOGjtQk4KiaXFhLcEZKibWtFXxseDwBQyg3L3/0gfKRtmBwVonH0E0fn3 X-Received: by 2002:aca:c7cb:: with SMTP id x194mr5536869oif.157.1575404939323; Tue, 03 Dec 2019 12:28:59 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1575404939; cv=none; d=google.com; s=arc-20160816; b=XPfkSumdr2PRrtG+1WFdif2oozY321aSHFQYFfUDN5guQ7kj3tX4hDjmFnpLNq7kY5 qLwPHkBBV78WpDsIIm7fktYUxevmwUM8ps66MHrf/xuCIb7nEwCXaHRxsJpB2zMkpx3T uS0MowZWj7AbmowtH/o4t8K+xAkJ38bD1utA6l2GU/7r2IlwwLmdU0y/Ygh/yby/LRig zSaSPWewDyT8aesjpxJ/oLKUQLtNpgQpGuA7RmKUEax26S0AojOJLppoHbjYUuFcWM4Y CApevfCPJ2ad1uEa+QElxbMO/my/BDj8oJvimyUpapZDMHh+ac4Lkzb9ONR8rHI1FqE3 k/Jw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :dkim-signature; bh=QA6N7ZZcbmK8/a59EgCuBTRkEVqdTx5F9l4vgf5SIAE=; b=V+9/uMXZfU8trG35/0rGPiMioQyNYz4kb/ok79gJP7NYf0jTkP/KLoPEKmenMVwRoR UHfTwtMAYgDyktDVjGUmCfGywg6LxFoo6wCs6W9P4WGc9PL81rVu1Bp3jq55LUHtZaQz MKVwV6deE5hysBv9V4YSDybRm1xFc/QLzD/rkpHWj6O4pECjcPg5q+1cBbPF7eP53u6p 77+mqBEbpDrZVQzsOlfYni1f7qgyg0/bLtVZKSjhJjw8ak10boWReGyMa7G1T4pByQbL Kd+P21reyJiWVzRhqZZjpC+5CVV0ZEubJlIi1QjC0Gh1fSGr1iJ214V72RiRTECYxxkF 4YbA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=iqbReJki; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 7si1884677oir.183.2019.12.03.12.28.44; Tue, 03 Dec 2019 12:28:59 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=iqbReJki; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727234AbfLCU14 (ORCPT + 99 others); Tue, 3 Dec 2019 15:27:56 -0500 Received: from mail-wr1-f65.google.com ([209.85.221.65]:34929 "EHLO mail-wr1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726567AbfLCU14 (ORCPT ); Tue, 3 Dec 2019 15:27:56 -0500 Received: by mail-wr1-f65.google.com with SMTP id g17so5486113wro.2 for ; Tue, 03 Dec 2019 12:27:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id; bh=QA6N7ZZcbmK8/a59EgCuBTRkEVqdTx5F9l4vgf5SIAE=; b=iqbReJkif4rmiO17LjWyIiQlsRR+YKdoKyUcsImbQoY1qxYlcR/xn7t6ZMP8ZTX5XP YzBl0yEeIeZB5QjWGDiJZiklZ4+CR7rVGQzTX283K/rKsY12MW/sKz00Y5GZG8OrMwyw k/0aMiBkQJt8O17PZ9WcDSw2evEXbqRocbh0o2J2DYASPtpGqWOtMJfT2jf6+bnwPzGM lRUbXZ8NxgzyiOw/i5+brLzjLb/XYkXHMnt5/+Rmn/sWLiF0eywLhHyCm/OpOmuXnoFW L45+wjH7EvtAS58E9x3eZfGNAsiwVvZMtzDO48aj/iz2etcqi1/ezemrHtLb4UNi68o4 ts2g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id; bh=QA6N7ZZcbmK8/a59EgCuBTRkEVqdTx5F9l4vgf5SIAE=; b=ifX8gmVMGge+hYof8/bWWSdeS7KhXzQ/BzufMd4/E457EAdW35JuY4mIVNsPCjcrnL HVXDi4xDISjXKz1yM/lzaR/A9Z9H36iSQKBFj8vcohBMi6pJMN4anP4Vfhslc9NHN06M ZPG4I0nbQxRBHay5sHDUAOhaVi/haP9Q+aFtpZvku5JWM9Bdhv7XdvY2F84h6biI76u0 9SKYvBpYq8JWAfxAsG7heHf7xqsiIgCq+voZhZDASbq3vJJpvFzBETqGkjmaQAG7wcDE Dq69zbqZwjeyoJGHqXkQ3KQ2HNf5Qk+aMInzsEYygSOXZHnS2p3CGesmSR0LXYxeEZPQ KFiQ== X-Gm-Message-State: APjAAAVGTq/o3qjSDzj1Ze3C6P9R0ew8NLYN8ai6V6qQXJ1bdUU+5UJd +STE7mVxzpbGvGP4TpwU0WnDNoUtTtg= X-Received: by 2002:adf:e6cb:: with SMTP id y11mr7396518wrm.345.1575404873554; Tue, 03 Dec 2019 12:27:53 -0800 (PST) Received: from ogabbay-VM.habana-labs.com ([31.154.190.6]) by smtp.gmail.com with ESMTPSA id 5sm5000176wrh.5.2019.12.03.12.27.52 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 03 Dec 2019 12:27:52 -0800 (PST) From: Oded Gabbay To: linux-kernel@vger.kernel.org, oshpigelman@habana.ai, ttayar@habana.ai Cc: gregkh@linuxfoundation.org Subject: [PATCH] habanalabs: rate limit error msg on waiting for CS Date: Tue, 3 Dec 2019 22:27:50 +0200 Message-Id: <20191203202750.9498-1-oded.gabbay@gmail.com> X-Mailer: git-send-email 2.17.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org In case a user submits a CS, and the submission fails, and the user doesn't check the return value and instead use the error return value as a valid sequence number of a CS and ask to wait on it, the driver will print an error and return an error code for that wait. The real problem happens if now the user ignores the error of the wait, and try to wait again and again. This can lead to a flood of error messages from the driver and even soft lockup event. Signed-off-by: Oded Gabbay --- drivers/misc/habanalabs/command_submission.c | 5 +++-- drivers/misc/habanalabs/context.c | 2 +- 2 files changed, 4 insertions(+), 3 deletions(-) diff --git a/drivers/misc/habanalabs/command_submission.c b/drivers/misc/habanalabs/command_submission.c index 8850f475a413..0bf08678431b 100644 --- a/drivers/misc/habanalabs/command_submission.c +++ b/drivers/misc/habanalabs/command_submission.c @@ -824,8 +824,9 @@ int hl_cs_wait_ioctl(struct hl_fpriv *hpriv, void *data) memset(args, 0, sizeof(*args)); if (rc < 0) { - dev_err(hdev->dev, "Error %ld on waiting for CS handle %llu\n", - rc, seq); + dev_err_ratelimited(hdev->dev, + "Error %ld on waiting for CS handle %llu\n", + rc, seq); if (rc == -ERESTARTSYS) { args->out.status = HL_WAIT_CS_STATUS_INTERRUPTED; rc = -EINTR; diff --git a/drivers/misc/habanalabs/context.c b/drivers/misc/habanalabs/context.c index 17db7b3dfb4c..2df6fb87e7ff 100644 --- a/drivers/misc/habanalabs/context.c +++ b/drivers/misc/habanalabs/context.c @@ -176,7 +176,7 @@ struct dma_fence *hl_ctx_get_fence(struct hl_ctx *ctx, u64 seq) spin_lock(&ctx->cs_lock); if (seq >= ctx->cs_sequence) { - dev_notice(hdev->dev, + dev_notice_ratelimited(hdev->dev, "Can't wait on seq %llu because current CS is at seq %llu\n", seq, ctx->cs_sequence); spin_unlock(&ctx->cs_lock); -- 2.17.1