Received: by 2002:a05:7412:8d10:b0:f3:1519:9f41 with SMTP id bj16csp1613920rdb; Thu, 7 Dec 2023 04:26:01 -0800 (PST) X-Google-Smtp-Source: AGHT+IE8oy4mi2V62zpFl0/KE6Xwgb7nS8ozybh4peEwCi5+g9iE+i6ZjdevlKmRaDYt3SyaR8e4 X-Received: by 2002:a05:6a20:442a:b0:18c:b133:dea4 with SMTP id ce42-20020a056a20442a00b0018cb133dea4mr2583209pzb.42.1701951961015; Thu, 07 Dec 2023 04:26:01 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701951961; cv=none; d=google.com; s=arc-20160816; b=JANgvjoIM8uBleEla6TUVJbVDrWiS63HKB/TOihNVw098GFDS+6PcU371gIl3xfhwB zgddfal4NiMJN5ZiFGj4LN3gfFUQ+dXfI4YbA06q68nW9RO8nk8wGJoUcoBfd0udTjyN oINeQfeMJF2rWU3OkF3E5WmBdwMZr10NhMRxmPOOWFqzSiiJ91R2l1imVPYjzfSTNYuD 6c6RcaZAVuJUuhXLoQVbM6kT8iXoZ9/jzSEPB6RnK9FGOETRrupIKz19Er3YTajJQY2W 3H8nDN9mBP7kaYM0izgGgC2abnzvhstZ2zlOUIpLn62D5Tws/gYy0mJ9+5dAoFle8d30 qu2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=TmQwk/K6ofEdx9NZc7OznblryDWVZltAPs0qRf7B26A=; fh=PceJZ/KGF6ujSpGwn68DXpj5qiXzqwvBM4Q702GUgLM=; b=UnbMkZ3QFDNL8MQsy2GXQZoi0aqZKBrSGN97V7kmxKxOX71R5n9LZZBbBiQ/cZoRtv p5W5UF21xoGtJPeFuJK6G1descy1ogLHrjyni/7CHIwSwsDIxCQpglyMfKqn3uGUzJqu ElICuzRIlNQGXIKygEObmME4XGYfkVywLkQ2seOeTfQu7y2btkfEuVoYAVqvVc/YvPZt vjNnS0QB7nUbI4NnfX6wQoIDsHBRtqU8HMkXx1Hno0XqQMlVyCVaOevqqRwtkZoARYJt 5ghgow11XjCr6z1u6S3nuy4YQAObCiOa6Ix2Rnpyel19+yI9rlVWxd75N1+AbXnFvjr4 8ONw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=lZgTZ7LN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from agentk.vger.email (agentk.vger.email. [23.128.96.32]) by mx.google.com with ESMTPS id fc30-20020a056a002e1e00b006be30cdc3d8si1159223pfb.163.2023.12.07.04.26.00 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 07 Dec 2023 04:26:00 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) client-ip=23.128.96.32; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=lZgTZ7LN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id AA4DC807A5B1; Thu, 7 Dec 2023 04:25:18 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1379370AbjLGMYy (ORCPT + 99 others); Thu, 7 Dec 2023 07:24:54 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38860 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1379319AbjLGMYv (ORCPT ); Thu, 7 Dec 2023 07:24:51 -0500 Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7A54E10C7 for ; Thu, 7 Dec 2023 04:24:56 -0800 (PST) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 13806C433C8; Thu, 7 Dec 2023 12:24:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1701951896; bh=2NZQ3Opr9G9uYMBH2XjlJi9TKT75+nOKGrZdYjCEuHk=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=lZgTZ7LNFnUUUY22MagHqb9EIIiiYB07XAmTpnR2kqd3KN8hIenCt4ahXD4aPgGp6 VhZN6ojBc7ojSHUYb10dzdDJDZD8HQCYJZJJdjw4Zgv6H4SrQa+Nea6XkCLh048oxQ ZskVlKa++vXIqO++krh989hXWaVJ5PYo/1Q3DoocKbO4Jk/adzvNeHvjGv27ffGuII fCiqKp08c1Aa1znarA/dn1ngUtFYoucIC3GLXGMsVre3Wo9ZCxdrX17YAiy1aEaFsv U0LmojeW38QHmt8M7+CFQpLfr14lUNU86SaOZ/sB7wkHvr8VWSATNSNlmI8djqjKe9 SYlc8Oa3Pq7EA== From: Oded Gabbay To: dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org Cc: Tomer Tayar Subject: [PATCH 5/5] accel/habanalabs/gaudi2: avoid overriding existing undefined opcode data Date: Thu, 7 Dec 2023 14:24:44 +0200 Message-Id: <20231207122444.50512-5-ogabbay@kernel.org> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20231207122444.50512-1-ogabbay@kernel.org> References: <20231207122444.50512-1-ogabbay@kernel.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.2 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Thu, 07 Dec 2023 04:25:18 -0800 (PST) From: Tomer Tayar Part of the undefined opcode data is updated in gaudi2_handle_qman_err_generic() and some in handle_lower_qman_data_on_err(). However, the 'write_enable' flag is checked only in gaudi2_handle_qman_err_generic(), and information of more than a single error can be mixed there. Moreover, handle_lower_qman_data_on_err() is called only for the lower QMAN, so for an error in the upper QMAN there is only a partial info. Move all the data update to be done in a single place, protected by the 'write_enable' flag. As mainly the lower QMAN's info is interesting, avoid saving the partial info for the upper QMAN. Signed-off-by: Tomer Tayar Reviewed-by: Oded Gabbay Signed-off-by: Oded Gabbay --- drivers/accel/habanalabs/gaudi2/gaudi2.c | 40 +++++++++++------------- 1 file changed, 19 insertions(+), 21 deletions(-) diff --git a/drivers/accel/habanalabs/gaudi2/gaudi2.c b/drivers/accel/habanalabs/gaudi2/gaudi2.c index f81b57649b00..e0e5615ef9b0 100644 --- a/drivers/accel/habanalabs/gaudi2/gaudi2.c +++ b/drivers/accel/habanalabs/gaudi2/gaudi2.c @@ -7858,10 +7858,11 @@ static bool gaudi2_handle_ecc_event(struct hl_device *hdev, u16 event_type, return !!ecc_data->is_critical; } -static void handle_lower_qman_data_on_err(struct hl_device *hdev, u64 qman_base, u64 event_mask) +static void handle_lower_qman_data_on_err(struct hl_device *hdev, u64 qman_base, u32 engine_id) { - u32 lo, hi, cq_ptr_size, cp_sts; + struct undefined_opcode_info *undef_opcode = &hdev->captured_err_info.undef_opcode; u64 cq_ptr, cp_current_inst; + u32 lo, hi, cq_size, cp_sts; bool is_arc_cq; cp_sts = RREG32(qman_base + QM_CP_STS_4_OFFSET); @@ -7871,12 +7872,12 @@ static void handle_lower_qman_data_on_err(struct hl_device *hdev, u64 qman_base, lo = RREG32(qman_base + QM_ARC_CQ_PTR_LO_STS_OFFSET); hi = RREG32(qman_base + QM_ARC_CQ_PTR_HI_STS_OFFSET); cq_ptr = ((u64) hi) << 32 | lo; - cq_ptr_size = RREG32(qman_base + QM_ARC_CQ_TSIZE_STS_OFFSET); + cq_size = RREG32(qman_base + QM_ARC_CQ_TSIZE_STS_OFFSET); } else { lo = RREG32(qman_base + QM_CQ_PTR_LO_STS_4_OFFSET); hi = RREG32(qman_base + QM_CQ_PTR_HI_STS_4_OFFSET); cq_ptr = ((u64) hi) << 32 | lo; - cq_ptr_size = RREG32(qman_base + QM_CQ_TSIZE_STS_4_OFFSET); + cq_size = RREG32(qman_base + QM_CQ_TSIZE_STS_4_OFFSET); } lo = RREG32(qman_base + QM_CP_CURRENT_INST_LO_4_OFFSET); @@ -7885,12 +7886,16 @@ static void handle_lower_qman_data_on_err(struct hl_device *hdev, u64 qman_base, dev_info(hdev->dev, "LowerQM. %sCQ: {ptr %#llx, size %u}, CP: {instruction %#018llx}\n", - is_arc_cq ? "ARC_" : "", cq_ptr, cq_ptr_size, cp_current_inst); + is_arc_cq ? "ARC_" : "", cq_ptr, cq_size, cp_current_inst); - if (event_mask & HL_NOTIFIER_EVENT_UNDEFINED_OPCODE) { - hdev->captured_err_info.undef_opcode.cq_addr = cq_ptr; - hdev->captured_err_info.undef_opcode.cq_size = cq_ptr_size; - hdev->captured_err_info.undef_opcode.stream_id = QMAN_STREAMS; + if (undef_opcode->write_enable) { + memset(undef_opcode, 0, sizeof(*undef_opcode)); + undef_opcode->timestamp = ktime_get(); + undef_opcode->cq_addr = cq_ptr; + undef_opcode->cq_size = cq_size; + undef_opcode->engine_id = engine_id; + undef_opcode->stream_id = QMAN_STREAMS; + undef_opcode->write_enable = 0; } } @@ -7929,19 +7934,12 @@ static int gaudi2_handle_qman_err_generic(struct hl_device *hdev, u16 event_type error_count++; } - /* check for undefined opcode */ - if (glbl_sts_val & PDMA0_QM_GLBL_ERR_STS_CP_UNDEF_CMD_ERR_MASK) { + /* Check for undefined opcode error in lower QM */ + if ((i == QMAN_STREAMS) && + (glbl_sts_val & PDMA0_QM_GLBL_ERR_STS_CP_UNDEF_CMD_ERR_MASK)) { + handle_lower_qman_data_on_err(hdev, qman_base, + gaudi2_queue_id_to_engine_id[qid_base]); *event_mask |= HL_NOTIFIER_EVENT_UNDEFINED_OPCODE; - if (hdev->captured_err_info.undef_opcode.write_enable) { - memset(&hdev->captured_err_info.undef_opcode, 0, - sizeof(hdev->captured_err_info.undef_opcode)); - hdev->captured_err_info.undef_opcode.timestamp = ktime_get(); - hdev->captured_err_info.undef_opcode.engine_id = - gaudi2_queue_id_to_engine_id[qid_base]; - } - - if (i == QMAN_STREAMS) - handle_lower_qman_data_on_err(hdev, qman_base, *event_mask); } } -- 2.34.1