Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp2140248pxm; Fri, 4 Mar 2022 10:06:23 -0800 (PST) X-Google-Smtp-Source: ABdhPJwK9e7xEwuUC6XrvVBaLQEnYK2aHXBL9V9/oqNqP0CTVUSx18gqIoLCSNB3sQwdTbjLR9Yv X-Received: by 2002:a17:907:d92:b0:6da:7ac4:533c with SMTP id go18-20020a1709070d9200b006da7ac4533cmr10310988ejc.234.1646417181731; Fri, 04 Mar 2022 10:06:21 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646417181; cv=none; d=google.com; s=arc-20160816; b=rhXcghvzUieugzovbD5pIsZxJ+oVEMe1qSRqVfSXJovdbe5wVDCwcVLJKKgxCY79jV NHBjZvNlgS81J1shEu4WmrmNI2or/RMwmgN+QSLpE3ngz8JRsWB8lQ+AHE6oIdxos37y 1BNsPXZuEwuzVK+V3IQG5NLfFeoUkhH++qgdd1tdfisydudkAtQjLo8pz2Uv/wT9I5Vh /m0vjVZT3R0tarmiOzczBkd/+Ntp8pf6LdgWsl0h92/5ouYwxsn3ekrpW+l5PkkaKoNB qwvEExX2VUB/ewAmQVw7eku1gaRUcNUPseeE04H7ebiBxqLC1HOVO9qgZADb1M0atCK2 1WKQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=v0Ga8lHVNMbrstzQK9wZGlExIBnj23jhu/B+lbFgQzE=; b=nszEMMKPy7n7oER1dLaL5ZAYd/ikrY1Wwu2mvoeXid8OWtptjZ4DyXhhmHIV0H407K mTiprCDMFKWC8OrN4Kd2kJGtmBpuB1XPCZzcuHEKLTaKlw6p4TlcE7GL9ZSpD8/D8HXa rCp2xP8gtlDuCqJySGWrKoCliVM3MvnlQw1XFRaU15FynQkFQWwdfK3bXgqrNBckbuqt rKojCh3di6jxyDQ/mMZxSwkWCSRf/Gs3DOa/cJnc3ux7+F27KWIFye35+npTdZgJh+JU pkmoKZtAx2ZFffuPG5CT3EH970QelRJqstxR9js0iiK1yRJE1ZzV5I7x8MGk94Nexdha n+MA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=ikUHPzFO; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t17-20020a170906269100b006d14287d5bbsi3103185ejc.663.2022.03.04.10.05.58; Fri, 04 Mar 2022 10:06:21 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=ikUHPzFO; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234248AbiCDRYz (ORCPT + 99 others); Fri, 4 Mar 2022 12:24:55 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49758 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230213AbiCDRYy (ORCPT ); Fri, 4 Mar 2022 12:24:54 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4DBBB156C75; Fri, 4 Mar 2022 09:24:06 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id DD90661E83; Fri, 4 Mar 2022 17:24:05 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id BFA25C340E9; Fri, 4 Mar 2022 17:24:04 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1646414645; bh=sKIE43FuoKzxA9J2rZA6k8NGfDYTGF0hkk3YnB5jMCU=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=ikUHPzFOoq7I8BAqGOMMDNa+1gY2GWMxOJ51nBFsoAqpNrF/ugKMuiVgwXpg4W8B/ G0heglFOjOLGy/9ZK+l3i09N6/YFgYq9wcu0vBJhNyYJv3BwLU5rKzQYJvtPmlbfsU QwKr97+NIIxKjKF0BFiS25gaQZVeo7PNdqMHlNS7W5jSkG3qaMZJeQkytWiO0C3qaS gLwvOByVokgAhQ4wEVk1XozLbqzawjsvK95iRaKy8ulB1PTJ+kRxSxceDqjL1sNcJz wK0QeccwOjKAIhzVxNug45rrbojerJRVhKwtRF+BYFxRiGBu84AFMW+4g48g0qLQ94 Z7zjTqm+9/tpw== Date: Fri, 4 Mar 2022 19:24:01 +0200 From: Leon Romanovsky To: Haakon Bugge Cc: Jason Gunthorpe , OFED mailing list , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH for-next] Revert "IB/mlx5: Don't return errors from poll_cq" Message-ID: References: <1646315417-25549-1-git-send-email-haakon.bugge@oracle.com> <726B8D27-B7C7-4779-A56B-3AE9266BC208@oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <726B8D27-B7C7-4779-A56B-3AE9266BC208@oracle.com> X-Spam-Status: No, score=-7.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Mar 04, 2022 at 10:53:34AM +0000, Haakon Bugge wrote: > > > > On 3 Mar 2022, at 20:09, Leon Romanovsky wrote: > > > > On Thu, Mar 03, 2022 at 02:50:17PM +0100, H?kon Bugge wrote: > >> This reverts commit dbdf7d4e7f911f79ceb08365a756bbf6eecac81c. > >> > >> Commit dbdf7d4e7f91 ("IB/mlx5: Don't return errors from poll_cq") is > >> needed, when driver/fw communication gets wedged. > >> > >> With a large fleet of systems equipped with CX-5, we have observed the > >> following mlx5 error message: > >> > >> wait_func:945:(pid xxx): ACCESS_REG(0x805) timeout. Will cause a > >> leak of a command resource > > > > It is arguably FW issue. Please contact your Nvidia support representative. > > The RC for the whacked driver/fw communication has been raised with Nvidia support. This commit is to avoid the kernel to crash when this situation arises. And inevitable, it may happen. I'm confident that support team will find best possible solution to the raised issue. Thanks > > > Thxs, H?kon