Received: by 2002:a05:7412:8d10:b0:f3:1519:9f41 with SMTP id bj16csp745996rdb; Tue, 5 Dec 2023 20:55:08 -0800 (PST) X-Google-Smtp-Source: AGHT+IHeBZC59Dz1h/1DgeQJxAQsItMpkLJH2QLKn+oXKkVyqdAv0t/zB8wDovEsYJxQ5gDz7Ej4 X-Received: by 2002:a17:902:ef89:b0:1d0:6ffd:ae0c with SMTP id iz9-20020a170902ef8900b001d06ffdae0cmr120960plb.115.1701838508529; Tue, 05 Dec 2023 20:55:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701838508; cv=none; d=google.com; s=arc-20160816; b=ZcD5RpN31BPO0elJv6lnF8h2N3cuBKxi6JPDQwhvj0SLH0P9B0jGtBnIPGQ71DH4O0 YwXoozUTZd5zSJHBmkq89hQt2Ri1QehyXZx2f2vM9vzEa0qZMkmG/I7wbakmYM42+ntT TegYfUSxaLXjwKxK2ZKfIDyQYiHIHd2jkCqsu/NnUa4vzwfak0JhojAlRpS2Im4jVowE UHUfAa2Zl42Sy0gJqBtuA03BafrjPgpQqSDfCYcOSjBNSKj3k/yKKn1xEJKteHsdOJg/ iBmiZKglgW+s2GyKRvTW64DPSn/ZbGaIzz92JkFJhI90I33Jyh1xpAt72LPqTAJjeALd TAoQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=RxXsT0yeOSizQW7Lk4I+1Lgcb1LSFz0cbGVjxKQp1MM=; fh=izAHtxAH7LZo30ylz7VgPsszaMVYPFGNIYR6BnuPXiA=; b=HDHROzZ0rkqFbOeemXBWg7mqDPaR326Y1Ceh6KYdR/jP5OvmbiafOGpvVyM+uhsJ/t muUFZ8YxGbLQByVTXGrAaQK7eeTb62eLxocZArCYSkCKnKdSqfHsVD2UZjAxHFTAmmvE oEIjeQOnl59W9sN24SuTKZjdGs3wSIeulfiB1h7CWWRaW096U0ngkPb7GLNiMou4tQIG QHXv/AgXsNOccquiOA6ttsw6ySCEtCZCcahNJksAv0GDKo3miq60atRpl75n70QPR8I0 OhmcIKG0QWn6PEhTjA5iX2cqgq3AeKA/kzOpgED/avF88Syc3Y4sTM93LG4DFP9tuqwk 8JEA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=rPLjrLQk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from howler.vger.email (howler.vger.email. [2620:137:e000::3:4]) by mx.google.com with ESMTPS id n6-20020a170902e54600b001d1ccbe983asi495806plf.383.2023.12.05.20.55.08 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 05 Dec 2023 20:55:08 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) client-ip=2620:137:e000::3:4; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=rPLjrLQk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id F1BC382B92A3; Tue, 5 Dec 2023 20:55:05 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1376641AbjLFEsw (ORCPT + 99 others); Tue, 5 Dec 2023 23:48:52 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37834 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229493AbjLFEsv (ORCPT ); Tue, 5 Dec 2023 23:48:51 -0500 Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3A856A5 for ; Tue, 5 Dec 2023 20:48:58 -0800 (PST) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 1695EC433C7; Wed, 6 Dec 2023 04:48:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1701838137; bh=m+CegGpcxRyo/raoNH696719o8PPhaXae7JopGc2FhQ=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=rPLjrLQk+3kKLIQvdb+37zG5m/hX+g/MoFu7JFtBKiJXzDG0RJLhvM4U6Jkd2K0T7 +m/7kY3FKFU4lOlzH+f9hd1m4iDOKIipNneln2zfDPnj1G/4TCiaVY7izmmNUz+32s J+GZZpKWh4y1vxQysMI9tHWwH5cRJDbHB76qblvVdEx5/X0w1yDRZrg9daOpFbnk2u Xz0pfQdjX1eAH+mJmnmr7eepktoWfOxbRynp2muQag611UkwGW9yZdBzIMB+GEejZ/ xqrNleabKQvgmOtAWYX0KvNE9rFZYTrEpf6ewmYh3EVJL7Yyi1/gOHNEGBr021qL6S S24PhyYlhvILQ== Date: Tue, 5 Dec 2023 20:48:55 -0800 From: Jakub Kicinski To: Aron Silverton Cc: Greg Kroah-Hartman , Saeed Mahameed , Jason Gunthorpe , David Ahern , Arnd Bergmann , Leon Romanovsky , Jiri Pirko , Leonid Bloch , Itay Avraham , linux-kernel@vger.kernel.org, Saeed Mahameed Subject: Re: [PATCH V3 2/5] misc: mlx5ctl: Add mlx5ctl misc driver Message-ID: <20231205204855.52fa5cc1@kernel.org> In-Reply-To: References: <20231128044628.GA8901@u2004-local> <20231128065321.53d4d5bb@kernel.org> <20231128162413.GP436702@nvidia.com> <20231128084421.6321b9b2@kernel.org> <20231128175224.GR436702@nvidia.com> <20231128103304.25c2c642@kernel.org> <2023112922-lyricist-unclip-8e78@gregkh> <20231204185210.030a72ca@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.2 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on howler.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Tue, 05 Dec 2023 20:55:06 -0800 (PST) On Tue, 5 Dec 2023 11:11:00 -0600 Aron Silverton wrote: > 1. As mentioned already, we recently faced a complex problem with RDMA > in KVM and were getting nowhere trying to debug using the usual methods. > Mellanox support was able to use this debug interface to see what was > happening on the PCI bus and prove that the issue was caused by > corrupted PCIe transactions. This finally put the investigation on the > correct path. The debug interface was used consistently and extensively > to test theories about what was happening in the system and, ultimately, > allowed the problem to be solved. You hit on an important point, and what is also my experience working at Meta. I may have even mentioned it in this thread already. If there is a serious issue with a complex device, there are two ways you can get support - dump all you can and send the dump to the vendor or get on a live debugging session with their engineers. Users' ability to debug those devices is practically non-existent. The idea that we need access to FW internals is predicated on the assumption that we have an ability to make sense of those internals. Once you're on a support call with the vendor - just load a custom kernel, module, whatever, it's already extremely expensive manual labor. > 2. We've faced RDMA issues related to lost EQ doorbells, requiring > complex debug, and ultimately root-caused as a defective CPU. Without > interactive access to the device allowing us to test theories like, > "what if we manually restart the EQ", we could not have proven this > definitively. I'm not familiar with the RDMA debugging capabilities. Perhaps there are some gaps there. The more proprietary the implementation the harder it is to debug. An answer to that would be "try to keep as much as possible open".. and interfaces which let closed user space talk to closed FW take us in the opposite direction. FWIW good netdevice drivers have a selftest which tests IRQ generation and EQ handling. I think that'd cover the case you're describing? IDK if mlx5 has them, but if it doesn't definitely worth adding. And I recommend running those on suspicious machines (ethtool -t, devlink has some selftests, too) > Firstly, We believe in working upstream and all of the advantages that > that brings to all the distros as well as to us and our customers. > > Secondly, Our cloud business offers many types of machine instances, > some with bare metal/vfio mlx5 devices, that require customer driven > debug and we want our customers to have the freedom to choose which OS > they want to use. I understand that having everything packaged and shipped together makes life easier. If the point of the kernel at this stage of its evolution is to collect incompatible bits of vendor software, make sure they build cleanly and ship them to distros - someone should tell me, and I will relent.