Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp36579679rwd; Tue, 11 Jul 2023 02:58:49 -0700 (PDT) X-Google-Smtp-Source: APBJJlG/N+8bk1pRPse+af8QSuySy8cNFRT+zNOCI7BqCHORLgOQBS0GjPGCRtF+Oe+RWSE/lIvF X-Received: by 2002:a05:6a20:139a:b0:130:1563:7502 with SMTP id hn26-20020a056a20139a00b0013015637502mr10653630pzc.7.1689069529676; Tue, 11 Jul 2023 02:58:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689069529; cv=none; d=google.com; s=arc-20160816; b=NDRAR3L6iSxvFUqky2Vbpr+lE+Ejcjq0S43dIv9olSFkTQGxcHGwM/upGLEmarceYS DqUrT7q9RnR7P46z0PSo+Tbu6HnbrHwZ+7TWE/SwbfthKbHSRqszztjugeEhtmmc5YUI 20fRN+JGy4nJzcYxkpQVOXXgL6sAKJvYQomzu3uB0opABlrZ39U3ayjswA1zM7NfvOvY D297wBOtz+J9MOPKnAcYKWuBqzafHVbkTHa7ognwTzavFpAi4kqqoCHC2j85eRq9SBmI nTlvJh1hWn2qW9IXP4YVVWDG491HPtnhG8UFUeyTP/3uICgwp/9dAZnp+yUyqmorv1Kn LPKA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:reply-to :from:references:cc:to:content-language:subject:user-agent :mime-version:date:message-id; bh=m+nahZG4ag4WYByQONVIafiPqzg8BBLi6TIrO5RhHRg=; fh=0YG8iRI0E4A6zWz9pQenhy5Nmw0rer1QKOMaTh4RNPQ=; b=Z79c559E4VrM3Ns0Skgu2GdBz7yQvOOidM+QicVXxFvyYLUSStTc56g1+w36nIvLR2 CEjAetSmkdWgyn38HlMVMmEy+PLAVac3jmgXqgR1hOnIVLiP0OpndBosP7HtPRq0CL4U JwOBV1T5GsPP9wn6foKwKY2oZR7hInc3vhH3/R/nGgbSrCmHppnd6owPOSq0yptLDHdw mPiOg83mE439A5NFz3szhg+1llK7+A8UMQsV6pNHrcDXm9JrXWpPhFWI5WlX9O7rpkcA hMlzS4h+wLbxvHuKy+RRMrkf1lUqjD/izhtlkp1QlhezJ59uOhb63+JL8jGGsg8HC30V 9cCg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id eb26-20020a056a004c9a00b00668718e54c6si1246156pfb.202.2023.07.11.02.58.37; Tue, 11 Jul 2023 02:58:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230494AbjGKJjr (ORCPT + 99 others); Tue, 11 Jul 2023 05:39:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59398 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230513AbjGKJje (ORCPT ); Tue, 11 Jul 2023 05:39:34 -0400 Received: from wp530.webpack.hosteurope.de (wp530.webpack.hosteurope.de [80.237.130.52]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A8712136 for ; Tue, 11 Jul 2023 02:39:22 -0700 (PDT) Received: from [2a02:8108:8980:2478:8cde:aa2c:f324:937e]; authenticated by wp530.webpack.hosteurope.de running ExIM with esmtpsa (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) id 1qJ9pw-00047e-HN; Tue, 11 Jul 2023 11:39:12 +0200 Message-ID: <462e0e1e-98ea-0f3c-4aaa-8d44f0a8e664@leemhuis.info> Date: Tue, 11 Jul 2023 11:39:11 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: Re: Fwd: Need NVME QUIRK BOGUS for SAMSUNG MZ1WV480HCGL-000MV (Samsung SM-953 Datacenter SSD) Content-Language: en-US, de-DE To: Pankaj Raghav , Keith Busch , Linux regressions mailing list Cc: Bagas Sanjaya , Jens Axboe , Christoph Hellwig , Sagi Grimberg , "Clemens S." , Martin Belanger , Chaitanya Kulkarni , John Meneghini , Hannes Reinecke , Linux Kernel Mailing List , Linux NVMe , Kanchan Joshi , Javier Gonzalez , =?UTF-8?B?67CV7KeE7ZmY?= , Linus Torvalds References: <6f333133-2cc4-406a-d6c2-642ac6ccabca@leemhuis.info> From: "Linux regression tracking (Thorsten Leemhuis)" Reply-To: Linux regressions mailing list In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-bounce-key: webpack.hosteurope.de;regressions@leemhuis.info;1689068362;c61ff5f5; X-HE-SMSGID: 1qJ9pw-00047e-HN X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [CCing Linus for the "whack a mole" aspect in the second half] On 11.07.23 08:54, Pankaj Raghav wrote: >>> I understand that, but I think we need middlemen for that, as I or Bagas >>> don't have the contacts -- and it's IMHO also a bit much too ask us for >>> in general, as regression tracking is hard enough already. At least >>> unless this becomes something that happen regularly, then a list of >>> persons we could contact would be fine I guess. But we simply can't deal >>> with too many subsystem specific special cases. >> >> I'm not asking the Linux regression trackers to fill that role, though. Well, during our work we often encounter those bugs -- often from people that are no regular developers that already had a hard time understanding the issue and reporting it to us somehow. Asking those to... >> I'm asking people who experience these issues report it to their vendor ...find the right destination and format to report their Linux problems to the vendors is unlikely to fly I suspect. And I'm not sure if that is in our interest, as then it might take a lot longer to get those quirk entries into the kernel source. But whatever, the main reason why I write this mail is different: >> directly because these device makers apparently have zero clue that >> their spec non-compliance is causing painful experiences for their >> customers and annoyance for maintainers. They keep pumping out more and >> more devices with the same breakage. >> >> This particular vendor has been great at engaging with Linux, but that's >> not necessarily normal among all device makers, and I don't have >> contacts with the majority of the vendors we've had to quirk for this >> issue. >> >> We did complain to the NVMe spec workgroup that their complaince cert >> suite is not testing for this. There was a little initial interest in >> fixing that gap, but it fizzled out... Preface: this is not my area of expertise, and maybe I should keep my mouth shut. But whatever. Well, that "They keep pumping out more and more devices with the same breakage" and the "new device" comment from Pankaj below bear the question: should we stop trying to play "whack a mole" with all those quirk entries and handle devices with duplicate ids just like Windows does? That would "make things just work"(tm). And yes, I suspect there are good reasons why we went down the "quirk" route or why abandoning it might be hard. But maybe it's time to reconsider that path, as from my outside point of view things sound a lot like they are somewhat similar to the ACPI problems we dealt with ~15 years ago: we learned that we have to deal with broken ACPI implementations and somehow use them in a way similar to how Windows uses them, as that's the OS the machine was designed for and tested with. Ciao, Thorsten >>> Another request came in today, even with a pseudo-patch: >>> https://bugzilla.kernel.org/show_bug.cgi?id=217649 >>> >>> To quote: >>> ``` >>> As with numerous NVMe controllers these days, Samsung's >>> MZAL41T0HBLB-00BL2, which Lenovo builds into their 16ARP8 also suffers >>> from invalid IDs, breaking suspend and hibernate also on the latest >>> kernel 6.4.2. > [...] >> Panjaj, okay with this one too? > > This looks a like a new device that might have a firmware update. I will ping > internally first. > > As you mentioned, the recent addition of globally unique ID check > is breaking a lot of devices because of non-compliant firmware. I will try to create > some awareness about this issue internally as well.