Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp1173486rwd; Thu, 1 Jun 2023 11:27:00 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ68f1IDcNfOBqUl5enjJfl0XAadjN+cyIJ1UzAuFco3nGPjr1oiGwdsOhTunss/EvdlDwFo X-Received: by 2002:a05:6a00:98f:b0:645:c730:f826 with SMTP id u15-20020a056a00098f00b00645c730f826mr8791486pfg.24.1685644020614; Thu, 01 Jun 2023 11:27:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685644020; cv=none; d=google.com; s=arc-20160816; b=yV74xtJYrYKIZ+5cywquc5lfaEuXGx/HjQG8+zdeCj4AjdkqlgeeJx/QqSAfpv2VFu 3PFOA30w9/rrxC1UpAyiMFHLZgB5dWyi4SSBPeooiei1a/uXNu7RvEAq8eww5/rpxHmI l5oob3m8v8IRah/A/TTHI+mOVTYO3uy46bHPrcukZ4vbbLihv+Q7NOr7tCs2VX5j3a73 +ZAhb3haHr/EkNfprih8nvSze8S/Uml0oRB+PliEGDWRophgOLTCQKhbSqJZPRbNOWa5 06CBVDHFErZzM4ao1atc9poxku/7UlIjAHD2p3wFFfDoImzTI+eLCxcuGaSHwMSY5OBT fK2g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=r2dczyxnMSoVxG4Sl+UFzUV7DqIebX1V1UnCasZfupU=; b=VdUPncc3rZDUDj1GbVs67qPJMBMpsC4ftbYmDKASnWatm2VIM6aCrSruAZy2d9CkUu 0l8JH9bQ1czpU2AahgQNV1lgLF4rhrkkggX1qx2BrhvVPT+aoBm5PXDSmopaRzuwkpY5 XCAXYVwkXbO1ytQtehvoNuUL1x0VPLJPr+bObvse4A/afu9k9NygQtPIZd4XqotaOhB0 h5A9/4wHt3+WZJj+rsPhmEFsmmbJeqAP6cZ4kUeQpF/N5zanWhJUbrZYXCiUpDq1X3HJ yY7Tgb2HlTZZ/BS4ZXa/QTGoR9G9kZT0XpHa5fjWuAYbHqpX6BhAF+pjbqe+oHLAbd00 4sFw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TwXEiuMu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id x1-20020aa79ac1000000b006293f8330fcsi578591pfp.322.2023.06.01.11.26.48; Thu, 01 Jun 2023 11:27:00 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TwXEiuMu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231480AbjFASLb (ORCPT + 99 others); Thu, 1 Jun 2023 14:11:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50230 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231223AbjFASL3 (ORCPT ); Thu, 1 Jun 2023 14:11:29 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id BF510193 for ; Thu, 1 Jun 2023 11:10:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1685643040; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=r2dczyxnMSoVxG4Sl+UFzUV7DqIebX1V1UnCasZfupU=; b=TwXEiuMumC/j3sqSSUxi2KbAZdiO7wuRBhLWf28g5YXEC95tKdNEq3tilYUfovpkk1x/wc t54cgoTTvEQLwgjJMYyl8TTuBf+FgQKQBUQJ7/ublqb0qpKHBSPZDnKe3vlovH2nfu7efS qhJoesa4CJDCXtq92WhnAoyT/D6ZlwM= Received: from mail-lj1-f200.google.com (mail-lj1-f200.google.com [209.85.208.200]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-208-R_0gaFJQPVi5Uhnt2j727A-1; Thu, 01 Jun 2023 14:10:39 -0400 X-MC-Unique: R_0gaFJQPVi5Uhnt2j727A-1 Received: by mail-lj1-f200.google.com with SMTP id 38308e7fff4ca-2b04d5ed394so2031471fa.1 for ; Thu, 01 Jun 2023 11:10:39 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1685643038; x=1688235038; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=r2dczyxnMSoVxG4Sl+UFzUV7DqIebX1V1UnCasZfupU=; b=MNQrgFuZKLACKNRy9lMH0yqfWsOx/ui/FckS5U4YDqiIcOptfprJbow349GnfNxvLB jripncnGLd+i0pm9YAC5sck/m5RCNEaiObFNFtW3IG0jDxGQNhzUehmxRfAPCDOyK3KQ s0Bq+31lF5RlbRso6ArQXQrUzcUmKqVTwnCSSZJ1poaSDaSbhtPXsy+Y1KgT8maKVbAd KNXwqDNldmM9JBkvEhNWGo/Tk8umRxQtQc8pYQti2H9vsYpft1uuWk1jp8Pqhe+CeHIl TgGEAnUvblgAzRu7slYjPYG1JX4oDLwG94IWoGcwN1SlzEwAH3wfiy4VpBBM5QPbCH6C FoyA== X-Gm-Message-State: AC+VfDwsvy2AGF7TSC0Xc0I9CHMC7DHkyP0HHeYzB9ubeOn9j232udMQ J/qR9Au8Bj9T3hGP0ee9Pmut1nMM80y/pnoIG+05IwXGqD1OIx9e8J3N/5ydWd2Ufq3tjZGcyL4 Yvk9OQ0JzwZJQkY95DOsOn2BiXYdM+DA4cFk6gE8p X-Received: by 2002:a2e:a366:0:b0:2b1:a667:dbca with SMTP id i6-20020a2ea366000000b002b1a667dbcamr1176773ljn.2.1685643038242; Thu, 01 Jun 2023 11:10:38 -0700 (PDT) X-Received: by 2002:a2e:a366:0:b0:2b1:a667:dbca with SMTP id i6-20020a2ea366000000b002b1a667dbcamr1176762ljn.2.1685643037963; Thu, 01 Jun 2023 11:10:37 -0700 (PDT) MIME-Version: 1.0 References: <168471337231.1913606.15905047692536779158.reportbug@xps> <09e24386-de63-e9e9-9e7f-5d04bad62d83@amd.com> <8537d965-ddf4-7f45-6459-d5acf520376e@amd.com> In-Reply-To: From: Karol Herbst Date: Thu, 1 Jun 2023 20:10:26 +0200 Message-ID: Subject: Re: Regression from "ACPI: OSI: Remove Linux-Dell-Video _OSI string"? (was: Re: Bug#1036530: linux-signed-amd64: Hard lock up of system) To: "Limonciello, Mario" Cc: Nick Hastings , Lyude Paul , Lukas Wunner , Salvatore Bonaccorso , "1036530@bugs.debian.org" <1036530@bugs.debian.org>, "Rafael J. Wysocki" , Len Brown , "linux-acpi@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "regressions@lists.linux.dev" Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-2.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 1, 2023 at 7:21=E2=80=AFPM Limonciello, Mario wrote: > > [AMD Official Use Only - General] > > > -----Original Message----- > > From: Karol Herbst > > Sent: Thursday, June 1, 2023 12:19 PM > > To: Limonciello, Mario > > Cc: Nick Hastings ; Lyude Paul > > ; Lukas Wunner ; Salvatore > > Bonaccorso ; 1036530@bugs.debian.org; Rafael J. > > Wysocki ; Len Brown ; linux- > > acpi@vger.kernel.org; linux-kernel@vger.kernel.org; > > regressions@lists.linux.dev > > Subject: Re: Regression from "ACPI: OSI: Remove Linux-Dell-Video _OSI > > string"? (was: Re: Bug#1036530: linux-signed-amd64: Hard lock up of sys= tem) > > > > On Thu, Jun 1, 2023 at 6:54=E2=80=AFPM Limonciello, Mario > > wrote: > > > > > > [AMD Official Use Only - General] > > > > > > > -----Original Message----- > > > > From: Karol Herbst > > > > Sent: Thursday, June 1, 2023 11:33 AM > > > > To: Limonciello, Mario > > > > Cc: Nick Hastings ; Lyude Paul > > > > ; Lukas Wunner ; Salvatore > > > > Bonaccorso ; 1036530@bugs.debian.org; Rafael J. > > > > Wysocki ; Len Brown ; linux- > > > > acpi@vger.kernel.org; linux-kernel@vger.kernel.org; > > > > regressions@lists.linux.dev > > > > Subject: Re: Regression from "ACPI: OSI: Remove Linux-Dell-Video _O= SI > > > > string"? (was: Re: Bug#1036530: linux-signed-amd64: Hard lock up of > > system) > > > > > > > > On Thu, Jun 1, 2023 at 6:18=E2=80=AFPM Limonciello, Mario > > > > wrote: > > > > > > > > > > +Lyude, Lukas, Karol > > > > > > > > > > On 5/31/2023 6:40 PM, Nick Hastings wrote: > > > > > > Hi, > > > > > > > > > > > > * Nick Hastings [230530 16:01]: > > > > > >> * Mario Limonciello [230530 13:00]= : > > > > > > > > > > > >>> As you're actually loading nouveau, can you please try > > > > nouveau.runpm=3D0 on > > > > > >>> the kernel command line? > > > > > >> I'm not intentionally loading it. This machine also has intel = graphics > > > > > >> which is what I prefer. Checking my > > > > > >> /etc/modprobe.d/blacklist-nvidia-nouveau.conf > > > > > >> I see: > > > > > >> > > > > > >> blacklist nvidia > > > > > >> blacklist nvidia-drm > > > > > >> blacklist nvidia-modeset > > > > > >> blacklist nvidia-uvm > > > > > >> blacklist ipmi_msghandler > > > > > >> blacklist ipmi_devintf > > > > > >> > > > > > >> So I thought I had blacklisted it but it seems I did not. Sinc= e I do not > > > > > >> want to use it maybe it is better to check if the lock up occu= rs with > > > > > >> nouveau blacklisted. I will try that now. > > > > > > I blacklisted nouveau and booted into a 6.1 kernel: > > > > > > % uname -a > > > > > > Linux xps 6.1.0-9-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.27-1 > > > > (2023-05-08) x86_64 GNU/Linux > > > > > > > > > > > > It has been running without problems for nearly two days now: > > > > > > % uptime > > > > > > 08:34:48 up 1 day, 16:22, 2 users, load average: 1.33, 1.26= , 1.27 > > > > > > > > > > > > Regards, > > > > > > > > > > > > Nick. > > > > > > > > > > Thanks, that makes a lot more sense now. > > > > > > > > > > Nick, Can you please test if nouveau works with runtime PM in the > > > > > latest 6.4-rc? > > > > > > > > > > If it works in 6.4-rc, there are probably nouveau commits that ne= ed > > > > > to be backported to 6.1 LTS. > > > > > > > > > > If it's still broken in 6.4-rc, I believe you should file a bug: > > > > > > > > > > https://gitlab.freedesktop.org/drm/nouveau/ > > > > > > > > > > > > > > > Lyude, Lukas, Karol > > > > > > > > > > This thread is in relation to this commit: > > > > > > > > > > 24867516f06d ("ACPI: OSI: Remove Linux-Dell-Video _OSI string") > > > > > > > > > > Nick has found that runtime PM is *not* working for nouveau. > > > > > > > > > > > > > keep in mind we have a list of PCIe controllers where we apply a > > > > workaround: > > > > > > https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree= /drivers > > > > /gpu/drm/nouveau/nouveau_drm.c?h=3Dv6.4-rc4#n682 > > > > > > > > And I suspect there might be one or two more IDs we'll have to add > > > > there. Do we have any logs? > > > > > > There's some archived onto the distro bug. Search this page for > > "journalctl.log.gz" > > > https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D1036530 > > > > > > > interesting.. It seems to be the same controller used here. I wonder > > if the pci topology is different or if the workaround is applied at > > all. > > I didn't see the message in the log about the workaround being applied > in that log, so I guess PCI topology difference is a likely suspect. > yeah, but I also couldn't see a log with the usual nouveau messages, so it's kinda weird. Anyway, the output of `lspci -tvnn` would help > > > > But yeah, I'd kinda love for somebody with better knowledge on all of > > this to figure out what exactly is going wrong, but everytime this > > gets investigated Intel says "our hardware has no bugs", the ACPI > > folks dig for months and find nothing and I end up figuring out some > > weirdo workaround I don't understand. And apparently also nobody is > > able to hand out docs explaining in detail how that runtime > > suspend/resume stuff is supposed to work. > > > > I have a Dell XPS 9560 where the added workaround in nouveau fixed the > > problem and I know it's fixed on a bunch of other systems. So if > > anybody is willing to publish docs and/or actually debug it with > > domain knowledge, please go ahead. > > > > > > And could anybody test if adding the > > > > controller in play here does resolve the problem? > > > > > > > > > If you recall we did 24867516f06d because 5775b843a619 was > > > > > supposed to have fixed it. > > > > > > > > >