Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp1142679rwd; Thu, 1 Jun 2023 11:02:07 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5VdXqtdojQPOY5s/2W7dcMMvsHuhrpV7qWFo/HtIhNRRjZI1Kbu3U3yxeBeUcwwqTWlxmr X-Received: by 2002:a17:902:c950:b0:1ac:750e:33f4 with SMTP id i16-20020a170902c95000b001ac750e33f4mr99198pla.32.1685642527350; Thu, 01 Jun 2023 11:02:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685642527; cv=none; d=google.com; s=arc-20160816; b=fAmh04YQwlEG2av7lFEGg45XXRcntZ7zrAXgqfOVTVg+eh18E5+FJ9y8Xe/1ce1/YB d129fnw1OLFGFfA6RT6xeeFQqb+GhxvKak6GUmfeNy6jiVY06+hjDiGROPiF+AgQb7lJ HTcSvXQVOC+Cf2Bi+KGxSd038D57WdPr0h7PgNmzcxcVm81x1vel+Mp0QyYs3SYJVtjT Mj1xM0LRC0+vKOshcf79PC7EKOtDr5csj1MU1T0wI6bCkgK466SvpF8B5p5c2zln16HH oqEsUSSy2TH7vCl106JJhecNqkZ4SWoNmDRkxo49VpE8LBm7xd3p1q3Gkgkk/qIVdyRj jkWA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=N7Mnlp0aSiXHT6Q09OS24+TgfPuSnXNbcRqOsgVtEPw=; b=KfkEyYwEa1QO5WFRaCEtGQnEF7K9i8qtoy3Dh41qqCDmNUFbHfOsfN9RfbRSUkNX04 x/wJFRbZTZuU3gqoLYLz2PYhNtSHQFxixrPy146OOmi+/ac2YoXon7p4cLAJC3u5Pdti SIAGlkhB9AZpCYQWz2TdhV5i/PdvvcrizwS1Q6RF9w824lyyGwLWwhSzPM3L6HhGcdML JkqwBT7nSuqFp62ohGoabkwPnh9Nm2lO1hdomiIbZMPGcY+skOvQ48pVBqYu+Qfc7g+H YpWmed//1eZNnNROvL17hk3Y/67XRjO09+/hMl2zKP3ExQ4DL1KW/p/eNd/BXYivPUUd 1ZFw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Q+INa2sM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id p16-20020a170902a41000b001ae014d8d03si3001565plq.432.2023.06.01.11.01.52; Thu, 01 Jun 2023 11:02:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Q+INa2sM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232763AbjFARTu (ORCPT + 99 others); Thu, 1 Jun 2023 13:19:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55342 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229459AbjFARTs (ORCPT ); Thu, 1 Jun 2023 13:19:48 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 15F32D1 for ; Thu, 1 Jun 2023 10:19:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1685639942; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=N7Mnlp0aSiXHT6Q09OS24+TgfPuSnXNbcRqOsgVtEPw=; b=Q+INa2sM34bfV+eCoGfw/XNLUJ0E9L0opS6NVo176wSK/ndS0P8HzTPeE4Tj0eYCSBlVyc SLsdvZnIjWOPsKMN0gofCTVzKDcHUCSUSEDufbYSwWvsf7jwG0BTW7ltdCntIzYESJ27eP V2XaIgjvL+HHIjApVvlB/cp+VmLDBPU= Received: from mail-lj1-f200.google.com (mail-lj1-f200.google.com [209.85.208.200]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-520-hJB1tOBSPoyyTtnFPiAGgQ-1; Thu, 01 Jun 2023 13:19:01 -0400 X-MC-Unique: hJB1tOBSPoyyTtnFPiAGgQ-1 Received: by mail-lj1-f200.google.com with SMTP id 38308e7fff4ca-2b04d5ed394so1930921fa.1 for ; Thu, 01 Jun 2023 10:19:00 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1685639939; x=1688231939; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=N7Mnlp0aSiXHT6Q09OS24+TgfPuSnXNbcRqOsgVtEPw=; b=kHYMYVM6YcBjwuHnU6lKFytNmu4kQ0tM+OQ1E2pkG4m49XGuUT/6MMEC3Sci11UgMi dgcX1xHadC105E+5124ZEonY+thY7NPbhcRdVRrMJitobg+6mneC70VVdCTmZrdSPiYc bfs8aPX4liqTg9OQWOFoj/Dk0/mbp+rFpNhpuIkKv/U7Y5DWPjAMenQdtU9YV2mPLc1d L6FSSj2Nnn40TjZyZPP3wo1W7aM89NZ+Vmm2JN3Z6WBiB6f/g39Wb15enwN6IQ9h68xY 5P/D65StoGBv8xMyOJjvAW79M3XiEdpwTquk7j1dXiK8hxdo0lh6H+0ddGMzvvqgNe0T 9whw== X-Gm-Message-State: AC+VfDynGqOZzUMKPMCu0fDxc3+ZelCCbYS2p/HSectbGj8BF7n8OyxA Dbf7gTkKF4w7C32XLS741qIOArTbjJXNgWW9sGoJhoILnDy09a7oAusHzF52l9o68Z6keSrSPU5 aOEUGHv+H5YTwa/8KFn6DR6xfPjUqYnwKU7rw9RTM X-Received: by 2002:a2e:8e71:0:b0:2b1:a85a:6921 with SMTP id t17-20020a2e8e71000000b002b1a85a6921mr403200ljk.0.1685639939548; Thu, 01 Jun 2023 10:18:59 -0700 (PDT) X-Received: by 2002:a2e:8e71:0:b0:2b1:a85a:6921 with SMTP id t17-20020a2e8e71000000b002b1a85a6921mr403186ljk.0.1685639939234; Thu, 01 Jun 2023 10:18:59 -0700 (PDT) MIME-Version: 1.0 References: <168471337231.1913606.15905047692536779158.reportbug@xps> <09e24386-de63-e9e9-9e7f-5d04bad62d83@amd.com> <8537d965-ddf4-7f45-6459-d5acf520376e@amd.com> In-Reply-To: From: Karol Herbst Date: Thu, 1 Jun 2023 19:18:48 +0200 Message-ID: Subject: Re: Regression from "ACPI: OSI: Remove Linux-Dell-Video _OSI string"? (was: Re: Bug#1036530: linux-signed-amd64: Hard lock up of system) To: "Limonciello, Mario" Cc: Nick Hastings , Lyude Paul , Lukas Wunner , Salvatore Bonaccorso , "1036530@bugs.debian.org" <1036530@bugs.debian.org>, "Rafael J. Wysocki" , Len Brown , "linux-acpi@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "regressions@lists.linux.dev" Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-2.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 1, 2023 at 6:54=E2=80=AFPM Limonciello, Mario wrote: > > [AMD Official Use Only - General] > > > -----Original Message----- > > From: Karol Herbst > > Sent: Thursday, June 1, 2023 11:33 AM > > To: Limonciello, Mario > > Cc: Nick Hastings ; Lyude Paul > > ; Lukas Wunner ; Salvatore > > Bonaccorso ; 1036530@bugs.debian.org; Rafael J. > > Wysocki ; Len Brown ; linux- > > acpi@vger.kernel.org; linux-kernel@vger.kernel.org; > > regressions@lists.linux.dev > > Subject: Re: Regression from "ACPI: OSI: Remove Linux-Dell-Video _OSI > > string"? (was: Re: Bug#1036530: linux-signed-amd64: Hard lock up of sys= tem) > > > > On Thu, Jun 1, 2023 at 6:18=E2=80=AFPM Limonciello, Mario > > wrote: > > > > > > +Lyude, Lukas, Karol > > > > > > On 5/31/2023 6:40 PM, Nick Hastings wrote: > > > > Hi, > > > > > > > > * Nick Hastings [230530 16:01]: > > > >> * Mario Limonciello [230530 13:00]: > > > > > > > >>> As you're actually loading nouveau, can you please try > > nouveau.runpm=3D0 on > > > >>> the kernel command line? > > > >> I'm not intentionally loading it. This machine also has intel grap= hics > > > >> which is what I prefer. Checking my > > > >> /etc/modprobe.d/blacklist-nvidia-nouveau.conf > > > >> I see: > > > >> > > > >> blacklist nvidia > > > >> blacklist nvidia-drm > > > >> blacklist nvidia-modeset > > > >> blacklist nvidia-uvm > > > >> blacklist ipmi_msghandler > > > >> blacklist ipmi_devintf > > > >> > > > >> So I thought I had blacklisted it but it seems I did not. Since I = do not > > > >> want to use it maybe it is better to check if the lock up occurs w= ith > > > >> nouveau blacklisted. I will try that now. > > > > I blacklisted nouveau and booted into a 6.1 kernel: > > > > % uname -a > > > > Linux xps 6.1.0-9-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.27-1 > > (2023-05-08) x86_64 GNU/Linux > > > > > > > > It has been running without problems for nearly two days now: > > > > % uptime > > > > 08:34:48 up 1 day, 16:22, 2 users, load average: 1.33, 1.26, 1.= 27 > > > > > > > > Regards, > > > > > > > > Nick. > > > > > > Thanks, that makes a lot more sense now. > > > > > > Nick, Can you please test if nouveau works with runtime PM in the > > > latest 6.4-rc? > > > > > > If it works in 6.4-rc, there are probably nouveau commits that need > > > to be backported to 6.1 LTS. > > > > > > If it's still broken in 6.4-rc, I believe you should file a bug: > > > > > > https://gitlab.freedesktop.org/drm/nouveau/ > > > > > > > > > Lyude, Lukas, Karol > > > > > > This thread is in relation to this commit: > > > > > > 24867516f06d ("ACPI: OSI: Remove Linux-Dell-Video _OSI string") > > > > > > Nick has found that runtime PM is *not* working for nouveau. > > > > > > > keep in mind we have a list of PCIe controllers where we apply a > > workaround: > > https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree= /drivers > > /gpu/drm/nouveau/nouveau_drm.c?h=3Dv6.4-rc4#n682 > > > > And I suspect there might be one or two more IDs we'll have to add > > there. Do we have any logs? > > There's some archived onto the distro bug. Search this page for "journal= ctl.log.gz" > https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D1036530 > interesting.. It seems to be the same controller used here. I wonder if the pci topology is different or if the workaround is applied at all. But yeah, I'd kinda love for somebody with better knowledge on all of this to figure out what exactly is going wrong, but everytime this gets investigated Intel says "our hardware has no bugs", the ACPI folks dig for months and find nothing and I end up figuring out some weirdo workaround I don't understand. And apparently also nobody is able to hand out docs explaining in detail how that runtime suspend/resume stuff is supposed to work. I have a Dell XPS 9560 where the added workaround in nouveau fixed the problem and I know it's fixed on a bunch of other systems. So if anybody is willing to publish docs and/or actually debug it with domain knowledge, please go ahead. > > And could anybody test if adding the > > controller in play here does resolve the problem? > > > > > If you recall we did 24867516f06d because 5775b843a619 was > > > supposed to have fixed it. > > > >