Received: by 2002:a05:6358:5282:b0:b5:90e7:25cb with SMTP id g2csp3725084rwa; Tue, 23 Aug 2022 09:06:11 -0700 (PDT) X-Google-Smtp-Source: AA6agR5k8PLTBZwTqTvXTmRqcl9QH7VCsys+05K9bYGK38l+0YcyL9jPZWmBF8QqntOUnRvBLhYc X-Received: by 2002:a05:6402:5ca:b0:43b:6e01:482c with SMTP id n10-20020a05640205ca00b0043b6e01482cmr4433781edx.189.1661270770997; Tue, 23 Aug 2022 09:06:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1661270770; cv=none; d=google.com; s=arc-20160816; b=S6iYhC7Kq8pys78pv6RwtdphFWUujzaInD51vXYCr9weTkhu3MYermqq+kEMtsYDCI qo0me7AAczDcn2EWpUiZ6nHfax+4XK1lWbnZLFNDXtleiajR1whLVD2mrvnpQtmMWSxA stSNFZoKIzx0rfQBb8u72OUFZQHNpcPAvsOHNCkmnvYNQcCjgP7y9ASs4MFVT78c2y20 RW6muacDnV2T8hud9yPYv81cdVhmY2L13doVWR8cmHKByyJoYmXB1nXycxIj7otwE/8q W4abgi2xs61M6s2dUoDVYaIdxazt9IlMzSVyCs5GmzU8fhbXg0MhSK6CSos5QXqb3slC gglg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:references:in-reply-to :subject:cc:to:from:message-id:date:dkim-signature:dkim-signature; bh=9o5iSdVTRwCUBgwkGJPpuGhtLCLbWfRwPX9fK6q7WKU=; b=Qcy76IefuHvQTscB6t7H1KZrv4+L7oExk8Z8ioCROSiF70cv1LPTsj1U2+29rntaT2 U4WBt+dFaPZUwT3wy+7kwlYTuVnCErmzzBJ4Sj1LeUrvMDNKsm3mo2PiYqLMPbpuKT51 kF//DgDkvPv0vvFVvs8PeDVZRuLzKOYT8zOJSLCXTSZHGMKJUk9ES55xjkfdsx+Uogcx 0iHUJWiJR7GwPERjIl43NuAXAWplC4JraY9RXwXGXEXfNA/i/2CNVX2xKjVuHOx96rF6 2qvuz7ViGiMAaLPfCt+eL7G41VEWlxTaqAKcAeLo1XXt5qe+xLbzGO7AUGBP+hRBWFmd C4uQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=oQtudiJd; dkim=neutral (no key) header.i=@suse.de header.s=susede2_ed25519; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id jz2-20020a170906bb0200b0072b11a2a2afsi101824ejb.57.2022.08.23.09.05.44; Tue, 23 Aug 2022 09:06:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=oQtudiJd; dkim=neutral (no key) header.i=@suse.de header.s=susede2_ed25519; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243807AbiHWPrH (ORCPT + 99 others); Tue, 23 Aug 2022 11:47:07 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43238 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243751AbiHWPqv (ORCPT ); Tue, 23 Aug 2022 11:46:51 -0400 Received: from smtp-out1.suse.de (smtp-out1.suse.de [IPv6:2001:67c:2178:6::1c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 71CE02AB3C2 for ; Tue, 23 Aug 2022 04:47:08 -0700 (PDT) Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id F3E5A2249F; Tue, 23 Aug 2022 11:46:36 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1661255197; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=9o5iSdVTRwCUBgwkGJPpuGhtLCLbWfRwPX9fK6q7WKU=; b=oQtudiJdms3SmnDi5sa5DnPCU57m9UeixY3Xdmh1Mlfw9rwYTdx/Y8QmKWs3oKC2JvuWoT v4zGdfNnP4hGJSv0n8/1oiWEN/6i74OzjzaMI0EMmx4XlhxIjoqET9pLqPB2WWfet49QZ2 J4iy10FMb+nwaopI+K2MZjLYls5WSHo= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1661255197; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=9o5iSdVTRwCUBgwkGJPpuGhtLCLbWfRwPX9fK6q7WKU=; b=e/lU9SYExuOuyeB4j/bQaW92vLIhHMDHPPo1CUTPDe64WM2ACKDI0pSDprkbVkVQB80aZ/ U4Ku1BAYIl5+jlCw== Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id BA6D713AB7; Tue, 23 Aug 2022 11:46:36 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id ClWrLBy+BGOlMwAAMHmgww (envelope-from ); Tue, 23 Aug 2022 11:46:36 +0000 Date: Tue, 23 Aug 2022 13:46:36 +0200 Message-ID: <87ilmjqj1f.wl-tiwai@suse.de> From: Takashi Iwai To: Jason Gunthorpe Cc: Lu Baolu , Joerg Roedel , Greg Kroah-Hartman , Bjorn Helgaas , Robin Murphy , Eric Auger , regressions@lists.linux.dev, linux-kernel@vger.kernel.org Subject: Re: [REGRESSION 5.19.x] AMD HD-audio devices missing on 5.19 In-Reply-To: <87h723sdde.wl-tiwai@suse.de> References: <874jy4cqok.wl-tiwai@suse.de> <20220823010021.GA5967@nvidia.com> <87h723sdde.wl-tiwai@suse.de> User-Agent: Wanderlust/2.15.9 (Almost Unreal) Emacs/27.2 Mule/6.0 MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 23 Aug 2022 08:06:05 +0200, Takashi Iwai wrote: > > On Tue, 23 Aug 2022 03:00:21 +0200, > Jason Gunthorpe wrote: > > > > On Mon, Aug 22, 2022 at 04:12:59PM +0200, Takashi Iwai wrote: > > > Hi, > > > > > > we've received regression reports about the missing HD-audio devices > > > on AMD platforms, and this turned out to be caused by the commit > > > 512881eacfa72c2136b27b9934b7b27504a9efc2 > > > bus: platform,amba,fsl-mc,PCI: Add device DMA ownership management > > > > > > The details are found in openSUSE bugzilla: > > > https://bugzilla.suse.com/show_bug.cgi?id=1202492 > > > > > > The problem seems to be that HD-audio (both onboard analog and HDMI) > > > PCI devices are assigned to the same IOMMU group as AMD graphics PCI > > > device, and once after the AMDGPU is initialized beforehand, those > > > audio devices can't be probed since iommu_device_use_default_domain() > > > returns -EBUSY. > > > > Can you describe exactly what drivers are involved in this? If it is > > the above commit then several devices are sharing an iommu group and > > one of them (well, the only one already attached, I suppose) has made > > the group unsharable. > > > > With grep I don't see an obvious place where the AMDGPU driver would > > mess with the iommu configuration, so I have no guess. > > I have also no concrete clue, either :) > At least, drivers/gpu/drm/amd/amdkfd/kfd_iommu.c calls > amd_iommu_init_device(), and this invokes iommu_attach_group(), which > may change group->domain. But it was just my wild guess, and it might > be others, indeed. > > > It would be good to have some debugging to confirm if it is > > group->owner (should be impossible, suggests memory corruption if it > > is) or group->domain != group->default_domain. > > > > Most likely it is the later, but I can't see how that could happen on > > a system like this.. There is no obvious manipulation in AMDGPU, for > > instance. > > > > So debugging to find the backtrace for exactly when > > group->domain != group->default_domain > > Occurs for the troubled group would be necessary. > > OK, will try to build a test kernel with some debug prints and ask the > reporters. It may take some time. It was tested now and confirmed that the call path is via AMDGPU, as expected: amdgpu_pci_probe -> amdgpu_driver_load_kms -> amdgpu_device_init -> amdgpu_amdkfd_device_init -> kgd2kfd_device_init -> kgd2kfd_resume_iommu -> kfd_iommu_resume -> amd_iommu_init_device -> iommu_attach_group -> __iommu_attach_group At first AMDGPU driver is probed, and the iommu_attach_group() call above changes the assigned group->domain. Afterwards, when HD-audio devices are probed, it fails because: - Both HD-audio PCI devices belong to the very same IOMMU group as the AMD graphics PCI device - PCI core calls iommu_device_use_default_domain() and the check fails there because group->domain != group->default_domain Takashi