Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp41205pxb; Mon, 25 Oct 2021 03:30:44 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzDun1/7ZQ2uonNGRCfFhKtKABuuyc6avBucZHajQSXahgOkxO3qsGaxCJRGr/D+3IfByb2 X-Received: by 2002:a05:6402:1a2f:: with SMTP id be15mr23839661edb.270.1635157843962; Mon, 25 Oct 2021 03:30:43 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1635157843; cv=none; d=google.com; s=arc-20160816; b=mytVmcia9LKxKCoSe5fsMU2nIhwckx/XR1V1sJzbCsZQU2prTnfNM2/0DtmScfdHJm rAK7YexkbdvUIILGnNPUES6I1VkfdpSMl0vsZmTDz60paBk3uuwj1JDy/SqPXoC3iEko okjyOp8PQAeyrsqDfdlNy812V5L8gou+jlzN372vzLV/AKGfOdnkXU4aqacHkcfxXvBz uaZ9YrXhK5FhPGdhFcnBCN8UfJHbSFp6LTT+nEvJMrpyYEvw6I74GOIVF+FDgLhptXNk +7FdljURcGtOgGBbO9zG35BzehPG1p6IsagRs/slETpSG9NdE8p5nX24UkKlVHzoFDBm h7Dg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject:from :content-language:user-agent:mime-version:date:message-id; bh=yyoDCt9qy85uXItVBqE39YXaJ/g/FOzGbbmXVW3y4Ak=; b=o0f/ePUhqttVb3W4+7/uuA9TNIwZAQc7iH+3l++aM9jQiVyKvjctc6ctek12rxw62S Ujj4yPoiOvb3bfio5f/EZ99m69nuuYyufrMwgHJjn8A149wUJkzPKCLf4Zvq06/iE2fv BjaulhSoJlHVAbW/k0un+HPLVFAbVjFI+NaMtoCNTay5KC7BvJm6vIXb5NnDgdk7SunF 2BZpbZ5Sq9MxV58es9XlLJOdNhisjQlxYASBHYmVZIPi16awBp6ZesiSmM+IPHQsPABV aQxYGJoiEgoC0EMMfaya8NkV+pLAB6yjjkDII4QXofIqVuI+nspA/Q4HrTKDS3DLIiMJ RNOw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id qf7si3599754ejc.612.2021.10.25.03.30.10; Mon, 25 Oct 2021 03:30:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232832AbhJYK2J (ORCPT + 99 others); Mon, 25 Oct 2021 06:28:09 -0400 Received: from mx3.molgen.mpg.de ([141.14.17.11]:34561 "EHLO mx1.molgen.mpg.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S232829AbhJYK2I (ORCPT ); Mon, 25 Oct 2021 06:28:08 -0400 Received: from [141.14.220.45] (g45.guest.molgen.mpg.de [141.14.220.45]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) (Authenticated sender: pmenzel) by mx.molgen.mpg.de (Postfix) with ESMTPSA id CC99861EA191C; Mon, 25 Oct 2021 12:25:45 +0200 (CEST) Message-ID: <7a5123b0-6370-59dc-f0c2-8be5b370d9ba@molgen.mpg.de> Date: Mon, 25 Oct 2021 12:25:45 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.2.1 Content-Language: en-US From: Paul Menzel Subject: I got an IOMMU IO page fault. What to do now? To: =?UTF-8?B?SsO2cmcgUsO2ZGVs?= , Suravee Suthikulpanit Cc: iommu@lists.linux-foundation.org, Alex Deucher , =?UTF-8?Q?Christian_K=c3=b6nig?= , Xinhui Pan , amd-gfx@lists.freedesktop.org, Thomas Gleixner , Ingo Molnar , Borislav Petkov , x86@kernel.org, LKML , it+linux-iommu@molgen.mpg.de Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Dear Linux folks, On a Dell OptiPlex 5055, Linux 5.10.24 logged the IOMMU messages below. (GPU hang in amdgpu issue #1762 [1] might be related.) $ lspci -nn -s 05:00.0 05:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Oland [Radeon HD 8570 / R7 240/340 OEM] [1002:6611] (rev 87) $ dmesg […] [6318399.745242] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xfffffff0c0 flags=0x0020] [6318399.757283] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xfffffff7c0 flags=0x0020] [6318399.769154] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffe0c0 flags=0x0020] [6318399.780913] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xfffffffec0 flags=0x0020] [6318399.792734] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffe5c0 flags=0x0020] [6318399.804309] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffd0c0 flags=0x0020] [6318399.816091] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffecc0 flags=0x0020] [6318399.827407] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffd3c0 flags=0x0020] [6318399.838708] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffc0c0 flags=0x0020] [6318399.850029] amdgpu 0000:05:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000c address=0xffffffdac0 flags=0x0020] [6318399.861311] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffc1c0 flags=0x0020] [6318399.872044] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffc8c0 flags=0x0020] [6318399.882797] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffb0c0 flags=0x0020] [6318399.893655] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffcfc0 flags=0x0020] [6318399.904445] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffb6c0 flags=0x0020] [6318399.915222] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffa0c0 flags=0x0020] [6318399.925931] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffbdc0 flags=0x0020] [6318399.936691] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffa4c0 flags=0x0020] [6318399.947479] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffff90c0 flags=0x0020] [6318399.958270] AMD-Vi: Event logged [IO_PAGE_FAULT device=05:00.0 domain=0x000c address=0xffffffabc0 flags=0x0020] As this is not reproducible, how would debugging go? (The system was rebooted in the meantime.) What options should be enabled, that next time the required information is logged, or what commands should I execute when the system is still in that state, so the bug (driver, userspace, …) can be pinpointed and fixed? Kind regards, Paul [1]: https://gitlab.freedesktop.org/drm/amd/-/issues/1762 "Oland [Radeon HD 8570 / R7 240/340 OEM]: GPU hang"