Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp4203284imw; Tue, 19 Jul 2022 01:47:10 -0700 (PDT) X-Google-Smtp-Source: AGRyM1uG7+fjOFeXMOp6pN5copcC3l2LAvWKbrC3nu0ExO7w4K4zkhEYWHb8v7ApculgO8+BNZdp X-Received: by 2002:a17:907:3daa:b0:72b:7656:d4d2 with SMTP id he42-20020a1709073daa00b0072b7656d4d2mr29591143ejc.166.1658220430388; Tue, 19 Jul 2022 01:47:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658220430; cv=none; d=google.com; s=arc-20160816; b=ojcSvHRjhEc9qUdv0ZElDPZpBN+6EKAS3QjJ5lo4CcVmd1oWdmEVfV+SRqosaed/Di nRatmg6OSo/g723IrocgoQVdYRq3LRpDN+1Q3wMgPsK4q9o83pmbywFvhH018sBpkwdo zO/JaibFJm+L6/bfIO+U1N0y9j0DLkY/r4VB/XW7VJqbEaHtIvJ7sVILUF/d+wfFesyW ApS4oT9K4MJypIcdu04+La9+yUSK4YdvaIXV7ONfMvIsa09YtizTI8SEhcFqya264crx JqdBXrxBwmPfMXOwmnLZLyR2DDda6lH2V1nhvt4t4gPzBaaHTHERehdUUlGilrrlCErg mNsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=SjTMYyNdBeUu7iomKZchO0cdHcESp1feqssx2cB1fms=; b=ck4lr7zl7/uMz9Seo/QrSftHZTX69nsdqd4v9ctwSSXGYrmzHNVKTAvtnDeZKzkeB+ KPz7rVwbMMegxG1gkymuCFf7n8EU9WXismpHFUMCfZqRQEuRijJFIg3KpwfI86FuUuOf WKa1B7tVRR7wK7e3KcaMMAH75VJ+hNijGX/409gQ2NVjaWS79MsTXiBdatUWUKULx+1d MtVt+wlPT/6sF4G+B51AKGmTjUWookoQNV6JwbMoeBf1In4iWVPUGaJZS2TFJA6v3hzd qANy5HCnX36frbevttLZfwMXJPNdUjlXfnJej6VmALIW64QcNLHJjWJKz3Zuod2T1niV wEmw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fireburn-co-uk.20210112.gappssmtp.com header.s=20210112 header.b=73FS09WA; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id qf2-20020a1709077f0200b0072ee40a72b0si19107108ejc.173.2022.07.19.01.46.45; Tue, 19 Jul 2022 01:47:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@fireburn-co-uk.20210112.gappssmtp.com header.s=20210112 header.b=73FS09WA; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235895AbiGSIlC (ORCPT + 99 others); Tue, 19 Jul 2022 04:41:02 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48794 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233598AbiGSIk7 (ORCPT ); Tue, 19 Jul 2022 04:40:59 -0400 Received: from mail-oa1-x2c.google.com (mail-oa1-x2c.google.com [IPv6:2001:4860:4864:20::2c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 89E2DB1F6 for ; Tue, 19 Jul 2022 01:40:58 -0700 (PDT) Received: by mail-oa1-x2c.google.com with SMTP id 586e51a60fabf-1013ecaf7e0so30252466fac.13 for ; Tue, 19 Jul 2022 01:40:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fireburn-co-uk.20210112.gappssmtp.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=SjTMYyNdBeUu7iomKZchO0cdHcESp1feqssx2cB1fms=; b=73FS09WA3/I7Q5DJHVJdYGp21QdQ/EcCS2t02L8sMAMb5r4K+NbYLj6EKD2qO+E04n +Es7yB8DjJCJ/445YlF46KW9w108Y/KR0xllDNsmgUmd6vbzlVeqnhVmWsX6MFHZ7PUN nQhsQLT81guwsIFf72u8nMG5Kmb7MudZNWNhxKGUSz9NrjlawyjETh8swwMNMNsSLMvz VgOB9AUpfuPjBi+qIx9aOCq/jDGsZeD+13vs4JTSgKpn/NVuAKlu/q/WtmB1ZtYr1Si9 5YALk5mdUYNtsT100hDNm+d+GoNTbP9iAzFNWeehUWRHx8bPFyBKfcRp7ZKIF3Uu+uT0 ekbA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=SjTMYyNdBeUu7iomKZchO0cdHcESp1feqssx2cB1fms=; b=WqsfZuSyvMvx1w/hI5GRfXGjkk/JkfWMO3cRlPNSVDzF3yoy+GCsBlAFAmMF5+FQV+ OdE4eHkBXCB8LWTfu3EVICije1Yd/GRRvAL1Kl+GKtXtyR+t46ezGDNb61jrjzJjsYv9 TzNlOmac2ZbMqnMTn2jQsZKeWz21NIbsn6tiO8NygoJOxHUPfHqLESNFoOS0az95TNnZ ndenn90Q2rAymBFJ7UTXih83S+QuFXV0W2DYYCLtKVfQlIpLubWdxDDhe22bpGjOu5fF ak7ZtBUvQQbUpFp4sByDS+boHQt1EsgTKuhoF3HCz+cLdSfy/er7rhN5eBhtzYxLSH3A 3x+Q== X-Gm-Message-State: AJIora/MNfH6/qmtVsO64VAve0jCqwX4hGc6Iv8mbhsOHpYWouPPO32d Ff25UumeywLrltkVbHPyuI0fb7fCyVzyHkr1qZBU1Aul1zj9NQ== X-Received: by 2002:a05:6870:c10b:b0:10d:1992:f249 with SMTP id f11-20020a056870c10b00b0010d1992f249mr10553048oad.256.1658220057827; Tue, 19 Jul 2022 01:40:57 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Mike Lothian Date: Tue, 19 Jul 2022 09:40:46 +0100 Message-ID: Subject: Re: Command "clinfo" causes BUG: kernel NULL pointer dereference, address: 0000000000000008 on driver amdgpu To: "Chen, Guchun" Cc: Mikhail Gavrilov , amd-gfx list , Linux List Kernel Mailing , =?UTF-8?Q?Christian_K=C3=B6nig?= Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org I was told that this patch replaces the patch you mentioned https://patchwork.freedesktop.org/series/106078/ and it the one that'll hopefully land in Linus's tree On Tue, 19 Jul 2022 at 03:33, Chen, Guchun wrote: > > Patch https://patchwork.freedesktop.org/series/106024/ should fix this. > > Regards, > Guchun > > -----Original Message----- > From: amd-gfx On Behalf Of Mikhai= l Gavrilov > Sent: Tuesday, July 19, 2022 7:50 AM > To: amd-gfx list ; Linux List Kernel Maili= ng ; Christian K=C3=B6nig > Subject: Command "clinfo" causes BUG: kernel NULL pointer dereference, ad= dress: 0000000000000008 on driver amdgpu > > Hi guys I continue testing 5.19 rc7 and found the bug. > Command "clinfo" causes BUG: kernel NULL pointer dereference, address: > 0000000000000008 on driver amdgpu. > > Here is trace: > [ 1320.203332] BUG: kernel NULL pointer dereference, address: 00000000000= 00008 [ 1320.203338] #PF: supervisor read access in kernel mode [ 1320.2033= 40] #PF: error_code(0x0000) - not-present page [ 1320.203341] PGD 0 P4D 0 [= 1320.203344] Oops: 0000 [#1] PREEMPT SMP NOPTI [ 1320.203346] CPU: 5 PID: = 1226 Comm: kworker/5:2 Tainted: G W L > -------- --- 5.19.0-0.rc7.53.fc37.x86_64+debug #1 [ 1320.203348] Hardware= name: System manufacturer System Product Name/ROG STRIX X570-I GAMING, BIO= S 4403 04/27/2022 [ 1320.203350] Workqueue: events delayed_fput [ 1320.2033= 54] RIP: 0010:dma_resv_add_fence+0x5a/0x2d0 > [ 1320.203358] Code: 85 c0 0f 84 43 02 00 00 8d 50 01 09 c2 0f 88 47 > 02 00 00 8b 15 73 10 99 01 49 8d 45 70 48 89 44 24 10 85 d2 0f 85 05 > 02 00 00 <49> 8b 44 24 08 48 3d 80 93 53 97 0f 84 06 01 00 00 48 3d 20 > 93 53 > [ 1320.203360] RSP: 0018:ffffaf4cc1adfc68 EFLAGS: 00010246 [ 1320.203362]= RAX: ffff976660408208 RBX: ffff975f545f2000 RCX: 0000000000000000 [ 1320.2= 03363] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff976660408198 [ = 1320.203364] RBP: ffff976806f6e800 R08: 0000000000000000 R09: 0000000000000= 000 [ 1320.203366] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000= 000000000 [ 1320.203367] R13: ffff976660408198 R14: ffff975f545f2000 R15: f= fff976660408198 [ 1320.203368] FS: 0000000000000000(0000) GS:ffff976de12000= 00(0000) > knlGS:0000000000000000 > [ 1320.203370] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1320.20= 3371] CR2: 0000000000000008 CR3: 00000007fb31c000 CR4: 0000000000350ee0 [ 1= 320.203372] Call Trace: > [ 1320.203374] > [ 1320.203378] amdgpu_amdkfd_gpuvm_destroy_cb+0x5d/0x1e0 [amdgpu] [ 1320.= 203516] amdgpu_vm_fini+0x2f/0x4e0 [amdgpu] [ 1320.203625] ? mutex_destroy+0= x21/0x50 [ 1320.203629] amdgpu_driver_postclose_kms+0x1da/0x2b0 [amdgpu] [ = 1320.203734] drm_file_free.part.0+0x20d/0x260 [ 1320.203738] drm_release+0x= 6a/0x120 [ 1320.203741] __fput+0xab/0x270 [ 1320.203743] delayed_fput+0x1f/= 0x30 [ 1320.203745] process_one_work+0x2a0/0x600 [ 1320.203749] worker_thre= ad+0x4f/0x3a0 [ 1320.203751] ? process_one_work+0x600/0x600 [ 1320.203753] = kthread+0xf5/0x120 [ 1320.203755] ? kthread_complete_and_exit+0x20/0x20 > [ 1320.203758] ret_from_fork+0x22/0x30 > [ 1320.203764] > > Full kernel log is here: > https://nam11.safelinks.protection.outlook.com/?url=3Dhttps%3A%2F%2Fpaste= bin.com%2FEeKh2LEr&data=3D05%7C01%7Cguchun.chen%40amd.com%7C06749e19d65= b418748dc08da6918435f%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C63793785= 0184140997%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJB= TiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=3Dx1%2FR7m9Vy2XwkXKXsm= EOeaAyv44ZKNsU4caZJOOSIvY%3D&reserved=3D0 > > And one hour later after a lot of messages "BUG: workqueue lockup" GPU co= mpletely hung. > > I will be glad to test patches that fix this bug. > > -- > Best Regards, > Mike Gavrilov.