Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp1405154ybz; Thu, 16 Apr 2020 08:35:48 -0700 (PDT) X-Google-Smtp-Source: APiQypLw4kZ3PACar+ZTPER8Js0TmnixUc7QL+ejuiz5BMJYQxITfx12HwiF/TlaBX2PwUjGbs0N X-Received: by 2002:a17:906:60d4:: with SMTP id f20mr10096340ejk.209.1587051348163; Thu, 16 Apr 2020 08:35:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1587051348; cv=none; d=google.com; s=arc-20160816; b=tbo4TPgcmMdPOtfu9Mu0vurmAamFizTr2p05MORYHqf1zjVnK0hXfN8UTF2zAfXq35 Vteo+Jz/0kGMR058+qJIqXu9BEoKAoCKt4lJ+aUbaBiBEl9b22DVlVtRIiPZu8x8YB5v iB6Jot17TJXbFOsVkUDTx4Nrtf5CK/jbaCnEjCy5lbMRIuFaMsmv36U6pSjeiLvrcZpK uc9oWo6In+2hfkLJAl6ONRPZpwlwmkFbrhELEYyGNKG7CDZ7DiG8vBqwUCwzkWoim8eQ P4FHNcqFkoftke0EJSweX0P5AdrIgzzqdCMi8JLBnB8b+IfEJnbanngZHDYuVIuJvwvM 5fqw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=zeLuMGm0tLcTdibFfwCCxD6lPxPtY/yRBtq80VdTxbQ=; b=Vtxl+lWl/OfCVG8J7lU6taAGnW2JotDXdT7jiYAK2vMEPgMeWlqKTW3XWiBC/B9ax9 ho2C5hwsTIyaYgm0IZ264PBYeBB2bgD/jExLZw2J+qzpLsXCNath3b9X3DcEzGAWBFrj CrlFU5fDf3x09BxY+3tsAbZ/D1rdVoU2ghxCwNFZhGkg/N+nwBuSYXXdCYmN6l9yhmSR /HZgDkhMRe3FNnF6BBWHWWKiDwRS/K0ZROJMrR8sLks693XSZNFbEo1MZNmKgDB/i31U IrLFKQpyJX2Jztx9Jqge5y5vwf4GINL+LpmO2lNOzuoMUzUiFKChlkZlFPxS7qzpAAfv SUyg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=cglOTA1M; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a15si4693441edk.298.2020.04.16.08.35.23; Thu, 16 Apr 2020 08:35:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=cglOTA1M; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2504349AbgDPPbO (ORCPT + 99 others); Thu, 16 Apr 2020 11:31:14 -0400 Received: from mail.kernel.org ([198.145.29.99]:56248 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730143AbgDPNnC (ORCPT ); Thu, 16 Apr 2020 09:43:02 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id A226D2223C; Thu, 16 Apr 2020 13:43:01 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1587044582; bh=3HsCAiHIhmh9HwXV4C3mMYFFAkRIIvi1FdCduwndbkA=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=cglOTA1MWtUT11ixWZEXacF9cCARhP3tQLsFmur4KWHPliG0R7mYL7jqHGTRjm1Sb qKoo4oLRSkqD8likfKZXRwPh7FIZCtHyy2ODwN53Gt4bDAwQoK1frJu/drk2vBgyAd BAJbClYD+4T2xNeSWTE4mKRKsQmmnc8wjNxYCrEA= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Yintian Tao , =?UTF-8?q?Christian=20K=C3=B6nig?= , Alex Deucher , Sasha Levin Subject: [PATCH 5.4 019/232] drm/scheduler: fix rare NULL ptr race Date: Thu, 16 Apr 2020 15:21:53 +0200 Message-Id: <20200416131318.719891976@linuxfoundation.org> X-Mailer: git-send-email 2.26.1 In-Reply-To: <20200416131316.640996080@linuxfoundation.org> References: <20200416131316.640996080@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Yintian Tao [ Upstream commit 77bb2f204f1f0a53a602a8fd15816d6826212077 ] There is one one corner case at dma_fence_signal_locked which will raise the NULL pointer problem just like below. ->dma_fence_signal ->dma_fence_signal_locked ->test_and_set_bit here trigger dma_fence_release happen due to the zero of fence refcount. ->dma_fence_put ->dma_fence_release ->drm_sched_fence_release_scheduled ->call_rcu here make the union fled “cb_list” at finished fence to NULL because struct rcu_head contains two pointer which is same as struct list_head cb_list Therefore, to hold the reference of finished fence at drm_sched_process_job to prevent the null pointer during finished fence dma_fence_signal [ 732.912867] BUG: kernel NULL pointer dereference, address: 0000000000000008 [ 732.914815] #PF: supervisor write access in kernel mode [ 732.915731] #PF: error_code(0x0002) - not-present page [ 732.916621] PGD 0 P4D 0 [ 732.917072] Oops: 0002 [#1] SMP PTI [ 732.917682] CPU: 7 PID: 0 Comm: swapper/7 Tainted: G OE 5.4.0-rc7 #1 [ 732.918980] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.8.2-0-g33fbe13 by qemu-project.org 04/01/2014 [ 732.920906] RIP: 0010:dma_fence_signal_locked+0x3e/0x100 [ 732.938569] Call Trace: [ 732.939003] [ 732.939364] dma_fence_signal+0x29/0x50 [ 732.940036] drm_sched_fence_finished+0x12/0x20 [gpu_sched] [ 732.940996] drm_sched_process_job+0x34/0xa0 [gpu_sched] [ 732.941910] dma_fence_signal_locked+0x85/0x100 [ 732.942692] dma_fence_signal+0x29/0x50 [ 732.943457] amdgpu_fence_process+0x99/0x120 [amdgpu] [ 732.944393] sdma_v4_0_process_trap_irq+0x81/0xa0 [amdgpu] v2: hold the finished fence at drm_sched_process_job instead of amdgpu_fence_process v3: resume the blank line Signed-off-by: Yintian Tao Reviewed-by: Christian König Signed-off-by: Alex Deucher Signed-off-by: Sasha Levin --- drivers/gpu/drm/scheduler/sched_main.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/drivers/gpu/drm/scheduler/sched_main.c b/drivers/gpu/drm/scheduler/sched_main.c index 2af64459b3d77..dfb29e6eeff1e 100644 --- a/drivers/gpu/drm/scheduler/sched_main.c +++ b/drivers/gpu/drm/scheduler/sched_main.c @@ -627,7 +627,9 @@ static void drm_sched_process_job(struct dma_fence *f, struct dma_fence_cb *cb) trace_drm_sched_process_job(s_fence); + dma_fence_get(&s_fence->finished); drm_sched_fence_finished(s_fence); + dma_fence_put(&s_fence->finished); wake_up_interruptible(&sched->wake_up_worker); } -- 2.20.1