Received: by 2002:a05:6a10:d5a5:0:0:0:0 with SMTP id gn37csp3793051pxb; Mon, 4 Oct 2021 09:42:36 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxBews09tZkNhVpTyZVTVSXq81ufhoyEdB9G+7aeiVPS6Q9/HYPH7zWjf7IZIunlXDCgFnX X-Received: by 2002:a17:906:7a50:: with SMTP id i16mr7712240ejo.507.1633365756544; Mon, 04 Oct 2021 09:42:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1633365756; cv=none; d=google.com; s=arc-20160816; b=EXqLdbyJpDyWH4qOLBN0UV6i1yKQ4Hnl3HZCakA7lu32Z0cBm+KE4M2vrm6FPI3CqD HPTIObzcTt6/nqMrNCOg7HY8PYSdd7eu2U5mp156N8X3csXYty9PJBVgtNtEctFeKQhJ CHM71mN9soBQ2hsDA9M5ea/UX6FQQho7hZXULdCCblLXwm2GY0sod+ivSd0ULMU/RGJY tSzoNG5a/7rL1/HUhtAN9USmKf0RggZ1Lh+2caGAasjY+wt49vESzotRVklHkLPKkPCr UaP/dcsqZnFSZjllM1oEMJlzUGXWgWhZdPZKyStLC6b7FxBDHI94ttJUMrSrJKnc17KI ENvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=ZwPHdapDFAJJDOTnSnmGDO5NdGbnIm2NGt0Od2TECpI=; b=U0g5d+2owvWBYcbzgwI/4JjTqXY71Qe12WOr6tGBsm2jX8HlaL6jNmjPZBR+sjx037 oj1kTHdn0tEo1/5mr0q15gPoH2NSu1P6kNkb2E4ryfdfLnZtwpLTNkOQCg7yD7TQmFjE XzjlLY6S2JZg4uwMhyR83LlY5nm4Ea3Ea+nkbTov2kuZ3tzSViby/4tpCoE8IqUB2Yrh GsAodZk5RQkQIAJ8eg8Ye0zeELHsR4+JtPoYJpGSCo3FSFFDQssrXoQe0T55kQI5s5qh jB4LYZWrEj0CYPw6fICFtqDDiWgLNVgQxj/AbIfPyi2EuJ1tLR45W9WEeay0HFXXQpVK WvFQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=a6tw3f3N; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 21si19698732eji.148.2021.10.04.09.42.08; Mon, 04 Oct 2021 09:42:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=a6tw3f3N; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238249AbhJDNhu (ORCPT + 99 others); Mon, 4 Oct 2021 09:37:50 -0400 Received: from mail.kernel.org ([198.145.29.99]:48040 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237980AbhJDNfo (ORCPT ); Mon, 4 Oct 2021 09:35:44 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 5A0CB619E5; Mon, 4 Oct 2021 13:16:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1633353373; bh=Y3zaoMuK9r5z71WMDMDg7pYrYJIYFgv8tbIWFfgPS7M=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=a6tw3f3NAERHIokrWocDpI0FZmkxoAfamEe1MCpw+b90gVqUZ/ZzJWxaqLf1Qw5ry 1BBKbfzeA7urqND8Rn8FNmQsLINL/U7h0Edn/9G9pvEtBArnX0DgqHhpdjOew9z3dH +eqFsdVxVFIhREoLfVjanGI9iOHYQN/twVQy2Wqs= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Matthew Auld , Michael Mason , Daniel Vetter , Daniel Vetter , Jani Nikula , Sasha Levin Subject: [PATCH 5.14 104/172] drm/i915/request: fix early tracepoints Date: Mon, 4 Oct 2021 14:52:34 +0200 Message-Id: <20211004125048.339352055@linuxfoundation.org> X-Mailer: git-send-email 2.33.0 In-Reply-To: <20211004125044.945314266@linuxfoundation.org> References: <20211004125044.945314266@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Matthew Auld [ Upstream commit c83ff0186401169eb27ce5057d820b7a863455c3 ] Currently we blow up in trace_dma_fence_init, when calling into get_driver_name or get_timeline_name, since both the engine and context might be NULL(or contain some garbage address) in the case of newly allocated slab objects via the request ctor. Note that we also use SLAB_TYPESAFE_BY_RCU here, which allows requests to be immediately freed, but delay freeing the underlying page by an RCU grace period. With this scheme requests can be re-allocated, at the same time as they are also being read by some lockless RCU lookup mechanism. In the ctor case, which is only called for new slab objects(i.e allocate new page and call the ctor for each object) it's safe to reset the context/engine prior to calling into dma_fence_init, since we can be certain that no one is doing an RCU lookup which might depend on peeking at the engine/context, like in active_engine(), since the object can't yet be externally visible. In the recycled case(which might also be externally visible) the request refcount always transitions from 0->1 after we set the context/engine etc, which should ensure it's valid to dereference the engine for example, when doing an RCU list-walk, so long as we can also increment the refcount first. If the refcount is already zero, then the request is considered complete/released. If it's non-zero, then the request might be in the process of being re-allocated, or potentially still in flight, however after successfully incrementing the refcount, it's possible to carefully inspect the request state, to determine if the request is still what we were looking for. Note that all externally visible requests returned to the cache must have zero refcount. One possible fix then is to move dma_fence_init out from the request ctor. Originally this was how it was done, but it was moved in: commit 855e39e65cfc33a73724f1cc644ffc5754864a20 Author: Chris Wilson Date: Mon Feb 3 09:41:48 2020 +0000 drm/i915: Initialise basic fence before acquiring seqno where it looks like intel_timeline_get_seqno() relied on some of the rq->fence state, but that is no longer the case since: commit 12ca695d2c1ed26b2dcbb528b42813bd0f216cfc Author: Maarten Lankhorst Date: Tue Mar 23 16:49:50 2021 +0100 drm/i915: Do not share hwsp across contexts any more, v8. intel_timeline_get_seqno() could also be cleaned up slightly by dropping the request argument. Moving dma_fence_init back out of the ctor, should ensure we have enough of the request initialised in case of trace_dma_fence_init. Functionally this should be the same, and is effectively what we were already open coding before, except now we also assign the fence->lock and fence->ops, but since these are invariant for recycled requests(which might be externally visible), and will therefore already hold the same value, it shouldn't matter. An alternative fix, since we don't yet have a fully initialised request when in the ctor, is just setting the context/engine as NULL, but this does require adding some extra handling in get_driver_name etc. v2(Daniel): - Try to make the commit message less confusing Fixes: 855e39e65cfc ("drm/i915: Initialise basic fence before acquiring seqno") Signed-off-by: Matthew Auld Cc: Michael Mason Cc: Daniel Vetter Reviewed-by: Daniel Vetter Link: https://patchwork.freedesktop.org/patch/msgid/20210921134202.3803151-1-matthew.auld@intel.com (cherry picked from commit be988eaee1cb208c4445db46bc3ceaf75f586f0b) Signed-off-by: Jani Nikula Signed-off-by: Sasha Levin --- drivers/gpu/drm/i915/i915_request.c | 11 ++--------- 1 file changed, 2 insertions(+), 9 deletions(-) diff --git a/drivers/gpu/drm/i915/i915_request.c b/drivers/gpu/drm/i915/i915_request.c index 37aef1308573..7db972fa7024 100644 --- a/drivers/gpu/drm/i915/i915_request.c +++ b/drivers/gpu/drm/i915/i915_request.c @@ -914,8 +914,6 @@ static void __i915_request_ctor(void *arg) i915_sw_fence_init(&rq->submit, submit_notify); i915_sw_fence_init(&rq->semaphore, semaphore_notify); - dma_fence_init(&rq->fence, &i915_fence_ops, &rq->lock, 0, 0); - rq->capture_list = NULL; init_llist_head(&rq->execute_cb); @@ -978,17 +976,12 @@ __i915_request_create(struct intel_context *ce, gfp_t gfp) rq->ring = ce->ring; rq->execution_mask = ce->engine->mask; - kref_init(&rq->fence.refcount); - rq->fence.flags = 0; - rq->fence.error = 0; - INIT_LIST_HEAD(&rq->fence.cb_list); - ret = intel_timeline_get_seqno(tl, rq, &seqno); if (ret) goto err_free; - rq->fence.context = tl->fence_context; - rq->fence.seqno = seqno; + dma_fence_init(&rq->fence, &i915_fence_ops, &rq->lock, + tl->fence_context, seqno); RCU_INIT_POINTER(rq->timeline, tl); rq->hwsp_seqno = tl->hwsp_seqno; -- 2.33.0