Received: by 2002:ac0:e34a:0:0:0:0:0 with SMTP id g10csp464672imn; Thu, 28 Jul 2022 06:15:50 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tNn82xiuDHrEbFYcOy5gwq4+E+1jPD92B5+U44tk5anr2Sx4z77PW5CPo+uMvlFmucR2WM X-Received: by 2002:a17:90b:278c:b0:1f2:ec37:52ee with SMTP id pw12-20020a17090b278c00b001f2ec3752eemr10329551pjb.236.1659014150466; Thu, 28 Jul 2022 06:15:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1659014150; cv=none; d=google.com; s=arc-20160816; b=YVR7dLkKZK3yamj9XRZ4nXxVgYM7onR1E4HCRbv9DYs7nS9xp3MiJ1UxOgHCk/nZnl rvZRAdvLLdm25sRhP1FTHHeFuZPoMXS52B4AFlr+aRaw1SHkVcaopkSKL7OTn4tOJUz9 q8L5pdNaUTwVkPnAYQJV+DZn98kQWXAk2CWf//Lx19lYEgqqcVx3G+BV4paprh3ZgqW6 Oi38LeDX0wjYBJlQ7PYmBRxVNyZt+v0qIX0mxCL/HbG9NTtrSftbRRDeqi05kPLHFIyC PgTcigPuAG+0sekjW8RZ7+1KHSV3bIQxsE/pdoa9CeAZvlIoFIIzSG05l1AsItRcQy/I bgKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=C3PDlhexKOdKBeMLwfC5xa/m4pUoO7tj5VOKBTrWNtM=; b=V8wBtwsAQoKNL203sa/4fGRu52iQS0YQ3WVu8mhWcxS5FhNxYlnxZw5Kh6Cm7WDtSH nDZwC9wZgQkdOQRlfemkoOUW8CWrR/hH2JV6U0PqpEv+Y8d8LqLBJCw3WYMAy4oWntxT s1n18QhcIXQF9sZihiAfMnD62oVCxSCRjvt6R4Rjy2QySRVsO+DvaNO0/vMu6WvStfcK G6GRf8jMs82yGlwMUANM4xklGe/CQhppCAFssXOSkjVYJANKqbYDmkbY6GAj1JHx9y3O n7WI2MkESBOu34wXdEIF0EgH9b9xR+iHVoKVPpHB22yAKVxpfA0ZsfUAEqmWJoWBbbJh qoMw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=f5WITDqV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l9-20020a056a00140900b005255d5ad6e7si680290pfu.305.2022.07.28.06.15.32; Thu, 28 Jul 2022 06:15:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=f5WITDqV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237765AbiG1MIU (ORCPT + 99 others); Thu, 28 Jul 2022 08:08:20 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44812 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236734AbiG1MIT (ORCPT ); Thu, 28 Jul 2022 08:08:19 -0400 Received: from mga11.intel.com (mga11.intel.com [192.55.52.93]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C8531691CB; Thu, 28 Jul 2022 05:08:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1659010097; x=1690546097; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=3jiLtV59Oj63tPBciYjw2M2P/yqHMni/unx6284Merw=; b=f5WITDqV0AphoVnFxLieRBtpb+lg1rPr+nyG+YfygASzSCdOMWlpGqte aFj2TLwJTR/DWdrgu8gVX6NjJlcrlpNgvZ6BvOJDleU243Nb16d5q32Gf ggbhsIBE3GrcxCyJmP+WJwu9DTEJV2NCAnzJKMLtttaHRZJyQQRVyf4RN ORQpVtJDgISVEN+8eAck6itlxEn/Eu6Jk5q/G1jiNFeaig0abeBKTStX/ HwhR7ODZckgm8fNGEr/847YGFtc1ByeKxWATv0VcoxYaiixKZc9mVkhUj ReHj5+iz9Nl2QW6OehjZ/bUKwtXU7WSNgLsO2ZvvgLA2eyKlSuyA7aEt9 g==; X-IronPort-AV: E=McAfee;i="6400,9594,10421"; a="286045354" X-IronPort-AV: E=Sophos;i="5.93,198,1654585200"; d="scan'208";a="286045354" Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Jul 2022 05:08:17 -0700 X-IronPort-AV: E=Sophos;i="5.93,198,1654585200"; d="scan'208";a="659698817" Received: from dgonsal1-mobl1.ger.corp.intel.com (HELO intel.com) ([10.252.46.97]) by fmsmga008-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Jul 2022 05:08:14 -0700 Date: Thu, 28 Jul 2022 14:08:11 +0200 From: Andi Shyti To: Mauro Carvalho Chehab Cc: David Airlie , intel-gfx@lists.freedesktop.org, linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, Sumit Semwal , linaro-mm-sig@lists.linaro.org, Christian =?iso-8859-15?Q?K=F6nig?= , linux-media@vger.kernel.org, Tvrtko Ursulin Subject: Re: [Intel-gfx] [PATCH v3 0/6] drm/i915: reduce TLB performance regressions Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-7.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,RCVD_IN_DNSWL_HI,SPF_HELO_NONE, SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Mauro, Pushed in drm-intel-gt-next. Thanks, Andi On Wed, Jul 27, 2022 at 02:29:50PM +0200, Mauro Carvalho Chehab wrote: > Doing TLB invalidation cause performance regressions, like: > [424.370996] i915 0000:00:02.0: [drm] *ERROR* rcs0 TLB invalidation did not complete in 4ms! > > As reported at: > https://gitlab.freedesktop.org/drm/intel/-/issues/6424 > > as this is an expensive operation. So, reduce the need of it by: > - checking if the engine is awake; > - checking if the engine is not wedged; > - batching operations. > > Additionally, add a workaround for a known hardware issue on some GPUs. > > In order to double-check that this series won't be introducing any regressions, > I used this new IGT test: > > https://patchwork.freedesktop.org/patch/495684/?series=106757&rev=1 > > Checking the results for 3 different patchsets, on Broadwell: > > 1) On the top of drm-tip (2022y-07m-14d-08h-35m-36) - e. g. with TLB > invalidation and serialization patches: > > $ sudo build/tests/gem_exec_tlb|grep Subtest > Subtest close-clear: SUCCESS (10.490s) > Subtest madv-clear: SUCCESS (10.484s) > Subtest u-unmap-clear: SUCCESS (10.527s) > Subtest u-shrink-clear: SUCCESS (10.506s) > Subtest close-dumb: SUCCESS (10.165s) > Subtest madv-dumb: SUCCESS (10.177s) > Subtest u-unmap-dumb: SUCCESS (10.172s) > Subtest u-shrink-dumb: SUCCESS (10.172s) > > 2) With the new version of the batch TLB invalidation patches from this series: > > $ sudo build/tests/gem_exec_tlb|grep Subtest > Subtest close-clear: SUCCESS (10.483s) > Subtest madv-clear: SUCCESS (10.495s) > Subtest u-unmap-clear: SUCCESS (10.545s) > Subtest u-shrink-clear: SUCCESS (10.508s) > Subtest close-dumb: SUCCESS (10.172s) > Subtest madv-dumb: SUCCESS (10.169s) > Subtest u-unmap-dumb: SUCCESS (10.174s) > Subtest u-shrink-dumb: SUCCESS (10.176s) > > 3) Changing the TLB invalidation routine to do nothing[1]: > > $ sudo ~/freedesktop-igt/build/tests/gem_exec_tlb|grep Subtest > (gem_exec_tlb:1958) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1958) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1958) CRITICAL: Found deadbeef in a new (clear) buffer after 3 tries! > (gem_exec_tlb:1956) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1956) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1956) CRITICAL: Found deadbeef in a new (clear) buffer after 89 tries! > (gem_exec_tlb:1957) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1957) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1957) CRITICAL: Found deadbeef in a new (clear) buffer after 256 tries! > (gem_exec_tlb:1960) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1960) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1960) CRITICAL: Found deadbeef in a new (clear) buffer after 845 tries! > (gem_exec_tlb:1961) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1961) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1961) CRITICAL: Found deadbeef in a new (clear) buffer after 1138 tries! > (gem_exec_tlb:1954) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1954) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1954) CRITICAL: Found deadbeef in a new (clear) buffer after 1359 tries! > (gem_exec_tlb:1955) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1955) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1955) CRITICAL: Found deadbeef in a new (clear) buffer after 1794 tries! > (gem_exec_tlb:1959) CRITICAL: Test assertion failure function check_bo, file ../tests/i915/gem_exec_tlb.c:384: > (gem_exec_tlb:1959) CRITICAL: Failed assertion: !sq > (gem_exec_tlb:1959) CRITICAL: Found deadbeef in a new (clear) buffer after 2139 tries! > Dynamic subtest smem0 failed. > **** DEBUG **** > (gem_exec_tlb:1944) DEBUG: 2M hole:200000 contains poison:6b6b6b6b > (gem_exec_tlb:1944) DEBUG: Running writer for 200000 at 300000 on bcs0 > (gem_exec_tlb:1944) DEBUG: Closing hole:200000 on rcs0, sample:deadbeef > (gem_exec_tlb:1944) DEBUG: Rechecking hole:200000, sample:6b6b6b6b > **** END **** > Subtest close-clear: FAIL (10.434s) > Subtest madv-clear: SUCCESS (10.479s) > Subtest u-unmap-clear: SUCCESS (10.512s) > > In summary, the test does properly detect fail when TLB cache invalidation doesn't happen, > as shown at result (3). It also shows that both current drm-tip and drm-tip with this series > applied don't have TLB invalidation cache issues. > > [1] I applied this patch on the top of drm-tip: > > diff --git a/drivers/gpu/drm/i915/gt/intel_gt.c b/drivers/gpu/drm/i915/gt/intel_gt.c > index 68c2b0d8f187..0aefcd7be5e9 100644 > --- a/drivers/gpu/drm/i915/gt/intel_gt.c > +++ b/drivers/gpu/drm/i915/gt/intel_gt.c > @@ -930,0 +931,3 @@ void intel_gt_invalidate_tlbs(struct intel_gt *gt) > + // HACK: don't do TLB invalidations!!! > + return; > + > > Regards, > Mauro > > Chris Wilson (4): > drm/i915/gt: Ignore TLB invalidations on idle engines > drm/i915/gt: Invalidate TLB of the OA unit at TLB invalidations > drm/i915/gt: Skip TLB invalidations once wedged > drm/i915/gt: Batch TLB invalidations > > Mauro Carvalho Chehab (2): > drm/i915/gt: document with_intel_gt_pm_if_awake() > drm/i915/gt: describe the new tlb parameter at i915_vma_resource > > .../gpu/drm/i915/gem/i915_gem_object_types.h | 3 +- > drivers/gpu/drm/i915/gem/i915_gem_pages.c | 25 +++--- > drivers/gpu/drm/i915/gt/intel_gt.c | 77 +++++++++++++++---- > drivers/gpu/drm/i915/gt/intel_gt.h | 12 ++- > drivers/gpu/drm/i915/gt/intel_gt_pm.h | 11 +++ > drivers/gpu/drm/i915/gt/intel_gt_types.h | 18 ++++- > drivers/gpu/drm/i915/gt/intel_ppgtt.c | 8 +- > drivers/gpu/drm/i915/i915_vma.c | 33 ++++++-- > drivers/gpu/drm/i915/i915_vma.h | 1 + > drivers/gpu/drm/i915/i915_vma_resource.c | 9 ++- > drivers/gpu/drm/i915/i915_vma_resource.h | 6 +- > 11 files changed, 163 insertions(+), 40 deletions(-) > > -- > 2.36.1 >