Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp998388iob; Fri, 13 May 2022 19:06:32 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw6Z8R0YgzT/ovSzD+OTGFCEQ1QYtsYqQcOlHWE2YnQBrP7uYXOPJBCCDfjstidQNBh7BfG X-Received: by 2002:a1c:c912:0:b0:37b:e074:dfcb with SMTP id f18-20020a1cc912000000b0037be074dfcbmr16820438wmb.161.1652493992416; Fri, 13 May 2022 19:06:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652493992; cv=none; d=google.com; s=arc-20160816; b=hrTnlmsXB05nPZVxqjgWsNRUaDegGr29lZFBxaO/it7jFsmZ3vOXWRRfjbfa2tTKkX j/zT6YelOfDYOPEeWlxuN41DC0aJ0Ba4w/2wmLsMfM1L9pukmNl+Q5SERYRepYfCJYhT ArBhwrX+rwhSHDM/jJIe0QCJr+eoATFuPTfFGrOd2YJSbxJycUfNBwTvTff97uQA6SLJ Y55tfT4wYf3HCjapBP9RBxS/YEbEIrF/hdTo2gNCdIlEV6niAvOYTn4W+Dpr66mjW7X9 ggoBIDChF/Jpc61Q9kL/NvUvKXRVlSJbsNosiRDLlztSqVTReAoCcj/92pnZGJXtjlYI 4TCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:mail-followup-to:message-id:subject:cc:to:from:date :dkim-signature; bh=Y5zEOR8a2tEkZy/r6HUczq8om3FfL9i9nYmMHgZoVJY=; b=Dp2F6fuE7nqYyNFzKibBmSW9QRJc31QM4lIstJsQ8rlMEymClEiIRYPpVTODMU61m6 i3sJUuFAhfPQwIPXInKbuo/89Adt1/OvZMc5H2bpZVGXtzGA0k8xrXkYkyEKdpzFwDN1 CQcztad61EgoC7W1pHFu+acakQA4YGGMYgibDm31Xs4HuhNdPze2wk4KT3b+LnQjXG2O f1IklVrrRgLO+6UGAab0zJrQui3+MfwCSmImV1WsZzB6ee8W63BuTsU3ZSWYphoActJk fphXhvs/bVQ77oqnXD+ox9QeNk99Qm5LnmFcO2O5I08yQvcF82tpoEX0RjncKFn6GMMI TIng== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=HprFLMVg; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id b3-20020a056000054300b0020ad2bae479si3970803wrf.114.2022.05.13.19.06.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 13 May 2022 19:06:32 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=HprFLMVg; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id D50D24891EE; Fri, 13 May 2022 17:26:56 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1352871AbiELOCg (ORCPT + 99 others); Thu, 12 May 2022 10:02:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46202 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1352695AbiELOCd (ORCPT ); Thu, 12 May 2022 10:02:33 -0400 Received: from mail-wm1-x336.google.com (mail-wm1-x336.google.com [IPv6:2a00:1450:4864:20::336]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A73295522D for ; Thu, 12 May 2022 07:02:31 -0700 (PDT) Received: by mail-wm1-x336.google.com with SMTP id l38-20020a05600c1d2600b00395b809dfbaso2799913wms.2 for ; Thu, 12 May 2022 07:02:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=date:from:to:cc:subject:message-id:mail-followup-to:references :mime-version:content-disposition:in-reply-to; bh=Y5zEOR8a2tEkZy/r6HUczq8om3FfL9i9nYmMHgZoVJY=; b=HprFLMVg4LbqJ+pbrw6wTtnHQb3AMb89BE3vhNP+EI+RbD0KiLCACPrCAyu1q2fDWw kJHivbngQXjomyStmHAncejy/E/iy4Cr+uGLfgohOAciGYUTD5YZAKiKF/wZABnIH7bQ jr5XdpGMMTm3GBXy5SC9VULk+3EFB9sjBTuCM= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :in-reply-to; bh=Y5zEOR8a2tEkZy/r6HUczq8om3FfL9i9nYmMHgZoVJY=; b=yzL7XZTWYcRjA9KVIjff3n9vCZR/mfnRp2rpFbEoKPeHULsFLRPIaOKPUNMt0+Sl4/ POf0yLCGBc180Z9HsG9sRUeqObVdXFHqfQ3BCgEk/yH7Lm9AsizGl+e1572JTCWpk+Cn UK9xPYRWRnl/o7Fj1jUN0iAqPgfMuJxpCz6nvXYT3bbtUb1n1R5IjZrgUaGmKEHGWrKD Zs6FWVdtjvb8+R51ZmpmS8qCsiWiMXNLGpRwflh7p3aNC25Qix2vmQx0q6hMQjN/1rh1 3V/3rWClvqaXh5MaKpqn4eyegy9flyTuP+fvrvbpX0eCEZFColCIwT9FYnBW1oaNhvTN go+g== X-Gm-Message-State: AOAM530tO7ZHW6Nh4yybdat4Pi0hJE8yxFijT1G3OG3PuCZFNa8jgGKc EiKvzJOVIJHnBYf1QpbTN8gaow== X-Received: by 2002:a1c:2c6:0:b0:38f:f280:caa2 with SMTP id 189-20020a1c02c6000000b0038ff280caa2mr25599wmc.87.1652364149421; Thu, 12 May 2022 07:02:29 -0700 (PDT) Received: from phenom.ffwll.local ([2a02:168:57f4:0:efd0:b9e5:5ae6:c2fa]) by smtp.gmail.com with ESMTPSA id k5-20020adff285000000b0020c5253d8desm4241643wro.42.2022.05.12.07.02.28 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 12 May 2022 07:02:28 -0700 (PDT) Date: Thu, 12 May 2022 16:02:26 +0200 From: Daniel Vetter To: Tomeu Vizoso Cc: Rob Clark , Daniel Vetter , Jessica Zhang , "open list:DOCUMENTATION" , linux-arm-msm , Maarten Lankhorst , Maxime Ripard , Thomas Zimmermann , David Airlie , Jonathan Corbet , Sean Paul , Abhinav Kumar , Linux Kernel Mailing List , dri-devel , freedreno Subject: Re: [Freedreno] [RFC v2] drm/msm: Add initial ci/ subdirectory Message-ID: Mail-Followup-To: Tomeu Vizoso , Rob Clark , Jessica Zhang , "open list:DOCUMENTATION" , linux-arm-msm , Maarten Lankhorst , Maxime Ripard , Thomas Zimmermann , David Airlie , Jonathan Corbet , Sean Paul , Abhinav Kumar , Linux Kernel Mailing List , dri-devel , freedreno References: <20220510070140.45407-1-tomeu.vizoso@collabora.com> <20220510141329.54414-1-tomeu.vizoso@collabora.com> <79d79110-9fbc-0e96-d17e-68a1f8f2c224@collabora.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <79d79110-9fbc-0e96-d17e-68a1f8f2c224@collabora.com> X-Operating-System: Linux phenom 5.10.0-8-amd64 X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 12, 2022 at 03:28:16PM +0200, Tomeu Vizoso wrote: > On 5/11/22 7:46 PM, Rob Clark wrote: > > On Wed, May 11, 2022 at 10:12 AM Daniel Vetter wrote: > > > > > > On Tue, 10 May 2022 at 22:26, Rob Clark wrote: > > > > > > > > On Tue, May 10, 2022 at 12:39 PM Jessica Zhang > > > > wrote: > > > > > > > > > > > > > > > > > > > > On 5/10/2022 7:13 AM, Tomeu Vizoso wrote: > > > > > > And use it to store expectations about what the drm/msm driver is > > > > > > supposed to pass in the IGT test suite. > > > > > > > > > > > > Also include a configuration file that points to the out-of-tree CI > > > > > > scripts. > > > > > > > > > > > > By storing the test expectations along the code we can make sure both > > > > > > stay in sync with each other, and so we can know when a code change > > > > > > breaks those expectations. > > > > > > > > > > > > This will allow all contributors to drm/msm to reuse the infrastructure > > > > > > already in gitlab.freedesktop.org to test the driver on several > > > > > > generations of the hardware. > > > > > > > > > > > > v2: > > > > > > - Fix names of result expectation files to match SoC > > > > > > - Don't execute tests that are going to skip on all boards > > > > > > > > > > > > Signed-off-by: Tomeu Vizoso > > > > > > --- > > > > > > Documentation/gpu/msm_automated_testing.rst | 70 +++++++++ > > > > > > drivers/gpu/drm/msm/ci/gitlab-ci.yml | 11 ++ > > > > > > drivers/gpu/drm/msm/ci/msm.testlist | 148 ++++++++++++++++++ > > > > > > .../gpu/drm/msm/ci/msm_apq8016_results.txt | 140 +++++++++++++++++ > > > > > > .../gpu/drm/msm/ci/msm_apq8096_results.txt | 140 +++++++++++++++++ > > > > > > drivers/gpu/drm/msm/ci/msm_sc7180_results.txt | 141 +++++++++++++++++ > > > > > > drivers/gpu/drm/msm/ci/msm_sdm845_results.txt | 141 +++++++++++++++++ > > > > > > 7 files changed, 791 insertions(+) > > > > > > create mode 100644 Documentation/gpu/msm_automated_testing.rst > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/gitlab-ci.yml > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/msm.testlist > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/msm_apq8016_results.txt > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/msm_apq8096_results.txt > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/msm_sc7180_results.txt > > > > > > create mode 100644 drivers/gpu/drm/msm/ci/msm_sdm845_results.txt > > > > > > > > > > [snip] > > > > > > > > diff --git a/drivers/gpu/drm/msm/ci/msm_sc7180_results.txt b/drivers/gpu/drm/msm/ci/msm_sc7180_results.txt > > > > > > new file mode 100644 > > > > > > index 000000000000..01f7b4b399b5 > > > > > > --- /dev/null > > > > > > +++ b/drivers/gpu/drm/msm/ci/msm_sc7180_results.txt > > > > > > @@ -0,0 +1,141 @@ > > > > > > +igt@core_auth@getclient-simple,dmesg-warn > > > > > > +igt@core_auth@getclient-master-drop,pass > > > > > > +igt@core_auth@basic-auth,pass > > > > > > +igt@core_auth@many-magics,pass > > > > > > +igt@core_getclient,pass > > > > > > +igt@core_getstats,pass > > > > > > +igt@core_getversion,pass > > > > > > +igt@core_setmaster_vs_auth,pass > > > > > > +igt@drm_read@invalid-buffer,pass > > > > > > +igt@drm_read@fault-buffer,pass > > > > > > +igt@drm_read@empty-block,pass > > > > > > +igt@drm_read@empty-nonblock,pass > > > > > > +igt@drm_read@short-buffer-block,pass > > > > > > +igt@drm_read@short-buffer-nonblock,pass > > > > > > +igt@drm_read@short-buffer-wakeup,pass > > > > > > +igt@kms_addfb_basic@unused-handle,pass > > > > > > +igt@kms_addfb_basic@unused-pitches,pass > > > > > > +igt@kms_addfb_basic@unused-offsets,pass > > > > > > +igt@kms_addfb_basic@unused-modifier,pass > > > > > > +igt@kms_addfb_basic@legacy-format,dmesg-warn > > > > > > +igt@kms_addfb_basic@no-handle,pass > > > > > > +igt@kms_addfb_basic@basic,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-0,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-32,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-63,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-128,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-256,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-1024,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-999,pass > > > > > > +igt@kms_addfb_basic@bad-pitch-65536,pass > > > > > > +igt@kms_addfb_basic@size-max,pass > > > > > > +igt@kms_addfb_basic@too-wide,pass > > > > > > +igt@kms_addfb_basic@too-high,dmesg-warn > > > > > > > > > > For test results on Trogdor, is is possible to have them be > > > > > success/fail/skip only? > > > > > > > > > > Results such as dmesg-warn/dmesg-fail are igt_runner specific and > > > > > because there isn't support for igt_runner on ChromeOS, they will be > > > > > difficult to replicate and debug. > > > > > > > > Actually, I wonder if it would be better to just treat > > > > dmesg-warn/dmesg-fail as pass/fail? I'd noticed some flakes on > > > > rockchip which looked just like unrelated dmesg msg which just > > > > happened to show up while the test was running. > > > > > > This is kinda the reason behind standardizing on drm dmesg logging, so > > > that we have some chances at filtering stuff out. Not sure that's a > > > good idea, since when your entire box splats and lockdep is dead, then > > > continuing to run drm tests is still fairly pointless. > > > > I'm not sure if we are using it yet for drm-ci, but for mesa-ci we > > monitor dmesg (over serial port, from the controller) for splats, so > > we already have the tech for restarting or aborting the CI run. We > > don't need igt-runner to tell us. > > Yep, these scripts are currently being used as-is from Mesa, so we got that > functionality for free. > > > > I think this is another reason why trying at least to standardize this > > > stuff over drivers would be pretty good idea. > > > > > > > Additionally, some of the tests, like msm_recovery, are *expected* to > > > > generate some dmesg spam since they are intentionally triggering GPU > > > > hangs to test the recovery mechanism. > > > > > > Uh I don't like that. It just allows userspace to spam dmesg, which > > > doesn't seem like a great idea. That's at least why i915 dumps these > > > at a lower level, and in the past had a special "I'm going to whack > > > the gpu real hard expect hangs" knob in debugfs. > > > > > > Having tests which intentionally spam dmesg above info level isn't > > > really good since then you need endless amounts of test-specific > > > encoding of what is considered a success and what not. Like when a > > > backmerge breaks a testcases which is already at dmesg-fail, is that > > > bad or not? Probably bad, but was the situation before that really > > > good or already kinda on fire? > > > > I guess I could add some debugfs knobs to squelch the dmesg msgs on > > gpu hangs. In the normal case, I'd prefer that gpu hangs are not > > silent.. since that is something we get in feedback reports if a user > > (or dogfooder) reports a bug. > > > > The rockchip case I mentioned was some unrelated dmesg about > > linktraining failing.. presumably because there was no display > > attached? IDK, I didn't look too closely. But my point is we could > > be getting unrelated and asynchronous dmesg spam, even from other > > kernel subsystems. Letting that be part of the test results just > > sounds like asking for flakes. > > I think some drivers are currently a bit too buggy to behave reliably under > CI unless one reduces coverage (rockchip on rk3399, for example). And some > other drivers (in other subsystems as well) could do with a review of what > they print to the console. I guess these are things we could and probably > should fix? I think the best way for these is to still run the tests, but supress the results. At least if the flakiness is only in whether the test passes, and not in the test run outright killing the kernel/machine somehow. But yeah sometimes just having an explicit run list is the way to go, and we have plenty of that in intel ci. But for i915 it's more a testament to how terrible the i915-gem tests in particular have become, largely also because people simply kept adding crap and never removed much at all (and sometimes even removed important tests, oh well). -Daniel -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch