Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp2723703rdb; Tue, 12 Sep 2023 10:05:20 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFzhEX8+65m0ZudliD2kcE7ljtIogiUo644+uQvnC9y1ZQhx3+8h5YC4pjjXQZcPc+16ByA X-Received: by 2002:a17:902:ea84:b0:1c1:e52e:49e3 with SMTP id x4-20020a170902ea8400b001c1e52e49e3mr390792plb.36.1694538319720; Tue, 12 Sep 2023 10:05:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694538319; cv=none; d=google.com; s=arc-20160816; b=oaOOz6LVYme34QScNsvuvxfXQSX6dUBheZR6TWtmXaFkMeAWv10rOLkWkIBBCWYD0j kaF1tDzASXFgQvRFrH8GA0wPXvrbsVOYLR4vBWjPrApvw5XFs7WUw1zMgar/MT5Haiqn oX8MeI7J616ibFzaaYXeAY0cCS6yxLpF92o6UuEjJcbuWLltOFEraDewCo4htJmvtFOM WvAMf8MRk+/mLdNRN7dZtGKysL9etkKAsqvDXf3KDHkUE/lHl0slPUYZVXkanNp9pnNw 0ziwBdgPNhJGvMDh0nsUpuQAhWE5sY7+6pXZus5cyhOLsW0cTUnGmcMDuS6WrBs3TZ35 +lQA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=74/lXDjAicbAQVpv6HCI5jeqajIVNGavg/gPM8Y5/Jo=; fh=8L6yf48fC6wKC7G+Mo1FP1v9re1i6HyYfoVeDgoXVXI=; b=f3z3X1Jj3yEqyE6kq/Q4Gg3M80gUnCIPQqixZgJo8lYEFWUZzCGr3RgFPsy2halpo9 t4fqYqK+t+roqeRCKRW5uNnMnVAa9+xvKcSDg3EyhfqAxtmjsr6XOnvk9WscBfc4fTW4 nimA6xEuST8HRW0eflNRl9idgaC8+sCwQfSz7j2Xj4Bh7vWk+DFHUr+xwpExhZJrJuQ8 WhRhrGdvMX21vcLMLooGGLxpxwyF3zFRC95/k/0nYKyQJrC0L62P9qpZGRfcRMXwyIjr YQy9T8R5IhHgq4SYj7yDO+2uacG2aAPI1bicF7U06jB2Xd//x9f6K8to0JxadP1DTWHi uo2Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fooishbar-org.20230601.gappssmtp.com header.s=20230601 header.b=hRnJK+wF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:5 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from groat.vger.email (groat.vger.email. [2620:137:e000::3:5]) by mx.google.com with ESMTPS id km14-20020a17090327ce00b001bf1973eafcsi8095358plb.571.2023.09.12.10.05.14 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Sep 2023 10:05:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:5 as permitted sender) client-ip=2620:137:e000::3:5; Authentication-Results: mx.google.com; dkim=pass header.i=@fooishbar-org.20230601.gappssmtp.com header.s=20230601 header.b=hRnJK+wF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:5 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by groat.vger.email (Postfix) with ESMTP id 88C4C812A60D; Tue, 12 Sep 2023 06:17:14 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.8 at groat.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235320AbjILNRK (ORCPT + 99 others); Tue, 12 Sep 2023 09:17:10 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33422 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233517AbjILNRH (ORCPT ); Tue, 12 Sep 2023 09:17:07 -0400 Received: from mail-pj1-x1030.google.com (mail-pj1-x1030.google.com [IPv6:2607:f8b0:4864:20::1030]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 515D510CA for ; Tue, 12 Sep 2023 06:17:03 -0700 (PDT) Received: by mail-pj1-x1030.google.com with SMTP id 98e67ed59e1d1-273ca7ab3f5so3840371a91.2 for ; Tue, 12 Sep 2023 06:17:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fooishbar-org.20230601.gappssmtp.com; s=20230601; t=1694524622; x=1695129422; darn=vger.kernel.org; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=74/lXDjAicbAQVpv6HCI5jeqajIVNGavg/gPM8Y5/Jo=; b=hRnJK+wFBqJIcfswqBqLspQkjytkccrhGA+4sABGreFmE+9URx+icD78lIE1g/8TsW 09efceDe0UcHxCSV96PRm+19shG08rXNwgoaPsj0GeOnJWrokumx8Fl7hATbgRvvNRlq UIQNC4iaLpuulfDvyf9BRM/DvTsr9ANOJxGvNmp0J6jur9CkJXPCrReIgZUmnJl12vPM GH6GdftVagDt3GhRObsgNxqLkA/VYTPix/lfsj5bttOg8b9Bv2WH/kNruFpQEP6rWzAm Ts3hidBk0J8QXKLE3fQWwzQeT63AdmsT+FQMmd/m/v4ZB/GT3j1teR1PspcdVeSXuf4j IMzQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1694524622; x=1695129422; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=74/lXDjAicbAQVpv6HCI5jeqajIVNGavg/gPM8Y5/Jo=; b=fjoXxKG/bwUgKA9anVctm2zEd1w5oILWkvu+jN+A82LPLxyahF1TxtVzvzxcDtXkEe l4QwvJFIuxj/oLTBikXm0H/84qempdcoZp7nPEHDOKpkd7nzW0yPzKRuqjeV/sS8sEDn /29lhgroXtbSV43boMzFmc5ph+pcgP6yMzV4oCIBMYx8AW4AD2ObrHLlo+rxulbJM2cb smXJqeaa2pdo2YMBK2+rLdN3NJWqtm6yeJYW8q/ZE8LDd6AoQAAqJIa/XCbl4Iu8Md9R wv4jcbdfgwqnHGibXHPsZ+qjIj2ZKiYjaLdLWm0beAURopxkMeFjpH/jGzPl5i7K3beY AmjA== X-Gm-Message-State: AOJu0Ywx2IDZMp8RJMzOrItz+4yAaHvkqrbhMiPT/dTh+Ce4jssfCxdI RTYlk2odrA+AnFCRH+DP9PnA+OeJitAF4r9o+ySQuA== X-Received: by 2002:a17:90a:aa85:b0:268:1e95:4e25 with SMTP id l5-20020a17090aaa8500b002681e954e25mr10043598pjq.17.1694524622460; Tue, 12 Sep 2023 06:17:02 -0700 (PDT) MIME-Version: 1.0 References: <4rpsqk4tgrdcxtxtfoum6o4oyglwkirmkh3jj4y5tays2ivb5p@uwqdf3snshkv> <25df6189-7b0a-b13d-e93d-c2a388fd45e3@collabora.com> <5fdf9d29-3f8d-0ee0-027f-57ff3a5cecb8@collabora.com> <9a2b1ad8-4359-4f12-b4f9-c1de477bc440@collabora.com> <5ejq3hjpoy3gxft2jbmoa5m656usetuxcs7g3ezyyiitj67rav@r5jhdz27foat> <550454b8-2e2c-c947-92c5-37f0367661c2@mailbox.org> In-Reply-To: From: Daniel Stone Date: Tue, 12 Sep 2023 14:16:41 +0100 Message-ID: Subject: Re: [PATCH v11] drm: Add initial ci/ subdirectory To: Maxime Ripard Cc: =?UTF-8?Q?Michel_D=C3=A4nzer?= , emma@anholt.net, linux-doc@vger.kernel.org, vignesh.raman@collabora.com, dri-devel@lists.freedesktop.org, alyssa@rosenzweig.io, jbrunet@baylibre.com, robdclark@google.com, corbet@lwn.net, khilman@baylibre.com, sergi.blanch.torne@collabora.com, david.heidelberg@collabora.com, linux-rockchip@lists.infradead.org, Daniel Stone , martin.blumenstingl@googlemail.com, robclark@freedesktop.org, Helen Koike , anholt@google.com, linux-mediatek@lists.infradead.org, matthias.bgg@gmail.com, linux-amlogic@lists.infradead.org, gustavo.padovan@collabora.com, linux-arm-kernel@lists.infradead.org, angelogioacchino.delregno@collabora.com, neil.armstrong@linaro.org, guilherme.gallo@collabora.com, linux-kernel@vger.kernel.org, tzimmermann@suse.de Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (groat.vger.email [0.0.0.0]); Tue, 12 Sep 2023 06:17:14 -0700 (PDT) X-Spam-Status: No, score=-0.8 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on groat.vger.email Hi Maxime, Hopefully less mangled formatting this time: turns out Thunderbird + plain text is utterly unreadable, so that's one less MUA that is actually usable to send email to kernel lists without getting shouted at. On Mon, 11 Sept 2023 at 15:46, Maxime Ripard wrote: > On Mon, Sep 11, 2023 at 03:30:55PM +0200, Michel D=C3=A4nzer wrote: > > > There's in 6.6-rc1 around 240 reported flaky tests. None of them have > > > any context. That new series hads a few dozens too, without any conte= xt > > > either. And there's no mention about that being a plan, or a patch > > > adding a new policy for all tests going forward. > > > > That does sound bad, would need to be raised in review. > > > > > Any concern I raised were met with a giant "it worked on Mesa" handwa= ve > > > > Lessons learned from years of experience with big real-world CI > > systems like this are hardly "handwaving". > > Your (and others) experience certainly isn't. It is valuable, welcome, > and very much appreciated. > > However, my questions and concerns being ignored time and time again > about things like what is the process is going to be like, what is going > to be tested, who is going to be maintaining that test list, how that > interacts with stable, how we can possibly audit the flaky tests list, > etc. have felt like they were being handwaived away. Sorry it ended up coming across like that. It wasn't the intent. > I'm not saying that because I disagree, I still do on some, but that's > fine to some extent. However, most of these issues are not so much an > infrastructure issue, but a community issue. And I don't even expect a > perfect solution right now, unlike what you seem to think. But I do > expect some kind of plan instead of just ignoring that problem. > > Like, I had to ask the MT8173 question 3 times in order to get an > answer, and I'm still not sure what is going to be done to address that > particular issue. > > So, I'm sorry, but I certainly feel like it here. I don't quite see the same picture from your side though. For example, my reading of what you've said is that flaky tests are utterly unacceptable, as are partial runs, and we shouldn't pretend otherwise. With your concrete example (which is really helpful, so thanks), what happens to the MT8173 hdmi-inject test? Do we skip all MT8173 testing until it's perfect, or does MT8173 testing always fail because that test does? Both have their downsides. Not doing any testing has the obvious downside, and means that the driver can get worse until it gets perfect. Always marking the test as failed means that the test results are useless: if failure is expected, then red is good. I mean, say you're contributing a patch to fix some documentation or add a helper to common code which only v3d uses. The test results come back, and your branch is failing tests on MT8173, specifically the hdmi-inject@4k test. What then? Either as a senior contributor you 'know' that's the case, or as a casual contributor you get told 'oh yeah, don't worry about the test results, they always fail'. Both lead to the same outcome, which is that no-one pays any attention to the results, and they get worse. What we do agree on is that yes, those tests should absolutely be fixed, and not just swept under the rug. Part of this is having maintainers actually meaningfully own their test results. For example, I'm looking at the expectation lists for the Intel gen in my laptop, and I'm seeing a lot of breakage in blending tests, as well as dual-display fails which include the resolution of my external display. I'd expect the Intel driver maintainers to look at them, get them fixed, and gradually prune those xfails/flakes down towards zero. If the maintainers don't own it though, then it's not going to get fixed. And we are exactly where we are today: broken plane blending and 1440p on KBL, broken EDID injection on MT8173, and broken atomic commits on stoney. Without stronger action from the maintainers (e.g. throwing i915 out of the tree until it has 100% pass 100% of the time), adding testing isn't making the situation better or worse in and of itself. What it _is_ doing though, is giving really clear documentation of the status of each driver, and backing that up by verifying it. Only maintainers can actually fix the drivers (or the tests tbf). But doing the testing does let us be really clear to everyone what the actual state is, and that way people can make informed decisions too. And the only way we're going to drive the test rate down is by the subsystem maintainers enforcing it. Does that make sense on where I'm (and I think a lot of others are) coming = from? To answer the other question about 'where are the logs?': some of them have the failure data in them, others don't. They all should going forward at least though. Cheers, Daniel