Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp8047714rwd; Tue, 20 Jun 2023 09:26:17 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5Yeh+AZfk+2NetQXtKiVjdo2KDFjkVYmqzOxJm78fkAWvUZv8mgZKMFWK0hM4KPpwcWaGO X-Received: by 2002:a05:6a00:10c2:b0:668:6ce8:48b6 with SMTP id d2-20020a056a0010c200b006686ce848b6mr10172190pfu.31.1687278377294; Tue, 20 Jun 2023 09:26:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1687278377; cv=none; d=google.com; s=arc-20160816; b=w8TQOJG9PK0SvzX61UONmJGNnFrZnLBKyjt8X9WNraacr25mq9Bs3F7cQfNzCePHkZ ZV4Yh8g4c1Pwy2Sr4xN6K4pscPnzIEyMGFZEcwdDAaF7Smffs33R3hJXLSOYjRZ+CGF3 AglF8Qa7DuAkRw2j0kuCD1TNU+JLUoANvbdGmD0263aXeTJ13DgfJ0mIR7w6FS3uzRFL mQ13s8l0MRP6MM+mSaHQGFS9UklJmmBLZ5lnz1cZGh6ULh1YCrJADNICBpSrJVW6X1at PvlQpiR1CToaaxY1nzhLlp9PIUfliXog5Gmiwp4v+9ZHQ8OY/K+jjL9xqVxAS9zu0G/A uZZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=X9rGoGWVOauSZgYxBllgmItkvr+L3osLYEQmmi6Q+FE=; b=QbI1sDi2/xE61bvURkzRRY3odhSl9ocoI/rm4LsDUkcjM6QWCyZeSTtmrcqFIrqIyn lQaQ44ecrxWACXd/WOk2VLLMBfQ0LvFr+f4vc15y53+Z//z7lJrOsIe4fjBA+pRLcUsi C3cqCCzQ6j9IYRFRLWSvgaRZBNUDGJ5f5eD8OSQ3lxPuuVreSvoBLe8ncaoQ/KGi5MCQ /3dgTcbmQKcnp7t+J+FLOARpVQk9tVPT/R1EiHC6ZGQlLtp0NsrrbKvsAGkeoBEIq+0G NMeyyuVQqPWaGqH6W7VdRphBnj3EutLMmVbQWua+nHNPxVnugP0B4lxTRmqIobJytH6v nuMQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=kkZSjN+R; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id o24-20020aa79798000000b0066883879b57si2031208pfp.51.2023.06.20.09.26.02; Tue, 20 Jun 2023 09:26:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=kkZSjN+R; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232868AbjFTPzw (ORCPT + 99 others); Tue, 20 Jun 2023 11:55:52 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41806 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229954AbjFTPzv (ORCPT ); Tue, 20 Jun 2023 11:55:51 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 447F0E6E; Tue, 20 Jun 2023 08:55:50 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id C2226612FF; Tue, 20 Jun 2023 15:55:49 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 2AFB9C433C8; Tue, 20 Jun 2023 15:55:48 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1687276549; bh=ZKPHxEnSu+t4Ow41QVyBNgZ7aWHN6jZsT83hXkppoig=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=kkZSjN+RB5K4RrVgDbyYdR0uGWYdE+0CEqmZd3AFdHO0rOZyJYWI1w5rKZOqsMS+r jPfpBGLLGg6plRC5o8b0Y7C48hXJkXtXyHyggalNHYeIg9D/EHEDcovKPliZ3Y1L2W ozq9z7ert0fEM6zU4jhrBxC/ctp9GKrMMeLbebWTQZ6MmOt0spZm6bi1XWDhRMDODj vZi613LD4+xdZNlIowKvPWteH6STHkGnCylKpMPSreNNeOfqQkz3CW9eVVcVkwwfu9 iPiyxT0mGWaPf5cHmcYZJJCRVv6FXasjpLhrBldgpEktQOHB7vDgZLCCNO5d2jsf0e BOBFZwzgh5DcA== Date: Tue, 20 Jun 2023 08:59:02 -0700 From: Bjorn Andersson To: Doug Anderson Cc: Krzysztof Kozlowski , Linux regressions mailing list , Amit Pundir , Mark Brown , Andy Gross , Rob Herring , Konrad Dybcio , Krzysztof Kozlowski , Caleb Connolly , Conor Dooley , linux-arm-msm , dt , lkml Subject: Re: [PATCH] arm64: dts: qcom: sdm845-db845c: Move LVS regulator nodes up Message-ID: <20230620155902.clspxncyvpodixft@ripper> References: <20230602161246.1855448-1-amit.pundir@linaro.org> <358c69ad-fa8a-7386-fe75-92369883ee48@leemhuis.info> <0f6c9dcb-b7f6-fff9-6bed-f4585ea8e487@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jun 14, 2023 at 12:44:15PM -0700, Doug Anderson wrote: > Hi, > > On Wed, Jun 14, 2023 at 11:47 AM Krzysztof Kozlowski > wrote: > > > > On 14/06/2023 20:18, Linux regression tracking (Thorsten Leemhuis) wrote: > > > On 02.06.23 18:12, Amit Pundir wrote: > > >> Move lvs1 and lvs2 regulator nodes up in the rpmh-regulators > > >> list to workaround a boot regression uncovered by the upstream > > >> commit ad44ac082fdf ("regulator: qcom-rpmh: Revert "regulator: > > >> qcom-rpmh: Use PROBE_FORCE_SYNCHRONOUS""). > > >> > > >> Without this fix DB845c fail to boot at times because one of the > > >> lvs1 or lvs2 regulators fail to turn ON in time. > > > > > > /me waves friendly > > > > > > FWIW, as it's not obvious: this... > > > > > >> Link: https://lore.kernel.org/all/CAMi1Hd1avQDcDQf137m2auz2znov4XL8YGrLZsw5edb-NtRJRw@mail.gmail.com/ > > > > > > ...is a report about a regression. One that we could still solve before > > > 6.4 is out. One I'll likely will point Linus to, unless a fix comes into > > > sight. > > > > > > When I noticed the reluctant replies to this patch I earlier today asked > > > in the thread with the report what the plan forward was: > > > https://lore.kernel.org/all/CAD%3DFV%3DV-h4EUKHCM9UivsFHRsJPY5sAiwXV3a1hUX9DUMkkxdg@mail.gmail.com/ > > > > > > Dough there replied: > > > > > > ``` > > > Of the two proposals made (the revert vs. the reordering of the dts), > > > the reordering of the dts seems better. It only affects the one buggy > > > board (rather than preventing us to move to async probe for everyone) > > > and it also has a chance of actually fixing something (changing the > > > order that regulators probe in rpmh-regulator might legitimately work > > > around the problem). That being said, just like the revert the dts > > > reordering is still just papering over the problem and is fragile / > > > not guaranteed to work forever. > > > ``` > > > > > > Papering over obviously is not good, but has anyone a better idea to fix > > > this? Or is "not fixing" for some reason an viable option here? > > > > > > > I understand there is a regression, although kernel is not mainline > > (hash df7443a96851 is unknown) and the only solutions were papering the > > problem. Reverting commit is a temporary workaround. Moving nodes in DTS > > is not acceptable because it hides actual problem and only solves this > > one particular observed problem, while actual issue is still there. It > > would be nice to be able to reproduce it on real mainline with normal > > operating system (not AOSP) - with ramdiks/without/whatever. So far no > > one did it, right? > > The worry I have about the revert here is that it will never be able > to be undone and that doesn't seem great long term. I'm all for a > temporary revert to fix a problem while the root cause is understood, > but in this case I have a hard time believing that we'll make more > progress towards a root cause once the revert lands. All the > investigation we've done so far seems to indicate that the revert only > fixes the problem by luck... > > I completely agree that moving the nodes in the DTS is a hack and just > hides the problem. However, it also at least limits the workaround to > the one board showing the problem and doesn't mean we're stuck with > synchronous probe for rpmh-regulator for all eternity because nobody > can understand this timing issue on db845c. > I agree that we shouldn't hide this by reverting the regulator change. And as has been stated a few times already, the symptom indicates that we have a misconfigured system. Before accepting a patch just shuffling the bricks, I'd like to see some more analysis of what happens wrt the rpmh right before the timeout. Perhaps the landing team can assist here? Regards, Bjorn