Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp2296754rwd; Fri, 16 Jun 2023 01:53:48 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5o6EYt6NhwjeFhRZSxJC9iC5Yap+u82mMprqwIg4WhM67/lO24KbebKCUbnZfHpMIXmvdn X-Received: by 2002:a05:6a00:198f:b0:662:56b4:c2f2 with SMTP id d15-20020a056a00198f00b0066256b4c2f2mr1888101pfl.25.1686905628587; Fri, 16 Jun 2023 01:53:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686905628; cv=none; d=google.com; s=arc-20160816; b=gcoI8u7KBAyoQZP/YUmV/TrBx8GsuTAymUsMEymHa/aXNfI0GDsN5vGP7i6Ad1rvtm tV3uIZELUoWG6mwkEvdAvjzjFhwxX6EHcx2PtiOxvc7Kxkb2fkCRMy+a/uj7BKcSu5Di q/7o2SskCej7MfsBnBMQBAgud4U96cmXUteEiATxPZyD0ZOK5a0q7BODyb9+51lQE9Oq 7utDJ9yL65C722AyEtx1EBdKfWPtp4gqKjtKI+Z4wauqOpB/qmJepABcStKpDOQs5I9Q cQhb4wNDGiY9HHO7/nsohXMlIoSqA9EbXPHOxEJd/m8YTM8kAKtbWBlVBarbE69oxj1u M2qQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=SalSihqMLPtzIyuuTKaEAb4CYXiE1PWXJIIS5x5bFLo=; b=CKj1atO5HEcQK3EnXJH/hQGxJxIKIY5yfGSZY9ooGy14xUh3xTEldFpSaVe94rkhU9 zpcOwJ5pBuF8rvQHFYRVVOa+xi5q3ofhC9IqkcwZeNAnTDu2hJsdOin9GtWVAhfXHJid Zcik6XwQtEyNBG1Qqwivl+dAtjI7vkp8BsPl+5T64d9gueCsMDcPK6RBgGN/tYYR6GK7 mXUWOvhT+PTAip439eN1pByO6dG7lfhdjYdUkKzkbpy/eB+0j0o3UGBmU137NoKipVBE fPXum8BsLc+ee1w6Kr9QxxH6a/XhWGxOHPmZ0PkcZwjqulWYA7cT1M88BwE+zChF2MQI 2uXQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=aFnBTsMN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id z26-20020aa79f9a000000b00666ad5bcfa0si1941726pfr.285.2023.06.16.01.53.34; Fri, 16 Jun 2023 01:53:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=aFnBTsMN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S242794AbjFPI1V (ORCPT + 99 others); Fri, 16 Jun 2023 04:27:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59166 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234224AbjFPI1S (ORCPT ); Fri, 16 Jun 2023 04:27:18 -0400 Received: from mail-ej1-x62a.google.com (mail-ej1-x62a.google.com [IPv6:2a00:1450:4864:20::62a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 419B3171F for ; Fri, 16 Jun 2023 01:27:16 -0700 (PDT) Received: by mail-ej1-x62a.google.com with SMTP id a640c23a62f3a-982a0232bdcso60430166b.1 for ; Fri, 16 Jun 2023 01:27:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; t=1686904035; x=1689496035; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date:message-id:reply-to; bh=SalSihqMLPtzIyuuTKaEAb4CYXiE1PWXJIIS5x5bFLo=; b=aFnBTsMNSDWDL+s8/PLotVo2dqsRSG/88z+9GTQYzBiP7zsdP5zuVpL2Mk+6lPUM0X 57x/QHQPe5RUoHHX54t3/h8IaGrFeYLm2MqCSBZKaWdsHW1CsNth4KS1RLDPEfIY42kd XbLFgSn6xilTA8L6apDX1PJ3/Hznbf0gMu+TeQrHlvLtzD3nznd7DehEX4mrY8hZiDzo wv8WWtOSKPpUMG7YwiCnn8h1SwKcW7aKIVaLonc/EEj7ZDsUEzpBh7E+/SllpiDqSqaS 6c6xDGjepYG5XgrGtszVcnfXAmFfGBRfDvIoOw0fgHXvT9XDTI6rDRB0ioqYa6qe9y4f J5QQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1686904035; x=1689496035; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=SalSihqMLPtzIyuuTKaEAb4CYXiE1PWXJIIS5x5bFLo=; b=RtdPkxt325LUon5PNwnMY3pyxmgherB5Ij0ERHTThX0nYPg5Lu4uqjC4kOFyYA+4c/ rhEeJuxT+REVKZ33PrGut8HRzUDnHjIuZ7MyJ9b8qCTsWdDI4JNvpa3ikfhaknavwUg4 zXJtqbj+5pTdiZPolB+oqVPRdULaA337S4/ORKonFpQEsLdnVPcjwxizth23aA+fQH23 06Riv+l7xnFKaFhPNU3rjpDwxtuBLvP6Llymoe/Dp7hqK9Go8XITwkZuajsNLEhz7wHm LEXyZiXlEpkXkmRmd9vF465g4S3OAdHGzcug9eAG3sHrUUb7ESeN8h9cp9v2PftKbguq BSrw== X-Gm-Message-State: AC+VfDzifxErvEcHgEKn9hQd76LDpT5xTqK+aOsadhq35cLF4hTcmahZ qIcFE8KLm6abIKmhY0HfjmxdIA== X-Received: by 2002:a17:907:2d28:b0:982:8bd5:4224 with SMTP id gs40-20020a1709072d2800b009828bd54224mr1434332ejc.45.1686904034627; Fri, 16 Jun 2023 01:27:14 -0700 (PDT) Received: from [192.168.1.20] ([178.197.219.26]) by smtp.gmail.com with ESMTPSA id q18-20020a1709066b1200b0096a6be0b66dsm10381979ejr.208.2023.06.16.01.27.12 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 16 Jun 2023 01:27:14 -0700 (PDT) Message-ID: <12d6b687-5e5a-bd7c-ff5c-007a74753edb@linaro.org> Date: Fri, 16 Jun 2023 10:27:11 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: Re: [PATCH] arm64: dts: qcom: sdm845-db845c: Move LVS regulator nodes up Content-Language: en-US To: Amit Pundir Cc: Linux regressions mailing list , Mark Brown , Doug Anderson , Bjorn Andersson , Andy Gross , Rob Herring , Konrad Dybcio , Krzysztof Kozlowski , Caleb Connolly , Conor Dooley , linux-arm-msm , dt , lkml References: <20230602161246.1855448-1-amit.pundir@linaro.org> <358c69ad-fa8a-7386-fe75-92369883ee48@leemhuis.info> <0f6c9dcb-b7f6-fff9-6bed-f4585ea8e487@linaro.org> <0ec6c988-d678-c96c-a7a2-af38e6701404@linaro.org> From: Krzysztof Kozlowski In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.2 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 15/06/2023 18:09, Amit Pundir wrote: > On Thu, 15 Jun 2023 at 20:33, Krzysztof Kozlowski > wrote: >> >> On 15/06/2023 15:47, Amit Pundir wrote: >>> On Thu, 15 Jun 2023 at 00:38, Amit Pundir wrote: >>>> >>>> On Thu, 15 Jun 2023 at 00:17, Krzysztof Kozlowski >>>> wrote: >>>>> >>>>> On 14/06/2023 20:18, Linux regression tracking (Thorsten Leemhuis) wrote: >>>>>> On 02.06.23 18:12, Amit Pundir wrote: >>>>>>> Move lvs1 and lvs2 regulator nodes up in the rpmh-regulators >>>>>>> list to workaround a boot regression uncovered by the upstream >>>>>>> commit ad44ac082fdf ("regulator: qcom-rpmh: Revert "regulator: >>>>>>> qcom-rpmh: Use PROBE_FORCE_SYNCHRONOUS""). >>>>>>> >>>>>>> Without this fix DB845c fail to boot at times because one of the >>>>>>> lvs1 or lvs2 regulators fail to turn ON in time. >>>>>> >>>>>> /me waves friendly >>>>>> >>>>>> FWIW, as it's not obvious: this... >>>>>> >>>>>>> Link: https://lore.kernel.org/all/CAMi1Hd1avQDcDQf137m2auz2znov4XL8YGrLZsw5edb-NtRJRw@mail.gmail.com/ >>>>>> >>>>>> ...is a report about a regression. One that we could still solve before >>>>>> 6.4 is out. One I'll likely will point Linus to, unless a fix comes into >>>>>> sight. >>>>>> >>>>>> When I noticed the reluctant replies to this patch I earlier today asked >>>>>> in the thread with the report what the plan forward was: >>>>>> https://lore.kernel.org/all/CAD%3DFV%3DV-h4EUKHCM9UivsFHRsJPY5sAiwXV3a1hUX9DUMkkxdg@mail.gmail.com/ >>>>>> >>>>>> Dough there replied: >>>>>> >>>>>> ``` >>>>>> Of the two proposals made (the revert vs. the reordering of the dts), >>>>>> the reordering of the dts seems better. It only affects the one buggy >>>>>> board (rather than preventing us to move to async probe for everyone) >>>>>> and it also has a chance of actually fixing something (changing the >>>>>> order that regulators probe in rpmh-regulator might legitimately work >>>>>> around the problem). That being said, just like the revert the dts >>>>>> reordering is still just papering over the problem and is fragile / >>>>>> not guaranteed to work forever. >>>>>> ``` >>>>>> >>>>>> Papering over obviously is not good, but has anyone a better idea to fix >>>>>> this? Or is "not fixing" for some reason an viable option here? >>>>>> >>>>> >>>>> I understand there is a regression, although kernel is not mainline >>>>> (hash df7443a96851 is unknown) and the only solutions were papering the >>>>> problem. Reverting commit is a temporary workaround. Moving nodes in DTS >>>>> is not acceptable because it hides actual problem and only solves this >>>>> one particular observed problem, while actual issue is still there. It >>>>> would be nice to be able to reproduce it on real mainline with normal >>>>> operating system (not AOSP) - with ramdiks/without/whatever. So far no >>>>> one did it, right? >>>> >>>> No, I did not try non-AOSP system yet. I'll try it tomorrow, if that >>>> helps. With mainline hash. >>> >>> Hi, here is the crash report on db845c running vanilla v6.4-rc6 with a >>> debian build https://bugs.linaro.org/attachment.cgi?id=1142 >>> >>> And fwiw here is the db845c crash log with AOSP running vanilla >>> v6.4-rc6 https://bugs.linaro.org/attachment.cgi?id=1141 >>> >>> Regards, >>> Amit Pundir >>> >>> PS: rootfs in this bug report doesn't matter much because I'm loading >>> all the kernel modules from a ramdisk and in the case of a crash the >>> UFS doesn't probe anyway. >> >> I just tried current next with defconfig (I could not find your config, >> neither here, nor in your previous mail thread nor in bugzilla). Also >> with REGULATOR_QCOM_RPMH as module. >> >> I tried also v6.4-rc6 - also defconfig with default and module >> REGULATOR_QCOM_RPMH. >> >> All the cases work on my RB3 - no warnings reported. >> >> If you do not use defconfig, then in all reports please mention the >> differences (the best) or at least attach it. > > Argh.. Sorry about that. Big mistake from my side. I did want to > upload my defconfig but forgot. Defconfig plays a key role because, as > I mentioned in one of my previous email, it is a timing/race bug and > if I do any much changes in my defconfig (i.e. enable ftrace for > example or as little as add printk in qcom_rpmh_regulator code) then I > can't reproduce this bug. So needless to say that I can't reproduce > this bug with default arm64 defconfig. > > Please find my custom (but upstream) defconfig here > https://bugs.linaro.org/attachment.cgi?id=1143 and prebuilt binaries > here https://people.linaro.org/~amit.pundir/db845c-userdebug/rpmh_bug/. > "fastboot flash boot ./boot.img-6.4-rc6 reboot" and/or a few (<5) > reboots should be enough to trigger the crash. > > I have downloaded the initrd from here > https://snapshots.linaro.org/96boards/dragonboard845c/linaro/debian/569/initrd.img-5.15.0-qcomlt-arm64 > but edited ramdisk/init to run "load_module" function early in the > boot and ramdisk/conf/initramfs.conf has "MODULES=list" instead of > "MODULES=most", where all the kernel modules are listed at > /etc/initramfs-tools/modules. So you have interconnect as module - this is not a supported setup. It might work with if all the modules are loaded very early or might not. Pinctrl is another driver which should be built-in. With your defconfig I see regular issue - console and system dies because of lack of interconnects, most likely. I don't see your WARNs - I just see usual hang. See: https://lore.kernel.org/all/20221021032702.1340963-1-krzysztof.kozlowski@linaro.org/ If you want them to really be modules, then you need to fix all the dependencies (SOFTDEP?), probe ordering glitches. It's not a problem of DTS. Just because something can be built as module, does not mean it will work. We don't test it, we don't work with them as modules. It's kind of the same as here: https://lore.kernel.org/all/ac328b6a-a8e2-873d-4015-814cb4f5588e@canonical.com/ I understand that we might have here regression, if these were working as modules, but I don't think we ever really committed to it. We can as well make it non-module to solve the regression. Best regards, Krzysztof