Received: by 2002:a05:6358:e9c4:b0:b2:91dc:71ab with SMTP id hc4csp4569598rwb; Mon, 8 Aug 2022 03:31:17 -0700 (PDT) X-Google-Smtp-Source: AA6agR6PeLBM4ZGVajeOL3i+lT1/MXZtfM6KJTxb5YsnY0QwYG4Q836XvfbtE56VRQkGmYCtSTYI X-Received: by 2002:a17:907:96a1:b0:730:e0bd:1420 with SMTP id hd33-20020a17090796a100b00730e0bd1420mr12183764ejc.83.1659954677321; Mon, 08 Aug 2022 03:31:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1659954677; cv=none; d=google.com; s=arc-20160816; b=K2asNzq68zaqX7oS9NLlnsGJyv2fRD+r/5ZJFmQ4fHR1KeKrl07Grb2keu7OqF7yi6 1x9d/VugzM1Y4JslUfZjROgTD/cXrIPTZT2nWXWnjmywjxuVPHxGzmWn5w1XKHgvUPLs b+L6cgy7tdrVcNyKLIxBcYinwwFV29Hdke9zbSuCkTrxg05zWkJ+xObaSJ//8MuPhyJy XZX31G/Bh62Q1QW9teX1uXNYbcojlOJwcrB8ntj7TrkIFpZpF7ip49uL2JinwUX3Pwnt 4oUQx7Kshy6cEQLMjQlwsethUbKhjAf0xGeV5w7Kg7Wpro8bHqha7elhQHZ4BDiB5Yi3 UWVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:sender:dkim-signature; bh=EXiXJN8MYkJGng5lNfVERdVuIaPMTXfVA4hJ6cFOEMk=; b=l2M5YAlnhKdp22qSaFnEUU2fo6WMGcWN6Y5hqKqNljdsdGDmA8g8QKyY7g2YKtDYJa TNfis2mTDYXPoJQ8/hRJPeSEc+zl1rooM6mz8XSsYCxlNGQFjt2Y7xZvMbyqqQraIfWX lrtZmoyfGz97FZ74KmV1iDy1JegdQFVXFQxcX67tveU1iUYE7aUuSlHqnyNmnF3ppKcS Z2YRAc0Xcb8RbXfX+ulSueBPUy6WnQrIdUi2PoVNl3Ip8bU8oOljDwppOXABDm7dSFfj sYpMWPlM8C9Ny/paQDXtv+blv1VezG6K5tjVX+6Pf6ADIAKnQxHzqXOM/N/49Z5GI8sR LUUg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=aLhQ9Rvq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hd37-20020a17090796a500b00730b61df902si12815341ejc.811.2022.08.08.03.30.52; Mon, 08 Aug 2022 03:31:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=aLhQ9Rvq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237452AbiHHK0R (ORCPT + 99 others); Mon, 8 Aug 2022 06:26:17 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47118 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232740AbiHHK0O (ORCPT ); Mon, 8 Aug 2022 06:26:14 -0400 Received: from mail-pl1-x636.google.com (mail-pl1-x636.google.com [IPv6:2607:f8b0:4864:20::636]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 59A72EAF; Mon, 8 Aug 2022 03:26:13 -0700 (PDT) Received: by mail-pl1-x636.google.com with SMTP id p18so8130039plr.8; Mon, 08 Aug 2022 03:26:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:from:to:cc; bh=EXiXJN8MYkJGng5lNfVERdVuIaPMTXfVA4hJ6cFOEMk=; b=aLhQ9RvqFygerD+CeJ4X5lxejhEF8/NQ8TOThb4q+E8/7SBY2lgeY/xmVjdZfE9xjN GQswa/57mZeRfipXhzaVGV0A2981YTeAlgRAACinebEliuxxT0HtLM63rBwxcMos0mq6 KmikJ+KHutRLaar7qYkCe6ktRoTjB5Yoxiq3zu2xpwd5LqS8sg9rX18hQWsu89O+3KTy PPaTC5q/GCw2VcOPzdzKVc2JxFq6FNnrBboTU/EeuGP7s7zc/ZQXt8dgDguA2AvKOPAv lTnBswm/5kvDsPyF92L6wJBAKrwMqWCmbJ2Vi1iOIvQFQg+uKcGj1owxGWwemGBvjnnD Cnkg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:x-gm-message-state:from:to:cc; bh=EXiXJN8MYkJGng5lNfVERdVuIaPMTXfVA4hJ6cFOEMk=; b=3hXapC9Rb9M7+KZ3/+4JvVho9Vs7o8AAfIcOv4TtjGmWumqI/3N+eGn4ZKSkh/k4MO /jCGiTFu5nusdfLM9kaZhER6r+4jCO74jGcwkFDevtXy82psJLtlHz60cmT7GdrbZkgZ KFNJxBYm5HMoZYM9KULTEgBqtuUK+WBT57eHQZpa/l6WH+mYLsCeZJRXCp9oiZ0rOuYV oCboiULJkZ0hJ1Z/DYySufueyos2nPTw37ut/31FQTwUrwq3uU3a1Q7Y8dvlCLoZQccO T2EMGA9DIX8nOdUuY8BseMxy1Zqw46xpaqKcAFayESb9cH3EKQ5PJMfNQUGtG2jMbEEW bUXw== X-Gm-Message-State: ACgBeo0xcFaGb/H/H56D7FjwjUpl6zIIF47yuBHEMAkMJlFOkWRC0/Xc u3soxWjh4obJfo9XaVZw/No= X-Received: by 2002:a17:902:b607:b0:170:c7fc:388a with SMTP id b7-20020a170902b60700b00170c7fc388amr1922821pls.29.1659954372772; Mon, 08 Aug 2022 03:26:12 -0700 (PDT) Received: from server.roeck-us.net ([2600:1700:e321:62f0:329c:23ff:fee3:9d7c]) by smtp.gmail.com with ESMTPSA id q19-20020a170902f35300b0016909be39e5sm8248889ple.177.2022.08.08.03.26.10 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 08 Aug 2022 03:26:11 -0700 (PDT) Sender: Guenter Roeck Date: Mon, 8 Aug 2022 03:26:10 -0700 From: Guenter Roeck To: Michael Walle Cc: daniel.lezcano@linexp.org, abailon@baylibre.com, anarsoul@gmail.com, baolin.wang7@gmail.com, bjorn.andersson@linaro.org, broonie@kernel.org, damien.lemoal@opensource.wdc.com, daniel.lezcano@linaro.org, digetx@gmail.com, f.fainelli@gmail.com, glaroque@baylibre.com, hayashi.kunihiko@socionext.com, heiko@sntech.de, j-keerthy@ti.com, jonathanh@nvidia.com, khilman@baylibre.com, linux-kernel@vger.kernel.org, linux-pm@vger.kernel.org, lukasz.luba@arm.com, matthias.bgg@gmail.com, mcoquelin.stm32@gmail.com, mhiramat@kernel.org, miquel.raynal@bootlin.com, niklas.soderlund@ragnatech.se, rafael@kernel.org, rui.zhang@intel.com, shawnguo@kernel.org, talel@amazon.com, thierry.reding@gmail.com, tiny.windzz@gmail.com Subject: Re: [PATCH v5 00/33] New thermal OF code Message-ID: <20220808102610.GA1969424@roeck-us.net> References: <20220804224349.1926752-1-daniel.lezcano@linexp.org> <20220808094216.928018-1-michael@walle.cc> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220808094216.928018-1-michael@walle.cc> X-Spam-Status: No, score=-1.3 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_EF,FREEMAIL_ENVFROM_END_DIGIT, FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 08, 2022 at 11:42:16AM +0200, Michael Walle wrote: > Hi, > > > The following changes are depending on: > > > > - 20220722200007.1839356-1-daniel.lezcano@linexp.org > > > > which are present in the thermal/linux-next branch: > > > > https://git.kernel.org/pub/scm/linux/kernel/git/thermal/linux.git/log/?h=thermal/linux-next > > > > The series introduces a new thermal OF code. The patch description gives > > a detailed explanation of the changes. Basically we write new OF parsing > > functions, we migrate all the users of the old thermal OF API to the new > > one and then we finish by removing the old OF code. > > > > That is the second step to rework the thermal OF code. More patches will > > come after that to remove the duplication of the trip definitions in the > > different drivers which will result in more code duplication removed and > > consolidation of the core thermal framework. > > > > Thanks for those who tested the series on their platform and > > investigated the regression with the disabled by default thermal zones. > > I haven't looked closely yet, but this series is breaking two of my > boards. > > There seems to be one mistake within the new thermal code: > > [ 2.030452] thermal_sys: Failed to find 'trips' node > [ 2.033664] usb 1-1: new high-speed USB device number 2 using xhci-hcd > [ 2.035434] thermal_sys: Failed to find trip points for tmu id=2 > [ 2.048010] qoriq_thermal 1f80000.tmu: Failed to register sensors > [ 2.054128] qoriq_thermal: probe of 1f80000.tmu failed with error -22 > [ 2.060607] devm_thermal_of_zone_release:707 res=ffff002002377180 > [ 2.067044] Unable to handle kernel paging request at virtual address 01adadadadadad88 > [ 2.075003] Mem abort info: > [ 2.077805] ESR = 0x0000000096000004 > [ 2.081562] EC = 0x25: DABT (current EL), IL = 32 bits > [ 2.086893] SET = 0, FnV = 0 > [ 2.089955] EA = 0, S1PTW = 0 > [ 2.093100] FSC = 0x04: level 0 translation fault > [ 2.097993] Data abort info: > [ 2.100876] ISV = 0, ISS = 0x00000004 > [ 2.104724] CM = 0, WnR = 0 > [ 2.107698] [01adadadadadad88] address between user and kernel address ranges > [ 2.114863] Internal error: Oops: 96000004 [#1] SMP > [ 2.119754] Modules linked in: > [ 2.122815] CPU: 1 PID: 1 Comm: swapper/0 Not tainted 5.19.0-next-20220808-00078-ga957a15f74fc-dirty #1694 > [ 2.132504] Hardware name: Kontron KBox A-230-LS (DT) > [ 2.137568] pstate: 20000005 (nzCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) > [ 2.144554] pc : kfree+0x5c/0x3c0 > [ 2.147885] lr : thermal_of_zone_unregister+0x34/0x54 > [ 2.152954] sp : ffff80000a22bab0 > [ 2.156274] x29: ffff80000a22bab0 x28: 0000000000000000 x27: ffff800009960464 > [ 2.163438] x26: ffff800009a16960 x25: 0000000000000006 x24: ffff800009f09a40 > [ 2.170601] x23: ffff800009ab9008 x22: ffff800008d0d684 x21: 01adadadadadad80 > [ 2.177763] x20: 6b6b6b6b6b6b6b6b x19: ffff002002335000 x18: 00000000fffffffb > [ 2.184925] x17: ffff800008d0d67c x16: ffff800008d072b4 x15: ffff800008d0c6c4 > [ 2.192087] x14: ffff800008d0c34c x13: ffff8000088d5034 x12: ffff8000088d46d4 > [ 2.199248] x11: ffff8000088d4624 x10: 0000000000000000 x9 : ffff800008d0d684 > [ 2.206410] x8 : ffff002000b1a158 x7 : bbbbbbbbbbbbbbbb x6 : ffff80000a0f53b8 > [ 2.213572] x5 : ffff80000a22b940 x4 : 0000000000000000 x3 : 0000000000000000 > [ 2.220733] x2 : fffffc0000000000 x1 : ffff002000838040 x0 : 01adb1adadadad80 > [ 2.227895] Call trace: > [ 2.230342] kfree+0x5c/0x3c0 > [ 2.233318] thermal_of_zone_unregister+0x34/0x54 > [ 2.238036] devm_thermal_of_zone_release+0x44/0x54 > [ 2.242931] release_nodes+0x64/0xd0 > [ 2.246516] devres_release_all+0xbc/0x350 > [ 2.250623] device_unbind_cleanup+0x20/0x70 > [ 2.254905] really_probe+0x1a0/0x2e4 > [ 2.258577] __driver_probe_device+0x80/0xec > [ 2.262859] driver_probe_device+0x44/0x130 > [ 2.267055] __driver_attach+0x104/0x1b4 > [ 2.270989] bus_for_each_dev+0x7c/0xe0 > [ 2.274834] driver_attach+0x30/0x40 > [ 2.278418] bus_add_driver+0x160/0x210 > [ 2.281900] hub 1-1:1.0: USB hub found > [ 2.282264] driver_register+0x84/0x140 > [ 2.286109] hub 1-1:1.0: 7 ports detected > [ 2.289859] __platform_driver_register+0x34/0x40 > [ 2.289867] qoriq_tmu_init+0x28/0x34 > [ 2.302258] do_one_initcall+0x50/0x250 > [ 2.306104] kernel_init_freeable+0x278/0x31c > [ 2.310474] kernel_init+0x30/0x140 > [ 2.313972] ret_from_fork+0x10/0x20 > [ 2.317559] Code: b25657e2 d34cfc00 d37ae400 8b020015 (f94006a1) > [ 2.323672] ---[ end trace 0000000000000000 ]--- > [ 2.328317] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b > [ 2.335999] SMP: stopping secondary CPUs > [ 2.339932] Kernel Offset: disabled > [ 2.343425] CPU features: 0x2000,0800f021,00001086 > [ 2.348229] Memory Limit: none > [ 2.351289] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ]--- > > This was seen a sl28 board > (arch/arm64/boot/dts/freescale/fsl-ls1028a-kontron-kbox-a-230-ls.dts). > The same board in the KernelCI also have some more information: > https://lavalab.kontron.com/scheduler/job/151900#L1162 > > But I guess even if that is fixed, the driver will not probe due to the > missing trip points? Are they now mandatory? Does it mean we'd need to > update our device trees? But that will then mean older devices trees > don't work anymore. It would also mean that all hwmon drivers registering a thermal zone sensor would fail to register unless such a thermal zone actually exists. This would make the whole concept of having the hwmon core register thermal zone sensors impossible. I have no idea how this is expected to work now, but there is an apparent flaw in the logic. That means I withdraw my Acked-by: for the hwmon patches in this series until it is guaranteed that hwmon registration does not fail as above if there is no thermal zone associated with a sensor. > > On my second board > (arch/arm/boot/dts/lan966x-kontron-kswitch-d10-mmt-6g-2gs.dts). I get the > following error: > > [ 6.292819] thermal_sys: Unable to find thermal zones description > [ 6.298872] thermal_sys: Failed to find thermal zone for hwmon id=0 > [ 6.305375] lan966x-hwmon e2010180.hwmon: error -EINVAL: failed to register hwmon device > [ 6.313508] lan966x-hwmon: probe of e2010180.hwmon failed with error -22 > > Again, is there seems to be something missing in the device tree. For this > board a device tree change should be easily doable, as it is still in > development. > That would work for this board, but not for all other boards where a sensor tries to register with the thermal subsystem but there is no thermal zone defined for it. Guenter