Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp882163pxb; Thu, 21 Oct 2021 11:15:20 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzI5bMe1ws/ShYCAu21eAfEbT4eoxymW13pT8pzbApJlc38RDJD+sPuEfuQzch4TCTs1UnS X-Received: by 2002:a17:902:ab8c:b0:13a:22d1:88d with SMTP id f12-20020a170902ab8c00b0013a22d1088dmr6558752plr.33.1634840119962; Thu, 21 Oct 2021 11:15:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1634840119; cv=none; d=google.com; s=arc-20160816; b=DbvPKKmj77kami2kWHl/+inDSrz7/8E7b/1vYry0jfb3JdnWTNlfjlKJxSQacodqo4 aQW6V1aU4ccXF0T8xEKphKRqiSTKbp05WFmGGa7Uu8pJ2i6saCTOpqyBBqcIudmaH9pG oxmSbEwTCzCEaETdI1SrxmxHJ1TnNkBNRjinKjjFHqfyw6YXVZ94PxO21cjchQzx1jJd XeC/PKM9M3NNGFq353WBtXwV+uD47afpH5tvJVJXcvix56K2q7PRYgF2nCPY7OiGEfDi 7gEU/GsvZ7cLEq7IPmirVKI25u4T2QJBKOrI/qKHpc6CoA+kkcFodoG6/WaMPQa1vtC3 Up5w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=oN/Uf7uD0kL4zJiPTmnms646OPjKKyTZQN5Y3qRKbp8=; b=DJMtS75GSgMUyAaD9RbsOYlVoHqsIa0Bt6tvUG8rNlQbBoXcAC5/OqRNr/SeT47Pzi lcE5IpHkpXRTif5dMwJLG9eXaUE/tprARGg4BrtP0Q4hUvg1CZnSGehWhXKXg8SsWa0x NgvEU623+r26DncF6ZAkVnBQ7MtNybGDzueQ+iXJ5+nT4aoxT86gUgpWmFiG3sLh+PYa gckyfNmrbgtSf9vFHsEE/ODpB18BBhMY5KncEwd7vi9T5bmcCOsexokZAR/r6ZMXUovk rxGxRclr13K8Ea4oO5ZZ9FC9y6obhVGYX19l+YcUCqpwiS1YW8fP4v2g09zGYEDifg6j qsYw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=WG3tf2Wl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c2si6589388plg.157.2021.10.21.11.15.03; Thu, 21 Oct 2021 11:15:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=WG3tf2Wl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231220AbhJUSOu (ORCPT + 99 others); Thu, 21 Oct 2021 14:14:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35776 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229914AbhJUSOu (ORCPT ); Thu, 21 Oct 2021 14:14:50 -0400 Received: from mail-lf1-x132.google.com (mail-lf1-x132.google.com [IPv6:2a00:1450:4864:20::132]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6DBDAC0613B9 for ; Thu, 21 Oct 2021 11:12:33 -0700 (PDT) Received: by mail-lf1-x132.google.com with SMTP id bq11so970919lfb.10 for ; Thu, 21 Oct 2021 11:12:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=oN/Uf7uD0kL4zJiPTmnms646OPjKKyTZQN5Y3qRKbp8=; b=WG3tf2Wl0FmD0p0MGvijfVd6sk2zuOw6qF/alXzGPLB72Tt502Iro25HNSyyqawmSP nZjlpMQ+RFhBR2JTJImFmXnSwnztPF++UhjS2+FUq8bVenZfRIa/vwgmT290rHBhAJlZ FSLIxzD8XiRGQklJldzl4MiEivw5mBdjSvASbrrbB8/3h2KI30PSenEprdILUSej0pbv PGGJHU07lEW0FNy3QCgReS5x/EMJVJlRb5P2Xh0QqgNhsRV556WkjlHwGxaCAtM7XdY/ TeVbDMQlZ9WzbRZGWqWK7DME1uNHaY9GlnGs1UVCr65ZnV81RKBOAsgNSyhghsknIXGn kX2w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=oN/Uf7uD0kL4zJiPTmnms646OPjKKyTZQN5Y3qRKbp8=; b=IYMS5OJfTY84P9KDHLvroOdGYv1BuA7RkJCs/x1XDsK3z/vOzBnZUNVslzNzv4G73o 6XPVB6ThQfIenydgmGRog/d4Ft2TyS5AHWSaEHscbgMqHCaDDyjPvyMtY5TQKEsm8aYy Qc4bodynS6x8SATpXTwc6aJ9HPL09D0uH1zQD5LT7O98c9wrY6NHglgcA0ZFdDIvNj3f r8uanhKK1njc3HVMQBE27+nJSxaF5c+2r84HXjWzloKTQ+yWCG3iFyCoZYvJma1SS42i gHvNru2/3v0Z4IaKnrcQEKgtBU9JGQGrkOMrTzlZWpZkBFAsq/Zm2Y/ObVDNZgN2oF+/ rBBg== X-Gm-Message-State: AOAM530UzTDGtCVvorHxA9rrMnofbIn/CU7qc+hEq4XDDRkVQyoYUmgl 1WB0nP7wO2yJOn4kvAAdUuAsfNz6DMY+RnjSI+qA+g== X-Received: by 2002:a05:6512:1515:: with SMTP id bq21mr6903554lfb.71.1634839951614; Thu, 21 Oct 2021 11:12:31 -0700 (PDT) MIME-Version: 1.0 References: <20210929144451.113334-1-ulf.hansson@linaro.org> <20210929144451.113334-3-ulf.hansson@linaro.org> In-Reply-To: From: Ulf Hansson Date: Thu, 21 Oct 2021 20:11:55 +0200 Message-ID: Subject: Re: [PATCH 2/2] PM: sleep: Fix runtime PM based cpuidle support To: "Rafael J. Wysocki" Cc: Daniel Lezcano , Linux PM , Maulik Shah , Peter Zijlstra , Vincent Guittot , Len Brown , Bjorn Andersson , Linux ARM , Linux Kernel Mailing List Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 21 Oct 2021 at 18:33, Rafael J. Wysocki wrote: > > On Thu, Oct 21, 2021 at 6:17 PM Ulf Hansson wrote: > > > > On Thu, 21 Oct 2021 at 17:09, Rafael J. Wysocki wrote: > > > > > > On Thu, Oct 21, 2021 at 4:05 PM Ulf Hansson wrote: > > > > > > > > On Thu, 21 Oct 2021 at 15:45, Rafael J. Wysocki wrote: > > > > > > > > > > On Thu, Oct 21, 2021 at 1:49 PM Ulf Hansson wrote: > > > > > > > > > > > > On Wed, 20 Oct 2021 at 20:18, Rafael J. Wysocki wrote: > > > > > > > > > > > > > > On Wed, Sep 29, 2021 at 4:44 PM Ulf Hansson wrote: > > > > > > > > > > > > > > > > In the cpuidle-psci case, runtime PM in combination with the generic PM > > > > > > > > domain (genpd), may be used when entering/exiting an idlestate. More > > > > > > > > precisely, genpd relies on runtime PM to be enabled for the attached device > > > > > > > > (in this case it belongs to a CPU), to properly manage the reference > > > > > > > > counting of its PM domain. > > > > > > > > > > > > > > > > This works fine most of the time, but during system suspend in the > > > > > > > > dpm_suspend_late() phase, the PM core disables runtime PM for all devices. > > > > > > > > Beyond this point and until runtime PM becomes re-enabled in the > > > > > > > > dpm_resume_early() phase, calls to pm_runtime_get|put*() will fail. > > > > > > > > > > > > > > > > To make sure the reference counting in genpd becomes correct, we need to > > > > > > > > prevent cpuidle-psci from using runtime PM when it has been disabled for > > > > > > > > the device. Therefore, let's move the call to cpuidle_pause() from > > > > > > > > dpm_suspend_noirq() to dpm_suspend_late() - and cpuidle_resume() from > > > > > > > > dpm_resume_noirq() into dpm_resume_early(). > > > > > > > > > > > > > > > > Diagnosed-by: Maulik Shah > > > > > > > > Suggested-by: Maulik Shah > > > > > > > > Signed-off-by: Ulf Hansson > > > > > > > > --- > > > > > > > > drivers/base/power/main.c | 6 ++---- > > > > > > > > 1 file changed, 2 insertions(+), 4 deletions(-) > > > > > > > > > > > > > > > > diff --git a/drivers/base/power/main.c b/drivers/base/power/main.c > > > > > > > > index cbea78e79f3d..1c753b651272 100644 > > > > > > > > --- a/drivers/base/power/main.c > > > > > > > > +++ b/drivers/base/power/main.c > > > > > > > > @@ -747,8 +747,6 @@ void dpm_resume_noirq(pm_message_t state) > > > > > > > > > > > > > > > > resume_device_irqs(); > > > > > > > > device_wakeup_disarm_wake_irqs(); > > > > > > > > - > > > > > > > > - cpuidle_resume(); > > > > > > > > } > > > > > > > > > > > > > > > > /** > > > > > > > > @@ -870,6 +868,7 @@ void dpm_resume_early(pm_message_t state) > > > > > > > > } > > > > > > > > mutex_unlock(&dpm_list_mtx); > > > > > > > > async_synchronize_full(); > > > > > > > > + cpuidle_resume(); > > > > > > > > dpm_show_time(starttime, state, 0, "early"); > > > > > > > > trace_suspend_resume(TPS("dpm_resume_early"), state.event, false); > > > > > > > > } > > > > > > > > @@ -1336,8 +1335,6 @@ int dpm_suspend_noirq(pm_message_t state) > > > > > > > > { > > > > > > > > int ret; > > > > > > > > > > > > > > > > - cpuidle_pause(); > > > > > > > > - > > > > > > > > device_wakeup_arm_wake_irqs(); > > > > > > > > suspend_device_irqs(); > > > > > > > > > > > > > > > > @@ -1467,6 +1464,7 @@ int dpm_suspend_late(pm_message_t state) > > > > > > > > int error = 0; > > > > > > > > > > > > > > > > trace_suspend_resume(TPS("dpm_suspend_late"), state.event, true); > > > > > > > > + cpuidle_pause(); > > > > > > > > mutex_lock(&dpm_list_mtx); > > > > > > > > pm_transition = state; > > > > > > > > async_error = 0; > > > > > > > > -- > > > > > > > > > > > > > > Well, this is somewhat heavy-handed and it affects even the systems > > > > > > > that don't really need to pause cpuidle at all in the suspend path. > > > > > > > > > > > > Yes, I agree. > > > > > > > > > > > > Although, I am not really changing the behaviour in regards to this. > > > > > > cpuidle_pause() is already being called in dpm_suspend_noirq(), for > > > > > > everybody today. > > > > > > > > > > Yes, it is, but pausing it earlier will cause more energy to be spent, > > > > > potentially. > > > > > > > > > > That said, there are not too many users of suspend_late callbacks in > > > > > the tree, so it may not matter too much. > > > > > > > > > > > > > > > > > > > Also, IIUC you don't need to pause cpuidle completely, but make it > > > > > > > temporarily avoid idle states potentially affected by this issue. An > > > > > > > additional CPUIDLE_STATE_DISABLED_ flag could be used for that I > > > > > > > suppose and it could be set via cpuidle_suspend() called from the core > > > > > > > next to cpufreq_suspend(). > > > > > > > > > > > > cpuidle_suspend() would then need to go and fetch the cpuidle driver > > > > > > instance, which in some cases is one driver per CPU. Doesn't that get > > > > > > rather messy? > > > > > > > > > > Per-CPU variables are used for that, so it is quite straightforward. > > > > > > > > > > > Additionally, since find_deepest_state() is being called for > > > > > > cpuidle_enter_s2idle() too, we would need to treat the new > > > > > > CPUIDLE_STATE_DISABLED_ flag in a special way, right? > > > > > > > > > > No, it already checks "disabled". > > > > > > > > Yes, but that would be wrong. > > > > > > Hmmm. > > > > > > > The use case I want to support, for cpuidle-psci, is to allow all idle > > > > states in suspend-to-idle, > > > > > > So does PM-runtime work in suspend-to-idle? How? > > > > No it doesn't. See below. > > > > > > > > > but prevent those that rely on runtime PM > > > > (after it has been disabled) for the regular idle path. > > > > > > Do you have a special suspend-to-idle handling of those states that > > > doesn't require PM-runtime? > > > > Yes. Feel free to have a look in __psci_enter_domain_idle_state(). > > So in theory you could check the pm_runtime_put_sync_suspend() return > value and fall back to something like WFI if that's an error code. I have already tried that, but it simply got too complicated. The main issue was that runtime PM could become disabled for the device in the middle of executing the ->enter() callback. For example, if pm_runtime_get_sync() fails, I still need to make sure the reference counting in genpd becomes correct - and I can't do that using dev_pm_genpd_resume(). That's because it's not designed to be called in this "unknown" suspend phase, but should be called after the noirq phase and be properly balanced with dev_pm_genpd_suspend(). In other words, the error path didn't work out for me. [...] Kind regards Uffe