Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 74D9DC433EF for ; Wed, 1 Dec 2021 13:50:37 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1349855AbhLANx4 (ORCPT ); Wed, 1 Dec 2021 08:53:56 -0500 Received: from mail-oi1-f182.google.com ([209.85.167.182]:38904 "EHLO mail-oi1-f182.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1349645AbhLANwy (ORCPT ); Wed, 1 Dec 2021 08:52:54 -0500 Received: by mail-oi1-f182.google.com with SMTP id r26so48559256oiw.5; Wed, 01 Dec 2021 05:49:24 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Bl/qxNDcbAoYnKBSdlmHVQ2e+mMpGHn+N5FT/PL5d34=; b=p+W3pXOqdSQpVfBI/hqLOYusLW2hV//anyl+ZkN/tH9hrsU2/sldUuDoyb6wmT8sAy BTCDE7m8oXSUrQZ8+Xf3xx3Oex2Yx17Vz+DQOnURHi8QjWBLavkmdLwMC4BDsKNKZ4Rp rTKiVq4my359e4yh33t6Et+xmcgpSHR35+yDj5w8itRKU4RogvzrdGlOjABcy7u2ZjHK XiUx0yE4mIs9VZve0GkYbkwYXLmlF04r+fnRxviIgG6HUcInJavkRz93oz7ooOWo9OeD kqY6BYNkiOxIDix/cIRXKmmZOc2gGMcLmiaMjoR7Be+GFiadPzOZM6V4S78MtRPfXyvS XUKQ== X-Gm-Message-State: AOAM5317LI8J2kSdIWe7ICDIJ3vAIUVIOHg/TTTAURBt//i2MkWmOShv FAkwWEkbz6wCkf7O5KcmCS51HQEFy9rX+Lpbie/g797f9vM= X-Google-Smtp-Source: ABdhPJzRMv+sVtovE/Ta1R5gX16yr63j3RHPcx648rnm+0OruXRZLrUiw8aaQ6z3mY8TEcHbIHYC1ABfZzRs+ojLARs= X-Received: by 2002:a05:6808:14c2:: with SMTP id f2mr6118724oiw.154.1638366563606; Wed, 01 Dec 2021 05:49:23 -0800 (PST) MIME-Version: 1.0 References: <20211026222626.39222-1-ulf.hansson@linaro.org> <4380690.LvFx2qVVIh@kreacher> In-Reply-To: From: "Rafael J. Wysocki" Date: Wed, 1 Dec 2021 14:49:12 +0100 Message-ID: Subject: Re: [PATCH] PM: runtime: Allow rpm_resume() to succeed when runtime PM is disabled To: Ulf Hansson Cc: "Rafael J. Wysocki" , "Rafael J. Wysocki" , Alan Stern , Linux PM , Kevin Hilman , Maulik Shah , Linux ARM , Linux Kernel Mailing List Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Dec 1, 2021 at 10:02 AM Ulf Hansson wrote: > > On Tue, 30 Nov 2021 at 18:26, Rafael J. Wysocki wrote: > > > > On Tue, Nov 30, 2021 at 5:41 PM Ulf Hansson wrote: > > > > > > On Tue, 30 Nov 2021 at 14:02, Rafael J. Wysocki wrote: > > > > > > > > On Tue, Nov 30, 2021 at 12:58 PM Ulf Hansson wrote: > > > > > > > > > > [...] > > > > > > > > > > > > > > > > > > > > > > > > > > > Am I thinking correctly that this is mostly about working around the > > > > > > > > > > > limitations of pm_runtime_force_suspend()? > > > > > > > > > > > > > > > > > > > > No, this isn't related at all. > > > > > > > > > > > > > > > > > > > > The cpuidle-psci driver doesn't have PM callbacks, thus using > > > > > > > > > > pm_runtime_force_suspend() would not work here. > > > > > > > > > > > > > > > > > > Just wanted to send a ping on this to see if we can come to a > > > > > > > > > conclusion. Or maybe we did? :-) > > > > > > > > > > > > > > > > > > I think in the end, what slightly bothers me, is that the behavior is > > > > > > > > > a bit inconsistent. Although, maybe it's the best we can do. > > > > > > > > > > > > > > > > I've been thinking about this and it looks like we can do better, but > > > > > > > > instead of talking about this I'd rather send a patch. > > > > > > > > > > > > > > Alright. > > > > > > > > > > > > > > I was thinking along the lines of make similar changes for > > > > > > > rpm_idle|suspend(). That would make the behaviour even more > > > > > > > consistent, I think. > > > > > > > > > > > > > > Perhaps that's what you have in mind? :-) > > > > > > > > > > > > Well, not exactly. > > > > > > > > > > > > The idea is to add another counter (called restrain_depth in the patch) > > > > > > to prevent rpm_resume() from running the callback when that is potentially > > > > > > problematic. With that, it is possible to actually distinguish devices > > > > > > with PM-runtime enabled and it allows the PM-runtime status to be checked > > > > > > when it is still known to be meaningful. > > > > > > > > > > Hmm, I don't quite understand the benefit of introducing a new flag > > > > > for this. rpm_resume() already checks the disable_depth to understand > > > > > when it's safe to invoke the callback. Maybe there is a reason why > > > > > that isn't sufficient? > > > > > > > > The problem is that disable_depth > 0 may very well mean that runtime > > > > PM has not been enabled at all for the given device which IMO is a > > > > problem. > > > > > > > > As it stands, it is necessary to make assumptions, like disable_depth > > > > == 1 meaning that runtime PM is really enabled, but the PM core has > > > > disabled it temporarily, which is somewhat questionable. > > > > > > > > Another problem with disabling is that it causes rpm_resume() to fail > > > > even if the status is RPM_ACTIVE and it has to do that exactly because > > > > it cannot know why runtime PM has been disabled. If it has never been > > > > enabled, rpm_resume() must fail, but if it has been disabled > > > > temporarily, rpm_resume() may return 1 when the status is RPM_ACTIVE. > > > > > > > > The new count allows the "enabled in general, but temporarily disabled > > > > at the moment" to be handled cleanly. > > > > > > My overall comment is that I fail to understand why we need to > > > distinguish between these two cases. To me, it shouldn't really > > > matter, *why* runtime PM is (or have been) disabled for the device. > > > > It matters if you want to trust the status, because "disabled" means > > "the status doesn't matter". > > Well, that doesn't really match how the runtime PM interface is being > used today. Well, I clearly disagree. > For example, we have a whole bunch of helper functions, allowing us to > update and check the runtime PM state of the device, even when the > disable_depth > 0. Some functions, like pm_runtime_set_active() for > example, even take parents and device-links into account. That's true, but that's for a purpose. If runtime PM becomes enabled after using pm_runtime_set_active(), the status should better be consistent with the settings of the parent etc. > > > > If you want the status to stay meaningful, but prevent callbacks from > > running, you need something else. > > > > > The important point is that the default state for a device is > > > RPM_SUSPENDED and someone has moved into RPM_ACTIVE, for whatever > > > reason. That should be sufficient to allow rpm_resume() to return '1' > > > when disable_depth > 0, shouldn't it? > > > > No, because there is no rule by which the status of devices with > > PM-runtime disabled must be RPM_SUSPENDED. > > That's not what I was trying to say. > > The initial/default runtime PM state for a device is RPM_SUSPENDED, > which is being set in pm_runtime_init(). Although, I agree that it > can't be trusted that this state actually reflects the state of the > HW, it's still a valid state for the device from a runtime PM point of > view. No, it is not. It's just the default. > However, and more importantly, if the state has moved to RPM_ACTIVE, > someone must have deliberately moved the device into that state. Sure, but it cannot be regarded as an indication on whether or not runtime PM is supported and has ever been enabled for the given device. Again, there is no rule regarding the status value for devices with runtime PM disabled, either way. > For this reason, I believe it seems reasonable to trust it, both from HW > point of view, but definitely also from a runtime PM point of view. If > not, then what should we do? Trust it only when runtime PM is enabled, ie. dev->power.disable_depth == 0. That's exactly the reason why pm_runtime_suspended() returns false if runtime PM is disabled for the target device and if pm_runtime_suspended() is called during system-wide suspend and resume, it is not clear how to interpret its return value. If it returns true outside the system suspend-resume code path, that means "runtime PM has been enabled and the device has been runtime-suspended" and I want it to mean exactly the same thing during system-wide suspend and resume, so people don't need to wonder about the context in which the code is running.