Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp696027iob; Wed, 18 May 2022 10:50:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxma638KTMIdK6E94UEWojabriAJn4NFg1mvIRUXDoqu+cFMsVE7B3DmS5MwH2iIRlQomcp X-Received: by 2002:a17:902:f605:b0:14d:9e11:c864 with SMTP id n5-20020a170902f60500b0014d9e11c864mr734055plg.54.1652896219470; Wed, 18 May 2022 10:50:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652896219; cv=none; d=google.com; s=arc-20160816; b=uhP2QFCBoPkh/BOedhNCv6RVZE0DMYgsFRvlohBNXp0QAOpogbPP2Q30M3jqyn9n4a DehOvunX3Ujecfunou58jspJ+CE5CFwhgz6V2bcpujJ45g8Zu6mljxYGhip0KQ7PaxI6 85dBRPxHezinXrCgmQBOxTADjfCaC6oxHhUtKd0YTj5JZMqjRgOLnRVqn8sdV4DdvxOa dxhLrNdEqONl3x8j8q4yH7G2P1Xx1d0j0qZQjv7qGC+ApMJpOAP3L6p7AFntjXgdy92a tcshKp+5fvmPwpB81Ct725od9sC8vsis+y6DYUUvmlhFelwTLsFe9XqNmYj4V2+htPJ1 exzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :message-id:subject:cc:to:from:date:dkim-signature; bh=zHyTSca72soLQMxXiXrBDFtnfJ09EdX+xrvyQ1zZwdY=; b=JPgI94jqINICXOCJaJOhsqWuif/xj8ZV/PZrXAVOvOkNLLeXGWNQ/Hsnxck70N8Q6Y aDCcLsNz9agAlqNe8HlPg5dsINwJEZEeODdvfFEJcQiVAkce59y5EzLcTjpi5YJNaE15 /OoMU6Iinh73JRC3qdesIRQjuU4CbAh62xE4YX4dDL09Ot0WKwd8PhCyuXr18jMZ+5PL 0sJjsd5b+1uG7W6APHgjgJOhOd8ni6OAPE1q8J2pIAOH9Cih+qceqqSsYaSfu25xoQwR v5YW0y83BFWTiz2Kui8HSsSJ+opEhJRzbZxW6OJXJfCB4zNLlbn3NzAXWPkD1+G4vDYE RS0g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=UO5zkNXw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id i12-20020a170902c94c00b001617b60e734si3709627pla.139.2022.05.18.10.50.18 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 18 May 2022 10:50:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=UO5zkNXw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 6A675205F27; Wed, 18 May 2022 10:50:16 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241094AbiERRuJ (ORCPT + 99 others); Wed, 18 May 2022 13:50:09 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57350 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S241006AbiERRuG (ORCPT ); Wed, 18 May 2022 13:50:06 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0FEAF1FC2D1; Wed, 18 May 2022 10:50:06 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 8F7A9617E9; Wed, 18 May 2022 17:50:05 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id A84DAC385A5; Wed, 18 May 2022 17:50:04 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1652896204; bh=K00bz992tBRd/BGSj33PgLYmtpNF5xkjClI/j3Rqjp0=; h=Date:From:To:Cc:Subject:In-Reply-To:From; b=UO5zkNXwKzGnoXZRI4EtNATHQzvLFfmmB+A/AsDHoJKcv6TFgNbjhUKzQ0E0M0ZU/ 9Kk7eOD986APQOe4jIDBbF+7te2kCTjZp3TYiGpdElzi80MCnZAj3azMzpKqceOpqw y+edCtDN4vbY7taSWRAsiq6FHOBVlKJtWRJZjYhsse3Pr5yz/NwdZcjeJ03btgwIJt 3qFszD6QokNG5/YtaN+trjj/ugPxIIkGRcFfmgpWYxOeQAZGbR3bvGGXjGeEAY3IJs MoYmp7atofOEguS/DqkzFYfb8zm+pSYyeLTE84KxgFW8/ciKzpgI9P4BMKNwJp++qF b1Hne3A/md3uw== Date: Wed, 18 May 2022 12:50:02 -0500 From: Bjorn Helgaas To: "Rafael J. Wysocki" Cc: Tanjore Suresh , Greg Kroah-Hartman , Christoph Hellwig , Sagi Grimberg , Bjorn Helgaas , Linux Kernel Mailing List , linux-nvme , Linux PCI Subject: Re: [PATCH v3 1/3] driver core: Support asynchronous driver shutdown Message-ID: <20220518175002.GA1148748@bhelgaas> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, May 18, 2022 at 01:38:49PM +0200, Rafael J. Wysocki wrote: > On Wed, May 18, 2022 at 12:08 AM Tanjore Suresh wrote: > > > > This changes the bus driver interface with additional entry points > > to enable devices to implement asynchronous shutdown. The existing > > synchronous interface to shutdown is unmodified and retained for > > backward compatibility. > > > > This changes the common device shutdown code to enable devices to > > participate in asynchronous shutdown implementation. > > > > Signed-off-by: Tanjore Suresh > > --- > > drivers/base/core.c | 38 +++++++++++++++++++++++++++++++++++++- > > include/linux/device/bus.h | 12 ++++++++++++ > > 2 files changed, 49 insertions(+), 1 deletion(-) > > > > diff --git a/drivers/base/core.c b/drivers/base/core.c > > index 3d6430eb0c6a..ba267ae70a22 100644 > > --- a/drivers/base/core.c > > +++ b/drivers/base/core.c > > @@ -4479,6 +4479,7 @@ EXPORT_SYMBOL_GPL(device_change_owner); > > void device_shutdown(void) > > { > > struct device *dev, *parent; > > + LIST_HEAD(async_shutdown_list); > > > > wait_for_device_probe(); > > device_block_probing(); > > @@ -4523,7 +4524,13 @@ void device_shutdown(void) > > dev_info(dev, "shutdown_pre\n"); > > dev->class->shutdown_pre(dev); > > } > > - if (dev->bus && dev->bus->shutdown) { > > + if (dev->bus && dev->bus->async_shutdown_start) { > > + if (initcall_debug) > > + dev_info(dev, "async_shutdown_start\n"); > > + dev->bus->async_shutdown_start(dev); > > + list_add_tail(&dev->kobj.entry, > > + &async_shutdown_list); > > + } else if (dev->bus && dev->bus->shutdown) { > > if (initcall_debug) > > dev_info(dev, "shutdown\n"); > > dev->bus->shutdown(dev); > > @@ -4543,6 +4550,35 @@ void device_shutdown(void) > > spin_lock(&devices_kset->list_lock); > > } > > spin_unlock(&devices_kset->list_lock); > > + > > + /* > > + * Second pass spin for only devices, that have configured > > + * Asynchronous shutdown. > > + */ > > + while (!list_empty(&async_shutdown_list)) { > > + dev = list_entry(async_shutdown_list.next, struct device, > > + kobj.entry); > > + parent = get_device(dev->parent); > > + get_device(dev); > > + /* > > + * Make sure the device is off the list > > + */ > > + list_del_init(&dev->kobj.entry); > > + if (parent) > > + device_lock(parent); > > + device_lock(dev); > > + if (dev->bus && dev->bus->async_shutdown_end) { > > + if (initcall_debug) > > + dev_info(dev, > > + "async_shutdown_end called\n"); > > + dev->bus->async_shutdown_end(dev); > > + } > > + device_unlock(dev); > > + if (parent) > > + device_unlock(parent); > > + put_device(dev); > > + put_device(parent); > > + } > > } > > > > /* > > diff --git a/include/linux/device/bus.h b/include/linux/device/bus.h > > index a039ab809753..f582c9d21515 100644 > > --- a/include/linux/device/bus.h > > +++ b/include/linux/device/bus.h > > @@ -49,6 +49,16 @@ struct fwnode_handle; > > * will never get called until they do. > > * @remove: Called when a device removed from this bus. > > * @shutdown: Called at shut-down time to quiesce the device. > > + * @async_shutdown_start: Called at the shutdown-time to start > > + * the shutdown process on the device. > > + * This entry point will be called only > > + * when the bus driver has indicated it would > > + * like to participate in asynchronous shutdown > > + * completion. > > + * @async_shutdown_end: Called at shutdown-time to complete the shutdown > > + * process of the device. This entry point will be called > > + * only when the bus drive has indicated it would like to > > + * participate in the asynchronous shutdown completion. > > I'm going to repeat my point here, but only once. > > I see no reason to do async shutdown this way, instead of adding a > flag for drivers to opt in for calling their existing shutdown > callbacks asynchronously, in analogy with the async suspend and resume > implementation. There's a lot of code here that mere mortals like myself don't understand very well, so here's my meager understanding of how async suspend works and what you're suggesting to make this a little more concrete. Devices have this async_suspend bit: struct device { struct dev_pm_info { unsigned int async_suspend:1; Drivers call device_enable_async_suspend() to set async_suspend if they want it. The system suspend path is something like this: suspend_enter dpm_suspend_noirq(PMSG_SUSPEND) dpm_noirq_suspend_devices(PMSG_SUSPEND) pm_transition = PMSG_SUSPEND while (!list_empty(&dpm_late_early_list)) device_suspend_noirq(dev) dpm_async_fn(dev, async_suspend_noirq) if (is_async(dev)) async_schedule_dev(async_suspend_noirq) # async path async_suspend_noirq # called asynchronously __device_suspend_noirq(dev, PMSG_SUSPEND, true) callback = pm_noirq_op(PMSG_SUSPEND) # .suspend_noirq() dpm_run_callback(callback) # async call __device_suspend_noirq(dev, pm_transition, false) # sync path callback = pm_noirq_op(PMSG_SUSPEND) # .suspend_noirq() dpm_run_callback(callback) # sync call async_synchronize_full # wait If a driver has called device_enable_async_suspend(), we'll use the async_schedule_dev() path to schedule the appropriate .suspend_noirq() method. After scheduling it via the async path or directly calling it via the sync path, the async_synchronize_full() waits for completion of all the async methods. I assume your suggestion is to do something like this: struct device { struct dev_pm_info { unsigned int async_suspend:1; + unsigned int async_shutdown:1; + void device_enable_async_shutdown(struct device *dev) + dev->power.async_shutdown = true; device_shutdown while (!list_empty(&devices_kset->list)) - dev->...->shutdown() + if (is_async_shutdown(dev)) + async_schedule_dev(async_shutdown) # async path + + async_shutdown # called asynchronously + dev->...->shutdown() + + else + dev->...->shutdown() # sync path + + async_synchronize_full # wait