Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp2905316rwd; Wed, 14 Jun 2023 08:43:42 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ4rv3uiEoO3Bq6zfplz29YwjpOiUBGCfB1XNsMO+uf3wF9dbgxtj44NAHDsrUPDSrz2NLaq X-Received: by 2002:a17:903:124f:b0:1b0:3a74:7fc4 with SMTP id u15-20020a170903124f00b001b03a747fc4mr13456686plh.24.1686757421907; Wed, 14 Jun 2023 08:43:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686757421; cv=none; d=google.com; s=arc-20160816; b=AoSeYhElVLSAU9REnoVNBEnemAuG9ys6c/P3nHgDAAdxp2lD2jGFEARhS8GZhbfQT0 sVswd0DRy44Z49SvcYXSRXY1iiZO9wmAKAmIH0CMb1H+rwfHDrv4EOmLKyCaB+pGWSSp TbyLADiHwGWwvc7CvtyZQPlCEx+yzkKBPM62Dysiy1fGbhcEIvel85nv/4T7Lo7RS4JY UltnkVhz3FqVj0D+maMAQoYNMmRu07ze0qPtOwCU/ApN+fUmxoP4aaBtem+dCWdD0BPj 5CibVglSYY/2klEmqZPO/aaWXALExwRKVfJr1OrrQw2rsilhOsomsn6RZv0p1A9qxwn+ Drlg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=SGaFDGvZVcuTutLTtyzgl8LrmRbfyATaJOmkhj2FqMI=; b=NzV6/hUqTkzs8nGPulZkaferJJSWkGgAB9noNzJlImoBh/Z7sJrjxsj+LkdpTxLwMb +c4EWJVX7I0JW3nS+rR2q6EHEr8oxSJpSV9A9ozAxRfLEBGjdB7enkgIx6FAWQ86YamZ ANQWnkqUuMkIJUa90b0UZpX0nELwl/2VyYgFWA/7Mjyv74kwPWrrpTjZSjihN7oih/3P Rm/WPZG86VXNzUg5Hvx94RIpyMBM0untErzFLjJJlb2jP7xA1I9riOxm58FG9s4mdyTr eAhugHgqJArZRAG5BGl6wAi04wkrmla3V/noc6CDTyPUOYjGS1BG3HmON03HEcK7BUVg /BCA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id n11-20020a1709026a8b00b001b049682a8dsi10857164plk.458.2023.06.14.08.43.20; Wed, 14 Jun 2023 08:43:41 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236064AbjFNO0Z (ORCPT + 99 others); Wed, 14 Jun 2023 10:26:25 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51078 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244460AbjFNO0Y (ORCPT ); Wed, 14 Jun 2023 10:26:24 -0400 Received: from netrider.rowland.org (netrider.rowland.org [192.131.102.5]) by lindbergh.monkeyblade.net (Postfix) with SMTP id 486771BC6 for ; Wed, 14 Jun 2023 07:26:22 -0700 (PDT) Received: (qmail 462530 invoked by uid 1000); 14 Jun 2023 10:26:20 -0400 Date: Wed, 14 Jun 2023 10:26:20 -0400 From: Alan Stern To: Damien Le Moal Cc: Hannes Reinecke , Joe Breuer , Bart Van Assche , Bagas Sanjaya , Pavel Machek , "Rafael J. Wysocki" , Len Brown , Greg Kroah-Hartman , Kees Cook , Tony Luck , "Guilherme G. Piccoli" , Thorsten Leemhuis , "James E.J. Bottomley" , "Martin K. Petersen" , Phillip Potter , Linux Power Management , Linux Kernel Mailing List , Linux Hardening , Linux Regressions , Linux SCSI , Dan Williams , Hannes Reinecke , Adrian Hunter , Martin Kepplinger , Kai-Heng Feng Subject: Re: Fwd: Waking up from resume locks up on sr device Message-ID: <859f0eda-4984-4489-9851-c9f6ec454a88@rowland.harvard.edu> References: <2d1fdf6d-682c-a18d-2260-5c5ee7097f7d@gmail.com> <5513e29d-955a-f795-21d6-ec02a2e2e128@gmail.com> <07d6e2e7-a50a-8cf4-5c5d-200551bd6687@gmail.com> <02e4f87a-80e8-dc5d-0d6e-46939f2c74ac@acm.org> <84f1c51c-86f9-04b3-0cd1-f685ebee7592@kernel.org> <37ed36f0-6f72-115c-85fb-62ef5ad72e76@suse.de> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Spam-Status: No, score=-1.7 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,SPF_HELO_PASS,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jun 14, 2023 at 04:35:50PM +0900, Damien Le Moal wrote: > On 6/14/23 15:57, Hannes Reinecke wrote: > > On 6/14/23 06:49, Damien Le Moal wrote: > >> On 6/11/23 18:05, Joe Breuer wrote: > >>> I'm the reporter of this issue. > >>> > >>> I just tried this patch against 6.3.4, and it completely fixes my > >>> suspend/resume issue. > >>> > >>> The optical drive stays usable after resume, even suspending/resuming > >>> during playback of CDDA content works flawlessly and playback resumes > >>> seamlessly after system resume. > >>> > >>> So, from my perspective: Good one! > >> > >> In place of Bart's fix, could you please try this patch ? > >> > >> diff --git a/drivers/ata/libata-eh.c b/drivers/ata/libata-eh.c > >> index b80e68000dd3..a81eb4f882ab 100644 > >> --- a/drivers/ata/libata-eh.c > >> +++ b/drivers/ata/libata-eh.c > >> @@ -4006,9 +4006,32 @@ static void ata_eh_handle_port_resume(struct > >> ata_port *ap) > >> ???????? /* tell ACPI that we're resuming */ > >> ???????? ata_acpi_on_resume(ap); > >> > >> -?????? /* update the flags */ > >> ???????? spin_lock_irqsave(ap->lock, flags); > >> + > >> +?????? /* Update the flags */ > >> ???????? ap->pflags &= ~(ATA_PFLAG_PM_PENDING | ATA_PFLAG_SUSPENDED); > >> + > >> +?????? /* > >> +??????? * Resuming the port will trigger a rescan of the ATA device(s) > >> +??????? * connected to it. Before scheduling the rescan, make sure that > >> +??????? * the associated scsi device(s) are fully resumed as well. > >> +??????? */ > >> +?????? ata_for_each_link(link, ap, HOST_FIRST) { > >> +?????????????? ata_for_each_dev(dev, link, ENABLED) { > >> +?????????????????????? struct scsi_device *sdev = dev->sdev; > >> + > >> +?????????????????????? if (!sdev) > >> +?????????????????????????????? continue; > >> +?????????????????????? if (scsi_device_get(sdev)) > >> +?????????????????????????????? continue; > >> + > >> +?????????????????????? spin_unlock_irqrestore(ap->lock, flags); > >> +?????????????????????? device_pm_wait_for_dev(&ap->tdev, > >> +????????????????????????????????????????????? &sdev->sdev_gendev); > >> +?????????????????????? scsi_device_put(sdev); > >> +?????????????????????? spin_lock_irqsave(ap->lock, flags); > >> +?????????????? } > >> +?????? } > >> ???????? spin_unlock_irqrestore(ap->lock, flags); > >> ? } > >> ? #endif /* CONFIG_PM */ > >> > >> Thanks ! > >> > > Well; not sure if that'll work out. > > The whole reason why we initial a rescan is that we need to check if the > > ports are still connected, and whether the devices react. > > So we can't iterate the ports here as this is the very thing which gets > > checked during EH. > > Hmmm... Right. So we need to move that loop into ata_scsi_dev_rescan(), > which itself already loops over the port devices anyway. > > > We really should claim resume to be finished as soon as we can talk with > > the HBA, and kick off EH asynchronously to let it finish the job after > > resume has completed. > > That is what's done already: > > static int ata_port_pm_resume(struct device *dev) > { > ata_port_resume_async(to_ata_port(dev), PMSG_RESUME); > pm_runtime_disable(dev); > pm_runtime_set_active(dev); > pm_runtime_enable(dev); > return 0; > } > > EH is kicked by ata_port_resume_async() -> ata_port_request_pm() and it > is async. There is no synchronization in EH with the PM side though. We > probably should have EH check that the port resume is done first, which > can be done in ata_eh_handle_port_resume() since that is the first thing > done when entering EH. > > The problem remains though that we *must* wait for the scsi device > resume to be done before calling scsi_rescan_device(), which is done > asynchronously from EH, as a different work. So that one needs to wait > for the scsi side resume to be done. > > I also thought of trigerring the rescan from the scsi side, but since > the resume may be asynchronous, we could endup trigerring it with the > ata side not yet resumed... That would only turn the problem around > instead of solving it. The order in which devices get resumed isn't arbitrary. If the system is set up not to use async suspends/resumes then the order is always the same as the order in which the devices were originally registered (for resume, that is -- suspend obviously takes place in the reverse order). So if you're trying to perform an action that requires two devices to be active, you must not do it in the resume handler for the device that was registered first. I don't know how the ATA and SCSI pieces interact here, but regardless, this is a pretty strict requirement. It should be okay to perform the action in the resume handler for the device that was registered second. But if the two devices aren't in an ancestor-descendant relationship then you also have to call device_pm_wait_for_dev() (or use device links as Rafael mentioned) to handle the async case properly. > Or... Why the heck scsi_rescan_device() is calling device_lock() ? This > is the only place in scsi code I can see that takes this lock. I suspect > this is to serialize either rescans, or serialize with resume, or both. > For serializing rescans, we can use another lock. For serializing with > PM, we should wait for PM transitions... > Something is not right here. Here's what commit e27829dc92e5 ("scsi: serialize ->rescan against ->remove", written by Christoph Hellwig) says: Lock the device embedded in the scsi_device to protect against concurrent calls to ->remove. That's the commit which added the device_lock() call. Alan Stern