Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp4619602pxv; Tue, 29 Jun 2021 11:15:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxYlqqFy9yzMefbplj5/JKDJ2QfYmsrmrnJ2BjP24qSfQfw/mDBvhQzAyPZVBOyzqIzw4hl X-Received: by 2002:a05:6402:406:: with SMTP id q6mr41645815edv.149.1624990508568; Tue, 29 Jun 2021 11:15:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1624990508; cv=none; d=google.com; s=arc-20160816; b=vPDc2L1N8HlCA9FnkHY3PPSKqvyrnIs//wl7I7RPnCQzzTFqplEudLLkhYtPuTxglV C+XjGu7D8F9OaqGAWAVXt656LkS3vDdu6+0s510Ppe7fcVZRIiFId6CxDW5xQKD74hbu UM3/6pfsVd/PoXcq2bpUoC7I6Fub5zRgFK1M3/GiQ1F6y6Usmk1sppRyJ2eZRIYrIRww dvS3MTghuKYI+HJZdtQ/3CI2Q0HmwVY7ef422qQJ4quqavCmvs9VBY3P4Jy3SPRNfhHZ 7eoaJNIjzDfCYmR5DdBxzFBGYG2izb80zxy66jlAygyZQwlapg6PNF+G/VjQ7jo9716i g6Uw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=/2ZtiF+TtydPZL+N+/zutiqV/u+eDJHq4C/EhYfFVq8=; b=CrTxPc9Y0PZKufn/PnCZi8DTudsgjksSSr6wtwtf/nfUvgURqNTO8dWrEmDFT5ZAeY xuRjSyRXBtIQXZX05ydFAS1/qCv+vLWTawaNP12ZJIOl6pU52Odjd2v4JOvq/WrMugII pR94dsKv9qp44mvNSiqy9JBBkF320YrBmPWaSN+d/cTP2+yW0PcKHhkWt73wcnh0fgXq 25WX1kbS1GgdVkdYTlHKZSFl+Jkfc+gx5gFyab6PipTZAE9zBPhAkieZO5y9aO1Iq7fW NVuNtFt4egqSi7R9XvmqHQCYaqyftHSM8rmV5Y1UjlEKJP4BR4+/3o09FwDyMsfCxV6e W+DQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b2si17185006ejl.199.2021.06.29.11.14.45; Tue, 29 Jun 2021 11:15:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234757AbhF2SEY (ORCPT + 99 others); Tue, 29 Jun 2021 14:04:24 -0400 Received: from mail-pj1-f54.google.com ([209.85.216.54]:44027 "EHLO mail-pj1-f54.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231856AbhF2SEY (ORCPT ); Tue, 29 Jun 2021 14:04:24 -0400 Received: by mail-pj1-f54.google.com with SMTP id x21-20020a17090aa395b029016e25313bfcso2987105pjp.2; Tue, 29 Jun 2021 11:01:56 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=/2ZtiF+TtydPZL+N+/zutiqV/u+eDJHq4C/EhYfFVq8=; b=IAXNf9lyaagbuwU9crTfdtTDGg71NIFlnlF1bcElSo0yi4rn1BEb93Z8FLz+vl5NM/ BMbF1UC9sXRfZm4c8pOIjl1QJBdQy3S1FO2XpYHKJI/M1PErJCn5Epnjp73+eW5zfYR+ EHMvbzA+8+6XvAjo5B1bT7ZZetWTubDOs7XUOd8+eM4IcX7lmTKQNWs6hTzXxoBrXNdV Y6pjYu20aWluyP/d5kk31hCQemTX+BqWhS7GAUsD6W/veuVnVlllZ6f+W+qAYlIO1Ua8 2zJaGJt//vHG8BvJ2d8NVpPbyhep0QcA4PPLDpBnOpHGtlRUV0GYJ+qN3nELQwvwcYEy XFqw== X-Gm-Message-State: AOAM530pRBpb3kp14DRXhF6hZCoHuwuI07baRqLPdBlNU0zraYSVFphY Ex8y8yUhFJdlgfaKoBCgnklHIZmsi98= X-Received: by 2002:a17:90a:390d:: with SMTP id y13mr35783455pjb.52.1624989715857; Tue, 29 Jun 2021 11:01:55 -0700 (PDT) Received: from [192.168.3.217] (c-73-241-217-19.hsd1.ca.comcast.net. [73.241.217.19]) by smtp.gmail.com with ESMTPSA id 190sm19278911pgd.1.2021.06.29.11.01.53 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 29 Jun 2021 11:01:55 -0700 (PDT) Subject: Re: [PATCH v4 06/10] scsi: ufs: Remove host_sem used in suspend/resume To: Can Guo Cc: Adrian Hunter , asutoshd@codeaurora.org, nguyenb@codeaurora.org, hongwus@codeaurora.org, ziqichen@codeaurora.org, linux-scsi@vger.kernel.org, kernel-team@android.com, Alim Akhtar , Avri Altman , "James E.J. Bottomley" , "Martin K. Petersen" , Stanley Chu , Bean Huo , Jaegeuk Kim , open list References: <1624433711-9339-1-git-send-email-cang@codeaurora.org> <1624433711-9339-8-git-send-email-cang@codeaurora.org> <9105f328ee6ce916a7f01027b0d28332@codeaurora.org> <1b351766a6e40d0df90b3adec964eb33@codeaurora.org> <3970b015e444c1f1714c7e7bd4c44651@codeaurora.org> <7ba226fe-789c-bf20-076b-cc635530db42@acm.org> <60a5496863100976b74d8c376c9e9cb0@codeaurora.org> From: Bart Van Assche Message-ID: Date: Tue, 29 Jun 2021 11:01:52 -0700 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 In-Reply-To: <60a5496863100976b74d8c376c9e9cb0@codeaurora.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/28/21 11:23 PM, Can Guo wrote: > On 2021-06-29 01:31, Bart Van Assche wrote: >> On 6/28/21 1:17 AM, Can Guo wrote: >>> On 2021-06-25 01:11, Bart Van Assche wrote: >>>> On 6/23/21 11:31 PM, Can Guo wrote: >>>>> Using back host_sem in suspend_prepare()/resume_complete() >>>>> won't have this problem of deadlock, right? >>>> >>>> Although that would solve the deadlock discussed in this email >>>> thread, it wouldn't solve the issue of potential adverse >>>> interactions of the UFS error handler and the SCSI error >>>> handler running concurrently. >>> >>> I think I've explained it before, paste it here - >>> >>> ufshcd_eh_host_reset_handler() invokes ufshcd_err_handler() and >>> flushes it, so SCSI error handler and UFS error handler can >>> safely run together. >> >> That code path is the exception. Do you agree that the following >> three functions all invoke the ufshcd_err_handler() function >> asynchronously? * ufshcd_uic_pwr_ctrl() * ufshcd_check_errors() * >> ufshcd_abort() > > I agree, but I don't see what's wrong with that. Any context can > invoke ufs error handler asynchronously and ufs error handler prepare > makes sure error handler can work safely, i.e., stopping PM > ops/gating/scaling in error handler prepare makes sure no one shall > call ufshcd_uic_pwr_ctrl() ever again. And ufshcd_check_errors() and > ufshcd_abort() are OK to run concurrently with UFS error handler. The current UFS error handling approach requires the following code in ufshcd_queuecommand(): if (hba->pm_op_in_progress) { hba->force_reset = true; set_host_byte(cmd, DID_BAD_TARGET); cmd->scsi_done(cmd); goto out; } Removing that code is not possible with the current error handling approach. My patch makes it possible to remove that code. > Sorry that I missed the change of scsi_transport_template() in your > previous message. I can understand that you want to invoke UFS error > hander by invoking SCSI error handler, but I didn't go that far > because I saw you changed pm_runtime_get_sync() to > pm_runtime_get_noresume() in ufs error handler prepare. How can that > change make sure that the device is not suspending or resuming while > error handler is running? UFS power state transitions happen by submitting a SCSI command to a WLUN. The SCSI error handler is only activated after all outstanding SCSI commands for a SCSI host have failed or completed. I think this guarantees for the UFS driver that eh_strategy_handler is not invoked while a command submitted to a WLUN is changing the power state of the UFS device. The following code from scsi_error.c only wakes up the error handler if (shost->host_failed || shost->host_eh_scheduled) && shost->host_failed == scsi_host_busy(shost): if ((shost->host_failed == 0 && shost->host_eh_scheduled == 0) || shost->host_failed != scsi_host_busy(shost)) { schedule(); continue; } /* Handle SCSI errors */ Thanks, Bart.