Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp2086372pxb; Mon, 23 Aug 2021 11:45:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzyYEs0CZR0/Q+3c7ooi2mvuMbDzjE68opYdN3XX3KnUHljjGfEgZxCn0WFKM0FqB9wyvpT X-Received: by 2002:a5d:8b04:: with SMTP id k4mr19443133ion.58.1629744333263; Mon, 23 Aug 2021 11:45:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629744333; cv=none; d=google.com; s=arc-20160816; b=L/fY+RX5PENXEiyBBi1uU3hzwBjS3deEZMQOYOxSo2D1Glj5pRLk2ATLJ1WVfEoF30 RYlcmO6Yg5KPblY5p+NpkHYYX7cLi/aqdlcXPM9O6x3voNWXFCApz4gmJdbpQp8RImD5 ppUveZdsCgdIfcCa+BsbepX40RCosdqqMk8W6GTjUfRkzLSlH2IC42mOfLHB5muIzMm0 uyAulcVXEc6XfFAJp/QRggHSiDmNwUSWi2BTLt9X7wEUk/GbAhh/9l30dPyAtzq2kEHe zmWzkxEned5q/cXFR1p2sua0QQ0SaBLhqZF8IZhd1sRKBga5Gv7PZslG6tCgHDFPtx3H pZxQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=+t65oQl7xIh0+ZYaiXd695eCAW8Gdp2vyyBE6uSk+fM=; b=b8jXmIHc9dq7vsn+QKyeJV18QMq3Qwr622d8pfY2mJdlEb+9ikevAtjN0cB3Ijb3IV fX4HcJCANZuemRYES6ubvu0YIOkvnuTNVEkNTnGXvipmLD+DYF3hw4MgowNZNI/8FTBr rxUiMeD6Zto+FEU+haiv3w7cK8AaDYllMvaflsQCRGv3LQ+f3058xJ2X+aNv9NHZ/GtJ Wz3Xj/qmTdxFQdz/9wT/6HeySIgeslToU30rP8fvolnD1pM26jyOIWHdDICckX5f1R21 QLQdwxyDLPlfOCBFU/Q00gttazm69fcrIZcQmXoidkulZKd6aTbXbxoadvBJ8ni+hhA7 OM8g== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@quicinc.com header.s=qcdkim header.b=RWVf9gj1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id h14si14734782ile.129.2021.08.23.11.45.21; Mon, 23 Aug 2021 11:45:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=fail header.i=@quicinc.com header.s=qcdkim header.b=RWVf9gj1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230332AbhHWSoS (ORCPT + 99 others); Mon, 23 Aug 2021 14:44:18 -0400 Received: from alexa-out.qualcomm.com ([129.46.98.28]:62565 "EHLO alexa-out.qualcomm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229962AbhHWSoR (ORCPT ); Mon, 23 Aug 2021 14:44:17 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=quicinc.com; i=@quicinc.com; q=dns/txt; s=qcdkim; t=1629744215; x=1661280215; h=subject:to:cc:references:from:message-id:date: mime-version:in-reply-to:content-transfer-encoding; bh=l3BuDyALSNXMVLCHv9r8ynYhlNyHsnWaTDrJJMRqkE0=; b=RWVf9gj1dN7RHb23tFPygRUIeSdJDoseISUrWguX4Pp5OgPmnlUDkDa6 W+PFmBLj6tbiSnS2vJlPSgfmKcXV8i4c452EYwjeVvfZBje7JPTkdNDAx 6SuOzvv4iOEVEp+TlK8g/TOG1fueOH30dQtP65v9rPekiEPFuu0RTJone g=; Received: from ironmsg07-lv.qualcomm.com ([10.47.202.151]) by alexa-out.qualcomm.com with ESMTP; 23 Aug 2021 11:43:33 -0700 X-QCInternal: smtphost Received: from nalasex01a.na.qualcomm.com ([10.47.209.196]) by ironmsg07-lv.qualcomm.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 23 Aug 2021 11:43:33 -0700 Received: from [10.226.59.216] (10.80.80.8) by nalasex01a.na.qualcomm.com (10.47.209.196) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384) id 15.2.922.7; Mon, 23 Aug 2021 11:43:31 -0700 Subject: Re: [PATCH v6 3/4] bus: mhi: core: Process execution environment changes serially To: Bhaumik Bhatt , CC: , , , , , References: <1614208985-20851-1-git-send-email-bbhatt@codeaurora.org> <1614208985-20851-4-git-send-email-bbhatt@codeaurora.org> From: Jeffrey Hugo Message-ID: <899da888-321a-c228-8537-b72821700dc7@quicinc.com> Date: Mon, 23 Aug 2021 12:43:31 -0600 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 In-Reply-To: <1614208985-20851-4-git-send-email-bbhatt@codeaurora.org> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanexm03f.na.qualcomm.com (10.85.0.47) To nalasex01a.na.qualcomm.com (10.47.209.196) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2/24/2021 4:23 PM, Bhaumik Bhatt wrote: > In current design, whenever the BHI interrupt is fired, the > execution environment is updated. This can cause race conditions > and impede ongoing power up/down processing. For example, if a > power down is in progress, MHI host updates to a local "disabled" > execution environment. If a BHI interrupt fires later, that value > gets replaced with one from the BHI EE register. This impacts the > controller as it does not expect multiple RDDM execution > environment change status callbacks as an example. Another issue > would be that the device can enter mission mode and the execution > environment is updated, while device creation for SBL channels is > still going on due to slower PM state worker thread run, leading > to multiple attempts at opening the same channel. > > Ensure that EE changes are handled only from appropriate places > and occur one after another and handle only PBL modes or RDDM EE > changes as critical events directly from the interrupt handler. > Simplify handling by waiting for SYS ERROR before handling RDDM. > This also makes sure that we use the correct execution environment > to notify the controller driver when the device resets to one of > the PBL execution environments. > > Signed-off-by: Bhaumik Bhatt > @@ -452,27 +451,30 @@ irqreturn_t mhi_intvec_threaded_handler(int irq_number, void *priv) > } > write_unlock_irq(&mhi_cntrl->pm_lock); > > - /* If device supports RDDM don't bother processing SYS error */ > - if (mhi_cntrl->rddm_image) { > - /* host may be performing a device power down already */ > - if (!mhi_is_active(mhi_cntrl)) > - goto exit_intvec; > + if (pm_state != MHI_PM_SYS_ERR_DETECT || ee == mhi_cntrl->ee) > + goto exit_intvec; > > - if (mhi_cntrl->ee == MHI_EE_RDDM && mhi_cntrl->ee != ee) { > + switch (ee) { > + case MHI_EE_RDDM: > + /* proceed if power down is not already in progress */ > + if (mhi_cntrl->rddm_image && mhi_is_active(mhi_cntrl)) { > mhi_cntrl->status_cb(mhi_cntrl, MHI_CB_EE_RDDM); > + mhi_cntrl->ee = ee; > wake_up_all(&mhi_cntrl->state_event); > } > - goto exit_intvec; > - } > - > - if (pm_state == MHI_PM_SYS_ERR_DETECT) { > + break; > + case MHI_EE_PBL: > + case MHI_EE_EDL: > + case MHI_EE_PTHRU: > + mhi_cntrl->status_cb(mhi_cntrl, MHI_CB_FATAL_ERROR); > + mhi_cntrl->ee = ee; > wake_up_all(&mhi_cntrl->state_event); > - > - /* For fatal errors, we let controller decide next step */ > - if (MHI_IN_PBL(ee)) > - mhi_cntrl->status_cb(mhi_cntrl, MHI_CB_FATAL_ERROR); > - else > - mhi_pm_sys_err_handler(mhi_cntrl); > + mhi_pm_sys_err_handler(mhi_cntrl); > + break; > + default: > + wake_up_all(&mhi_cntrl->state_event); > + mhi_pm_sys_err_handler(mhi_cntrl); > + break; > } Bhaumik, can you explain the above change? Before this patch (which is now committed), if there was a fatal error, the controller was notified (MHI_CB_FATAL_ERROR) and it decided all action. After this patch, the controller is notified, but also the core attempts to handle the syserr. This is a change in behavior, and seems to make a mess of the controller, and possibly the core fighting each other. Specifically, I'm rebasing the AIC100 driver onto 5.13, which has this change, and I'm seeing a serious regression. I'm thinking that for the PBL/EDL/PTHRU case, mhi_pm_sys_err_handler() should not be called. Thoughts?