Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp1638383ybb; Thu, 26 Mar 2020 04:42:33 -0700 (PDT) X-Google-Smtp-Source: ADFU+vusm3QtkgE8ZNzJok77EIz94G3IgtnL+lUoFXLs4pz+9cUVulaFHL7CmpKiB0iLwrowH4xW X-Received: by 2002:aca:5444:: with SMTP id i65mr1351224oib.101.1585222953530; Thu, 26 Mar 2020 04:42:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585222953; cv=none; d=google.com; s=arc-20160816; b=fs5dM4G1kzHgYfvvw3wF87nj99fl7o3z8Ckt0pTgEgFCMm6FYCdEk3xB1mH6wiHn7c 80XiJanwpMkZf0UlZi8hvDGWE/MVRy1Lb3aDasUNhmbkH/f1jGjbJvNIcSHFw1De2Xk2 leBFS9IY2tzhtlJ6bvHobafOdEekPzDQAJGE3DaAYLsVydtlAHJ4hJceDnem0TxV8Z7h DQPqK+ALte0qHvhIu9nmiIWTxGL7AE+XumkwyNr87sPv2u61ilmgNhL/JeeyW92epKB+ Ko2IwodEtpNpFV2zMMVTS0SXXgpVkYGsJb0Ohuo6erViaGcYaGjgKpCwnSkVNYGb12Tb eywg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=rTPMYZldMUBKMU5MuQH6COKbNKcnltyKYs/kin3qjqU=; b=lS/dlcWESDJncyAVarqfSWUrlkLuAH41DYFj58lp4UrIYFvIyd2qxm/jVwJtPa4V4l W6b0ZeMevOOSniwUQBEOHhEj+ucVgLmOZeqY5oYYa/4wACbSuPnzv55P6+5HIWAQtwcQ T4GX8TKG4El5cpvnGDf9LQdmkOPuBJvrTpXMRfgPdHOafG6Lo0ehbQj/NfnXLyTlzzqF vjzCwg0hrqCEtffL9wndDBrel0+v2V6whDAhQkQGPSYX5B9BtyvAMeYCdN/IafJYYGHn Ug1yiRYtZlkq9wiOEnSJr6iNGLAgvmZvOmTtNfDTtyGyPBxP0TwY4qa+Qy+p8+AM0UUW TG7Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b20si921352otq.263.2020.03.26.04.42.20; Thu, 26 Mar 2020 04:42:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728150AbgCZLlj (ORCPT + 99 others); Thu, 26 Mar 2020 07:41:39 -0400 Received: from mx3.molgen.mpg.de ([141.14.17.11]:43797 "EHLO mx1.molgen.mpg.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727994AbgCZLlj (ORCPT ); Thu, 26 Mar 2020 07:41:39 -0400 Received: from [192.168.0.2] (ip5f5af065.dynamic.kabel-deutschland.de [95.90.240.101]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: pmenzel) by mx.molgen.mpg.de (Postfix) with ESMTPSA id 8FCEC206442F6; Thu, 26 Mar 2020 12:41:36 +0100 (CET) Subject: Re: [Intel-wired-lan] [PATCH] e1000e: bump up timeout to wait when ME un-configure ULP mode To: Kai-Heng Feng Cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org, intel-wired-lan@lists.osuosl.org, David Miller , Rex Tsai References: <20200323191639.48826-1-aaron.ma@canonical.com> <2c765c59-556e-266b-4d0d-a4602db94476@intel.com> <899895bc-fb88-a97d-a629-b514ceda296d@canonical.com> <750ad0ad-816a-5896-de2f-7e034d2a2508@intel.com> <60A8493D-811B-4AD5-A8D3-82054B562A8C@canonical.com> From: Paul Menzel Message-ID: <3895cb4d-cc9d-238a-0776-a1fd3a490664@molgen.mpg.de> Date: Thu, 26 Mar 2020 12:41:36 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.6.0 MIME-Version: 1.0 In-Reply-To: <60A8493D-811B-4AD5-A8D3-82054B562A8C@canonical.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Dear Kai-Heng, Am 26.03.20 um 12:29 schrieb Kai-Heng Feng: >> On Mar 25, 2020, at 23:49, Paul Menzel wrote: >> Am 25.03.20 um 14:58 schrieb Neftin, Sasha: >>> On 3/25/2020 08:43, Aaron Ma wrote: >> >>>> On 3/25/20 2:36 PM, Neftin, Sasha wrote: >>>>> On 3/25/2020 06:17, Kai-Heng Feng wrote: >> >>>>>>> On Mar 24, 2020, at 03:16, Aaron Ma wrote: >>>>>>> >>>>>>> ME takes 2+ seconds to un-configure ULP mode done after resume >>>>>>> from s2idle on some ThinkPad laptops. >>>>>>> Without enough wait, reset and re-init will fail with error. >>>>>> >>>>>> Thanks, this patch solves the issue. We can drop the DMI quirk in >>>>>> favor of this patch. >>>>>> >>>>>>> Fixes: f15bb6dde738cc8fa0 ("e1000e: Add support for S0ix") >>>>>>> BugLink: https://bugs.launchpad.net/bugs/1865570 >>>>>>> Signed-off-by: Aaron Ma >>>>>>> --- >>>>>>> drivers/net/ethernet/intel/e1000e/ich8lan.c | 4 ++-- >>>>>>> 1 file changed, 2 insertions(+), 2 deletions(-) >>>>>>> >>>>>>> diff --git a/drivers/net/ethernet/intel/e1000e/ich8lan.c >>>>>>> b/drivers/net/ethernet/intel/e1000e/ich8lan.c >>>>>>> index b4135c50e905..147b15a2f8b3 100644 >>>>>>> --- a/drivers/net/ethernet/intel/e1000e/ich8lan.c >>>>>>> +++ b/drivers/net/ethernet/intel/e1000e/ich8lan.c >>>>>>> @@ -1240,9 +1240,9 @@ static s32 e1000_disable_ulp_lpt_lp(struct >>>>>>> e1000_hw *hw, bool force) >>>>>>> ew32(H2ME, mac_reg); >>>>>>> } >>>>>>> >>>>>>> - /* Poll up to 300msec for ME to clear ULP_CFG_DONE. */ >>>>>>> + /* Poll up to 2.5sec for ME to clear ULP_CFG_DONE. */ >>>>>>> while (er32(FWSM) & E1000_FWSM_ULP_CFG_DONE) { >>>>>>> - if (i++ == 30) { >>>>>>> + if (i++ == 250) { >>>>>>> ret_val = -E1000_ERR_PHY; >>>>>>> goto out; >>>>>>> } >>>>>> >>>>>> The return value was not caught by the caller, so the error ends up >>>>>> unnoticed. >>>>>> Maybe let the caller check the return value of >>>>>> e1000_disable_ulp_lpt_lp()? >> >>>>> I a bit confused. In our previous conversation you told ME not running. >>>>> let me shimming in. Increasing delay won't be solve the problem and just >>>>> mask it. We need to understand why ME take too much time. What is >>>>> problem with this specific system? >>>>> So, basically no ME system should works for you. >>>> >>>> Some laptops ME work that's why only reproduce issue on some laptops. >>>> In this issue i219 is controlled by ME. >>> >>> Who can explain - why ME required too much time on this system? >>> Probably need work with ME/BIOS vendor and understand it. >>> Delay will just mask the real problem - we need to find root cause. >>>> Quirk is only for 1 model type. But issue is reproduced by more models. >>>> So it won't work. >> >> (Where is Aaron’s reply? It wasn’t delivered to me yet.) >> >> As this is clearly a regression, please revert the commit for now, >> and then find a way to correctly implement S0ix support. Linux’ >> regression policy demands that as no fix has been found since >> v5.5-rc1. Changing Intel ME settings, even if it would work around >> the issue, is not an acceptable solution. Delaying the resume time >> is also not a solution. > > The s0ix patch can reduce power consumption, finally makes S2idle an > acceptable sleep method. So I'd say it's a fix, albeit a regression > was introduced. > >> Regarding Intel Management Engine, only Intel knows what it does >> and what the error is, as the ME firmware is proprietary and >> closed. >> >> Lastly, there is no way to fully disable the Intel Management >> Engine. The HAP stuff claims to stop the Intel ME execution, but >> nobody knows, if it’s successful. >> >> Aaron, Kai-Hang, can you send the revert? > > I consider that as an important fix for s2idle, I don't think > reverting is appropriate. If there is a fix with no other regression, I agree. But there has not been one, so please revert. It doesn’t matter if the commit introducing the regression fixes something else. It gets too complicated to decide, which regression is worth it or not. The Linux kernel guarantees its users, they can update any time without breaking user space (in this case suspend/resume). Please read Linus’ several messages about that. His message [1] exactly addresses your arguments. > Yeah, HELL NO! > > Guess what? You're wrong. YOU ARE MISSING THE #1 KERNEL RULE. > > We do not regress, and we do not regress exactly because your are 100% wrong. > > And the reason you state for your opinion is in fact exactly *WHY* you > are wrong. > > Your "good reasons" are pure and utter garbage. > > The whole point of "we do not regress" is so that people can upgrade > the kernel and never have to worry about it. > >> Kernel had a bug which has been fixed > > That is *ENTIRELY* immaterial. > > Guys, whether something was buggy or not DOES NOT MATTER. So, please (also Intel developers), please adhere to this rule, and revert the commit. Kind regards, Paul [1]: https://lkml.org/lkml/2018/8/3/621