Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp522832iob; Wed, 18 May 2022 07:18:30 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzEAwTlsAMxpHdCO0e/VpdXUccCXkJHuF7fUPF9Uaynq4mAYjjCJd2tkwHzLPLuMWiBGioM X-Received: by 2002:a63:c22:0:b0:3f2:638f:c203 with SMTP id b34-20020a630c22000000b003f2638fc203mr14670767pgl.215.1652883510538; Wed, 18 May 2022 07:18:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652883510; cv=none; d=google.com; s=arc-20160816; b=l8JR1Q3oOpWvcioGUTffEr2+fUPAG/wG3y8gEX+Jy9I3HNJ3T2Zk2GxIWDCJl9coVf vxSDm8pYNyey1mMhivPzMi0yQv9BjR/d6J5QL++DvyccMByqHEvZrqzJHNODC46cnLn8 sIZQ/koASXpBPm+P0v03MMV1YjD0AdbJUMHSrUaN1QLPRXeDu8eeu4AycgqkG0NFCjdz FGqyw2H8F7MHm5wzMZTBBTjHSjGyBL7erotXvkwX7dLvUEFtYMleILH0h0tcwZtcR2qL NOUlXi/3aaU4CC5jbBN2QiixIgs9WkZW5PYOCjlmMLL/I6W8wTtZ52gcKTOFsz5Xykl4 XhSg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id :dkim-signature; bh=0a/RDuNOknjNIwmQoqSxBeD/S3QhXnWttTsKefUznjo=; b=NcVnR7OGTasbYfHYz7PBIXzsN0keWPw0iWkRcrRm0EGlIe3j3U9fzb9Wb22p4brm32 4ACNg6dfhaw7cqq3FuT+4ZUAxYAsT9A7FGCBbFkCYSSyGhSfbajbfI0oIQDJLVZJ2aXz zCELdMUZLhCF02CHhyQrWzU6ZM5jO0SN9X7NpFqq6TLH8HPtGXbHqNJmUjtmizfnQZgO YMIUbqGq8kXh94STZYEJeQHsfDzlaKhSyzwekQ0w8HGlXAVL5jg2/xOgjb0/NER3CS81 MlBGxV3SCjObQOEUsFwxAdC2Lu/IL4FpxFxtqTmZnzcx3NvjKFAJNmHe2TvNeIeqEI9c XA7Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=n514XNfY; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id g1-20020a056a001a0100b0050d2da2e380si3469573pfv.47.2022.05.18.07.18.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 18 May 2022 07:18:30 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=n514XNfY; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 41DEE1A490C; Wed, 18 May 2022 07:11:34 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238523AbiEROLb (ORCPT + 71 others); Wed, 18 May 2022 10:11:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33662 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238571AbiEROLN (ORCPT ); Wed, 18 May 2022 10:11:13 -0400 Received: from mga07.intel.com (mga07.intel.com [134.134.136.100]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3210DBBE; Wed, 18 May 2022 07:11:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1652883072; x=1684419072; h=message-id:subject:from:to:cc:date:in-reply-to: references:mime-version:content-transfer-encoding; bh=M9I9fQ/4b004eaPqCWILxpXuWE4PMN1HRcRIEnUtL7Q=; b=n514XNfYivaWbi+UJ89rna5c1o82JIWysC5uMZJVpkK5OwE05ngw2hfa sbW5mHvaEtdjKEn+3MZLa8YrzOXrXgNkbBJggwB9LhD7T2Cy+SrjhKJti QFOcWV/ml9PS0SjdGrJxW8D/++8U53lwJde969fku66YWvLjSM09hRYyN Kq4veATBf/barhwA0NVtfN8QzpD22aQj8WBN6tuWgAjOTYhTrOww4lQJ3 LE7jerr+OCg7hfXNn5GRtyK0BUQpSvUGkJQhbwobqs0DmoUpEoJAeOYtK VF8ZdJlMnzCB/tTRpdUI6iDVd1r/6FBHsrHrkaAt85tQqYpZnh9QFo5CE g==; X-IronPort-AV: E=McAfee;i="6400,9594,10350"; a="334735484" X-IronPort-AV: E=Sophos;i="5.91,235,1647327600"; d="scan'208";a="334735484" Received: from orsmga007.jf.intel.com ([10.7.209.58]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 18 May 2022 07:11:11 -0700 X-IronPort-AV: E=Sophos;i="5.91,235,1647327600"; d="scan'208";a="569487380" Received: from zhenyan1-mobl1.ccr.corp.intel.com ([10.249.171.228]) by orsmga007-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 18 May 2022 07:11:08 -0700 Message-ID: Subject: Re: [PATCH 0/7] PM: Solution for S0ix failure caused by PCH overheating From: Zhang Rui To: "Rafael J. Wysocki" Cc: "Rafael J. Wysocki" , kvalo@kernel.org, Alexandre Belloni , Linux PM , ACPI Devel Maling List , linux-rtc@vger.kernel.org, "open list:NETWORKING DRIVERS (WIRELESS)" , Daniel Lezcano , merez@codeaurora.org, mat.jonczyk@o2.pl, Sumeet Pawnikar , Len Brown Date: Wed, 18 May 2022 22:11:05 +0800 In-Reply-To: References: <20220505015814.3727692-1-rui.zhang@intel.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5-0ubuntu0.18.04.1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-wireless@vger.kernel.org Hi, Rafael, On Tue, 2022-05-17 at 17:11 +0200, Rafael J. Wysocki wrote: > On Thu, May 5, 2022 at 3:58 AM Zhang Rui wrote: > > > > On some Intel client platforms like SKL/KBL/CNL/CML, there is a > > PCH thermal sensor that monitors the PCH temperature and blocks the > > system > > from entering S0ix in case it overheats. > > > > Commit ef63b043ac86 ("thermal: intel: pch: fix S0ix failure due to > > PCH > > temperature above threshold") introduces a delay loop to cool the > > temperature down for this purpose. > > > > However, in practice, we found that the time it takes to cool the > > PCH down > > below threshold highly depends on the initial PCH temperature when > > the > > delay starts, as well as the ambient temperature. > > > > For example, on a Dell XPS 9360 laptop, the problem can be > > triggered > > 1. when it is suspended with heavy workload running. > > or > > 2. when it is moved from New Hampshire to Florida. > > > > In these cases, the 1 second delay is not sufficient. As a result, > > the > > system stays in a shallower power state like PCx instead of S0ix, > > and > > drains the battery power, without user' notice. > > > > In this patch series, we first fix the problem in patch 1/7 ~ 3/7, > > by > > 1. expand the default overall cooling delay timeout to 60 seconds. > > 2. make sure the temperature is below threshold rather than equal > > to it. > > 3. move the delay to .suspend_noirq phase instead, in order to > > a) do the cooling when the system is in a more quiescent state > > b) be aware of wakeup events during the long delay, because some > > wakeup > > events (ACPI Power button Press, USB mouse, etc) become valid > > only > > in .suspend_noirq phase and later. > > > > However, this potential long delay introduces a problem to our > > suspend > > stress automation test, because the delay makes it hard to predict > > how > > much time it takes to suspend the system. > > As we want to do as much suspend iterations as possible in limited > > time, > > setting a 60+ seconds rtc alarm for suspend which usually takes > > shorter > > than 1 second is far beyond overkill. > > > > Thus, in patch 4/7 ~ 7/7, a rtc driver hook is introduced, which > > cancels > > the armed rtc alarm in the beginning of suspend and then rearm the > > rtc > > alarm with a short interval (say, 2 second) right before system > > suspended. > > > > By running > > # echo 2 > /sys/module/rtc_cmos/parameters/rtc_wake_override_sec > > before suspend, the system can be resumed by RTC alarm right after > > it is > > suspended, no matter how much time the suspend really takes. > > > > This patch series has been tested on the same Dell XPS 9360 laptop > > and > > S0ix is 100% achieved across 1000+ s2idle iterations. > > Overall, the first three patches in the series can go in without the > rest, so let's put them into a separate series. > > Patch [4/7] doesn't depend on the first three ones, so it can go in > by itself. > > Patch [5/7] is to be dropped anyway as per the earlier discussion. > > Patch [6/7] is only needed to apply patch [7/7] which is > controversial. > > I think that we can drop or defer patches [6-7/7] for now. This all sounds reasonable to me. I will resend them separately. -rui