Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp204815pxj; Fri, 28 May 2021 01:50:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx8D133OzYThkPwkzhlf996n15cH+KXFqtXd+ifAf37oB4O9gEJgrAT4/lrsoqChK1mhZMF X-Received: by 2002:a17:906:6dd0:: with SMTP id j16mr8086898ejt.208.1622191806052; Fri, 28 May 2021 01:50:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1622191806; cv=none; d=google.com; s=arc-20160816; b=tjbgVqoRFBsJvtv2YeIqFmSEfZrDKbw+rsG1b3L0X5IhGZ2gxUQ2+fEQycuAdDS9gO 84HN0d1SuJiL/LReOQ7/aRdEJCQbXqDT7PRxEgRQW5vrFWTSCNfrKWWu2kTh+iblARpf U7QK+AS8z/9fs4cVjJHDQDMma/duk2Ct6MjlmC36HoQ7+yTV+CUERlsF54g/7y2b7X8I rrEzA6d+/hOzTkb0t6iyat22n21H2ttOxE3Ssso3RyOBDZa7KPixNYqfSGVuJ10wVx4+ Y9YATfEuABMY3rRp79Q4YEMQVSXlrumQtV2sjzQx1fZ4bYjumxJQALIm5xLZszvNysmE B5tg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:ironport-sdr:ironport-sdr; bh=aSEnawRkRJDathk1EiG4JA+vVvF31GpAQqEc8Gv5xic=; b=WG/giSFlYia4RQEuAx5G0wTLuT9/6SQDtq0h16qRpXCFfES6E46hfKiVSot17N0HKL byiF0Xj6UYUbWgg1FFcLYxNyR8RNeRB4cRUQRK6hlZauMfGndc+ycBX2rk65PAEAPAEe YPJOUDY2iPn4WabGYn9spmWqAx5H3gvhZCEb1FnlWiFJjZ4CGqqPecGS9ExPjrRuq7X3 025MLgLxjNCKE1oXH6q5eCyx0NSa0J+BLjxVeU+gdr4o3n/VIpZsDHzcr/fV1qzfr980 E5IeMLKNHfXkRLLE5zB2W4guSEJFi2qeXsCw63D3yOF2UKudK3UvgRm89JY+OoaIkjBM OQeQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bo20si4540497edb.524.2021.05.28.01.49.43; Fri, 28 May 2021 01:50:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236305AbhE1DRi (ORCPT + 99 others); Thu, 27 May 2021 23:17:38 -0400 Received: from mga03.intel.com ([134.134.136.65]:17457 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236023AbhE1DRh (ORCPT ); Thu, 27 May 2021 23:17:37 -0400 IronPort-SDR: 1uJ5F7lVWGag/1RLKtoapE8wfXrusWc2xGJPTO79sjWmgXgAWtwfSry9vPJ6YXqu1o66uvRVR4 dQDXIIY/ZuAg== X-IronPort-AV: E=McAfee;i="6200,9189,9997"; a="202919618" X-IronPort-AV: E=Sophos;i="5.83,228,1616482800"; d="scan'208";a="202919618" Received: from orsmga002.jf.intel.com ([10.7.209.21]) by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 May 2021 20:16:02 -0700 IronPort-SDR: k6k4oW7BOPAJL29GdrGA0ZY1pJfaCwU0lhxI3HU5CnHbksoAbL/i9IVjxtV7O6vwH9Q1eRDj2N p/Rzc6PdANBw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.83,228,1616482800"; d="scan'208";a="415155080" Received: from chenyu-desktop.sh.intel.com ([10.239.158.131]) by orsmga002.jf.intel.com with ESMTP; 27 May 2021 20:15:56 -0700 From: Chen Yu To: linux-pm@vger.kernel.org Cc: "Rafael J. Wysocki" , Len Brown , Artem Bityutskiy , Zhang Rui , linux-kernel@vger.kernel.org, Chen Yu Subject: [PATCH][v2] intel_idle: Adjust the SKX C6 latency and residency if PC6 is disabled Date: Fri, 28 May 2021 11:20:54 +0800 Message-Id: <20210528032054.7572-1-yu.c.chen@intel.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Currently cpuidle assumes worst-case C-state parameters, and so C6 is described with PC6 parameters, which is worst case for requesting CC6. When PC6 is enabled, this is appropriate. But if PC6 is disabled in BIOS, the exit latency and target_residency should be adjusted accordingly. Exit latency: Previously the C6 exit latency was measured when woken up from CC6/PC6. With PC6 disabled, the C6 exit latency should be CC6/PC0. Target residency: With PC6 disabled, idle duration within [CC6, PC6) would make the idle governor choose C1E over C6. This would cause low energy-efficiency. We should lower the bar to request C6 when PC6 is disabled. To fill this gap, check if PC6 is disabled in the BIOS in the MSR_PKG_CST_CONFIG_CONTROL(0xe2). If so, use CC6/PC0 parameters as the new exit latency. Meanwhile, update target_residency to 3 times of the new exit latency. This is consistent with how intel_idle driver uses _CST to calculate the target_residency. The consequence is that, the OS would be more offen to choose C6 over C1E when PC6 is disabled. This is reasonable because if the user is using C6, it implies that the user cares about energy, so choosing C6 more frequently is in accordance with user requirement. The new exit latency of CC6/PC0 92us was from wult[1] result on SKX, which was measured via NIC wakeup from 99.99th latency. Besides SKX, the CLX and CPX both have the same CPU model number. And since they have similar CC6 exit latency to SKX, 96us and 89us respectively, reuse the value of SKX. There is concern that if we should introduce a more generic solution rather than optimizing on each platforms. However consider the code complexity and different PC6 bit interpretation on different platforms, tune the code per platform seems to be an acceptable trade-off. [1] https://intel.github.io/wult/ Suggested-by: Len Brown Signed-off-by: Chen Yu --- v2: Simplify the commit log to not mention C3/PC3. (Artem) Confirm the exit latency on CLX and CPX.(Artem) --- drivers/idle/intel_idle.c | 33 +++++++++++++++++++++++++++++++++ 1 file changed, 33 insertions(+) diff --git a/drivers/idle/intel_idle.c b/drivers/idle/intel_idle.c index ec1b9d306ba6..e6c543b5ee1d 100644 --- a/drivers/idle/intel_idle.c +++ b/drivers/idle/intel_idle.c @@ -1484,6 +1484,36 @@ static void __init sklh_idle_state_table_update(void) skl_cstates[6].flags |= CPUIDLE_FLAG_UNUSABLE; /* C9-SKL */ } +/** + * skx_idle_state_table_update - Adjust the Sky Lake/Cascade Lake + * idle states table. + */ +static void __init skx_idle_state_table_update(void) +{ + unsigned long long msr; + + rdmsrl(MSR_PKG_CST_CONFIG_CONTROL, msr); + + /* + * 000b: C0/C1 (no package C-state support) + * 001b: C2 + * 010b: C6 (non-retention) + * 011b: C6 (retention) + * 111b: No Package C state limits. + */ + if ((msr & 0x7) < 2) { + /* + * Uses the CC6 + PC0 latency and 3 times of + * latency for target_residency if the PC6 + * is disabled in BIOS. This is consistent + * with how intel_idle driver uses _CST + * to set the target_residency. + */ + skx_cstates[2].exit_latency = 92; + skx_cstates[2].target_residency = 276; + } +} + static bool __init intel_idle_verify_cstate(unsigned int mwait_hint) { unsigned int mwait_cstate = MWAIT_HINT2CSTATE(mwait_hint) + 1; @@ -1515,6 +1545,9 @@ static void __init intel_idle_init_cstates_icpu(struct cpuidle_driver *drv) case INTEL_FAM6_SKYLAKE: sklh_idle_state_table_update(); break; + case INTEL_FAM6_SKYLAKE_X: + skx_idle_state_table_update(); + break; } for (cstate = 0; cstate < CPUIDLE_STATE_MAX; ++cstate) { -- 2.25.1