Received: by 2002:ac0:946b:0:0:0:0:0 with SMTP id j40csp3165280imj; Mon, 11 Feb 2019 15:18:44 -0800 (PST) X-Google-Smtp-Source: AHgI3IYA/0YLsz7gBbkhQn8+/11XkeG1I6RQl8YltJ0EFZE78CyYM0MZG27p2iAg2S0sbrJYpEAN X-Received: by 2002:a17:902:b697:: with SMTP id c23mr794474pls.23.1549927124172; Mon, 11 Feb 2019 15:18:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549927124; cv=none; d=google.com; s=arc-20160816; b=ch/r6b5wZBwIYj5G/5ejEDHAXhD56aTagWF5E1sR/DAWDhCeL1S4xoWz49b6SQ2qyh QVkObx5d70inORbrrWJNm3ZzYRFgvyxpz+90uxPN0rhSjqMJ9z2wOCj70YMsMWKiREcD m6kquAdyFTpR0HV26bMCBIPhAqoqFMzURuzxiI5vRmTGYP8+RPM5c40JRAonhvvtT5GN iA1DiT90Sz16nSAldyh2oCfx/wsc7VfF1pRaX5zIsHQ2+OerWLIwQTTzwV5mfMWPbqUV 9Q15U2ZYBaytl0iO6cNNE6C8XeznRiDzPUrSmYYy+12sfRUbvE3krgAGm5czAUKU4Y4p RFiw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id; bh=A3FF6QNT2vUbq/crSeNASQnXZsd5kP3vGiX+Ioio02o=; b=fF1aSwA6qpUtTUNQpx8jnm1iO0nwi9jjuH9/kv03Q3WJCqQtcIdnbNEnddQz85C9+y FvKedNyhrqkVXAkuWFxffEXlX8YnWOJIYqDeBgt/1Iq+h93vtfv7tuKOKGm98G0y+Lxa 5Kn+UI8W81aCv2B0ADFmTFlpbuis23x6ZDg4gmO1ETVCCHTrHn/b0kYjJvFZ1woEuaid kfcllHvpl1E2tNDqws+7nf4luaL7kM4pAPNkB8TXuVICY3C3NaeAjkQ3Sgte4JIs+do4 241CoDz1KA/9idq/dHPTa4Vzc5hcCRvL6SguqmAj+b1Ni+tolIHYNaHkW4p1oxwd3Ysb W3eQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a8si11769788ple.216.2019.02.11.15.18.28; Mon, 11 Feb 2019 15:18:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727418AbfBKXRZ (ORCPT + 99 others); Mon, 11 Feb 2019 18:17:25 -0500 Received: from mga12.intel.com ([192.55.52.136]:13772 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726949AbfBKXRZ (ORCPT ); Mon, 11 Feb 2019 18:17:25 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga106.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 11 Feb 2019 15:17:21 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.58,360,1544515200"; d="scan'208";a="143415087" Received: from spandruv-desk.jf.intel.com ([10.54.75.31]) by fmsmga004.fm.intel.com with ESMTP; 11 Feb 2019 15:17:20 -0800 Message-ID: <754cbb153e6c215db9771b05a735afaf7eb1120f.camel@linux.intel.com> Subject: Re: [PATCH v3] cpufreq: intel_pstate: Reporting reasons why driver prematurely exit From: Srinivas Pandruvada To: Erwan Velu Cc: e.velu@criteo.com, Liam.Howlett@oracle.com, Len Brown , "Rafael J. Wysocki" , Viresh Kumar , "open list:INTEL PSTATE DRIVER" , open list Date: Mon, 11 Feb 2019 15:17:20 -0800 In-Reply-To: References: <20190211093140.23608-1-e.velu@criteo.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5 (3.28.5-2.fc28) Mime-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2019-02-11 at 22:34 +0100, Erwan Velu wrote: > I understand your concern but I'd like to defend my use case ;) > > I was in a case where I didn't noticed that intel_pstate did engaged > after a kernel upgrade while it didn't before. > But there strictly no information message why the driver took the > decision not to load (aka considering there is already a power > management system engaged). > > That had a major impact on the system performance without _any_ > notice. > > So I would have found helpful to have informative message about the > reason why it didn't loaded. > A intel_pstate is usually set (and by default) to Y, which prevent > debugging this live and enforce a kernel reboot which was an issue in > my production case. To know if the intel_pstate in control, you can look at: #cat /sys/devices/system/cpu/cpufreq/policy0/scaling_driver So if it is not loaded and Intel intend to support a processor model with intel_pstate, then OEM's platform_power_management policy can override. So we can add one pr_info to show that driver can't be loaded because of platform intel_pstate_platform_pwr_mgmt_exists(), return true. If HWP is used we already have a pr_info. If HWP is present it will always be used unless user overrides. The cases where a memory allocation fails you will see other warnings in the system, so don't need to add in driver. Also if someone explcitly using kernel command line to either disable or control features, user knows what he is doing. So no need of pr_warn or pr_info except one case for platform mower management. The others are debug messages only. Thanks, Srinivas > So I would agree to put some details in pr_debug() but keep the > important return path being explicit to help user understanding > (withtout triggering the debug) why it did behave like that giving > serious hints (kernel, bios, fw update). > > Would you agree if I update the patch this way ? > Thx ! > Erwan, > > Le lun. 11 févr. 2019 à 22:25, Srinivas Pandruvada > a écrit : > > > > On Mon, 2019-02-11 at 10:31 +0100, Erwan Velu wrote: > > > The init code path have several execeptions where the module can > > > decide not to load. > > > As CONFIG_X86_INTEL_PSTATE is generally set to Y, the return code > > > is > > > not reachable. > > > The initialisation code is neither verbose of the reason why it > > > did > > > choose to prematurely exit. > > > > > > This situation leads to situation where its difficult for a user > > > to > > > determine, > > > on a given platform, why the driver didn't loaded properly. > > > > > > This patch is about reporting to the user the reason/context why > > > the > > > driver failed to load. > > > That is a precious hint when debugging a platform. > > > > pr_info and pr_warn are too strong for this debug use case. Anytime > > someone see some error in dmesg, we have an additional work to > > explain > > that that is not a problem to an average user. So this is an > > maintenance overhead for the purpose of debugging. Only few users > > who > > are really trying to debug, will care for these errors. For them > > pr_debug will be enough, we have other pr_debugs which will be > > helpful > > to root cause the issue, even if driver loads. > > > > Also when some user disabled intel_pstate via kernel command line, > > additional debug message is not useful. > > > > Thanks, > > Srinivas > > > > > > > > Signed-off-by: Erwan Velu > > > --- > > > drivers/cpufreq/intel_pstate.c | 36 ++++++++++++++++++++++++++ > > > ---- > > > ---- > > > 1 file changed, 28 insertions(+), 8 deletions(-) > > > > > > diff --git a/drivers/cpufreq/intel_pstate.c > > > b/drivers/cpufreq/intel_pstate.c > > > index dd66decf2087..ba2e2aee6c20 100644 > > > --- a/drivers/cpufreq/intel_pstate.c > > > +++ b/drivers/cpufreq/intel_pstate.c > > > @@ -2475,6 +2475,7 @@ static bool __init > > > intel_pstate_no_acpi_pss(void) > > > kfree(pss); > > > } > > > > > > + pr_info("Cannot detect ACPI PSS"); > > > return true; > > > } > > > > > > @@ -2484,10 +2485,16 @@ static bool __init > > > intel_pstate_no_acpi_pcch(void) > > > acpi_handle handle; > > > > > > status = acpi_get_handle(NULL, "\\_SB", &handle); > > > - if (ACPI_FAILURE(status)) > > > + if (ACPI_FAILURE(status)) { > > > + pr_info("Cannot detect ACPI SB"); > > > return true; > > > + } > > > > > > - return !acpi_has_method(handle, "PCCH"); > > > + status = acpi_has_method(handle, "PCCH"); > > > + if (!status) { > > > + pr_info("Cannot detect ACPI PCCH"); > > > + } > > > + return !status; > > > } > > > > > > static bool __init intel_pstate_has_acpi_ppc(void) > > > @@ -2502,6 +2509,7 @@ static bool __init > > > intel_pstate_has_acpi_ppc(void) > > > if (acpi_has_method(pr->handle, "_PPC")) > > > return true; > > > } > > > + pr_info("Cannot detect ACPI PPC"); > > > return false; > > > } > > > > > > @@ -2592,8 +2600,10 @@ static int __init intel_pstate_init(void) > > > const struct x86_cpu_id *id; > > > int rc; > > > > > > - if (no_load) > > > + if (no_load) { > > > + pr_info("disabling as per user-request\n"); > > > return -ENODEV; > > > + } > > > > > > id = x86_match_cpu(hwp_support_ids); > > > if (id) { > > > @@ -2606,31 +2616,41 @@ static int __init intel_pstate_init(void) > > > } > > > } else { > > > id = x86_match_cpu(intel_pstate_cpu_ids); > > > - if (!id) > > > + if (!id) { > > > + pr_warn("CPU ID is not in the list of > > > supported > > > devices\n"); > > > return -ENODEV; > > > + } > > > > > > copy_cpu_funcs((struct pstate_funcs *)id- > > > >driver_data); > > > } > > > > > > - if (intel_pstate_msrs_not_valid()) > > > + if (intel_pstate_msrs_not_valid()) { > > > + pr_warn("Cannot enable driver as per invalid > > > MSRs\n"); > > > return -ENODEV; > > > + } > > > > > > hwp_cpu_matched: > > > /* > > > * The Intel pstate driver will be ignored if the platform > > > * firmware has its own power management modes. > > > */ > > > - if (intel_pstate_platform_pwr_mgmt_exists()) > > > + if (intel_pstate_platform_pwr_mgmt_exists()) { > > > + pr_warn("Platform already taking care of power > > > management\n"); > > > return -ENODEV; > > > + } > > > > > > - if (!hwp_active && hwp_only) > > > + if (!hwp_active && hwp_only) { > > > + pr_warn("HWP not present\n"); > > > return -ENOTSUPP; > > > + } > > > > > > pr_info("Intel P-state driver initializing\n"); > > > > > > all_cpu_data = vzalloc(array_size(sizeof(void *), > > > num_possible_cpus())); > > > - if (!all_cpu_data) > > > + if (!all_cpu_data) { > > > + pr_warn("Cannot allocate memory\n"); > > > return -ENOMEM; > > > + } > > > > > > intel_pstate_request_control_from_smm(); > > >