Received: by 2002:a05:6a10:eb17:0:0:0:0 with SMTP id hx23csp2998579pxb; Mon, 6 Sep 2021 09:56:37 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwC2z8Sh7xqBQ0jJ5506Y+s7/MDOd2l9e5j59EMCuN7l8L6zt/kzkZaTNHeb1WZ0PG6vcGG X-Received: by 2002:a92:2e0d:: with SMTP id v13mr9026026ile.111.1630947396935; Mon, 06 Sep 2021 09:56:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1630947396; cv=none; d=google.com; s=arc-20160816; b=zAlqcFs6TT0uN3xL/Jwc0cOQYjdOM0iqfSjkCnF18crNRSbUSqJUszMF5dIOfsMSrl hV8zMPzMVSsZ5oejwpJQyyhSVvvns0JJsOeJCdhXBzTshSZ4wpURbRdQ9d17slPrC6aF P0HSkavJim+5PFukeLfc5ukfMZFYvEoUMGPIJxzAr9kEXFhL34lJntJijaIEK/ACzr07 tWSLWqmX5vDgd7yBbNLM4z0YqUiJwnfAdjpD0aCehF8NjEcpxetrUiGE1lSWGl19KVOA tB1U9JO1AYm+oBapgzlGa2ymzTrU8KO671ksMoBIsErMCP24h1xfS4pk8hzyzc0gcp8U fmwA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id; bh=PcIQVLchssvLigvi50RxCIm6v7fQuvZP4QSEZ9pqM4o=; b=C19eEA1VDnLul9xjS/ZwHBwZ4eOBcgwybzm0IvWFy4m0fp8CofsBqoAZH7xzpCCl3F Ou8HxKaLZGLDfpOBVmnkLEZyjz0yKp2NhlAtvk3n2hHosN3qWINcKLRtWjYrB+IMx2OG jEgxWz0Vu02wisJyEY7+spcdv8Zqotjj0BUAlopoFYwEuJTC/ydmlnKdY9xVDhC1yMsQ n5l4VUwciMyx4rVlJHLoNGnts3xnLaFMbtWy4LNr16iwnYDQmlLpCxc2+SIKkPga2GAs kgIfyC4trlq50HOjI1vuzzHUIvOT9Ct5cChaFxp6om5sB3fKJwEGI1jtDCDqfn9yLXyN 4Dwg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r2si7292302ila.178.2021.09.06.09.56.24; Mon, 06 Sep 2021 09:56:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234230AbhIFQ4F (ORCPT + 99 others); Mon, 6 Sep 2021 12:56:05 -0400 Received: from mga12.intel.com ([192.55.52.136]:43273 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230491AbhIFQ4E (ORCPT ); Mon, 6 Sep 2021 12:56:04 -0400 X-IronPort-AV: E=McAfee;i="6200,9189,10099"; a="199539372" X-IronPort-AV: E=Sophos;i="5.85,272,1624345200"; d="scan'208";a="199539372" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 Sep 2021 09:54:59 -0700 X-IronPort-AV: E=Sophos;i="5.85,272,1624345200"; d="scan'208";a="546293949" Received: from ibelagox-mobl1.gar.corp.intel.com ([10.213.76.130]) by fmsmga002-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 Sep 2021 09:54:56 -0700 Message-ID: Subject: Re: [PATCH] cpufreq: intel_pstate: Fix for HWP interrupt before driver is ready From: Srinivas Pandruvada To: Jens Axboe , "Rafael J. Wysocki" Cc: "Rafael J. Wysocki" , Len Brown , Viresh Kumar , Linux PM , Linux Kernel Mailing List Date: Mon, 06 Sep 2021 09:54:52 -0700 In-Reply-To: <926ac4b9-1bb5-e96e-ded3-6461f7a215b7@kernel.dk> References: <20210904053703.581297-1-srinivas.pandruvada@linux.intel.com> <926ac4b9-1bb5-e96e-ded3-6461f7a215b7@kernel.dk> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.40.0-1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2021-09-06 at 10:43 -0600, Jens Axboe wrote: > On 9/6/21 10:17 AM, Rafael J. Wysocki wrote: > > On Sat, Sep 4, 2021 at 7:37 AM Srinivas Pandruvada > > wrote: > > > > > > In Lenovo X1 gen9 laptop, HWP interrupts arrive before driver is > > > ready > > > to handle on that CPU. Basically didn't even allocated memory for > > > per > > > cpu data structure and not even started interrupt enable process > > > on that > > > CPU. So interrupt handler observes a NULL pointer to schedule > > > work. > > > > > > This interrupt was probably for SMM, but since it is redirected > > > to > > > OS by OSC call, OS receives it, but not ready to handle. That > > > redirection > > > of interrupt to OS was also done to solve one SMM crash on Yoga > > > 260 for > > > HWP interrupt a while back. > > > > > > To solve this the HWP interrupt handler should ignore such > > > request if the > > > driver is not ready. This will require some flag to wait till the > > > driver > > > setup a workqueue to handle on a CPU. We can't simply assume > > > cpudata to > > > be NULL and avoid processing as it may not be NULL but data > > > structure is > > > not in consistent state. > > > > > > So created a cpumask which sets the CPU on which interrupt was > > > setup. If > > > not setup, simply clear the interrupt status and return. Since > > > the > > > similar issue can happen during S3 resume, clear the bit during > > > offline. > > > > > > Since interrupt timing may be before HWP is enabled, use safe MSR > > > read > > > writes as before the change for HWP interrupt. > > > > > > Fixes: d0e936adbd22 ("cpufreq: intel_pstate: Process HWP > > > Guaranteed change notification") > > > Reported-and-tested-by: Jens Axboe > > > Signed-off-by: Srinivas Pandruvada < > > > srinivas.pandruvada@linux.intel.com> > > > --- > > >  drivers/cpufreq/intel_pstate.c | 23 ++++++++++++++++++++++- > > >  1 file changed, 22 insertions(+), 1 deletion(-) > > > > > > diff --git a/drivers/cpufreq/intel_pstate.c > > > b/drivers/cpufreq/intel_pstate.c > > > index b4ffe6c8a0d0..5ac86bfa1080 100644 > > > --- a/drivers/cpufreq/intel_pstate.c > > > +++ b/drivers/cpufreq/intel_pstate.c > > > @@ -298,6 +298,8 @@ static bool hwp_boost __read_mostly; > > > > > >  static struct cpufreq_driver *intel_pstate_driver __read_mostly; > > > > > > +static cpumask_t hwp_intr_enable_mask; > > > + > > >  #ifdef CONFIG_ACPI > > >  static bool acpi_ppc; > > >  #endif > > > @@ -1067,11 +1069,15 @@ static void intel_pstate_hwp_set(unsigned > > > int cpu) > > >         wrmsrl_on_cpu(cpu, MSR_HWP_REQUEST, value); > > >  } > > > > > > +static void intel_pstate_disable_hwp_interrupt(struct cpudata > > > *cpudata); > > > + > > >  static void intel_pstate_hwp_offline(struct cpudata *cpu) > > >  { > > >         u64 value = READ_ONCE(cpu->hwp_req_cached); > > >         int min_perf; > > > > > > +       intel_pstate_disable_hwp_interrupt(cpu); > > > + > > >         if (boot_cpu_has(X86_FEATURE_HWP_EPP)) { > > >                 /* > > >                  * In case the EPP has been set to "performance" > > > by the > > > @@ -1645,20 +1651,35 @@ void notify_hwp_interrupt(void) > > >         if (!hwp_active || !boot_cpu_has(X86_FEATURE_HWP_NOTIFY)) > > >                 return; > > > > > > -       rdmsrl(MSR_HWP_STATUS, value); > > > +       rdmsrl_safe(MSR_HWP_STATUS, &value); > > >         if (!(value & 0x01)) > > >                 return; > > > > > > +       if (!cpumask_test_cpu(this_cpu, &hwp_intr_enable_mask)) { > > > +               wrmsrl_safe(MSR_HWP_STATUS, 0); > > > +               return; > > > +       } > > > > Without additional locking, there is a race between this and > > intel_pstate_disable_hwp_interrupt(). > > > > 1. notify_hwp_interrupt() checks hwp_intr_enable_mask() and the > > target > > CPU is in there, so it will go for scheduling the delayed work. > > 2. intel_pstate_disable_hwp_interrupt() runs between the check and > > the > > cpudata load below. > > 3. hwp_notify_work is scheduled on the CPU that isn't there in the > > mask any more. > > I noticed that too, not clear to me how much of an issue that is in > practice. But there's definitely a race there. Glad to see how this is possible from code running in ISR context. Thanks, Srinivas > > Can we just revert d0e936adbd22 for now so this can get solved in due > time? I doubt I'm the only one that got bit by this, but I might be > the > only one that bothered to actually figured out what caused my laptop > to > crash with a black screen on boot. >