From: "Luck, Tony" <tony.luck@intel.com>
To: Thomas Gleixner <tglx@linutronix.de>,
        Vikas Shivappa <vikas.shivappa@linux.intel.com>
CC: "Shivappa, Vikas" <vikas.shivappa@intel.com>,
        "x86@kernel.org" <x86@kernel.org>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
        "hpa@zytor.com" <hpa@zytor.com>,
        "peterz@infradead.org" <peterz@infradead.org>,
        "Shankar, Ravi V" <ravi.v.shankar@intel.com>,
        "Yu, Fenghua" <fenghua.yu@intel.com>,
        "ak@linux.intel.com" <ak@linux.intel.com>,
        "eranian@google.com" <eranian@google.com>,
        "davidcc@google.com" <davidcc@google.com>
Subject: RE: [PATCH 1/2] x86/intel_rdt/mbm: Fix MBM overflow handler during
 hot cpu
Thread-Topic: [PATCH 1/2] x86/intel_rdt/mbm: Fix MBM overflow handler during
 hot cpu
Thread-Index: AQHTFirRxF/MG2eKTU24+fmVpmJpbaKHKpyA///mSVA=
Date: Wed, 16 Aug 2017 14:53:52 +0000
Message-ID: <3908561D78D1C84285E8C5FCA982C28F61340139@ORSMSX114.amr.corp.intel.com>
References: <1502845243-20454-1-git-send-email-vikas.shivappa@linux.intel.com>
 <1502845243-20454-2-git-send-email-vikas.shivappa@linux.intel.com>
 <alpine.DEB.2.20.1708161117450.1987@nanos>
In-Reply-To: <alpine.DEB.2.20.1708161117450.1987@nanos>
Accept-Language: en-US
Content-Language: en-US
dlp-product: dlpe-windows
dlp-version: 10.0.102.7
dlp-reaction: no-action
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 8BIT
MIME-Version: 1.0
Sender: linux-kernel-owner@vger.kernel.org
Content-Length: 672
Lines: 16

> You could alternatively use flush and make the worker code schedule the
> work on a still online CPU in the domain instead of blindly rescheduling it
> on the same CPU.

We looked at that when you suggested flush. The problem is that we have
already deleted the current cpu from the bitmask for the domain. So the
worker code doesn't know which domain it is running on, so can't pick
another.

If we try to do the flush before dropping the cpu from the bitmask, then
the worker code doesn't have any reason to pick a different CPU.

Is there is some cheap "I'm running on a CPU that is in the process of going
offline" test that we could make in the worker code?

-Tony