Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752879AbbHaVYk (ORCPT ); Mon, 31 Aug 2015 17:24:40 -0400 Received: from bh-25.webhostbox.net ([208.91.199.152]:37395 "EHLO bh-25.webhostbox.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752173AbbHaVYO (ORCPT ); Mon, 31 Aug 2015 17:24:14 -0400 Message-ID: <55E4C5F8.6050303@roeck-us.net> Date: Mon, 31 Aug 2015 14:24:08 -0700 From: Guenter Roeck User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.8.0 MIME-Version: 1.0 To: Peter Zijlstra CC: Borislav Petkov , Ingo Molnar , Huang Rui , Borislav Petkov , Jean Delvare , Andy Lutomirski , Andreas Herrmann , Thomas Gleixner , "Rafael J. Wysocki" , Len Brown , John Stultz , =?windows-1252?Q?Fr=E9d=E9ric_Weisbecker?= , lm-sensors@lm-sensors.org, linux-kernel@vger.kernel.org, x86@kernel.org, Andreas Herrmann , Aravind Gopalakrishnan , Fengguang Wu , Aaron Lu , Tony Li Subject: Re: [PATCH 09/15] x86, amd: add accessor for number of cores per compute unit References: <1440662866-28716-1-git-send-email-ray.huang@amd.com> <1440662866-28716-10-git-send-email-ray.huang@amd.com> <20150828064814.GB20153@nazgul.tnic> <20150828080418.GB13309@gmail.com> <20150829091914.GA8878@gmail.com> <20150830155322.GA26370@nazgul.tnic> <20150831083821.GE16853@twins.programming.kicks-ass.net> <20150831160622.GA29830@nazgul.tnic> <55E47E88.7080202@roeck-us.net> <20150831204426.GI16853@twins.programming.kicks-ass.net> In-Reply-To: <20150831204426.GI16853@twins.programming.kicks-ass.net> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-Authenticated_sender: linux@roeck-us.net X-OutGoing-Spam-Status: No, score=0.0 X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - bh-25.webhostbox.net X-AntiAbuse: Original Domain - vger.kernel.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - roeck-us.net X-Get-Message-Sender-Via: bh-25.webhostbox.net: authenticated_id: linux@roeck-us.net X-Source: X-Source-Args: X-Source-Dir: Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2602 Lines: 63 On 08/31/2015 01:44 PM, Peter Zijlstra wrote: > On Mon, Aug 31, 2015 at 09:19:20AM -0700, Guenter Roeck wrote: >> On 08/31/2015 09:06 AM, Borislav Petkov wrote: > >>> That's a good point - I missed that during previous review. Rui, please >>> put the rdmsrl_safe_on_cpu() accesses in a separate function which you >>> run on a particular CPU, for your next version. >>> >> ... and maybe work with Peter to address the other hotplug related issues. >> >> It might also be worthwhile thinking about per-CU attributes, if that >> provides any value (Peter's comments suggested that this might be the case). > > Yeah, so it would allow measuring the power of a subset of compute > units. Typically only useful if you've partitioned your workload. But > since the hardware trivially supports it, its a waste to not expose it. > > (Note that its not per-cpu, its per compute unit. What we do with perf > is export a cpumask) > > My biggest problem is that all this is user readable and unthrottled. It > basically allows DoS (perf does not typically allow user access to CPU > wide resources). > > Imagine joe user doing: > > for ((i=0; i<1000; i++)); do > (while :; do cat /sys/foo/file > /dev/null ; done) & > done > > Even when contained to a subset of CPUs, that will cause an IPI storm on > all (/2) CPUs, even if you've tried really hard to keep users away from > some of them (see the above partitioning) because you're running some > important RT workload or whatnot. > That is a matter of driver implementation. Very commonly hwmon drivers implement value caching, where data from the device is only read at minimum intervals, and the cached values are reported if the information is polled too rapidly. For the most part, that is used with i2c temperature sensors, which tend to update their readings only a few times per second anyway, but the same mechanism can (and possibly should) be used in situations like this. At some point I would like to move the caching mechanism into the hwmon core, but that is going to take a while. > > As to hotplug, if you unplug any of the even numbered CPUs the whole > thing bails and returns 0, even if the corresponding odd CPU of the > compute unit it still online and perfectly capable of accessing the MSR. > Ah yes, I recall there was a similar problem in the coretemp driver some time ago. Guenter -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/