Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751267AbbLKRuD (ORCPT ); Fri, 11 Dec 2015 12:50:03 -0500 Received: from mezzanine.sirena.org.uk ([106.187.55.193]:43060 "EHLO mezzanine.sirena.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750748AbbLKRuA (ORCPT ); Fri, 11 Dec 2015 12:50:00 -0500 Date: Fri, 11 Dec 2015 17:49:40 +0000 From: Mark Brown To: Juri Lelli Cc: Rob Herring , linux-kernel@vger.kernel.org, linux-pm@vger.kernel.org, linux-arm-kernel@lists.infradead.org, devicetree@vger.kernel.org, peterz@infradead.org, vincent.guittot@linaro.org, mark.rutland@arm.com, linux@arm.linux.org.uk, sudeep.holla@arm.com, lorenzo.pieralisi@arm.com, catalin.marinas@arm.com, will.deacon@arm.com, morten.rasmussen@arm.com, dietmar.eggemann@arm.com, Pawel Moll , Ian Campbell , Kumar Gala , Maxime Ripard , Olof Johansson , Gregory CLEMENT , Paul Walmsley , Linus Walleij , Chen-Yu Tsai , Thomas Petazzoni Message-ID: <20151211174940.GQ5727@sirena.org.uk> References: <1448288921-30307-1-git-send-email-juri.lelli@arm.com> <1448288921-30307-3-git-send-email-juri.lelli@arm.com> <20151124020631.GA15165@rob-hp-laptop> <20151210153004.GA26758@sirena.org.uk> <20151210175820.GE14571@e106622-lin> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="TuLZpJsoCmPfu6RO" Content-Disposition: inline In-Reply-To: <20151210175820.GE14571@e106622-lin> X-Cookie: revolutionary, adj.: User-Agent: Mutt/1.5.24 (2015-08-30) X-SA-Exim-Connect-IP: 94.175.94.161 X-SA-Exim-Mail-From: broonie@sirena.org.uk Subject: Re: [RFC PATCH 2/8] Documentation: arm: define DT cpu capacity bindings X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000) X-SA-Exim-Scanned: Yes (on mezzanine.sirena.org.uk) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 6726 Lines: 135 --TuLZpJsoCmPfu6RO Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Dec 10, 2015 at 05:58:20PM +0000, Juri Lelli wrote: > On 10/12/15 15:30, Mark Brown wrote: > > On Mon, Nov 23, 2015 at 08:06:31PM -0600, Rob Herring wrote: > > > In other words, I want to see these numbers have a defined method=20 > > > of determining them and don't want to see random values from every=20 > > > vendor. ARM, Ltd. says core X has a value of Y would be good enough f= or=20 > > > me. Vendor X's A57 having a value of 2 and Vendor Y's A57 having a=20 > > > value of 1024 is not what I want to see. Of course things like cache= =20 > > > sizes can vary the performance, but is a baseline value good enough?= =20 > > > However, no vendor will want to publish their values if these are=20 > > > absolute values relative to other vendors. > > > If you expect these to need frequent tuning, then don't put them in D= T. > > I agree strongly. Putting what are essentially tuning numbers for the > > system into the ABI is going to lead us into a mess long term since if > > we change anything related to the performance of the system the numbers > > may become invalid and we've no real way of recovering sensible > > information. > I'm not entirely getting here why you consider capacity values to be > tunables. As part of the EAS effort, we are proposing ways in which users The purpose of the capacity values is to influence the scheduler behaviour and hence performance. Without a concrete definition they're just magic numbers which have meaining only in terms of their effect on the performance of the system. That is a sufficiently complex outcome to ensure that there will be an element of taste in what the desired outcomes are. Sounds like tuneables to me. > should be able to fine tune their system as needed, when required > (don't know if you had a chance to have a look at the SchedTune posting > back in August for example [1]). This patch tries to only standardize > where do we get default values from and how we specify them. I'm not > seeing them changing much after an initial benchmarking phase has been > done. Tuning should happen using different methods, not by changing > these values, IMHO. If you are saying people should use other, more sensible, ways of specifying the final values that actually get used in production then why take the defaults from direct numbers DT in the first place? If you are saying that people should tune and then put the values in here then that's problematic for the reasons I outlined. > > It would be better to have the DT describe concrete physical properties > > of the system which we can then map onto numbers we like, that way if we > > get better information in future or just decide that completely > > different metrics are appropriate for tuning we can just do that without > > having to worry about translating the old metrics into new ones. We can > > then expose the tuning knobs to userspace for override if that's needed. > > If doing system specific tuning on vertically integrated systems really > > is terribly important it's not going to matter too much where the tuning > > is but we also have to consider more general purpose systems. > As replied to Rob, I'm not sure it is so easy to find any physical > property that expresses what we essentially need (without maybe relying > on a complex mix of hardware details and a model to extract numbers from > them). Instead, we propose to have reasonable, per SoC, default numbers; > and then let users fine tune their platform afterwards, without changing > those default values. If users are supposed to do fine tuning elsewhere after the fact why bother with this initial callibration? Something that's ballpark good enough like just knowing the core used and perhaps some important options on it should give an adequate starting point and not have the issues with having the tuning numbers present as magic numbers. Perhaps we might also feed cache information in at some point. If in future we're able to improve those default numbers (or just adapt at runtime) then even better. It also seems a bit strange to expect people to do some tuning in one place initially and then additional tuning somewhere else later, from a user point of view I'd expect to always do my tuning in the same place. > > We're not going to get out of having to pick numbers at some point, > > pushing them into DT doesn't get us out of that but it does make the > > situation harder to manage long term and makes the performance for the > > general user less relaible. It's also just more work all round, > > everyone doing the DT for a SoC is going to have to do some combination > > of cargo culting or repeating the callibration. > I'm most probably a bit naive here, but I see the calibration phase > happening only once, after the platform is stable. You get default > capacity values by running a pretty simple benchmark on a fixed > configuration; and you put them somewhere (DTs still seem to be a > sensible place to me). Then you'll be able to suit tuning needs using > different interfaces. My point is that everyone making any kind of SoC with asymmetries is expected to go and do some kind of callibration based on some unclear criteria, if these are just ballpark accurate starting points that seems like wasted effort - the kernel should be making a reasonable effort to do something sensible without this information which is going to be less effort all round. It doesn't need to wait for real silicon (this seems like the sort of core bit of DT which will be being written pre-tapeout) and doesn't have marketing implications. Doing that and then switching to some other interface for real tuning seems especially odd and I'm not sure that's something that users are going to expect or understand. --TuLZpJsoCmPfu6RO Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQEcBAEBCAAGBQJWawyzAAoJECTWi3JdVIfQxdYH/2akLo+F2UEntQFjlRZmUkK4 54Mdb9h5dgVrUtDPs+M2uzskF3ngUAnsEllypc03ewp5xbjNeRtVOvpB1xBI6dFy g6+li9ULgPhStUQDZUVJVk/mTwOkNC1iUAGX03UBtq+y+XWgNEATbyJuOgaLsHRD jwEO9Ll+CixQZBafnjwFG6O1cbVp5gf4dLNOnEVHsVx8W+bOJo6Ozv8BjIZmi3PG MMWBiK0f8iGpBUt/Id7EIz3nzEA3gdbBV2SuAeXAVGhBcbLX1yyLgTB0KV1SGRC9 2f564S4gXArSpUey+/4ZIdXrKeOWzLEjrz2RV8dD6PFN4Q5uyvqahKaTWmFNVe0= =IwPZ -----END PGP SIGNATURE----- --TuLZpJsoCmPfu6RO-- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/