Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933873AbbKSKg5 (ORCPT ); Thu, 19 Nov 2015 05:36:57 -0500 Received: from pandora.arm.linux.org.uk ([78.32.30.218]:39749 "EHLO pandora.arm.linux.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932906AbbKSKg4 (ORCPT ); Thu, 19 Nov 2015 05:36:56 -0500 Date: Thu, 19 Nov 2015 10:36:45 +0000 From: Russell King - ARM Linux To: Thomas Gleixner Cc: Marc Gonzalez , Daniel Lezcano , LKML , Mans Rullgard , Viresh Kumar , Nicolas Pitre , Tony Lindgren , Sebastian Frias Subject: Re: [PATCH] clocksource: Store reg field within struct clocksource Message-ID: <20151119103645.GV8644@n2100.arm.linux.org.uk> References: <564C8086.80804@sigmadesigns.com> <20151118172114.GS8644@n2100.arm.linux.org.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1927 Lines: 45 On Thu, Nov 19, 2015 at 11:33:47AM +0100, Thomas Gleixner wrote: > Russell, > > On Wed, 18 Nov 2015, Russell King - ARM Linux wrote: > > > On Wed, Nov 18, 2015 at 02:43:34PM +0100, Marc Gonzalez wrote: > > > Since 'struct clocksource' is ____cacheline_aligned, gcc must insert > > > a lot of padding between reg and clksrc in 'struct clocksource_mmio' > > > (for example, L1_CACHE_BYTES = 64 on ARMv7). > > > > > > Storing reg within 'struct clocksource' removes unnecessary padding, > > > and reg can then be grouped with other hot data. A nice side-effect > > > of this patch is making container_of() unnecessary, which makes the > > > code a bit simpler. > > > > > > On 32-bit platforms, reg fits in the padding between read and mask, > > > meaning no downside from storing it there. > > > > Just swap the order of 'reg' and 'clksrc'. > > That might reduce the memory footprint, but it does not bring the > iomem pointer closer to the other hotpath clocksource data. So we > still need to touch at minimum two cache lines for reading the time. > > With Marcs change we have all hotpath data in a single cacheline. Right, and what it's doing is polluting struct clocksource with lots of ifdefs which determine how much data is contained in there. Seems to me to be totally insane. The basic cause of this problem is the ____cacheline_aligned annotation which effectively prevents wrapping struct clocksource to provide implementation specific data. Maybe your idea is that struct clocksource should be bloated with all implementation specific data in the long term? -- FTTC broadband for 0.8mile line: currently at 9.6Mbps down 400kbps up according to speedtest.net. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/