Received: by 2002:a05:6a10:6006:0:0:0:0 with SMTP id w6csp1374533pxa; Fri, 28 Aug 2020 10:51:16 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyY6KK0OaRd6mh5XxRC31peD4WZIV8zSguZWLYB1iAVoVuEkWggw48AD20c8du1eRSt9PRJ X-Received: by 2002:a17:906:b09a:: with SMTP id x26mr3001808ejy.162.1598637076737; Fri, 28 Aug 2020 10:51:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598637076; cv=none; d=google.com; s=arc-20160816; b=FEhs/bTTyGG+gUx0pB+fSzgvQatC7AqZJvD3yT2y5VujLbXuBQZXffSOFKuNLMDjMe GixZbl4NGK9P84Tr14IoN+wZrpKi39cXdPb7oppHRZIRkh+UaaYEOkH6gV5EMi82PUyN Y1gVOLAsYkav6PX5lqe/YruJatdPffp35Ne4Hll5E2djCnY/GlT7K4Rsl+EVXLAfs1NN edzyWpyZwbXLisCQAnG1sXALWk2fcnCqrkrKPVSSz52nOjJkuxhDCBHctlRTEEC/Wv5G hCGoXdGw9PXfPh1ukih6OgvWU4kv8zT5xxiE2ABN5f0n4CEsxjRy8v7rcxascmOfO5YW XypA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=v6F4nVL9wBETL+ZN7yPpn90OWPQHQz6pjoHFx9z42lk=; b=i5OvdxEazUh/qYbEYnUr0s4sGZYG0Ii75WMo1b34ZkNmIOpk72LYi7iGKsCzdC8Wmt SPds0lOc8/8lkQkwHJBePXrm0G3r4yAkX3LnK5uAGlNLXAq/iCUcrPIt0M6Xt9Kmveuf 7IytURxY9cTRpUxp2DQeQz2eR2oJjj3Y9rlwiaFo8+YEZONLlnI3QHxWWRa6A4gRLxZr YRSk05wBZQGj26Dx/THvRBMwnye22iLVT4Qop4I5w6VB7eN6v0OizhV9aZ0qbJC3FBWE eHtxILAhlr3oePsB+5iWN6nYUXMTemQv9+BHZSONFzoPjTOKOEq9WAMRQP409HFKOiF8 c8YA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ck7si117675ejb.489.2020.08.28.10.50.54; Fri, 28 Aug 2020 10:51:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727056AbgH1Rsr (ORCPT + 99 others); Fri, 28 Aug 2020 13:48:47 -0400 Received: from mx2.suse.de ([195.135.220.15]:54708 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725814AbgH1Rsq (ORCPT ); Fri, 28 Aug 2020 13:48:46 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id AFEBBAF19; Fri, 28 Aug 2020 17:49:17 +0000 (UTC) Date: Fri, 28 Aug 2020 19:48:39 +0200 From: Borislav Petkov To: Feng Tang Cc: "Luck, Tony" , kernel test robot , LKML , lkp@lists.01.org, Mel Gorman Subject: Re: [LKP] Re: [x86/mce] 1de08dccd3: will-it-scale.per_process_ops -14.1% regression Message-ID: <20200828174839.GD19448@zn.tnic> References: <20200425114414.GU26573@shao2-debian> <20200425130136.GA28245@zn.tnic> <20200818082943.GA65567@shbuild999.sh.intel.com> <20200818200654.GA21494@agluck-desk2.amr.corp.intel.com> <20200819020437.GA2605@shbuild999.sh.intel.com> <20200821020259.GA90000@shbuild999.sh.intel.com> <20200824151425.GF4794@zn.tnic> <20200824153300.GA56944@shbuild999.sh.intel.com> <20200824161238.GI4794@zn.tnic> <20200825062305.GA83850@shbuild999.sh.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20200825062305.GA83850@shbuild999.sh.intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Aug 25, 2020 at 02:23:05PM +0800, Feng Tang wrote: > Also one good news is, we seem to identify the 2 key percpu variables > out of the list mentioned in previous email: > 'arch_freq_scale' > 'tsc_adjust' > > These 2 variables are accessed in 2 hot call stacks (for this 288 CPU > Xeon Phi platform): > > - arch_freq_scale is accessed in scheduler tick > arch_scale_freq_tick+0xaf/0xc0 > scheduler_tick+0x39/0x100 > update_process_times+0x3c/0x50 > tick_sched_handle+0x22/0x60 > tick_sched_timer+0x37/0x70 > __hrtimer_run_queues+0xfc/0x2a0 > hrtimer_interrupt+0x122/0x270 > smp_apic_timer_interrupt+0x6a/0x150 > apic_timer_interrupt+0xf/0x20 > > - tsc_adjust is accessed in idle entrance > tsc_verify_tsc_adjust+0xeb/0xf0 > arch_cpu_idle_enter+0xc/0x20 > do_idle+0x91/0x280 > cpu_startup_entry+0x19/0x20 > start_kernel+0x4f4/0x516 > secondary_startup_64+0xb6/0xc0 > > From systemmap file, for bad kernel these 2 sit in one cache line, while > for good kernel they sit in 2 separate cache lines. > > It also explains why it turns from a regression to an improvement with > updated gcc/kconfig, as the cache line sharing situation is reversed. > > The direct patch I can think of is to make 'tsc_adjust' cache aligned > to separate these 2 'hot' variables. How do you think? > > --- a/arch/x86/kernel/tsc_sync.c > +++ b/arch/x86/kernel/tsc_sync.c > @@ -29,7 +29,7 @@ struct tsc_adjust { > bool warned; > }; > > -static DEFINE_PER_CPU(struct tsc_adjust, tsc_adjust); > +static DEFINE_PER_CPU_ALIGNED(struct tsc_adjust, tsc_adjust); So why don't you define both variables with DEFINE_PER_CPU_ALIGNED and check if all your bad measurements go away this way? You'd also need to check whether there's no detrimental effect from this change on other, i.e., !KNL platforms, and I think there won't be because both variables will be in separate cachelines then and all should be good. Hmm? -- Regards/Gruss, Boris. SUSE Software Solutions Germany GmbH, GF: Felix Imendörffer, HRB 36809, AG Nürnberg