Received: by 10.223.148.5 with SMTP id 5csp6357040wrq; Wed, 17 Jan 2018 12:38:03 -0800 (PST) X-Google-Smtp-Source: ACJfBovfRn0BdMvVyKX5yxW8Swlb6PF9JvFVmbV3A0BvTEg5/8hoVBTy1DvFkAtVjKws4OUcx02c X-Received: by 10.99.173.65 with SMTP id y1mr1411417pgo.160.1516221483603; Wed, 17 Jan 2018 12:38:03 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516221483; cv=none; d=google.com; s=arc-20160816; b=aPMT1reG/1/jV/U8KSQvI+0GjUtS12/2Dcbrg3IfFvmf2b1hwKBGiGmv8jWnOJxG11 OQ0RuJkMZXxwl/GC4SlXDNKaKVAQapEJXqCRn8MaV4lhX1GQro3MWzgZsy8o/1GPAeDc vA2R8HG8FpvY2eAx7JTTxxeYhMjm0ah3f9aV6499NDxDJ0yWzzr59+lh+HBuCaY0u9Si fUB68XaAjo/DWQ8pnBrFRaqzdU5P3JldeBYeGIxPWm9Z1MFzfOvprlHNwSm26HIYUiuF 3GY2e613Y4rdNeYGpj08eLg/vWsBKGypEcOrhj2v25dJW3IJyhMxMzAb3BYE01AfCdO8 S4sA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=6EM0JcWigLo6qB/NIKaQ54wbjUL1fLD4COxgA4dFl/U=; b=iXKVbmbX+hdu9cQkc3L5QDoxZOqy/DtufE2NLC+6FAWM4H0HcyHaetE3C+paJ/oUZZ Wp8Xgoy4ZoR3UhXqpm+6IrpuFXILx6iD6wiv+x/EFsOEKxFfAt7aCqFr7MCg1748/PiS K/wdhGgKw/LbRlCwi8es10ODGkfqU92M8afDduhQ+/tnkv4APBETDL+uRhx6bmtmrOLQ 4s3UXYP8CeWGN2pODttd0SXpo50d9GKr/dRByDD3whvj2ghHFsPPhEAFSZFdL/Q2jZGu 52E6ZbXhtcDzvZZw/MmvwNeV2OCvKYPHimkLdvMwZ+ITIICMyS0otYWDaQxb/wFh6CSC ElFw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=canonical.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a6si4944293pfl.299.2018.01.17.12.37.49; Wed, 17 Jan 2018 12:38:03 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=canonical.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754582AbeAQUgG (ORCPT + 99 others); Wed, 17 Jan 2018 15:36:06 -0500 Received: from youngberry.canonical.com ([91.189.89.112]:40013 "EHLO youngberry.canonical.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753901AbeAQUgC (ORCPT ); Wed, 17 Jan 2018 15:36:02 -0500 Received: from 1.general.jsalisbury.us.vpn ([10.172.67.212]) by youngberry.canonical.com with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.76) (envelope-from ) id 1ebuQc-00013y-Mj; Wed, 17 Jan 2018 20:35:22 +0000 Subject: Re: [REGRESSION][v4.14.y][v4.15] x86/intel_rdt/cqm: Improve limbo list processing To: Thomas Gleixner , "Yu, Fenghua" Cc: "Shankar, Ravi V" , "vikas.shivappa@linux.intel.com" , "stable@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "Luck, Tony" , "peterz@infradead.org" , "eranian@google.com" , "ak@linux.intel.com" , "davidcc@google.com" , "mingo@redhat.com" , "hpa@zytor.com" , "x86@kernel.org" , "1733662@bugs.launchpad.net" <1733662@bugs.launchpad.net>, "Roderick W. Smith" References: <84b8d891-6217-f56d-8ec0-313f7eb317c9@canonical.com> <159B72D0-06FE-4925-A11A-1F8A7741BF70@intel.com> <3E5A0FA7E9CA944F9D5414FEC6C7122075908855@FMSMSX153.amr.corp.intel.com> From: Joseph Salisbury Message-ID: <7d3e17d9-b7f1-0829-1b2c-83a72deb281b@canonical.com> Date: Wed, 17 Jan 2018 15:35:19 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 01/16/2018 01:59 PM, Thomas Gleixner wrote: > On Tue, 16 Jan 2018, Yu, Fenghua wrote: >>> From: Thomas Gleixner [mailto:tglx@linutronix.de] >> Is this a Haswell specific issue? >> >> I run the following test forever without issue on Broadwell and 4.15.0-rc6 with rdt mounted: >> for ((;;)) do >> for ((i=1;i<88;i++)) do >> echo 0 >/sys/devices/system/cpu/cpu$i/online >> done >> echo "online cpus:" >> grep processor /proc/cpuinfo |wc >> for ((i=1;i<88;i++)) do >> echo 1 >/sys/devices/system/cpu/cpu$i/online >> done >> echo "online cpus:" >> grep processor /proc/cpuinfo|wc >> done >> >> I'm finding a Haswell to reproduce the issue. > Come on. This is crystal clear from the KASAN trace. And the fix is simple enough. > > You simply do not run into it because on your machine > > is_llc_occupancy_enabled() is false... > > Thanks, > > tglx > > 8<-------------------- > > diff --git a/arch/x86/kernel/cpu/intel_rdt.c b/arch/x86/kernel/cpu/intel_rdt.c > index 88dcf8479013..99442370de40 100644 > --- a/arch/x86/kernel/cpu/intel_rdt.c > +++ b/arch/x86/kernel/cpu/intel_rdt.c > @@ -525,10 +525,6 @@ static void domain_remove_cpu(int cpu, struct rdt_resource *r) > */ > if (static_branch_unlikely(&rdt_mon_enable_key)) > rmdir_mondata_subdir_allrdtgrp(r, d->id); > - kfree(d->ctrl_val); > - kfree(d->rmid_busy_llc); > - kfree(d->mbm_total); > - kfree(d->mbm_local); > list_del(&d->list); > if (is_mbm_enabled()) > cancel_delayed_work(&d->mbm_over); > @@ -545,6 +541,10 @@ static void domain_remove_cpu(int cpu, struct rdt_resource *r) > cancel_delayed_work(&d->cqm_limbo); > } > > + kfree(d->ctrl_val); > + kfree(d->rmid_busy_llc); > + kfree(d->mbm_total); > + kfree(d->mbm_local); > kfree(d); > return; > } Thanks, Thomas.  I'll build some test kernels and have your patch tested out. Thanks, Joe