Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp609263imm; Wed, 6 Jun 2018 03:07:21 -0700 (PDT) X-Google-Smtp-Source: ADUXVKJV1Nu7rZ+/7fxPCmbysSjTI9iaRsrEekuytnARZMD+ulPSL/rXZbTLvq4VRyzO9PugbcnS X-Received: by 2002:a17:902:a5c7:: with SMTP id t7-v6mr2604745plq.360.1528279641271; Wed, 06 Jun 2018 03:07:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528279641; cv=none; d=google.com; s=arc-20160816; b=dTC3yrPLA6PF4W+5w21uWObfaJSB6gvBLFmlnRXZln85I6zio3ReWlMASYCIcLm7Im u0hbcWOOlVb3EmD7nkZwZSCGaRyY17OJ4EiekDamSdfH+ZrdEZj3NK3HwbT7N/jR7ofH leuoeFzAidxaVVp8fPfjlLf6X7Dvi+W2CzfS+7HGDv9ow1QZPJM8EwbjHLShF+ppstzn P/zn3B1Pc1N1CaedCPXvq4CNqzI/PEPng0au2xmpKsTRevIb34FKJVSBGsobTiIBldsZ h1nwuf3ZDwOPTW0x3eP+qnS3TMD7kjlMdbHfCEvMgdO0AelkTiwwIUDlqgLokVHzstqd w7FA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:from:references:to:subject:cc :arc-authentication-results; bh=R4TSaF2sFmaKJXAl5rCH2gs9zNudEiqZ6Vl6JKCmF34=; b=RIdqBCSuFVRfMS1qYxNJld9BcD3jHeHRk/LYqp2nNs3Fz9yhFkr5DrCsqoBw7mwMiO QC6kR54r+zpg+CAbG+XXtTa/Zfo/LPOyi0HOu/H2jXfqHbGmqrwsKGAVE2wlZYmLl/BD UVDNt81CTtgxfY9dyE8dZwOOum9FAvfHAh3R8xyh3Oa+T62k7ATwEjnxtOd2WchaFWWr zihs2MqCgF95sqKRKd4Zl5mrKLqCQT1OaI46J02/DzlsG6HeeSA059nCTg8zb4EHEg5f 7UakavW2XsOGmeYPgdVmh6avHGDFo7Hl0+K7n9Mg3XzURXLXkLY85+1s+7wgytbKI3ow aKKg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f186-v6si15049165pfg.194.2018.06.06.03.07.06; Wed, 06 Jun 2018 03:07:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752415AbeFFKFp (ORCPT + 99 others); Wed, 6 Jun 2018 06:05:45 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:38890 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752343AbeFFKFn (ORCPT ); Wed, 6 Jun 2018 06:05:43 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 6C5DC15AB; Wed, 6 Jun 2018 03:05:43 -0700 (PDT) Received: from [10.1.210.28] (e107155-lin.cambridge.arm.com [10.1.210.28]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 6ADB83F557; Wed, 6 Jun 2018 03:05:41 -0700 (PDT) Cc: Sudeep Holla , Will.Deacon@arm.com, Catalin.Marinas@arm.com, Robin.Murphy@arm.com, Morten.Rasmussen@arm.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, geert@linux-m68k.org, linux-acpi@vger.kernel.org, ard.biesheuvel@linaro.org Subject: Re: [PATCH] arm64: topology: Avoid checking numa mask for scheduler MC selection To: Jeremy Linton References: <20180605190837.493505-1-jeremy.linton@arm.com> From: Sudeep Holla Organization: ARM Message-ID: <3c8a541d-78a1-389a-112c-494f7c1a543d@arm.com> Date: Wed, 6 Jun 2018 11:05:40 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <20180605190837.493505-1-jeremy.linton@arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 05/06/18 20:08, Jeremy Linton wrote: > The numa mask subset check has problems if !CONFIG_NUMA, over hotplug > operations or during early boot. Lets disable the NUMA siblings checks > for the time being, as NUMA in socket machines have LLC's that will > assure that the scheduler topology isn't "borken". > ^ broken ? (not sure if usage of borken is intentional :)) > Futher, as a defensive mechanism during hotplug, lets assure that the ^ Further > LLC siblings are also masked. > Also add the symptoms of the issue we say as Geert suggested me. Something like: " This often leads to system hang or crash during CPU hotplug and system suspend operation. This is mostly observed on HMP systems where the CPU compute capacities are different and ends up in different scheduler domains. Since cpumask_of_node is returned instead core_sibling, the scheduler is confused with incorrect cpumasks(e.g. one CPU in two different sched domains at the same time) on CPU hotplug." You can add Reported-by: Geert... ? > Signed-off-by: Jeremy Linton > --- > arch/arm64/kernel/topology.c | 11 ++++------- > 1 file changed, 4 insertions(+), 7 deletions(-) > > diff --git a/arch/arm64/kernel/topology.c b/arch/arm64/kernel/topology.c > index 7415c166281f..f845a8617812 100644 > --- a/arch/arm64/kernel/topology.c > +++ b/arch/arm64/kernel/topology.c > @@ -215,13 +215,8 @@ EXPORT_SYMBOL_GPL(cpu_topology); > > const struct cpumask *cpu_coregroup_mask(int cpu) > { > - const cpumask_t *core_mask = cpumask_of_node(cpu_to_node(cpu)); > + const cpumask_t *core_mask = &cpu_topology[cpu].core_sibling; > > - /* Find the smaller of NUMA, core or LLC siblings */ > - if (cpumask_subset(&cpu_topology[cpu].core_sibling, core_mask)) { > - /* not numa in package, lets use the package siblings */ > - core_mask = &cpu_topology[cpu].core_sibling; > - } > if (cpu_topology[cpu].llc_id != -1) { > if (cpumask_subset(&cpu_topology[cpu].llc_siblings, core_mask)) > core_mask = &cpu_topology[cpu].llc_siblings; > @@ -239,8 +234,10 @@ static void update_siblings_masks(unsigned int cpuid) > for_each_possible_cpu(cpu) { > cpu_topo = &cpu_topology[cpu]; > > - if (cpuid_topo->llc_id == cpu_topo->llc_id) > + if (cpuid_topo->llc_id == cpu_topo->llc_id) { > cpumask_set_cpu(cpu, &cpuid_topo->llc_siblings); > + cpumask_set_cpu(cpuid, &cpu_topo->llc_siblings); > + } > > if (cpuid_topo->package_id != cpu_topo->package_id) > continue; > Looks good to me for now. I might need to tweek it a bit when I add the support to update topology on hotplug. But that's for latter. For now, Reviewed-by: Sudeep Holla -- Regards, Sudeep