Received: by 2002:ab2:69cc:0:b0:1f4:be93:e15a with SMTP id n12csp1294307lqp; Mon, 15 Apr 2024 01:46:10 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCV/bfeYejXrHxD3ajqlt+EcpFYj2QdkvJAPpaZhIL7j2C4caqm95wa7HFvpoXiSTtCMNNtvPnArQC+qYMWxeCGOwgYL5BOFejTqTwJUQQ== X-Google-Smtp-Source: AGHT+IGtJt4llyEQpxmAlZl03VCjuAzGFngy7kex/ZzyUGgEDhjZSn2EdeVCFkQo5nZqxwf9BFXd X-Received: by 2002:a05:6a21:7897:b0:1a8:e79a:2b0a with SMTP id bf23-20020a056a21789700b001a8e79a2b0amr13698438pzc.0.1713170770150; Mon, 15 Apr 2024 01:46:10 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1713170770; cv=pass; d=google.com; s=arc-20160816; b=CkpZvg7YQWXa4jTH6wRiZA1jXcOY6Ln49IcZxC0neoCdllnQhF2tjKIRVncCSmAy7w UfIK2PI0KsSCP1PU1nSaz2NEdnhKDlk8mPDpSH0AOholVtbFseyUYM5yoHB1oHP0P53D mVY811ys/fCDvZvmHKC13DzyZha/Rdv3Y2i1pKouHKcmauCLTi1vc1RYgAycBVPWVd3O s1n5RpAKKhhSqeTy9Z7XeZk5skGFrrUBKWjWZ9TFb/HBJ6jL9ValmVPNyT8YeeKL+MoT QAzsto7KJy5R7/7pTu7A0Us7IgzeZrzJ2Aanyejh5N7ZWbV7ubeYMunvlfX8m4WgE5RR CsDw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:organization:references :in-reply-to:message-id:subject:cc:to:from:date; bh=EdfktfGGPKO/Li4NcEvvo4UMh2m2RiaO6ZVQftZGRLU=; fh=yx1pbZzinqLJz2Sprdv9oNsT+o3mYEcze3S17uuIrw8=; b=dgSzIDz0g4wmFoLYa7U6qgnvhNBJMgIfzZOje3BY+Bqdhg5NTR1VOnHuzbJLq8Cxlb 5WWMnGT/grUTmMvtBXIjRU3aQ3quG8jCE5nxn8t1mKy6GChrAq5DZ/MbEhNbc6V6TDAl G0Q2vY2raKa+dH31p0oHa/EE6opaXszUl75QP8Fw9w7hPtZrRxDoIJ1CeMTJ2OAAVy0u 9fbh19Y5yXSAgixfBisgaiOHBcPyifGMtMT/Ts5etDHkb0qyYGF1RXXvRL0Dpqk79lHv UivLEKfY7RUeaQSpq0T6txBjwZWCmwTt6ewjVnJdxASb7TrF3rTkILOWAabjo3aNvGFf n6+A==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=huawei.com dmarc=pass fromdomain=huawei.com); spf=pass (google.com: domain of linux-kernel+bounces-144787-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-144787-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [2604:1380:45e3:2400::1]) by mx.google.com with ESMTPS id q3-20020a17090a2dc300b002a2da57aa5bsi7504574pjm.44.2024.04.15.01.46.09 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 15 Apr 2024 01:46:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-144787-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) client-ip=2604:1380:45e3:2400::1; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=huawei.com dmarc=pass fromdomain=huawei.com); spf=pass (google.com: domain of linux-kernel+bounces-144787-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-144787-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id C6248281084 for ; Mon, 15 Apr 2024 08:46:09 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 5AA4C22EEF; Mon, 15 Apr 2024 08:46:00 +0000 (UTC) Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id C84F43839C; Mon, 15 Apr 2024 08:45:56 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=185.176.79.56 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1713170759; cv=none; b=gBaPdjHwuc3+zG2LBuzkrDd8dyrcrZ8qWMeHj40jrnFgy49uT4jZp3IPYnQ2l+Vvg2SX+HkEBATiD3G69frfZodsQQdCJOJxSskcZsvXJEn32+a80l3b25xUJDV6ZDeBSa1gYjzUFwQ8QDkykurxEHkjXuYADl3tP5YfDiWfdqU= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1713170759; c=relaxed/simple; bh=OgrKO1lLfj0QwPpIfKi+OF6oT15+1u+32KvRkSoVxV8=; h=Date:From:To:CC:Subject:Message-ID:In-Reply-To:References: MIME-Version:Content-Type; b=egMj9IBPLe6HDI25x1d9k08CJljXiWYjre9TQbwxASgFTSj5jJSRrdARsJ78IP1zwcQaKEkrAeE7x+418fhzbJlt2WpskK2AFNr6KHRvwola2FS9ThMGdVEmjL3YBxuBAV0o5UuNB7XeOb8wcK/1bvzj2dFKqth/hgmoIUhcWKc= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=Huawei.com; spf=pass smtp.mailfrom=huawei.com; arc=none smtp.client-ip=185.176.79.56 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=Huawei.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=huawei.com Received: from mail.maildlp.com (unknown [172.18.186.231]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4VJ11F19yNz6K60r; Mon, 15 Apr 2024 16:44:01 +0800 (CST) Received: from lhrpeml500005.china.huawei.com (unknown [7.191.163.240]) by mail.maildlp.com (Postfix) with ESMTPS id 423101400E7; Mon, 15 Apr 2024 16:45:54 +0800 (CST) Received: from localhost (10.202.227.76) by lhrpeml500005.china.huawei.com (7.191.163.240) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.1.2507.35; Mon, 15 Apr 2024 09:45:53 +0100 Date: Mon, 15 Apr 2024 09:45:52 +0100 From: Jonathan Cameron To: Thomas Gleixner CC: "Russell King (Oracle)" , "Rafael J. Wysocki" , , , , , , , , , Miguel Luis , James Morse , Salil Mehta , Jean-Philippe Brucker , Catalin Marinas , Will Deacon , , , Subject: Re: [PATCH v5 03/18] ACPI: processor: Register deferred CPUs from acpi_processor_get_info() Message-ID: <20240415094552.000008d7@Huawei.com> In-Reply-To: <878r1iyxkr.ffs@tglx> References: <20240412143719.11398-1-Jonathan.Cameron@huawei.com> <20240412143719.11398-4-Jonathan.Cameron@huawei.com> <87bk6ez4hj.ffs@tglx> <878r1iyxkr.ffs@tglx> Organization: Huawei Technologies Research and Development (UK) Ltd. X-Mailer: Claws Mail 4.1.0 (GTK 3.24.33; x86_64-w64-mingw32) Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-ClientProxiedBy: lhrpeml100002.china.huawei.com (7.191.160.241) To lhrpeml500005.china.huawei.com (7.191.163.240) On Sat, 13 Apr 2024 01:23:48 +0200 Thomas Gleixner wrote: > Russell! > > On Fri, Apr 12 2024 at 22:52, Russell King (Oracle) wrote: > > On Fri, Apr 12, 2024 at 10:54:32PM +0200, Thomas Gleixner wrote: > >> > As for the cpu locking, I couldn't find anything in arch_register_cpu() > >> > that depends on the cpu_maps_update stuff nor needs the cpus_write_lock > >> > being taken - so I've no idea why the "make_present" case takes these > >> > locks. > >> > >> Anything which updates a CPU mask, e.g. cpu_present_mask, after early > >> boot must hold the appropriate write locks. Otherwise it would be > >> possible to online a CPU which just got marked present, but the > >> registration has not completed yet. > > > > Yes. As far as I've been able to determine, arch_register_cpu() > > doesn't manipulate any of the CPU masks. All it seems to be doing > > is initialising the struct cpu, registering the embedded struct > > device, and setting up the sysfs links to its NUMA node. > > > > There is nothing obvious in there which manipulates any CPU masks, and > > this is rather my fundamental point when I said "I couldn't find > > anything in arch_register_cpu() that depends on ...". > > > > If there is something, then comments in the code would be a useful aid > > because it's highly non-obvious where such a manipulation is located, > > and hence why the locks are necessary. > > acpi_processor_hotadd_init() > ... > acpi_map_cpu(pr->handle, pr->phys_id, pr->acpi_id, &pr->id); > > That ends up in fiddling with cpu_present_mask. > > I grant you that arch_register_cpu() is not, but it might rely on the > external locking too. I could not be bothered to figure that out. > > >> Define "real hotplug" :) > >> > >> Real physical hotplug does not really exist. That's at least true for > >> x86, where the physical hotplug support was chased for a while, but > >> never ended up in production. > >> > >> Though virtualization happily jumped on it to hot add/remove CPUs > >> to/from a guest. > >> > >> There are limitations to this and we learned it the hard way on X86. At > >> the end we came up with the following restrictions: > >> > >> 1) All possible CPUs have to be advertised at boot time via firmware > >> (ACPI/DT/whatever) independent of them being present at boot time > >> or not. > >> > >> That guarantees proper sizing and ensures that associations > >> between hardware entities and software representations and the > >> resulting topology are stable for the lifetime of a system. > >> > >> It is really required to know the full topology of the system at > >> boot time especially with hybrid CPUs where some of the cores > >> have hyperthreading and the others do not. > >> > >> > >> 2) Hot add can only mark an already registered (possible) CPU > >> present. Adding non-registered CPUs after boot is not possible. > >> > >> The CPU must have been registered in #1 already to ensure that > >> the system topology does not suddenly change in an incompatible > >> way at run-time. > >> > >> The same restriction would apply to real physical hotplug. I don't think > >> that's any different for ARM64 or any other architecture. > > > > This makes me wonder whether the Arm64 has been barking up the wrong > > tree then, and whether the whole "present" vs "enabled" thing comes > > from a misunderstanding as far as a CPU goes. > > > > However, there is a big difference between the two. On x86, a processor > > is just a processor. On Arm64, a "processor" is a slice of the system > > (includes the interrupt controller, PMUs etc) and we must enumerate > > those even when the processor itself is not enabled. This is the whole > > reason there's a difference between "present" and "enabled" and why > > there's a difference between x86 cpu hotplug and arm64 cpu hotplug. > > The processor never actually goes away in arm64, it's just prevented > > from being used. > > It's the same on X86 at least in the physical world. There were public calls on this via the Linaro Open Discussions group, so I can talk a little about how we ended up here. Note that (in my opinion) there is zero chance of this changing - it took us well over a year to get to this conclusion. So if we ever want ARM vCPU HP we need to work within these constraints. The ARM architecture folk (the ones defining the ARM ARM, relevant ACPI specs etc, not the kernel maintainers) are determined that they want to retain the option to do real physical CPU hotplug in the future with all the necessary work around dynamic interrupt controller initialization, debug and many other messy corners. Thus anything defined had to be structured in a way that was 'different' from that. I don't mind the proposed flattening of the 2 paths if the ARM kernel maintainers are fine with it but it will remove the distinctions and we will need to be very careful with the CPU masks - we can't handle them the same as x86 does. I'll get on with doing that, but do need input from Will / Catalin / James. There are some quirks that need calling out as it's not quite a simple as it appears from a high level. Another part of that long discussion established that there is userspace (Android IIRC) in which the CPU present mask must include all CPUs at boot. To change that would be userspace ABI breakage so we can't do that. Hence the dance around adding yet another mask to allow the OS to understand which CPUs are 'present' but not possible to online. Flattening the two paths removes any distinction between calls that are for real hotplug and those that are for this online capable path. As a side note, the indicating bit for these flows is defined in ACPI for x86 from ACPI 6.3 as a flag in Processor Local APIC (the ARM64 definition is a cut and paste of that text). So someone is interested in this distinction on x86. I can't say who but if you have a mantis account you can easily follow the history and it might be instructive to not everyone considering the current x86 flow the right way to do it. Jonathan > > Thanks, > > tglx >