Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp3740351ybb; Mon, 23 Mar 2020 06:52:04 -0700 (PDT) X-Google-Smtp-Source: ADFU+vuW8s/OEVUEFJM1T0CMS0vtqSuoiDcqusK9p+1jWs+B4Foy5badU7SEqqUH8ThzPNjKqnpc X-Received: by 2002:a9d:708a:: with SMTP id l10mr18879705otj.152.1584971524667; Mon, 23 Mar 2020 06:52:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584971524; cv=none; d=google.com; s=arc-20160816; b=abGhfmd8y9BdsC9uRNlylqwU9zQ2gdKT+5acF1o+yHJfbmLduv3gyN+Bfgm+U93vX0 mkrE8Ip9eN8KSAfuOA6V1xUrpAPVTKeMkyKb83qMj+MRmU77jAf5/uIXngRn0OVky6Lp A/uwt9dx8q9xh++/C3UlS0Cj3lsv4KIJfZul4LI7oUHi+nLkwwF3JVAqPlWJgZGxQMiR oQcNFXLgCeZSuYDBGtZKGEShRCSXtMJThrlv8AJOUI/bgXIodGgeaGHUZ40mgG6J9wtK Y+vvF4O5wAhH0J6loV22+7qx1U2yGdTd+StrzkUBPjx3w8aqe7iNc/BWhel1SfTSm5mw J5qA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from; bh=xTO7LRD7QlyH9vATJKyX5Wj0tbcAuKeLGozsQKM39P0=; b=r/cLKWS9dD0MMCJjWGlKl89bSlUZ+/evwnEt1dYJdIWHztWeBop3jgIKpRgMNS1Zc9 ioqdzucN7Zyvg8nG8s6hIPi4VssLESRiJ4A0oRpWWi9XgTrPvHs6vhPJA3d6txqiPKYT JQMdu+pPzaw6F5mErmZUjmkIxnCnZR/0t2aq96fWIYBM+7yDEqIo9jCTc2R0cxlFhRGf KO/t6Nv36quJ8sg4MzvXtr1acF7GeF81fhuB+wwmpnShikl8kpoWZLNQpDp/rRVebVrt IlhQIR9zMZn4YJJYkNncZ/cbnS/PMU8oEksMf66rjUecq1ChXrpScbPzIYOVUFDdS9xS Vl8Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t28si220856ooc.16.2020.03.23.06.51.51; Mon, 23 Mar 2020 06:52:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728544AbgCWNvS (ORCPT + 99 others); Mon, 23 Mar 2020 09:51:18 -0400 Received: from foss.arm.com ([217.140.110.172]:49402 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728423AbgCWNvS (ORCPT ); Mon, 23 Mar 2020 09:51:18 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 2962E1FB; Mon, 23 Mar 2020 06:51:17 -0700 (PDT) Received: from e107158-lin.cambridge.arm.com (e107158-lin.cambridge.arm.com [10.1.195.21]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 82C293F52E; Mon, 23 Mar 2020 06:51:14 -0700 (PDT) From: Qais Yousef To: Thomas Gleixner Cc: linux-kernel@vger.kernel.org, Qais Yousef , Tony Luck , Fenghua Yu , Russell King , Catalin Marinas , Michael Ellerman , "David S. Miller" , Helge Deller , Juergen Gross , Mark Rutland , Lorenzo Pieralisi , "Paul E. McKenney" , Greg Kroah-Hartman , xen-devel@lists.xenproject.org, linux-parisc@vger.kernel.org, sparclinux@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, x86@kernel.org, linux-arm-kernel@lists.infradead.org, linux-ia64@vger.kernel.org Subject: [PATCH v4 00/17] Convert cpu_up/down to device_online/offline Date: Mon, 23 Mar 2020 13:50:53 +0000 Message-Id: <20200323135110.30522-1-qais.yousef@arm.com> X-Mailer: git-send-email 2.17.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org ============= Changes in v4 ============= * Split arm and arm64 patches so that the change to use reboot_cpu goes into its own separate patch (Russell) * Collected new Acked-by * Rebased on top of v5.6-rc6 * Trimmed the CC list on the cover letter as lists were rejecting it git clone git://linux-arm.org/linux-qy.git -b cpu-hp-cleanup-v4 Older post can be found here ---------------------------- https://lore.kernel.org/lkml/20200223192942.18420-2-qais.yousef@arm.com/ ============= Test Coverage ============= All tests ran with LOCKDEP enabled. Platform: Juno-r2: arm64 ------------------------ * Overnight rcutorture * Overnight locktorture * kexec -f Image --command="$(cat /proc/cmdline) reboot=s[0-5]" * Hibernate to disk (using suspend option) * Userspace hotplug via sysfs * PSCI firemware checker Notes: * Couldn't convince Juno to hibernate using [reboot] or [shutdown] options. Platform: qemu (8 vCPUs) and VM (2 vCPUs): x86_64 ------------------------------------------------- * Overnight rcutorture * Overnight locktorture * Userspace hotplug via sysfs * echo mmiotrace > /sys/kernel/debug/tracing/current_tracer && echo nop > /sys/kernel/debug/tracing/current_tracer * Ran with CONFIG_DEBUG_HOTPLUG_CPU0 and CONFIG_BOOTPARAM_HOTPLUG_CPU0 Notes: * qemu failed to bring cpu0 after offlining. Same behavior observed on vanilla v5.6-rc6. Worked fine on the VM. * mmiotrace successfully brought down all cpus when enabled, then back online again when disabled. Including when cpu0 was offline. * My xen shenanigans are too 'humble' too create environment to test the change in xen yet.. ===================== Original Cover Letter ===================== Using cpu_up/down directly to bring cpus online/offline loses synchronization with sysfs and could suffer from a race similar to what is described in commit a6717c01ddc2 ("powerpc/rtas: use device model APIs and serialization during LPM"). cpu_up/down seem to be more of a internal implementation detail for the cpu subsystem to use to boot up cpus, perform suspend/resume and low level hotplug operations. Users outside of the cpu subsystem would be better using the device core API to bring a cpu online/offline which is the interface used to hotplug memory and other system devices. Several users have already migrated to use the device core API, this series converts the remaining users and hides cpu_up/down from internal users at the end. I noticed this problem while working on a hack to disable offlining a particular CPU but noticed that setting the offline_disabled attribute in the device struct isn't enough because users can easily bypass the device core. While my hack isn't a valid use case but it did highlight the inconsistency in the way cpus are being onlined/offlined and this attempt hopefully improves on this. The first patch introduces new API to {add,remove}_cpu() using device_{online, offline}() with correct locks held and export it. The following 10 patches fix arch users. The remaining 6 patches fix generic code users. Particularly creating a new special exported API for the device core to use instead of cpu_up/down. The last patch removes cpu_up/down from cpu.h and unexport the functions. In some cases where the use of cpu_up/down seemed legitimate, I encapsulated the logic in a higher level - special purposed function; and converted the code to use that instead. CC: Thomas Gleixner CC: Tony Luck CC: Fenghua Yu CC: Russell King CC: Catalin Marinas CC: Michael Ellerman CC: "David S. Miller" CC: Helge Deller CC: Juergen Gross CC: Mark Rutland CC: Lorenzo Pieralisi CC: "Paul E. McKenney" CC: Greg Kroah-Hartman CC: xen-devel@lists.xenproject.org CC: linux-parisc@vger.kernel.org CC: sparclinux@vger.kernel.org CC: linuxppc-dev@lists.ozlabs.org CC: x86@kernel.org CC: linux-arm-kernel@lists.infradead.org CC: linux-ia64@vger.kernel.org CC: linux-kernel@vger.kernel.org Qais Yousef (17): cpu: Add new {add,remove}_cpu() functions smp: Create a new function to shutdown nonboot cpus ia64: Replace cpu_down with smp_shutdown_nonboot_cpus() arm: Don't use disable_nonboot_cpus() arm: Use reboot_cpu instead of hardcoding it to 0 arm64: Don't use disable_nonboot_cpus() arm64: Use reboot_cpu instead of hardconding it to 0 arm64: hibernate.c: Create a new function to handle cpu_up(sleep_cpu) x86: Replace cpu_up/down with add/remove_cpu powerpc: Replace cpu_up/down with add/remove_cpu sparc: Replace cpu_up/down with add/remove_cpu parisc: Replace cpu_up/down with add/remove_cpu driver: xen: Replace cpu_up/down with device_online/offline firmware: psci: Replace cpu_up/down with add/remove_cpu torture: Replace cpu_up/down with add/remove_cpu smp: Create a new function to bringup nonboot cpus online cpu: Hide cpu_up/down arch/arm/kernel/reboot.c | 4 +- arch/arm64/kernel/hibernate.c | 13 +-- arch/arm64/kernel/process.c | 4 +- arch/ia64/kernel/process.c | 8 +- arch/parisc/kernel/processor.c | 2 +- arch/powerpc/kexec/core_64.c | 2 +- arch/sparc/kernel/ds.c | 4 +- arch/x86/kernel/topology.c | 22 ++--- arch/x86/mm/mmio-mod.c | 4 +- arch/x86/xen/smp.c | 2 +- drivers/base/cpu.c | 4 +- drivers/firmware/psci/psci_checker.c | 4 +- drivers/xen/cpu_hotplug.c | 2 +- include/linux/cpu.h | 10 +- kernel/cpu.c | 134 ++++++++++++++++++++++++++- kernel/smp.c | 9 +- kernel/torture.c | 9 +- 17 files changed, 172 insertions(+), 65 deletions(-) -- 2.17.1