Received: by 2002:a5b:505:0:0:0:0:0 with SMTP id o5csp3400599ybp; Sun, 6 Oct 2019 11:05:58 -0700 (PDT) X-Google-Smtp-Source: APXvYqyebP02AJOzI1EFMbrNHc+Utd2t6heFK+5Ap05HZacmSomSdZb8LJyOCCOg1MmO5WL/mSaN X-Received: by 2002:a50:f703:: with SMTP id g3mr25426709edn.43.1570385157958; Sun, 06 Oct 2019 11:05:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1570385157; cv=none; d=google.com; s=arc-20160816; b=GATBMtn0iOhLpjy/JBzU83oQbqcel85VzR2MfD26oIBt3bYP32r2eBzKqWu6iBN9a2 2Rh+qVRzJUwzNfCaE4K65iI8Bibf9a91gn2Ep4hHfrkn0g3YEzmHxSe3jaj/eIShTHWh 2T+d7xgNCDjnhlcP+/7Q+sGoNy5veJRREIyLeOz8WFuGJRPLKTWDOmXwHNVgiCu1pKJr LIAdm4HrOLC6eLEuN6TwhxK/43M0LhBtl9Qyk4qfUoAL/7JPC/kG/zTbUwHgfh7CZFKv 8i12pvnJtScpX2Y9DX0wfTG/n4Lzw++yb+Dok6k4l6x2f7m/I3mwLlpFwVsCtwUsIFmo DKrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=qDULerQksexUuT3X7cbRDJF2m4ytwtp/kggeOwhyA9E=; b=f6tNfmpl6TFUc0Y8M+FnaG344OrVnpmYDk8TLSVrMMTTdpvy2sM0DQf6ets26wBIU0 8Ucxwj1i+G15rpw2hwHZi0zOqqA7WEiO9jAKV2nDUuWw5bHC927ELEwGXfkznK5n6f8p ZbZdaaRRk5ysdgwvadDPE8SHUEC4QbWIJEYpKl4uO/XAegFK/cnGUYcZr10oMxKotW+o MIhPWLeHYLdnCMdJLZOh+gIuOskVADq9iGKKuzcmAEpX7Dp37CmrLC51Ka3TIV2rpRzv +g9MzBxC9vmBZ0pXyd7+XCfik5Ov0BOnFx+8JrQxSMPnyKY6srLHPL/16ur/Pcobsd+h nh/w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=KkE226Aj; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l26si7127818edb.258.2019.10.06.11.05.34; Sun, 06 Oct 2019 11:05:57 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=KkE226Aj; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727355AbfJFRZb (ORCPT + 99 others); Sun, 6 Oct 2019 13:25:31 -0400 Received: from mail.kernel.org ([198.145.29.99]:50430 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728083AbfJFRZX (ORCPT ); Sun, 6 Oct 2019 13:25:23 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id D80BD20867; Sun, 6 Oct 2019 17:25:21 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1570382722; bh=Y1mU3yamgEEqPOt5jB7Oj42CunCeU8jAvCNAfpFIBAk=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=KkE226AjfpCJsdQFjeK2bdtROAGMXqPnD/rd+xvDGLi9FRmdrGqGREyDAK0XHEYcu 394XmRbVeiZUBojgYqVk4fUVPygvvha3QMGmIoUyx8vBqF6ICDWffocYA9Vtdzg3R4 SUz6IqLLloFI6FlzsVmThw6GwlDRu6tnxVLw4fvw= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Nathan Lynch , "Gautham R. Shenoy" , Michael Ellerman , Sasha Levin Subject: [PATCH 4.14 16/68] powerpc/rtas: use device model APIs and serialization during LPM Date: Sun, 6 Oct 2019 19:20:52 +0200 Message-Id: <20191006171116.099196571@linuxfoundation.org> X-Mailer: git-send-email 2.23.0 In-Reply-To: <20191006171108.150129403@linuxfoundation.org> References: <20191006171108.150129403@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Nathan Lynch [ Upstream commit a6717c01ddc259f6f73364779df058e2c67309f8 ] The LPAR migration implementation and userspace-initiated cpu hotplug can interleave their executions like so: 1. Set cpu 7 offline via sysfs. 2. Begin a partition migration, whose implementation requires the OS to ensure all present cpus are online; cpu 7 is onlined: rtas_ibm_suspend_me -> rtas_online_cpus_mask -> cpu_up This sets cpu 7 online in all respects except for the cpu's corresponding struct device; dev->offline remains true. 3. Set cpu 7 online via sysfs. _cpu_up() determines that cpu 7 is already online and returns success. The driver core (device_online) sets dev->offline = false. 4. The migration completes and restores cpu 7 to offline state: rtas_ibm_suspend_me -> rtas_offline_cpus_mask -> cpu_down This leaves cpu7 in a state where the driver core considers the cpu device online, but in all other respects it is offline and unused. Attempts to online the cpu via sysfs appear to succeed but the driver core actually does not pass the request to the lower-level cpuhp support code. This makes the cpu unusable until the cpu device is manually set offline and then online again via sysfs. Instead of directly calling cpu_up/cpu_down, the migration code should use the higher-level device core APIs to maintain consistent state and serialize operations. Fixes: 120496ac2d2d ("powerpc: Bring all threads online prior to migration/hibernation") Signed-off-by: Nathan Lynch Reviewed-by: Gautham R. Shenoy Signed-off-by: Michael Ellerman Link: https://lore.kernel.org/r/20190802192926.19277-2-nathanl@linux.ibm.com Signed-off-by: Sasha Levin --- arch/powerpc/kernel/rtas.c | 11 ++++++++--- 1 file changed, 8 insertions(+), 3 deletions(-) diff --git a/arch/powerpc/kernel/rtas.c b/arch/powerpc/kernel/rtas.c index 1643e9e536557..141d192c69538 100644 --- a/arch/powerpc/kernel/rtas.c +++ b/arch/powerpc/kernel/rtas.c @@ -874,15 +874,17 @@ static int rtas_cpu_state_change_mask(enum rtas_cpu_state state, return 0; for_each_cpu(cpu, cpus) { + struct device *dev = get_cpu_device(cpu); + switch (state) { case DOWN: - cpuret = cpu_down(cpu); + cpuret = device_offline(dev); break; case UP: - cpuret = cpu_up(cpu); + cpuret = device_online(dev); break; } - if (cpuret) { + if (cpuret < 0) { pr_debug("%s: cpu_%s for cpu#%d returned %d.\n", __func__, ((state == UP) ? "up" : "down"), @@ -971,6 +973,8 @@ int rtas_ibm_suspend_me(u64 handle) data.token = rtas_token("ibm,suspend-me"); data.complete = &done; + lock_device_hotplug(); + /* All present CPUs must be online */ cpumask_andnot(offline_mask, cpu_present_mask, cpu_online_mask); cpuret = rtas_online_cpus_mask(offline_mask); @@ -1002,6 +1006,7 @@ int rtas_ibm_suspend_me(u64 handle) __func__); out: + unlock_device_hotplug(); free_cpumask_var(offline_mask); return atomic_read(&data.error); } -- 2.20.1