Received: by 10.223.185.116 with SMTP id b49csp3726998wrg; Tue, 13 Feb 2018 06:54:06 -0800 (PST) X-Google-Smtp-Source: AH8x2260fvadX5xej1hZOTzTBTsftJCjzG8xxi7+Eq6Ap0Br2Z0aDeRN4QNjh7gWjmS70qDLZyZC X-Received: by 10.98.71.3 with SMTP id u3mr1204990pfa.219.1518533646278; Tue, 13 Feb 2018 06:54:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518533646; cv=none; d=google.com; s=arc-20160816; b=BFohZEDIBh6NHLyxRF3qgKfJgE2fMvD5xw07M2GLH1ppAsh5ab0EP+R0fR+NoB+6Jn XFyzoG9h6cIqOwQiC7hu8U+4NjhBTbgOpXc+HIpSkUKB8Pyir1Z7ug2pwTjkcr9DhMQT Gfr2Zx/Qd/GK8R80sx5RNh+f5MAtsuylFuW36ntno+z/F2GdMm45cmOwQTSiPPvkwyKg RB8LL1uItjYRxQJ70siPK4QsKyGuYcuh0K8idFyB4jyFNNNyvuM0Bl2m427SeLjDiWtn yZqVPU6+xEi36B6wrJZvOmC4poA+CrSrRIY/GHN1fFBeyFGLrf3MUDf5iWnT19G7SNq+ k5pg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature :arc-authentication-results; bh=pAWd7osPpctnqVI0gmYnxIBpw2kTHRSssyxWnJiGCWQ=; b=v8s/9reMRwPBuNAvds29soRT1xUrXCsPQ6RDCf16yo0kjR+tVjtkhVRvivcVf1mruf Rr6Khic3eOJFPL6kF4DG/iBo2gUucNMec0NC6TNfI+rCDdKRf90iH8u37vfXqUHMy6rV BwmeWaDf01HhL+6nxIB6zrmcRC+SAweZkLe0oEJvOS6clv8mL6GSc1HTuY4ovYPgRrVa BboYUPA6X67lRt7Gma+p+whlUWv1vVD7f4nIY4RgLKTLz/bvRVkt7834cXQsxNz6y96m XIfvc9FogMyArkbrLT5GVyiIxDKdI4P4DY8mbJQI3W3mmh04ECDFUX3UT3sTgBGcEWNo fo9A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ursulin-net.20150623.gappssmtp.com header.s=20150623 header.b=CjDASzsq; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 97-v6si1397566ple.430.2018.02.13.06.53.50; Tue, 13 Feb 2018 06:54:06 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@ursulin-net.20150623.gappssmtp.com header.s=20150623 header.b=CjDASzsq; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935508AbeBMOwA (ORCPT + 99 others); Tue, 13 Feb 2018 09:52:00 -0500 Received: from mail-wm0-f47.google.com ([74.125.82.47]:51042 "EHLO mail-wm0-f47.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S935258AbeBMOv7 (ORCPT ); Tue, 13 Feb 2018 09:51:59 -0500 Received: by mail-wm0-f47.google.com with SMTP id f71so16533228wmf.0 for ; Tue, 13 Feb 2018 06:51:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ursulin-net.20150623.gappssmtp.com; s=20150623; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=pAWd7osPpctnqVI0gmYnxIBpw2kTHRSssyxWnJiGCWQ=; b=CjDASzsqcxt3Pv34cg+SbLaQxZ7gkhZMDBCIe7sv4dZinaakqPM2kzqD/yaCjz2XVC 2XsCi0wxJoz830IMI3ToRIpMjyt9PYbexUbjBxPHnuuPLCywndRt9Oo9Z2nyEW2/TftR wWKx58MA9qu+hJ5bgG9Wp9Fxf+4C0PM/A6TwfcVgyOL214wbe1OSi62GdxuZPwO7/93b lS+r7jMM6uiySjxymeOhkPCiNkD4jgW8xpB/iIW3MmFRQxMHLKfPPbkd7qPlVS/0px+F vGTVKEFRp0u2wt85Zian3tTygw87WT+Jh51Or0V19b+x5PpEHGzoRpXTwVnEagX1hOHY 3LGA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=pAWd7osPpctnqVI0gmYnxIBpw2kTHRSssyxWnJiGCWQ=; b=Kq5m7jT7ky7E4jMEKW17+VkLKo4xUU5RlJpjaPCz3/X5omEDU64gnrs/xOTHn3uWu/ AYXOvcqt2cMzinDLCupxHp/i7Lio4/IR8EMVLhZab4c/FUUxhGzBCq34IsOwoRsCyA08 VypJMCrJ3Nlu8i5SjYbNBUrvan4Ic8dX55pYb0g7pSBv1uF7SiH+H8/2u74iA6ttnMS3 Fyy3r4HsRN3oSZQaVnZHAOYXz2mH8qW6vFUCXqGUJvOdtCxXeoFA8FoIrWrivHdQO2vm BGAh5aIBG1gfeS1ewA8Cn2pHm8T/WHXyt23Mjptt2f41D/LTuTh0w32oAo52lriVcC3L pjcg== X-Gm-Message-State: APf1xPDn2Wqq793t0TxkqE7N6+bE91e/ZjZ8gV9tsCS4eTKuMNbE0GaS 3FBHtS66UDu9TAlnnUnsoPMW9ukh X-Received: by 10.28.154.67 with SMTP id c64mr1628446wme.125.1518533518228; Tue, 13 Feb 2018 06:51:58 -0800 (PST) Received: from [192.168.0.153] ([95.146.144.186]) by smtp.googlemail.com with ESMTPSA id k34sm2330006wrk.95.2018.02.13.06.51.57 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 13 Feb 2018 06:51:57 -0800 (PST) Subject: Re: smpboot: do_boot_cpu failed(-1) to wakeup CPU#0 To: Thomas Gleixner Cc: Ingo Molnar , x86@kernel.org, linux-kernel@vger.kernel.org References: <1696e2c6-8d0a-e954-1205-439d70a81f77@ursulin.net> From: Tvrtko Ursulin Message-ID: Date: Tue, 13 Feb 2018 14:51:57 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 13/02/18 14:39, Thomas Gleixner wrote: > On Tue, 13 Feb 2018, Tvrtko Ursulin wrote: >> On 07/02/18 12:48, Tvrtko Ursulin wrote: >>> We are seeing failures to online the CPU0 on Apollo Lake in the form of: >>> >>> <6>[ 126.508783] smpboot: CPU 0 is now offline >>> <6>[ 127.520746] smpboot: Booting Node 0 Processor 0 APIC 0x0 >>> <3>[ 137.521036] smpboot: do_boot_cpu failed(-1) to wakeup CPU#0 >>> >>> I unfortunately cannot say with which kernel version this started since >>> we added a test which does this only recently. I also have no local >>> access to this machine. (It is part of a test farm for i915 driver >>> development testing.) But we recently added a test which off-lines, and >>> on-lines back, CPUs and started seeing this. Small reproducer looks like >>> this (without boilerplate): >> >> Any hints on how to debug this? Could it be firwmare? Try some boot options or >> something? > > There are issues with CPU0 hotplug on commodity hardware. I have systems > where it does not work, but TBH I never bothered to investigate it. Some > years ago we had issues with suspend/resume when it was not running on > CPU0. These were related to firmware assumptions about CPU0. So I wouldn't > be too surprised if there are general issues with unplugging CPU0. > > CPU0 unplug is really only relevant for systems which support physical > hotplug, so testing it on commodity hardware does not have much > value. Testing in VMs for increasing the test coverage works well enough. Thanks, that explains it. We actually use CPU hotplug just to test if the PMU event migration and accounting works as expected in i915 PMU. And since, luckily, the issue with CPU0 hotplug manifests only on one of the test systems, I think we will just skip this test on that machine. Thanks again! Tvrtko