Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp6581237ybi; Mon, 8 Jul 2019 05:23:10 -0700 (PDT) X-Google-Smtp-Source: APXvYqw9IRSfOrRqO1IzOqoO4+GW6J/1jc63fBmSAK9c2WmrtuVRySBkCqJbsrfi/hS1FL4nUKNQ X-Received: by 2002:a17:902:722:: with SMTP id 31mr24021748pli.163.1562588589952; Mon, 08 Jul 2019 05:23:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1562588589; cv=none; d=google.com; s=arc-20160816; b=Rc04VnkRnPd1ZN119m2DAapzk8smKXYMJc82Mps9Z5mo4c+CTDrzXo78pLhZbscOkg mzFFMJyxcFImERazDo0OCtPjYqsUTUp/9OllG13OYWB5yD0acFJeGXpkd0x5zYNR3teE 72iQtcXpH0axMT1+M9PXGqOyAs5dQ8rhkxro0hU8k4TptryMNuaTJxt+dGf6JHc6Zaop RqjqsUsuMAPWIXBUPNhZ/yJUlSLxdSn12sfF4zcTcpdPjCXrdI3fB91NOGuSHouzN4Ek Wnbzj3iiLwwfZqZvt6L41VjlGscOFOEY48O7HT4x1+jcntDe6O9Tab4bCnvcbQvDGUpu bw9g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=5KXyzOyddJPSYupMaPtJc3n1+Qs9tx3Gcb2Q7zVo0M8=; b=OuUkNgN6sauu14bCnam4j3owBf09oEs4hARNC/1qn2C7YuhBCo66hZjDEWBkhSm/Hk JCaSn2oAhJtYql35JP/bXlA26eH/Y2rsuddLsNW18cnwJvhVFHW/Sci86m7PL3FCUj5C jVE+9c1aQwhbMHEv2MicJxSYEPT10DdoqWweFi11FDmpicRP67J8t9cuQQcmToYG/pLx r29JIEjEB6h0pyvYMfzLt1S5pkonyjIVVu0OyK5s7+Gs7I/VxcdbB4QSnbMxMt0hjrjt yR2Hdx27JfnAqWYdMIeEVk6D5oqCWmr9JRfU9OjDQh1f/AeJQ4fscDUQuEWuDnxa2Nod 2iZg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=cOMsmIgo; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q23si18759062pff.103.2019.07.08.05.22.55; Mon, 08 Jul 2019 05:23:09 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=cOMsmIgo; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729397AbfGHIgg (ORCPT + 99 others); Mon, 8 Jul 2019 04:36:36 -0400 Received: from mail-io1-f68.google.com ([209.85.166.68]:40705 "EHLO mail-io1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729250AbfGHIgg (ORCPT ); Mon, 8 Jul 2019 04:36:36 -0400 Received: by mail-io1-f68.google.com with SMTP id h6so25112304iom.7 for ; Mon, 08 Jul 2019 01:36:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=5KXyzOyddJPSYupMaPtJc3n1+Qs9tx3Gcb2Q7zVo0M8=; b=cOMsmIgo9no2UIxYQtl0IEhs9cOfaY/6hH+TSNYWkEVjYrtyDyiSG+Pawi7rst84iL aYMFDkhVgQ713ZWq/WNdEtQUY3srXzZzw6MniZc45HgtCzqilWOYA4acWuiVnXpt6yCn dYF7IAD/K7goEhoSST/5CMrxkxRbHqybP9YsubSqGJ+1GueL5Ta2hGjNUlxXwFCc1BEO WZ1+QrX+fsuoq8p3EohEENriAsyZrotv0Lh+a/HQbO5y1KeHvr9lhZOUp8qSZZ5AaeL4 HN3Qosc6+tKtmoIL0erNdz9FKdlU/HmzbGtDN9Tbcxr+J5fBiL5lJMyfjeWH2F/YDbOz ajKA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=5KXyzOyddJPSYupMaPtJc3n1+Qs9tx3Gcb2Q7zVo0M8=; b=UiAj3z2gw+2WCH8ux/rSxqHqqEy6IeDQW0pftyII0fw/1uWtLZyJ+jjxWZFiLeADXd 4sZqSaOwRWF/AnVuTHPn09f6MHY3S3Q6RVvpIRiB2Zm3JR2Wubx2wd+jfmqEo1nCVc7P Ajrk5gZ+fYIZBCt7FMiPnzRtfijmWspNbZxWkZn/rkz5f3WMYUiqYsuwzWj+nT/d1NP5 0bKnrscJq3YXDBBjMz4r2bWfGOihAQMAnyU4Jy3+Jkys53Wfmh2iHoau/51zkpqlnyN2 /r+e/JwtR4ZQM3FW3ZJSs5epBVRzUK9sKaRHtrv0Am/+5YYYtWum/QWW7nZFck54g2vU 4C+w== X-Gm-Message-State: APjAAAV8atoKC+/yfNmbjEPVJPHsHNRq2Q67t4gxD+wqaiKEELS7J+U6 hRWFSKlhDTSfAyviIzGGKKvNF0KmtNniTlSL1A== X-Received: by 2002:a6b:6f06:: with SMTP id k6mr13682792ioc.32.1562574994974; Mon, 08 Jul 2019 01:36:34 -0700 (PDT) MIME-Version: 1.0 References: <1562300143-11671-1-git-send-email-kernelfans@gmail.com> <1562300143-11671-2-git-send-email-kernelfans@gmail.com> In-Reply-To: From: Pingfan Liu Date: Mon, 8 Jul 2019 16:36:23 +0800 Message-ID: Subject: Re: [PATCH 2/2] x86/numa: instance all parsed numa node To: Thomas Gleixner Cc: x86@kernel.org, Michal Hocko , Dave Hansen , Mike Rapoport , Tony Luck , Andy Lutomirski , Peter Zijlstra , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , Andrew Morton , Vlastimil Babka , Oscar Salvador , Pavel Tatashin , Mel Gorman , Benjamin Herrenschmidt , Michael Ellerman , Stephen Rothwell , Qian Cai , Barret Rhoden , Bjorn Helgaas , David Rientjes , linux-mm@kvack.org, LKML Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jul 8, 2019 at 3:44 AM Thomas Gleixner wrote: > > On Fri, 5 Jul 2019, Pingfan Liu wrote: > > > I hit a bug on an AMD machine, with kexec -l nr_cpus=4 option. nr_cpus option > > is used to speed up kdump process, so it is not a rare case. > > But fundamentally wrong, really. > > The rest of the CPUs are in a half baken state and any broadcast event, > e.g. MCE or a stray IPI, will result in a undiagnosable crash. Very appreciate if you can pay more word on it? I tried to figure out your point, but fail. For "a half baked state", I think you concern about LAPIC state, and I expand this point like the following: For IPI: when capture kernel BSP is up, the rest cpus are still loop inside crash_nmi_callback(), so there is no way to eject new IPI from these cpu. Also we disable_local_APIC(), which effectively prevent the LAPIC from responding to IPI, except NMI/INIT/SIPI, which will not occur in crash case. For MCE, I am not sure whether it can broadcast or not between cpus, but as my understanding, it can not. Then is it a problem? From another view point, is there any difference between nr_cpus=1 and nr_cpus> 1 in crashing case? If stray IPI raises issue to nr_cpus>1, it does for nr_cpus=1. Thanks, Pingfan