Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp233296pxb; Fri, 16 Apr 2021 04:25:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyi4MgMmD2spXWhHUcguwAW/a/Oqgg3HE8Wkl86mcSUT9CqD3ukH8z7VZtxiJ9stdGoa5Pi X-Received: by 2002:a63:f801:: with SMTP id n1mr6930586pgh.231.1618572314804; Fri, 16 Apr 2021 04:25:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618572314; cv=none; d=google.com; s=arc-20160816; b=LmtscekuOICSCl090VkA9nNtVQ3FA3SKtKPHkdP2OXpmdMTIKiuQP78e3yMxs/vz4E 0WHKU2h85JH/RO7N7MXQgTZgzgP67bW8bSNRHlgt/xgJzt0yXqLhRJ5pxG1KAXzF7jJP W7AFC/B5T1bF9Npd9tj4oOYNVf1IfKA+7LfRAeifrQZrzCj/yBQW18knOrX+ttl3tixE M2kqOsq2aqJl7vhDHjJ5faab8/EyqG6xsU3XXc9W8ajrVaDx6u4D9LRr7E68ZCTOth3Z klv0XbGm4J/+KZc51pToV4LmgjtYGVKu7+KtiOoKqEZ10FMbM2+xKhmujV1MxTg57R6q owiQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=WK9ZHnjIfRIuS8OH5gbd+xot7IxDbuuJKa54sA4gQv0=; b=UT9VFTCOGwHmq9ujux4YMkizkwaROv8sDjQnxiAK68IIatbZprH3vADIHKJ8X/pi93 5p5o/aAMzFq9U4q3splkYXcEWxcQLjluaYKTd0lJyfMH19XzdfJ0BXnoHlR37UceOh9G pjNw55Y2/ngfc0tlCDh30UJ1GfvOaswsriRnYV5cxO5Yjxn1jP6v+SFI/6RyOOLzb64a QErAYOQbEVraJJLpRZdLfJZP9+lpjdkBBAHB3A750pzh9QEF+HinaOzxGxnt3TPMUoXZ 17Ee1pzvP9l16q39PgW/SVdsYjMDikRz2sh4Lw7UWdpcLIoh2OLcVwpVAKAv00YH3jn0 rfuQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=fl4Lq67L; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a24si6025641pjs.157.2021.04.16.04.25.00; Fri, 16 Apr 2021 04:25:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=fl4Lq67L; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241020AbhDPJeT (ORCPT + 99 others); Fri, 16 Apr 2021 05:34:19 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:27844 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S240911AbhDPJeR (ORCPT ); Fri, 16 Apr 2021 05:34:17 -0400 Received: from pps.filterd (m0098416.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.43/8.16.0.43) with SMTP id 13G941Qt099018; Fri, 16 Apr 2021 05:33:48 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=subject : to : cc : references : from : message-id : date : mime-version : in-reply-to : content-type : content-transfer-encoding; s=pp1; bh=WK9ZHnjIfRIuS8OH5gbd+xot7IxDbuuJKa54sA4gQv0=; b=fl4Lq67LkU6GCGLweF+x1bojrxO9r+WumF2LRwAp/mnnN3JzOc3CTq263MI43WP8TqWK /LqQSVMtrdGqCG5UtNsKtAiSAPmoTXFIeoTsKeq82WOFDsajP6IXkwqILz2ccHgLDcZe 3FY46DyPewsmk9/Zhg9QBLRU2KUEtsaunXIBzJ+kgo06Uytpc0OPsmZKeM7Eao3vuhCk msOIdBV8sAtr24c517tgXusa651FiQBOtLH8gU2QqSPYbaOQYRE+vjpnRXg1AzIj/6vw uckkhNzkyewl+28mHndz8dAmp+iEDyt4LNYcYNklhMl5aJeddiJQUIIuCXBOHEJfd6en SQ== Received: from ppma06fra.de.ibm.com (48.49.7a9f.ip4.static.sl-reverse.com [159.122.73.72]) by mx0b-001b2d01.pphosted.com with ESMTP id 37xtqa2vm6-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 16 Apr 2021 05:33:48 -0400 Received: from pps.filterd (ppma06fra.de.ibm.com [127.0.0.1]) by ppma06fra.de.ibm.com (8.16.0.43/8.16.0.43) with SMTP id 13G9RedB028803; Fri, 16 Apr 2021 09:33:46 GMT Received: from b06cxnps4075.portsmouth.uk.ibm.com (d06relay12.portsmouth.uk.ibm.com [9.149.109.197]) by ppma06fra.de.ibm.com with ESMTP id 37u39habvp-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 16 Apr 2021 09:33:46 +0000 Received: from d06av24.portsmouth.uk.ibm.com (mk.ibm.com [9.149.105.60]) by b06cxnps4075.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 13G9XhB358917288 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 16 Apr 2021 09:33:44 GMT Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id D2BE542045; Fri, 16 Apr 2021 09:33:43 +0000 (GMT) Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 4680042041; Fri, 16 Apr 2021 09:33:41 +0000 (GMT) Received: from [9.163.28.215] (unknown [9.163.28.215]) by d06av24.portsmouth.uk.ibm.com (Postfix) with ESMTP; Fri, 16 Apr 2021 09:33:40 +0000 (GMT) Subject: Re: [PATCH] powerpc/kdump: fix kdump kernel hangup issue with hot add CPUs To: Sourabh Jain , mpe@ellerman.id.au Cc: mahesh@linux.vnet.ibm.com, linux-kernel@vger.kernel.org, linuxppc-dev@ozlabs.org References: <20210416064749.657585-1-sourabhjain@linux.ibm.com> From: Hari Bathini Message-ID: Date: Fri, 16 Apr 2021 15:03:38 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1 MIME-Version: 1.0 In-Reply-To: <20210416064749.657585-1-sourabhjain@linux.ibm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 X-Proofpoint-GUID: ZyoryQs7cw4Cy4bfSmEWsACKD6B34kHb X-Proofpoint-ORIG-GUID: ZyoryQs7cw4Cy4bfSmEWsACKD6B34kHb X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.391,18.0.761 definitions=2021-04-16_05:2021-04-15,2021-04-16 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 impostorscore=0 suspectscore=0 mlxlogscore=999 spamscore=0 malwarescore=0 lowpriorityscore=0 priorityscore=1501 phishscore=0 bulkscore=0 adultscore=0 mlxscore=0 clxscore=1011 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2104060000 definitions=main-2104160069 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 16/04/21 12:17 pm, Sourabh Jain wrote: > With the kexec_file_load system call when system crashes on the hot add > CPU the capture kernel hangs and failed to collect the vmcore. > > Kernel panic - not syncing: sysrq triggered crash > CPU: 24 PID: 6065 Comm: echo Kdump: loaded Not tainted 5.12.0-rc5upstream #54 > Call Trace: > [c0000000e590fac0] [c0000000007b2400] dump_stack+0xc4/0x114 (unreliable) > [c0000000e590fb00] [c000000000145290] panic+0x16c/0x41c > [c0000000e590fba0] [c0000000008892e0] sysrq_handle_crash+0x30/0x40 > [c0000000e590fc00] [c000000000889cdc] __handle_sysrq+0xcc/0x1f0 > [c0000000e590fca0] [c00000000088a538] write_sysrq_trigger+0xd8/0x178 > [c0000000e590fce0] [c0000000005e9b7c] proc_reg_write+0x10c/0x1b0 > [c0000000e590fd10] [c0000000004f26d0] vfs_write+0xf0/0x330 > [c0000000e590fd60] [c0000000004f2aec] ksys_write+0x7c/0x140 > [c0000000e590fdb0] [c000000000031ee0] system_call_exception+0x150/0x290 > [c0000000e590fe10] [c00000000000ca5c] system_call_common+0xec/0x278 > --- interrupt: c00 at 0x7fff905b9664 > NIP: 00007fff905b9664 LR: 00007fff905320c4 CTR: 0000000000000000 > REGS: c0000000e590fe80 TRAP: 0c00 Not tainted (5.12.0-rc5upstream) > MSR: 800000000280f033 CR: 28000242 > XER: 00000000 > IRQMASK: 0 > GPR00: 0000000000000004 00007ffff5fedf30 00007fff906a7300 0000000000000001 > GPR04: 000001002a7355b0 0000000000000002 0000000000000001 00007ffff5fef616 > GPR08: 0000000000000001 0000000000000000 0000000000000000 0000000000000000 > GPR12: 0000000000000000 00007fff9073a160 0000000000000000 0000000000000000 > GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 > GPR20: 0000000000000000 00007fff906a4ee0 0000000000000002 0000000000000001 > GPR24: 00007fff906a0898 0000000000000000 0000000000000002 000001002a7355b0 > GPR28: 0000000000000002 00007fff906a1790 000001002a7355b0 0000000000000002 > NIP [00007fff905b9664] 0x7fff905b9664 > LR [00007fff905320c4] 0x7fff905320c4 > --- interrupt: c00 > /** > * setup_new_fdt_ppc64 - Update the flattend device-tree of the kernel > * being loaded. > @@ -1020,6 +1113,13 @@ int setup_new_fdt_ppc64(const struct kimage *image, void *fdt, > } > } > > + /* Update cpus nodes information to account hotplug CPUs. */ > + if (image->type == KEXEC_TYPE_CRASH) { Shouldn't this apply to regular kexec_file_load case as well? Yeah, there won't be a hang in regular kexec_file_load case but for correctness, that kernel should also not see stale CPU info in FDT? Thanks Hari