Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752116AbdIVJ5Q (ORCPT ); Fri, 22 Sep 2017 05:57:16 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:51036 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1752010AbdIVJ5P (ORCPT ); Fri, 22 Sep 2017 05:57:15 -0400 Subject: Re: [linux-next][DLPAR CPU][Oops] Bad kernel stack pointer From: Abdul Haleem To: Michael Ellerman Cc: linuxppc-dev , linux-kernel , linux-next , Stephen Rothwell , Rob Herring , Paul Mackerras Date: Fri, 22 Sep 2017 15:27:04 +0530 In-Reply-To: <878th9lhpe.fsf@concordia.ellerman.id.au> References: <1505729319.6990.5.camel@abdul.in.ibm.com> <878th9lhpe.fsf@concordia.ellerman.id.au> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.10.4-0ubuntu1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 x-cbid: 17092209-0008-0000-0000-0000089BD609 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00007777; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000231; SDB=6.00920653; UDB=6.00462637; IPR=6.00700874; BA=6.00005601; NDR=6.00000001; ZLA=6.00000005; ZF=6.00000009; ZB=6.00000000; ZP=6.00000000; ZH=6.00000000; ZU=6.00000002; MB=3.00017245; XFM=3.00000015; UTC=2017-09-22 09:57:11 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 17092209-0009-0000-0000-000044135E21 Message-Id: <1506074224.17232.8.camel@abdul.in.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-09-22_03:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1707230000 definitions=main-1709220139 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 922 Lines: 40 On Wed, 2017-09-20 at 21:42 +1000, Michael Ellerman wrote: > Abdul Haleem writes: > > > Hi, > > > > Dynamic CPU remove operation resulted in Kernel Panic on today's > > next-20170915 kernel. > > > > Machine Type: Power 7 PowerVM LPAR > > Kernel : 4.13.0-next-20170915 > > config : attached > > test: DLPAR CPU remove > > > > > > dmesg logs: > > ---------- > > cpu 37 (hwid 37) Ready to die... > > cpu 38 (hwid 38) Ready to die... > > cpu 39 (hwid 39) > > ******* RTAS CReady to die... > > ALL BUFFER CORRUPTION ******* > > Cool. Does that come from RTAS itself? I have never seen that happen > before. Not sure, the var logs does not have any messages captured. This is first time we hit this type of issue. > > Is this easily reproducible? I am unable to reproduce it again. I will keep an eye on our CI runs for few more runs. -- Regard's Abdul Haleem IBM Linux Technology Centre