Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932478AbdCFTiP (ORCPT ); Mon, 6 Mar 2017 14:38:15 -0500 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:38324 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932067AbdCFTiI (ORCPT ); Mon, 6 Mar 2017 14:38:08 -0500 Subject: [next-20170306] Memory off-on resulted in Oops on Power 6 system From: Abdul Haleem To: linuxppc-dev , linux-kernel@vger.kernel.org Cc: "aneesh.kumar" , mpe Date: Tue, 07 Mar 2017 00:31:22 +0530 Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.10.4-0ubuntu1 Mime-Version: 1.0 Content-Transfer-Encoding: 8bit X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 17030619-0008-0000-0000-00000538C560 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 17030619-0009-0000-0000-0000134335E2 Message-Id: <1488826882.3961.8.camel@abdul-ThinkPad-T450> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-03-06_17:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1702020001 definitions=main-1703060153 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4403 Lines: 92 Hi, Today’s next resulted in kernel Oops while running memory-off-on test from kernel selftest. Kernel Version : 4.11.0-rc1-next-20170306 Machine type : Power6 PowerVM LPAR [stdout] make[1]: Entering directory `/var/tmp/avocado_kkNtjt/1-kernel_kselftest.py_kselftest.test/src/linux-4.8.6/tools/testing/selftests/memory-hotplug' [stdout] ./mem-on-off-test.sh -r 2 || echo "selftests: memory-hotplug [FAIL]" [stdout] Test scope: 2% hotplug memory [stdout] online all hotplug memory in offline state [stdout] offline 2% hotplug memory in online state [stdout] online all hotplug memory in offline state [stdout] online-offline 89 ehea: memory is going offline ehea: LPAR memory changed - re-initializing driver ehea: re-initializing driver complete Offlined Pages 1024 [stdout] online-offline 92 ehea: memory is going offline ehea: LPAR memory changed - re-initializing driver ehea: WARNING: sq not flushed completely ehea: WARNING: sq not flushed completely ehea: re-initializing driver complete ehea: memory offlining canceled ehea: memory is going online ehea: LPAR memory changed - re-initializing driver ehea: WARNING: sq not flushed completely ehea: re-initializing driver complete Unable to handle kernel paging request for data at address 0x00000008 Faulting instruction address: 0xc000000000205138 Oops: Kernel access of bad area, sig: 11 [#1] SMP NR_CPUS=1024 NUMA pSeries Modules linked in: xt_CHECKSUM bnep bluetooth nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_REJECT nf_reject_ipv6 nf_conntrack_ipv6 nf_defrag_ipv6 cfg80211 ipt_REJECT nf_reject_ipv4 nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack rfkill libcrc32c ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_mangle iptable_security iptable_raw iptable_filter ip_tables nfsd auth_rpcgss nfs_acl lockd ses osst enclosure st scsi_transport_sas ehea uio_pdrv_genirq uio sunrpc grace ipv6 crc_ccitt autofs4 ext4 mbcache jbd2 sr_mod cdrom sd_mod dm_mirror dm_region_hash dm_log dm_mod CPU: 8 PID: 8993 Comm: mem-on-off-test Not tainted 4.11.0-rc1-next-20170306-autotest #2 task: c0000000fd47d200 task.stack: c0000000fca18000 NIP: c000000000205138 LR: c0000000002870ec CTR: c00000000082b110 REGS: c0000000fca1b3b0 TRAP: 0300 Not tainted (4.11.0-rc1-next-20170306-autotest) MSR: 800000000200b032 CR: 24222882 XER: 00000007 CFAR: c000000000008920 DAR: 0000000000000008 DSISR: 40000000 SOFTE: 0 GPR00: c000000000286ec0 c0000000fca1b630 c000000001025400 c0000001c3623800 GPR04: 0000000000000000 0000000000000002 0000000000000007 0000000000000100 GPR08: c000000001065400 0000000000000000 0000000000000008 c0000000011b0f00 GPR12: 0000000024222882 c00000000edc1800 fffffffffffffff5 00000000000000fe GPR16: 0000000000017400 0000000000000173 0000000000000400 0000000100002358 GPR20: c000000001063b00 0000000000000000 6db6db6db6db6db7 c0000001c3ff6500 GPR24: 0000000000000001 0000000000000008 0000000000000001 0000000000000001 GPR28: c0000000011b0f00 c0000001c3ff6920 c0000001c3ff6500 c0000001c3620000 NIP [c000000000205138] .get_pfnblock_flags_mask+0x38/0x70 LR [c0000000002870ec] .unset_migratetype_isolate+0x28c/0x300 Call Trace: [c0000000fca1b630] [c000000000286ec0] .unset_migratetype_isolate+0x60/0x300 (unreliable) [c0000000fca1b6e0] [c000000000287608] .undo_isolate_page_range+0x148/0x180 [c0000000fca1b790] [c000000000289448] .__offline_pages.constprop.19+0x268/0xad0 [c0000000fca1b8f0] [c000000000547ae8] .memory_subsys_offline+0x58/0xe0 [c0000000fca1b970] [c000000000525478] .device_offline+0xe8/0x130 [c0000000fca1ba00] [c00000000054793c] .store_mem_state+0x15c/0x180 [c0000000fca1ba90] [c000000000520910] .dev_attr_store+0x30/0x60 [c0000000fca1bb00] [c000000000335be0] .sysfs_kf_write+0x60/0xa0 [c0000000fca1bb70] [c0000000003349c4] .kernfs_fop_write+0x184/0x260 [c0000000fca1bc10] [c00000000028d37c] .__vfs_write+0x3c/0x1a0 [c0000000fca1bcf0] [c00000000028eca0] .vfs_write+0xc0/0x230 [c0000000fca1bd90] [c000000000290878] .SyS_write+0x58/0x100 [c0000000fca1be30] [c00000000000b758] system_call+0x38/0xfc Instruction dump: 3928bb00 794a1f24 7d29502a 788ac202 2fa90000 419e000c 794a2428 7d295214 3d020004 78840620 81487564 7c845436 5484103a 7c8807b4 548406ba ---[ end trace 911354ae3120f308 ]--- -- Regard's Abdul Haleem IBM Linux Technology Centre