Received: by 2002:a25:f815:0:0:0:0:0 with SMTP id u21csp479386ybd; Wed, 26 Jun 2019 01:16:20 -0700 (PDT) X-Google-Smtp-Source: APXvYqwauSCrdSxbIinSW3w7CnCKgL9arK2e0F7GiKSWofEoqq2BgqdTKX1fXeUIDME9zHaR+eCq X-Received: by 2002:a63:4f07:: with SMTP id d7mr1677275pgb.77.1561536979947; Wed, 26 Jun 2019 01:16:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1561536979; cv=none; d=google.com; s=arc-20160816; b=D5XrbURdT1mskRKrUHjIKQDyIHPjwugJrrzIgXBRMJs7cHW22dqkw1Lok4rXBogk3c hMjbCJGSBQoOVsQeblyfNtgLFxOS/U3h3Joad0wEQynhwAh9B3X57h5XIu3IfRKnPpgk zFxKNc1WXCp/1ngUwCRKOgagOFrAH6iqb+eZz4YQdywyxZ515Twrx2ZX2KdjRSGGvsdI A77I7go8UxdkrQH3QGTGw80holk/ONio3cEFXPH3SoxToTAa/xmx6J1+h3ZaVxgQN4r7 AUEooftXWoekzi1TyTOnExeszapCzv3VaMFBdn+EGpMpwlUiiTWtiFKAqBy1cv3E8XHx 2/sQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:content-disposition :mime-version:message-id:subject:cc:to:from:date; bh=GvxV/Uxv+voIsynr5OgZYBViHlwadYLYnska2zMGoTk=; b=Hulp9N8TALLfYupe1bKQ+srpChvEFxEGlofIUAHX18kKV68wjwwGthN1kL1M+VJkI+ sRNepfkK7PIv2NUjUr5m2w5IvTtU/YopFfHIdHw2bDIHHprTmU1xYxyCm7JH5/ghmnba Wlklj9a2+xpabsABkhwAPqfb1WANBcjBgy9GeL54mJdzUlzRsylxZunJKTFpD7/A0g/9 Ei4NOt+r3NJ9ZiWjj52+8f/3umbWs25m2dncXake+axZOHgOaZZ9LfbHf8a/aqj3lVtq bWjBtVc0z3++o27IGKUTsSG1FmHDr14v5JQUm6sI71W3S+GrOFjW2V/Q2EKdhRm93rhQ cn+w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w188si17147650pfd.283.2019.06.26.01.16.03; Wed, 26 Jun 2019 01:16:19 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726716AbfFZIP1 (ORCPT + 99 others); Wed, 26 Jun 2019 04:15:27 -0400 Received: from mx1.redhat.com ([209.132.183.28]:53464 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725379AbfFZIP1 (ORCPT ); Wed, 26 Jun 2019 04:15:27 -0400 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 876B630832E4; Wed, 26 Jun 2019 08:15:27 +0000 (UTC) Received: from localhost (ovpn-12-135.pek2.redhat.com [10.72.12.135]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 956BF6085B; Wed, 26 Jun 2019 08:15:24 +0000 (UTC) Date: Wed, 26 Jun 2019 16:15:22 +0800 From: Baoquan He To: airlied@redhat.com Cc: kexec@lists.infradead.org, x86@kernel.org, linux-kernel@vger.kernel.org, dyoung@redhat.com Subject: mgag200 fails kdump kernel booting Message-ID: <20190626081522.GX24419@MiWiFi-R3L-srv> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.10.1 (2018-07-13) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.44]); Wed, 26 Jun 2019 08:15:27 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Dave, We met an kdump kernel boot failure on a lenovo system. Kdump kernel failed to boot, but just reset to firmware to reboot system. And nothing is printed out. The machine is a big server, with 6T memory and many cpu, its graphic driver module is mgag200. When added 'earlyprintk=ttyS0' into kernel command line, it printed out only one line to console during kdump kernel booting: KASLR disabled: 'nokaslr' on cmdline. Then reset to firmware to reboot system. By further code debugging, the failure happened in arch/x86/boot/compressed/misc.c, during kernel decompressing stage. It's triggered by the vga printing. As you can see, in __putstr() of arch/x86/boot/compressed/misc.c, the code checks if earlyprintk= is specified, and print out to the target. And no matter if earlyprintk= is added or not, it will print to VGA. And printing to VGA caused it to reset to firmware. That's why we see nothing when didn't specify earlyprintk=, but see only one line of printing about the 'KASLR disabled'. To confirm it's caused by VGA printing, I blacklist the mgag200 by writting it into /etc/modprobe.d/blacklist.conf. The kdump kernel can boot up successfully. And add 'nomodeset' can also make it work. So it's for sure mgag driver or related code have something wrong when booting code tries to re-init it. This is the only case we ever see, tend to pursuit fix in mgag200 driver side. Any idea or suggestion? We have two machines to be able to reproduce it stablly. Thanks Baoquan