Received: by 2002:a25:f815:0:0:0:0:0 with SMTP id u21csp491280ybd; Wed, 26 Jun 2019 01:31:28 -0700 (PDT) X-Google-Smtp-Source: APXvYqzlCk24aVoeo+57eVNbbIBtWAQfBQfdMK1by3o/fOBs5aH4XnoOesOud4VRCU4INCxgy34z X-Received: by 2002:a65:4085:: with SMTP id t5mr1798468pgp.109.1561537887735; Wed, 26 Jun 2019 01:31:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1561537887; cv=none; d=google.com; s=arc-20160816; b=mpNbxO+4kl51qwohZ+OkSKauSLql77jXrsEQyBjkNaeD1DkjChjAVbQaDevi+ON2K6 YH+n6lSjCeABfJZA1Wx1IXAIEl3qobQZfaspxnn7lOw8VmTNOX9vZcHgqiz8+zaN+bQa 3ZprR7YhipbFrN9dJ2c/97Od5JwFtGCqBWx2td7LVGR9TsSnVnwDSMBFyAE8Bbn1ab0X XZcV5N13xLDAZucqHej+zkXqmfeV5jRYWd4RDeogQnlkgMJ3xOAFtKyuseVKoC7zwVa4 PVXpujsR1o8qXmcnxsgiG4MFXdUh7xHzon3CfosazGYGakccJBIBilBwLOc0RwBnVe17 qDuQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=MGjvXPJu0WkRr5iSbXdq5N2wRxiINhS+kpTeH7DtTnw=; b=AArtWbVBOQw+EQlRzp3XasOPEqmEdkPkTP9CCAbpKTpTUNwXdUAv7cYMcY4fhWBfwu gVV2QvE+v2PP2qvLy/PZQvLZO83paYofTxszqvCoghXA4iWBH6+hWL3DS6oJAfVQckBm Dhgnon92FvZMgp6wpVjW5Gm63x/gyXf8X5nI+ehsWeQQzWO3xhuwFivU2SzveoPOgYhD 0d3qpscWj0SjUvybNUrTBcIA7aI2n4tl7QVsOd1MUGc9lGGAL/kWE125O2jrYxXbN07R dFzaoNiI9CqpdysUyUvZ6BEG/6t6Bgxw3RxJf18LTfK7K59/mbC1fx5V+QCYNsMwPVDI fA7A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id bd11si2702709plb.184.2019.06.26.01.31.11; Wed, 26 Jun 2019 01:31:27 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727079AbfFZI3N (ORCPT + 99 others); Wed, 26 Jun 2019 04:29:13 -0400 Received: from mx1.redhat.com ([209.132.183.28]:59426 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726239AbfFZI3N (ORCPT ); Wed, 26 Jun 2019 04:29:13 -0400 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id BEB32D7809; Wed, 26 Jun 2019 08:29:12 +0000 (UTC) Received: from localhost (ovpn-12-135.pek2.redhat.com [10.72.12.135]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 0D7FA60BE5; Wed, 26 Jun 2019 08:29:09 +0000 (UTC) Date: Wed, 26 Jun 2019 16:29:07 +0800 From: Baoquan He To: airlied@redhat.com Cc: kexec@lists.infradead.org, x86@kernel.org, linux-kernel@vger.kernel.org, dyoung@redhat.com Subject: Re: mgag200 fails kdump kernel booting Message-ID: <20190626082907.GY24419@MiWiFi-R3L-srv> References: <20190626081522.GX24419@MiWiFi-R3L-srv> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190626081522.GX24419@MiWiFi-R3L-srv> User-Agent: Mutt/1.10.1 (2018-07-13) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.38]); Wed, 26 Jun 2019 08:29:12 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06/26/19 at 04:15pm, Baoquan He wrote: > Hi Dave, > > We met an kdump kernel boot failure on a lenovo system. Kdump kernel > failed to boot, but just reset to firmware to reboot system. And nothing > is printed out. > > The machine is a big server, with 6T memory and many cpu, its graphic > driver module is mgag200. > > When added 'earlyprintk=ttyS0' into kernel command line, it printed > out only one line to console during kdump kernel booting: > KASLR disabled: 'nokaslr' on cmdline. > > Then reset to firmware to reboot system. > > By further code debugging, the failure happened in > arch/x86/boot/compressed/misc.c, during kernel decompressing stage. It's > triggered by the vga printing. As you can see, in __putstr() of > arch/x86/boot/compressed/misc.c, the code checks if earlyprintk= is > specified, and print out to the target. And no matter if earlyprintk= is > added or not, it will print to VGA. And printing to VGA caused it to > reset to firmware. That's why we see nothing when didn't specify > earlyprintk=, but see only one line of printing about the 'KASLR > disabled'. Here I mean: That's why we see nothing when didn't specify earlyprintk=, but see only one line of printing about the 'KASLR disabled' message when earlyprintk=ttyS0 added. > > To confirm it's caused by VGA printing, I blacklist the mgag200 by > writting it into /etc/modprobe.d/blacklist.conf. The kdump kernel can > boot up successfully. And add 'nomodeset' can also make it work. So it's > for sure mgag driver or related code have something wrong when booting > code tries to re-init it. > > This is the only case we ever see, tend to pursuit fix in mgag200 driver > side. Any idea or suggestion? We have two machines to be able to > reproduce it stablly. > > Thanks > Baoquan