Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753655AbbF2JZ0 (ORCPT ); Mon, 29 Jun 2015 05:25:26 -0400 Received: from mail-oi0-f44.google.com ([209.85.218.44]:33496 "EHLO mail-oi0-f44.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753624AbbF2JZS (ORCPT ); Mon, 29 Jun 2015 05:25:18 -0400 MIME-Version: 1.0 X-Originating-IP: [212.51.149.109] In-Reply-To: <20150629080914.GB12383@pd.tnic> References: <1435305314-14337-1-git-send-email-rui.y.wang@intel.com> <3908561D78D1C84285E8C5FCA982C28F32AA062A@ORSMSX114.amr.corp.intel.com> <20150627141237.GE26543@pd.tnic> <20150629080914.GB12383@pd.tnic> Date: Mon, 29 Jun 2015 11:25:17 +0200 Message-ID: Subject: Re: drm/mgag200: doesn't work in panic context From: Daniel Vetter To: Borislav Petkov Cc: "Luck, Tony" , "Wang, Rui Y" , Dave Airlie , "Clark, Rob" , "Roper, Matthew D" , "Chen, Gong" , Linux Kernel Mailing List Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2705 Lines: 63 On Mon, Jun 29, 2015 at 10:09 AM, Borislav Petkov wrote: > On Sat, Jun 27, 2015 at 07:56:19PM +0200, Daniel Vetter wrote: > > > >> Which could all happen very much after the kernel made it's dying >> sigh. Display hw has long stopped being this simple and display >> drivers also. > > Thanks for the explanation. I was fearing that it would go in such > direction though. :\ Oh well. > > So there are two observations to be ventured here: > > * there's still no robust and simple way to save/show oops messages on > laptops. When a laptop crashes and the user has only this one machine, > catching the oops can be a serious PITA. Laptop most likely doesn't have > serial, modeset is hard to do, as you explained, and saving it into > persistent storage (UEFI) might brick it in some cases or whatever the > firmware will do to it. I guess we can keep wishing here for something > better. As long as the display is up and running we should have a fair stab at showing the oops - it's just that no one has seriously bothered with the necessary infastructure, automated testing (it won't work otherwise) and driver work. > * The server aspect might not really need GPU modesetting IMHO because > realistically, servers have serial so the oops can go out there. Which > begs the other question: > > Shouldn't drm_fb_helper_panic() and that path below it simply be > removed? > > I mean, if it needs to do all that hw-reconf on the panic path and if > that reconfiguration is not completely reliable, maybe we shouldn't do > it all? > > This way, we will basically remain as quiet as possible in order not > to muddle the system unreasonably when we're about to die and the only > thing we want to catch is an oops or maybe kexec the crash kernel... > > Oops will go out on serial anyway and server will kexec - GPU/fb stuff > doesn't do anything. Yes, no? drm_fb_helper_panic isn't the only panic handler - fbdev/fbcon have their own. They interfere, and fbdev blissfully assumes that it can call almost any driver hook from hardirq context. Which means you'd also need to consolidate the various hand-rolled (in drivers instead of the fbdev helper) bits to offload fbdev callbacks to workers and make sure we don't go boom in panics in there either. It's really a big mess unfortunately :( -Daniel -- Daniel Vetter Software Engineer, Intel Corporation +41 (0) 79 365 57 48 - http://blog.ffwll.ch -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/