Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3210736imu; Sat, 24 Nov 2018 00:11:01 -0800 (PST) X-Google-Smtp-Source: AFSGD/Wrc908e97CBJVexaoOnsysEsSqAfQFIkTs/9oENsmPODCRJMlc1aWyd6djdmyPkF1edD61 X-Received: by 2002:a17:902:6bc4:: with SMTP id m4mr4079791plt.93.1543047061208; Sat, 24 Nov 2018 00:11:01 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543047061; cv=none; d=google.com; s=arc-20160816; b=O3O5rtlpu0Y1pJWJQXqxa8pMfu3+VbDn1wvVb0gS/lR49kOQ2twl1lHv0gtfZMGP21 G7Z2y24PyOYIEDxGRLFCsz+VibZDBD/DHb3OZ3poh1XQTYXvYfJyfbRMA/ARDblUYmrH 4ekd+xqWHQ4Srb7k1HMk8fMyRwsIWxs0AK5E6AEJFr15AhlFH6+P2QIf1xuyuk5aNUDR IrJu/4HXYqN5mOXmhRhMyyZsmlLPZ6W8YppPoG0beuqtELqU5pKP221po8jeCuSIPfkQ tpX9mTROQFnW+mrtDz0s/Ft3D0BGS2gwpMIjI1JnRVnPBqx4D7oU/0mKcMgcDytkwZ9w XOLQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:date:subject:user-agent:message-id :references:cc:in-reply-to:organization:from:to :content-transfer-encoding:mime-version; bh=wlfWKUiBRShjZzPz7+Ldr5S5unbAPOlzYgfbNtgSL+s=; b=EIKIJ5CsHr0+zTF2O33c5soEtQlY1VVO7Zq13ALywuivFm1IGZeM4zHjDbaw7wWTdi ppXfYFNontLQRFGAPhycv8QmHcE8X4uerluNPy8sScB/2cNdlrH5sG6nA9mohekwoKq6 ZHbeB081CSFVL27J7vSAA5hPM/WYogNzESrzlRr0QN0oVpXQdhkDXcIHj25JBH7Oqsko UWq/lxJN4BVNLJHuz47ORazz1Ztdxv3A6DMnybXxDM/pvIjd0T/4twDdaVlVx/AmS0KL 7eYBoh/l1YnMd6klVaG/TY0zLwUDv8Ta46pSPI11BVxqtWrhrEeJXvIeMjf3or80GEMf yjJQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id cb2si36796087plb.298.2018.11.24.00.10.47; Sat, 24 Nov 2018 00:11:01 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2408449AbeKWTAu convert rfc822-to-8bit (ORCPT + 99 others); Fri, 23 Nov 2018 14:00:50 -0500 Received: from mga11.intel.com ([192.55.52.93]:5296 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731828AbeKWTAu (ORCPT ); Fri, 23 Nov 2018 14:00:50 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 23 Nov 2018 00:17:39 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.56,268,1539673200"; d="scan'208";a="108655586" Received: from jlahtine-desk.ger.corp.intel.com (HELO localhost) ([10.252.2.240]) by fmsmga004.fm.intel.com with ESMTP; 23 Nov 2018 00:17:36 -0800 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT To: Pavel Machek From: Joonas Lahtinen Organization: Intel Finland Oy - BIC 0357606-4 - Westendinkatu 7, 02160 Espoo In-Reply-To: <20181121115449.GA32455@amd> Cc: bp@alien8.de, hpa@zytor.com, kernel list , mingo@redhat.com, tglx@linutronix.de, x86@kernel.org, jani.nikula@linux.intel.com, rodrigo.vivi@intel.com, intel-gfx@lists.freedesktop.org, chris@chris-wilson.co.uk References: <20181108175803.GA10785@amd> <154279919462.20217.14259089584802660420@jlahtine-desk.ger.corp.intel.com> <20181121115449.GA32455@amd> Message-ID: <154296105546.7930.1457928786446716358@jlahtine-desk.ger.corp.intel.com> User-Agent: alot/0.6 Subject: Re: v4.20-rc1: list_del corruption on thinkpad x220 Date: Fri, 23 Nov 2018 10:17:35 +0200 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Quoting Pavel Machek (2018-11-21 13:54:49) > Hi! > > > > My machine locked hard (thinkpad x220). After reboot, I found this in > > > syslog: > > > > > > Sounds like memory corruption..? Does not sound like easy to debug. > > > > Were you doing something GPU intense when you experienced the hard hang? > > > > And if so, have you been able to hit the issue more than once? At this > > point it doesn't look like anything we've hit previously, so would be > > great to have some more insight into how we could reproduce. > > I seen another crash since that, but I don't think it counts at > "easily reproducible". > > I may have been running flightgear at that point. That's fairly GPU intensive. > > > There's one similar for nouveau in Bugzilla, but it seems like a genuine > > memory corruption (1 bit flipped): > > > > https://bugs.freedesktop.org/show_bug.cgi?id=84880 > > > > Any extra information would be of use :) > > > > Regards, Joonas > > > > PS. Could you open a bug to Bugzilla, it'll help to collect the > > information in one consolidated place: > > > > https://01.org/linuxgraphics/documentation/how-report-bugs > > I prefer email... certainly for bugs that can't be reproduced. By adding it to the Bugzilla it may be recognized by somebody else who is experiencing a similar issue. Internet points are not deducted for submitting bugs in good faith, even if they get closed as NOTABUG. It sounds like you've hit the same signature twice, so it may very well be reproducible. Does flightgear have some demo mode where you could leave it running a heavy scene overnight? Were you running 4.19 kernel previously, distro one or vanilla? A full dmesg from a boot would be appreciated (from kernel where you didn't experience issues, and from one where you do). We actually have a well defined process and personnel to look into the Bugzilla entries, so it'd still be helpful to have this logged to Bugzilla. Regards, Joonas > > Best regards, > Pavel > > > > > > ...otoh, it still looks like an addres, so maybe it is "just" race in > > > GPU drivers? > > > > > > Any ideas? > > > Pavel > > > > > > Nov 8 18:35:01 duo CRON[28511]: (root) CMD (command -v debian-sa1 > > > > /dev/null && debian-sa > > > 1 1 1) > > > Nov 8 18:42:57 duo kernel: list_del corruption. prev->next should be > > > ffff8801742b8178, but > > > was ffffc9000192fec8 > > > Nov 8 18:42:57 duo kernel: ------------[ cut here ]------------ > > > Nov 8 18:42:57 duo kernel: kernel BUG at > > > /data/fast/l/k/lib/list_debug.c:53! > > > Nov 8 18:42:57 duo kernel: invalid opcode: 0000 [#1] SMP PTI > > > Nov 8 18:42:57 duo kernel: CPU: 2 PID: 1082 Comm: i915/signal:1 Not > > > tainted 4.20.0-rc1+ #3 > > > Nov 8 18:42:57 duo kernel: Hardware name: LENOVO 42872WU/42872WU, > > > BIOS 8DET74WW (1.44 ) 03 > > > /13/2018 > > > Nov 8 18:42:57 duo kernel: RIP: > > > 0010:__list_del_entry_valid+0x8e/0x90 > > > Nov 8 18:42:57 duo kernel: Code: 66 88 d1 ff 0f 0b 48 89 fe 31 c0 48 > > > c7 c7 90 74 5e 85 e8 > > > 53 88 d1 ff 0f 0b 48 89 fe 31 c0 48 c7 c7 c8 74 5e 85 e8 40 88 d1 ff > > > <0f> 0b 55 48 89 d0 48 > > > 8b 52 08 48 89 e5 48 39 f2 75 19 48 8b 32 48 > > > Nov 8 18:42:57 duo kernel: RSP: 0000:ffffc9000196be78 EFLAGS: > > > 00210086 > > > Nov 8 18:42:57 duo kernel: RAX: 0000000000000054 RBX: > > > ffff8801742b8178 RCX: 00000000000000 > > > 00 > > > Nov 8 18:42:57 duo kernel: RDX: 0000000000000000 RSI: > > > ffff88019e2a53d8 RDI: ffff88019e2a53 > > > d8 > > > Nov 8 18:42:57 duo kernel: RBP: ffffc9000196be78 R08: > > > ffff880196e2cd10 R09: 00000000000000 > > > 00 > > > Nov 8 18:42:57 duo kernel: R10: 00000000e7684eb9 R11: > > > 3863656632393101 R12: ffffc9000196be > > > c8 > > > Nov 8 18:42:57 duo kernel: R13: ffff88019707e000 R14: > > > ffff8801742b8080 R15: ffffc9000192fd > > > d0 > > > Nov 8 18:42:57 duo kernel: FS: 0000000000000000(0000) > > > GS:ffff88019e280000(0000) knlGS:000 > > > 0000000000000 > > > Nov 8 18:42:57 duo kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > > > 0000000080050033 > > > Nov 8 18:42:57 duo kernel: CR2: 00000000ed2bf000 CR3: > > > 000000000581e001 CR4: 00000000000606a0 > > > Nov 8 18:42:57 duo kernel: Call Trace: > > > Nov 8 18:42:57 duo kernel: intel_breadcrumbs_signaler+0x162/0x330 > > > Nov 8 18:42:57 duo kernel: kthread+0x116/0x150 > > > Nov 8 18:42:57 duo kernel: ? intel_engine_wakeup+0x40/0x40 > > > Nov 8 18:42:57 duo kernel: ? kthread_park+0x90/0x90 > > > Nov 8 18:42:57 duo kernel: ret_from_fork+0x35/0x40 > > > Nov 8 18:42:57 duo kernel: Modules linked in: > > > Nov 8 18:42:57 duo kernel: ---[ end trace 2f8da183a56f80f6 ]--- > > > Nov 8 18:42:57 duo kernel: RIP: > > > 0010:__list_del_entry_valid+0x8e/0x90 > > > Nov 8 18:42:57 duo kernel: Code: 66 88 d1 ff 0f 0b 48 89 fe 31 c0 > > > 48 c7 c7 90 74 5e 85 e8 53 88 d1 ff 0f 0b 48 89 fe 31 c0 48 c7 c7 c8 > > > 74 5e 85 e8 40 88 d1 ff <0f> 0b 55 48 89 d0 48 8b 52 08 48 89 e5 48 > > > 39 f2 75 19 48 8b 32 48 > > > Nov 8 18:42:57 duo kernel: RSP: 0000:ffffc9000196be78 EFLAGS: > > > 00210086 > > > Nov 8 18:42:57 duo kernel: RAX: 0000000000000054 RBX: > > > ffff8801742b8178 RCX: 0000000000000000 > > > Nov 8 18:42:57 duo kernel: RDX: 0000000000000000 RSI: > > > ffff88019e2a53d8 RDI: ffff88019e2a53d8 > > > Nov 8 18:42:57 duo kernel: RBP: ffffc9000196be78 R08: > > > ffff880196e2cd10 R09: 0000000000000000 > > > Nov 8 18:42:57 duo kernel: R10: 00000000e7684eb9 R11: > > > 3863656632393101 R12: ffffc9000196bec8 > > > Nov 8 18:42:57 duo kernel: R13: ffff88019707e000 R14: > > > ffff8801742b8080 R15: ffffc9000192fdd0 > > > Nov 8 18:42:57 duo kernel: FS: 0000000000000000(0000) > > > GS:ffff88019e280000(0000) knlGS:0000000000000000 > > > Nov 8 18:42:57 duo kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > > > 0000000080050033 > > > Nov 8 18:42:57 duo kernel: CR2: 00000000ed2bf000 CR3: > > > 000000000581e001 CR4: 00000000000606a0 > > > > > > -- > > > (english) http://www.livejournal.com/~pavelmachek > > > (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html > > -- > (english) http://www.livejournal.com/~pavelmachek > (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html