Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S942209AbcJFJwq (ORCPT ); Thu, 6 Oct 2016 05:52:46 -0400 Received: from mout.kundenserver.de ([217.72.192.75]:62619 "EHLO mout.kundenserver.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S941780AbcJFJwm (ORCPT ); Thu, 6 Oct 2016 05:52:42 -0400 To: linux-kernel@vger.kernel.org From: Peter Maloney Subject: amdgpu crash Message-ID: <3568fc58-aac9-0532-2684-3283bcdd2db3@brockmann-consult.de> Date: Thu, 6 Oct 2016 11:52:39 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.4.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:yfAUiUU1JX3cayWgvDkmbG8Xbb1HNNq0NHjUSL9G8GmBTuB+LNl dlr240VB2KSzFYXv5xIm/fXJrHLLaUL+GTJiR54LwhWT0/H6zBG6ebhnqP9TLHYuGb/EO16 NthJA8sU+YX1HfovaIShGLG/9jUPrbvvbN4bVIt2fgF1fF0ZkVfUQGPg2sQqEMNry7CZO3G fIqJcDZ3M3ZtFPLK95eHQ== X-UI-Out-Filterresults: notjunk:1;V01:K0:A9d33T4V7ZY=:TbGZQU9e3iExejUUToCBab 1xmsGeOhWfkHZq/Sd4Jm8oP2Diem4KdcMz4HRhzJHXx3SjPBis4mvzMOULBhkmNY65V3FgRky XshB2iWUbNsA71+isK2ePLGxWV36vHIpBTgAAEldIGYtnV4dEG9gMWU+98+VxB4NPDWBt5AKs g1yBAqXBDsdJDk5o0cSv5cWy+PwaTiI0YtzIJCI8Huhe1p/I75iXk7UD3Vag9Wp+Jx+9tXG1H 4jTS4ZmPxbJJK/Q5b3fmR/DKYSUF4QlZE2AOMWXe3BhPY+tB4gDNBzY9GzS5iHvrUweAz1n5u 6NZmA2Z66T8OWgFIMi4KvHFzBdYDaxMe27P1zVE0aj8K7lwsRwqyAboZRW/fb6MeVjq+r5HiQ fomFBme9s7hdwmjgfAypmU+ykafnS8BdG7mM7Yy0Okd6ZNtzuweenCuWg5/HdwtPDSc+MoUhL ubNHE4byb48UH49MBvWBJYRASP2Y5IvzYx68c6o3H/CTqu1RSsmxJBYL1EaOVW02z5m+COpR2 4+F3hOXEUj+ees2UerkzUIIZpjrpgO740mZRBZfu0J33XKqrqZYwyGR8aI0n/NySrezo+maOb 081OVOCMS6ovbaRN0g06tiCVTglQs9aTFLip5MRO3ZUdkk3x5ig7z+mce+VTtDdYqJ6PQvxY7 /Hz8AB4FBAeI+4T/aCwU+MeycM6iZdKIUrBnIeagkV591XnmFfcF6amnZwlXY4nf0CZGqHP6L lMGhcBjv65pVhpw5 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3592 Lines: 82 Hi, I seem to have a crash in amdgpu. It results in a black screen with monitors in power save mode, but sysrq still works to reboot. (is this the right place to report it...?) It never failed this way with kernel 4.5.7, and fails every day when idle for a long time in kernel 4.7.6. > Oct 4 19:09:51 peter kernel: INFO: task plasmashell:3200 blocked for > more than 120 seconds. > Oct 4 19:09:51 peter kernel: Not tainted 4.7.6-1-grsec-kvm-host #24 > Oct 4 19:09:51 peter kernel: "echo 0 > > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > Oct 4 19:09:51 peter kernel: plasmashell D ffffc9000318b890 > 0 3200 3091 0x00000080 > Oct 4 19:09:51 peter kernel: ffffc9000318b890 ffff880469e56540 > ffff88046da5a1c0 dad31f4a396d9287 > Oct 4 19:09:51 peter kernel: ffff880469e56548 ffff88045b4a90e8 > ffff88045b4a90e8 0000000000000001 > Oct 4 19:09:51 peter kernel: ffff880450ad4800 ffffc9000318b8a8 > ffffffff816be89b ffff880450ad4800 > Oct 4 19:09:51 peter kernel: Call Trace: > Oct 4 19:09:51 peter kernel: [] schedule+0x3b/0xa0 > Oct 4 19:09:51 peter kernel: [] > amd_sched_entity_push_job+0x6b/0xe0 [amdgpu] > Oct 4 19:09:51 peter kernel: [] ? > wake_atomic_t_function+0xc0/0xc0 > Oct 4 19:09:51 peter kernel: [] > amdgpu_job_submit+0xaf/0x120 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > amdgpu_vm_bo_update_mapping+0x2e0/0x4f0 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > amdgpu_vm_bo_split_mapping+0x122/0x150 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > amdgpu_vm_bo_update+0x157/0x270 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > amdgpu_gem_va_update_vm+0x1bb/0x1e0 [amdgpu] > Oct 4 19:09:51 peter kernel: [] ? __list_add+0x11/0x90 > Oct 4 19:09:51 peter kernel: [] > amdgpu_gem_va_ioctl+0x242/0x310 [amdgpu] > Oct 4 19:09:51 peter kernel: [] ? > amdgpu_exit+0x188d/0x90fd1 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > drm_ioctl+0x362/0x6c0 [drm] > Oct 4 19:09:51 peter kernel: [] ? > amdgpu_gem_metadata_ioctl+0x1e0/0x1e0 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > amdgpu_drm_ioctl+0x47/0x90 [amdgpu] > Oct 4 19:09:51 peter kernel: [] > do_vfs_ioctl+0xd0/0xb30 > Oct 4 19:09:51 peter kernel: [] ? __fget+0x79/0xb0 > Oct 4 19:09:51 peter kernel: [] sys_ioctl+0x7d/0xa0 > Oct 4 19:09:51 peter kernel: [] > do_syscall_64+0x56/0xf0 > Oct 4 19:09:51 peter kernel: [] > entry_SYSCALL64_slow_path+0x25/0x25 > root@peter:~ # uname -a > Linux peter 4.7.6-1-grsec-kvm-host #24 SMP PREEMPT Tue Oct 4 12:43:34 > CEST 2016 x86_64 GNU/Linux > xorg-server 1.18.4-1 > xf86-video-amdgpu 1.1.2-1 > plasma-desktop 5.7.5-1 > plasma-framework 5.26.0-1 > plasma-workspace 5.7.5-1 > root@peter:~ # lspci -k | grep -E "VGA|in use" | grep -A1 "VGA" > 06:00.0 VGA compatible controller: Advanced Micro Devices, Inc. > [AMD/ATI] Bonaire XTX [Radeon R7 260X/360] > Kernel driver in use: vfio-pci > -- > 07:00.0 VGA compatible controller: Advanced Micro Devices, Inc. > [AMD/ATI] Tobago PRO [Radeon R7 360 / R9 360 OEM] (rev 81) > Kernel driver in use: amdgpu (Only the 2nd one is in use here, which is the primary one [numbers are backwards order on this machine]. Other has vfio-pci bound early via an initcpio hook; and it was not used by qemu since rebooting)