2015-02-03 19:15:58

by Andrey Skvortsov

[permalink] [raw]
Subject: [regression in linux-next] i915: broken graphics on laptop

Hi,

tested next-20150202. System boots, but graphic output is broken (empty black screen).
Booted five times the same kernel, always got the same result. The system works with 3.19-rc7.

This is the first warning in the log:

WARNING: CPU: 0 PID: 855 at drivers/gpu/drm/i915/intel_uncore.c:169 intel_uncore_forcewake_reset+0x188/0x24d [i915]()
WARN_ON(dev_priv->uncore.fw_domains == 0)
Modules linked in: i915(E+) lpc_ich(E) mfd_core(E) snd_hda_controller(E) snd_hda_codec(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) drm_kms_helper(E) drm(E) battery(E) button(E) video(E) ac(E) snd(E) soundcore(E) i2c_algo_bit(E) i2c_core(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E) ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) sdhci_pci(E) firewire_ohci(E) sdhci(E) b44(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) ehci_pci(E) thermal(E) thermal_sys(E) uhci_hcd(E) ehci_hcd(E) usbcore(E) usb_common(E)
CPU: 0 PID: 855 Comm: systemd-udevd Tainted: G E 3.19.0-rc6-next-20150202-150201- #4
Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
0000000000000000 0000000000000009 ffffffff813e790a ffff8800da083958
ffffffff8104178e ffffffffa0601100 ffffffffa0589bc4 ffff8800da083988
ffff8800da6c00c8 ffff8800da6c00c8 0000000000000000 0000000000000246
Call Trace:
[<ffffffff813e790a>] ? dump_stack+0x4a/0x74
[<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
[<ffffffffa0589bc4>] ? intel_uncore_forcewake_reset+0x188/0x24d [i915]
[<ffffffff81041851>] ? warn_slowpath_fmt+0x4a/0x4f
[<ffffffffa0589bc4>] ? intel_uncore_forcewake_reset+0x188/0x24d [i915]
[<ffffffffa0589ed2>] ? intel_uncore_init+0x1e4/0x4a8 [i915]
[<ffffffffa05cb960>] ? i915_driver_load+0x58f/0xeda [i915]
[<ffffffff811fd88e>] ? kobject_uevent_env+0x581/0x5d8
[<ffffffff8112d1d5>] ? kfree+0xa4/0x127
[<ffffffff811fd88e>] ? kobject_uevent_env+0x581/0x5d8
[<ffffffff812e705d>] ? devtmpfs_create_node+0x102/0x117
[<ffffffff8105f8ed>] ? preempt_count_sub+0xab/0xca
[<ffffffff8105f8ed>] ? preempt_count_sub+0xab/0xca
[<ffffffffa04418c8>] ? drm_dev_register+0x79/0xec [drm]
[<ffffffffa0443de7>] ? drm_get_pci_dev+0xfc/0x1b7 [drm]
[<ffffffff81230a60>] ? pci_device_probe+0x74/0xd1
[<ffffffff812e23ef>] ? driver_probe_device+0x2ff/0x2ff
[<ffffffff812e220c>] ? driver_probe_device+0x11c/0x2ff
[<ffffffff812e23ef>] ? driver_probe_device+0x2ff/0x2ff
[<ffffffff812e2447>] ? __driver_attach+0x58/0x78
[<ffffffff812e076b>] ? bus_for_each_dev+0x53/0x84
[<ffffffff812e18f8>] ? bus_add_driver+0x113/0x1f8
[<ffffffff812e2a95>] ? driver_register+0x87/0xba
[<ffffffffa0627000>] ? 0xffffffffa0627000
[<ffffffff8100031d>] ? do_one_initcall+0xf7/0x18e
[<ffffffff8112d44b>] ? kmem_cache_alloc_trace+0xd6/0xe8
[<ffffffff8109eb2b>] ? load_module+0x1c81/0x202e
[<ffffffff8109eba3>] ? load_module+0x1cf9/0x202e
[<ffffffff8109b4aa>] ? mod_kobject_put+0x48/0x48
[<ffffffff8109b98d>] ? copy_module_from_fd+0x8c/0xf5
[<ffffffff8109f024>] ? SyS_finit_module+0x82/0x9a
[<ffffffff813ec352>] ? system_call_fastpath+0x12/0x17


Other warnings are below:

[ 19.253096] WARNING: CPU: 1 PID: 746 at drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object+0x134/0x272 [i915]()
[ 19.253098] WARN_ON(obj->frontbuffer_bits)
[ 19.253126] Modules linked in: snd_hda_intel(E+) i2c_i801(E+) i915(E+) lpc_ich(E) mfd_core(E) snd_hda_controller(E) snd_hda_codec(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) drm_kms_helper(E) drm(E) battery(E) button(E) video(E) ac(E) snd(E) soundcore(E) i2c_algo_bit(E) i2c_core(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E) ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) sdhci_pci(E) firewire_ohci(E) sdhci(E) b44(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) ehci_pci(E) thermal(E) thermal_sys(E) uhci_hcd(E) ehci_hcd(E) usbcore(E) usb_common(E)
[ 19.253129] CPU: 1 PID: 746 Comm: kworker/u4:5 Tainted: G W E 3.19.0-rc6-next-20150202-150201- #4
[ 19.253130] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
[ 19.253135] Workqueue: events_unbound async_run_entry_fn
[ 19.253137] 0000000000000000 0000000000000009 ffffffff813e790a ffff880037a9b9e8
[ 19.253139] ffffffff8104178e ffff8800da6c0000 ffffffffa056e22b ffff880196dda800
[ 19.253140] ffff880196e54000 ffff8800da6c0000 ffff880196e54040 ffff880196e54040
[ 19.253141] Call Trace:
[ 19.253144] [<ffffffff813e790a>] ? dump_stack+0x4a/0x74
[ 19.253147] [<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
[ 19.253173] [<ffffffffa056e22b>] ? i915_gem_free_object+0x134/0x272 [i915]
[ 19.253176] [<ffffffff81041851>] ? warn_slowpath_fmt+0x4a/0x4f
[ 19.253202] [<ffffffffa056e0bb>] ? i915_vma_unbind+0x18f/0x1cb [i915]
[ 19.253228] [<ffffffffa056e22b>] ? i915_gem_free_object+0x134/0x272 [i915]
[ 19.253246] [<ffffffffa043d7d7>] ? drm_gem_object_release+0x3b/0x3b [drm]
[ 19.253277] [<ffffffffa05932c3>] ? kref_sub.constprop.59+0x2f/0x38 [i915]
[ 19.253308] [<ffffffffa059333d>] ? intel_user_framebuffer_destroy+0x62/0x75 [i915]
[ 19.253321] [<ffffffffa0446344>] ? drm_framebuffer_unregister_private+0x37/0x37 [drm]
[ 19.253333] [<ffffffffa04466af>] ? kref_sub.constprop.33+0x2f/0x38 [drm]
[ 19.253346] [<ffffffffa04469b7>] ? drm_mode_set_config_internal+0xa6/0xd7 [drm]
[ 19.253355] [<ffffffffa04ae3df>] ? restore_fbdev_mode+0xad/0xc8 [drm_kms_helper]
[ 19.253361] [<ffffffffa04afa80>] ? drm_fb_helper_restore_fbdev_mode_unlocked+0x24/0x5a [drm_kms_helper]
[ 19.253367] [<ffffffffa04afaea>] ? drm_fb_helper_set_par+0x34/0x39 [drm_kms_helper]
[ 19.253398] [<ffffffffa05acba9>] ? intel_fbdev_set_par+0x16/0x5a [i915]
[ 19.253401] [<ffffffff8124b474>] ? fbcon_init+0x304/0x40d
[ 19.253404] [<ffffffff812b7828>] ? visual_init+0xb4/0x107
[ 19.253406] [<ffffffff812b8fc3>] ? do_bind_con_driver+0x176/0x28b
[ 19.253408] [<ffffffff812b920f>] ? do_take_over_console+0x137/0x167
[ 19.253410] [<ffffffff81247cda>] ? do_fbcon_takeover+0x59/0x9c
[ 19.253413] [<ffffffff81058e24>] ? notifier_call_chain+0x33/0x5e
[ 19.253415] [<ffffffff81058f46>] ? __blocking_notifier_call_chain+0x48/0x62
[ 19.253417] [<ffffffff81251de7>] ? register_framebuffer+0x23f/0x29b
[ 19.253424] [<ffffffffa04afd7c>] ? drm_fb_helper_initial_config+0x28d/0x33f [drm_kms_helper]
[ 19.253427] [<ffffffff8100166b>] ? __switch_to+0x1fb/0x47d
[ 19.253428] [<ffffffff8105a4fa>] ? async_run_entry_fn+0x6b/0x131
[ 19.253430] [<ffffffff810543c6>] ? process_one_work+0x171/0x28e
[ 19.253432] [<ffffffff810546b2>] ? worker_thread+0x1a5/0x272
[ 19.253434] [<ffffffff8105450d>] ? process_scheduled_works+0x2a/0x2a
[ 19.253436] [<ffffffff8105812d>] ? kthread+0x9e/0xa6
[ 19.253438] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
[ 19.253440] [<ffffffff813ec2ac>] ? ret_from_fork+0x7c/0xb0
[ 19.253442] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c

....

[ 29.097165] WARNING: CPU: 1 PID: 19 at include/linux/kref.h:47 drm_framebuffer_reference+0x5b/0x64 [drm]()
Feb 2 21:14:15 crion86 kernel: [ 29.097166] Modules linked in: cfg80211(E) bnep(E) cpufreq_stats(E) cpufreq_powersave(E) cpufreq_userspace(E) cpufreq_conservative(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) joydev(E) coretemp(E) snd_hda_codec_generic(E) kvm_intel(E) kvm(E) i8k(E) evdev(E) cdc_ether(E) btusb(E) bluetooth(E) psmouse(E) rfkill(E) serio_raw(E) snd_pcsp(E) usbnet(E) xhci_pci(E) xhci_hcd(E) cdc_wdm(E) cdc_acm(E) snd_hda_intel(E) i2c_i801(E) i915(E) lpc_ich(E) mfd_core(E) snd_hda_controller(E) snd_hda_codec(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) drm_kms_helper(E) drm(E) battery(E) button(E) video(E) ac(E) snd(E) soundcore(E) i2c_algo_bit(E) i2c_core(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E) ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) sdhci_pci(E) firewire_ohci(E) sdhci(E) b44(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) ehci_pci(E) thermal(E) thermal_sys(E) uhci_hcd(E) ehci_hcd(E) usbcore(E) usb_common(E)
[ 29.097233] CPU: 1 PID: 19 Comm: kworker/1:1 Tainted: G W E 3.19.0-rc6-next-20150202-150201- #4
[ 29.097235] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
[ 29.097244] Workqueue: events output_poll_execute [drm_kms_helper]
[ 29.097246] 0000000000000000 0000000000000009 ffffffff813e790a 0000000000000000
[ 29.097249] ffffffff8104178e ffff880037bae3c0 ffffffffa044656d 0000000000000000
[ 29.097252] ffff88019718f840 0000000000000500 0000000000000400 0000000000000000
[ 29.097256] Call Trace:
[ 29.097261] [<ffffffff813e790a>] ? dump_stack+0x4a/0x74
[ 29.097266] [<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
[ 29.097280] [<ffffffffa044656d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
[ 29.097294] [<ffffffffa044656d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
[ 29.097333] [<ffffffffa05b01fc>] ? intel_plane_duplicate_state+0x4d/0x69 [i915]
[ 29.097340] [<ffffffffa04a7b22>] ? drm_plane_helper_update+0x61/0xff [drm_kms_helper]
[ 29.097365] [<ffffffffa059af84>] ? __intel_set_mode+0x796/0x89a [i915]
[ 29.097390] [<ffffffffa059fc43>] ? intel_set_mode+0x6e/0x8f [i915]
[ 29.097414] [<ffffffffa05a00b7>] ? intel_get_load_detect_pipe+0x382/0x420 [i915]
[ 29.097443] [<ffffffffa05caa8c>] ? intel_tv_detect+0x116/0x43d [i915]
[ 29.097447] [<ffffffff81088f64>] ? del_timer_sync+0x25/0x42
[ 29.097455] [<ffffffffa04a6f5b>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc7/0x392 [drm_kms_helper]
[ 29.097463] [<ffffffffa04adfa2>] ? drm_fb_helper_probe_connector_modes+0x3d/0x51 [drm_kms_helper]
[ 29.097471] [<ffffffffa04afa25>] ? drm_fb_helper_hotplug_event+0x75/0xac [drm_kms_helper]
[ 29.097477] [<ffffffffa04a6d85>] ? output_poll_execute+0x125/0x154 [drm_kms_helper]
[ 29.097480] [<ffffffff810543c6>] ? process_one_work+0x171/0x28e
[ 29.097483] [<ffffffff810546b2>] ? worker_thread+0x1a5/0x272
[ 29.097486] [<ffffffff8105450d>] ? process_scheduled_works+0x2a/0x2a
[ 29.097489] [<ffffffff8105812d>] ? kthread+0x9e/0xa6
[ 29.097492] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
[ 29.097496] [<ffffffff813ec2ac>] ? ret_from_fork+0x7c/0xb0
[ 29.097499] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c


--
Best regards,
Andrey Skvortsov

PGP Key ID: 0x57A3AEAD


2015-02-03 20:22:05

by Chris Wilson

[permalink] [raw]
Subject: Re: [Intel-gfx] [regression in linux-next] i915: broken graphics on laptop

On Tue, Feb 03, 2015 at 10:15:47PM +0300, Andrey Skvortsov wrote:
> Hi,
>
> tested next-20150202. System boots, but graphic output is broken (empty black screen).
> Booted five times the same kernel, always got the same result. The system works with 3.19-rc7.

Those two warnings are more or less symptoms of the black screen (well
the first is just overzealous). More important would be the drm.debug=6
dmesg from boot along with the gdm.log (or equivalent) aned Xorg.0.log
as my guess is that X (or the display server) is crashing.
-Chris

--
Chris Wilson, Intel Open Source Technology Centre

2015-02-04 18:26:46

by Andrey Skvortsov

[permalink] [raw]
Subject: Re: [Intel-gfx] [regression in linux-next] i915: broken graphics on laptop

On Tue, Feb 03, 2015 at 08:21:52PM +0000, Chris Wilson wrote:
> On Tue, Feb 03, 2015 at 10:15:47PM +0300, Andrey Skvortsov wrote:
> > Hi,
> >
> > tested next-20150202. System boots, but graphic output is broken (empty black screen).
> > Booted five times the same kernel, always got the same result. The system works with 3.19-rc7.
>
> Those two warnings are more or less symptoms of the black screen (well
> the first is just overzealous). More important would be the drm.debug=6
> dmesg from boot along with the gdm.log (or equivalent) aned Xorg.0.log
> as my guess is that X (or the display server) is crashing.

Requested logs with drm.debug=6 are attached. lightdm was running after WARN_ON, but I couldn't restart it.
The command hanged.

As I booted next-20150202 system crashed several times with a lot of drm_ calls in the backtrace, but I couldn't catch kernel logs,
because I have not serial port on the laptop.

If you need to get other information or to test patches, I would be glad to help.

--
Best regards,
Andrey Skvortsov

PGP Key ID: 0x57A3AEAD


Attachments:
(No filename) (1.04 kB)
dmesg.log (177.71 kB)
lightdm.log (1.98 kB)
Xorg.0.log (6.92 kB)
Download all attachments

2015-02-04 18:42:28

by Chris Wilson

[permalink] [raw]
Subject: Re: [Intel-gfx] [regression in linux-next] i915: broken graphics on laptop

On Wed, Feb 04, 2015 at 09:26:27PM +0300, Andrey Skvortsov wrote:
> On Tue, Feb 03, 2015 at 08:21:52PM +0000, Chris Wilson wrote:
> > On Tue, Feb 03, 2015 at 10:15:47PM +0300, Andrey Skvortsov wrote:
> > > Hi,
> > >
> > > tested next-20150202. System boots, but graphic output is broken (empty black screen).
> > > Booted five times the same kernel, always got the same result. The system works with 3.19-rc7.
> >
> > Those two warnings are more or less symptoms of the black screen (well
> > the first is just overzealous). More important would be the drm.debug=6
> > dmesg from boot along with the gdm.log (or equivalent) aned Xorg.0.log
> > as my guess is that X (or the display server) is crashing.
>
> Requested logs with drm.debug=6 are attached. lightdm was running after WARN_ON, but I couldn't restart it.
> The command hanged.
>
> As I booted next-20150202 system crashed several times with a lot of drm_ calls in the backtrace, but I couldn't catch kernel logs,
> because I have not serial port on the laptop.
>
> If you need to get other information or to test patches, I would be glad to help.

Right, here it looks like it freezing in intel_get_load_detect_pipe()
during the initial configuration probe of X. Given the other crashes,
we're back to worring about memory corruption.

> [ 29.292333] [drm:intel_tv_detect] [CONNECTOR:33:SVIDEO-1] force=1
> [ 29.292336] [drm:intel_get_load_detect_pipe] [CONNECTOR:33:SVIDEO-1], [ENCODER:34:TV-34]
> [ 29.292339] [drm:intel_get_load_detect_pipe] creating tmp fb for load-detection
> [ 29.292396] [drm:intel_modeset_affected_pipes] set mode pipe masks: modeset: 1, prepare: 1, disable: 0
> [ 29.292408] [drm:connected_sink_compute_bpp] [CONNECTOR:33:SVIDEO-1] checking for sink bpp constrains
> [ 29.292413] [drm:intel_tv_compute_config] forcing bpc to 8 for TV
> [ 29.292416] [drm:intel_modeset_pipe_config] plane bpp: 24, pipe bpp: 24, dithering: 0
> [ 29.292418] [drm:intel_dump_pipe_config] [CRTC:20][modeset] config for pipe A
> [ 29.292419] [drm:intel_dump_pipe_config] cpu_transcoder: A
> [ 29.292421] [drm:intel_dump_pipe_config] pipe bpp: 24, dithering: 0
> [ 29.292423] [drm:intel_dump_pipe_config] fdi/pch: 0, lanes: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> [ 29.292425] [drm:intel_dump_pipe_config] dp: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> [ 29.292428] [drm:intel_dump_pipe_config] dp: 0, gmch_m2: 0, gmch_n2: 0, link_m2: 0, link_n2: 0, tu2: 0
> [ 29.292429] [drm:intel_dump_pipe_config] audio: 0, infoframes: 0
> [ 29.292431] [drm:intel_dump_pipe_config] requested mode:
> [ 29.292433] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> [ 29.292435] [drm:intel_dump_pipe_config] adjusted mode:
> [ 29.292438] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> [ 29.292440] [drm:intel_dump_crtc_timings] crtc timings: 108000 1280 1368 1496 1712 1024 1027 1034 1104, type: 0x40 flags: 0x0
> [ 29.292442] [drm:intel_dump_pipe_config] port clock: 108000
> [ 29.292444] [drm:intel_dump_pipe_config] pipe src size: 1280x1024
> [ 29.292446] [drm:intel_dump_pipe_config] gmch pfit: control: 0x00000000, ratios: 0x00000000, lvds border: 0x00000000
> [ 29.292447] [drm:intel_dump_pipe_config] pch pfit: pos: 0x00000000, size: 0x00000000, disabled
> [ 29.292449] [drm:intel_dump_pipe_config] ips: 0
> [ 29.292451] [drm:intel_dump_pipe_config] double wide: 0
> [ 29.292565] ------------[ cut here ]------------
> [ 29.293785] WARNING: CPU: 0 PID: 53 at include/linux/kref.h:47 drm_framebuffer_reference+0x5b/0x64 [drm]()
> [ 29.295032] Modules linked in: bnep(E) cfg80211(E) cpufreq_stats(E) cpufreq_powersave(E) cpufreq_userspace(E) cpufreq_conservative(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) cdc_wdm(E) cdc_acm(E) cdc_ether(E) usbnet(E) joydev(E) coretemp(E) kvm_intel(E) kvm(E) i8k(E) btusb(E) psmouse(E) snd_pcsp(E) i915(E) evdev(E) bluetooth(E) i2c_i801(E) snd_hda_codec_generic(E) lpc_ich(E) mfd_core(E) xhci_pci(E) xhci_hcd(E) serio_raw(E) rfkill(E) drm_kms_helper(E) drm(E) i2c_algo_bit(E) i2c_core(E) snd_hda_intel(E) snd_hda_controller(E) snd_hda_codec(E) button(E) snd_hwdep(E) battery(E) snd_pcm(E) snd_timer(E) snd(E) soundcore(E) video(E) ac(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E)
> [ 29.295080] ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) b44(E) firewire_ohci(E) sdhci_pci(E) sdhci(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) uhci_hcd(E) ehci_pci(E) ehci_hcd(E) thermal(E) thermal_sys(E) usbcore(E) usb_common(E)
> [ 29.296301] CPU: 0 PID: 53 Comm: kworker/0:3 Tainted: G W E 3.19.0-rc6-next-20150202-150201- #4
> [ 29.296303] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
> [ 29.296314] Workqueue: events output_poll_execute [drm_kms_helper]
> [ 29.296316] 0000000000000000 0000000000000009 ffffffff813e790a 0000000000000000
> [ 29.296319] ffffffff8104178e ffff880197a38e40 ffffffffa04eb56d 0000000000000000
> [ 29.296323] ffff880195844d80 0000000000000500 0000000000000400 0000000000000000
> [ 29.296326] Call Trace:
> [ 29.296332] [<ffffffff813e790a>] ? dump_stack+0x4a/0x74
> [ 29.296337] [<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
> [ 29.296354] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> [ 29.296368] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> [ 29.296408] [<ffffffffa07e01fc>] ? intel_plane_duplicate_state+0x4d/0x69 [i915]
> [ 29.296415] [<ffffffffa054cb22>] ? drm_plane_helper_update+0x61/0xff [drm_kms_helper]
> [ 29.296439] [<ffffffffa07caf84>] ? __intel_set_mode+0x796/0x89a [i915]
> [ 29.296464] [<ffffffffa07cfc43>] ? intel_set_mode+0x6e/0x8f [i915]
> [ 29.296489] [<ffffffffa07d00b7>] ? intel_get_load_detect_pipe+0x382/0x420 [i915]
> [ 29.296517] [<ffffffffa07faa8c>] ? intel_tv_detect+0x116/0x43d [i915]
> [ 29.296522] [<ffffffff8105f8ed>] ? preempt_count_sub+0xab/0xca
> [ 29.296529] [<ffffffffa054bf5b>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc7/0x392 [drm_kms_helper]
> [ 29.296538] [<ffffffffa0552fa2>] ? drm_fb_helper_probe_connector_modes+0x3d/0x51 [drm_kms_helper]
> [ 29.296545] [<ffffffffa0554a25>] ? drm_fb_helper_hotplug_event+0x75/0xac [drm_kms_helper]
> [ 29.296551] [<ffffffffa054bd85>] ? output_poll_execute+0x125/0x154 [drm_kms_helper]
> [ 29.296555] [<ffffffff810543c6>] ? process_one_work+0x171/0x28e
> [ 29.296558] [<ffffffff810546b2>] ? worker_thread+0x1a5/0x272
> [ 29.296560] [<ffffffff8105450d>] ? process_scheduled_works+0x2a/0x2a
> [ 29.296564] [<ffffffff8105812d>] ? kthread+0x9e/0xa6
> [ 29.296567] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> [ 29.296571] [<ffffffff813ec2ac>] ? ret_from_fork+0x7c/0xb0
> [ 29.296574] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> [ 29.296576] ---[ end trace 4742dbfffee243fc ]---

Which makes me wonder whether this is not the more significant warning?
-Chris

--
Chris Wilson, Intel Open Source Technology Centre

2015-02-23 20:10:59

by Andrey Skvortsov

[permalink] [raw]
Subject: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object


> > > Those two warnings are more or less symptoms of the black screen (well
> > > the first is just overzealous). More important would be the drm.debug=6
> > > dmesg from boot along with the gdm.log (or equivalent) aned Xorg.0.log
> > > as my guess is that X (or the display server) is crashing.
> >
> > Requested logs with drm.debug=6 are attached. lightdm was running after WARN_ON, but I couldn't restart it.
> > The command hanged.
> >
> > As I booted next-20150202 system crashed several times with a lot of drm_ calls in the backtrace, but I couldn't catch kernel logs,
> > because I have not serial port on the laptop.
> >
> > If you need to get other information or to test patches, I would be glad to help.
>
> Right, here it looks like it freezing in intel_get_load_detect_pipe()
> during the initial configuration probe of X. Given the other crashes,
> we're back to worring about memory corruption.
>
> > [ 29.292333] [drm:intel_tv_detect] [CONNECTOR:33:SVIDEO-1] force=1
> > [ 29.292336] [drm:intel_get_load_detect_pipe] [CONNECTOR:33:SVIDEO-1], [ENCODER:34:TV-34]
> > [ 29.292339] [drm:intel_get_load_detect_pipe] creating tmp fb for load-detection
> > [ 29.292396] [drm:intel_modeset_affected_pipes] set mode pipe masks: modeset: 1, prepare: 1, disable: 0
> > [ 29.292408] [drm:connected_sink_compute_bpp] [CONNECTOR:33:SVIDEO-1] checking for sink bpp constrains
> > [ 29.292413] [drm:intel_tv_compute_config] forcing bpc to 8 for TV
> > [ 29.292416] [drm:intel_modeset_pipe_config] plane bpp: 24, pipe bpp: 24, dithering: 0
> > [ 29.292418] [drm:intel_dump_pipe_config] [CRTC:20][modeset] config for pipe A
> > [ 29.292419] [drm:intel_dump_pipe_config] cpu_transcoder: A
> > [ 29.292421] [drm:intel_dump_pipe_config] pipe bpp: 24, dithering: 0
> > [ 29.292423] [drm:intel_dump_pipe_config] fdi/pch: 0, lanes: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> > [ 29.292425] [drm:intel_dump_pipe_config] dp: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> > [ 29.292428] [drm:intel_dump_pipe_config] dp: 0, gmch_m2: 0, gmch_n2: 0, link_m2: 0, link_n2: 0, tu2: 0
> > [ 29.292429] [drm:intel_dump_pipe_config] audio: 0, infoframes: 0
> > [ 29.292431] [drm:intel_dump_pipe_config] requested mode:
> > [ 29.292433] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> > [ 29.292435] [drm:intel_dump_pipe_config] adjusted mode:
> > [ 29.292438] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> > [ 29.292440] [drm:intel_dump_crtc_timings] crtc timings: 108000 1280 1368 1496 1712 1024 1027 1034 1104, type: 0x40 flags: 0x0
> > [ 29.292442] [drm:intel_dump_pipe_config] port clock: 108000
> > [ 29.292444] [drm:intel_dump_pipe_config] pipe src size: 1280x1024
> > [ 29.292446] [drm:intel_dump_pipe_config] gmch pfit: control: 0x00000000, ratios: 0x00000000, lvds border: 0x00000000
> > [ 29.292447] [drm:intel_dump_pipe_config] pch pfit: pos: 0x00000000, size: 0x00000000, disabled
> > [ 29.292449] [drm:intel_dump_pipe_config] ips: 0
> > [ 29.292451] [drm:intel_dump_pipe_config] double wide: 0
> > [ 29.292565] ------------[ cut here ]------------
> > [ 29.293785] WARNING: CPU: 0 PID: 53 at include/linux/kref.h:47 drm_framebuffer_reference+0x5b/0x64 [drm]()
> > [ 29.295032] Modules linked in: bnep(E) cfg80211(E) cpufreq_stats(E) cpufreq_powersave(E) cpufreq_userspace(E) cpufreq_conservative(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) cdc_wdm(E) cdc_acm(E) cdc_ether(E) usbnet(E) joydev(E) coretemp(E) kvm_intel(E) kvm(E) i8k(E) btusb(E) psmouse(E) snd_pcsp(E) i915(E) evdev(E) bluetooth(E) i2c_i801(E) snd_hda_codec_generic(E) lpc_ich(E) mfd_core(E) xhci_pci(E) xhci_hcd(E) serio_raw(E) rfkill(E) drm_kms_helper(E) drm(E) i2c_algo_bit(E) i2c_core(E) snd_hda_intel(E) snd_hda_controller(E) snd_hda_codec(E) button(E) snd_hwdep(E) battery(E) snd_pcm(E) snd_timer(E) snd(E) soundcore(E) video(E) ac(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E)
> > [ 29.295080] ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) b44(E) firewire_ohci(E) sdhci_pci(E) sdhci(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) uhci_hcd(E) ehci_pci(E) ehci_hcd(E) thermal(E) thermal_sys(E) usbcore(E) usb_common(E)
> > [ 29.296301] CPU: 0 PID: 53 Comm: kworker/0:3 Tainted: G W E 3.19.0-rc6-next-20150202-150201- #4
> > [ 29.296303] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
> > [ 29.296314] Workqueue: events output_poll_execute [drm_kms_helper]
> > [ 29.296316] 0000000000000000 0000000000000009 ffffffff813e790a 0000000000000000
> > [ 29.296319] ffffffff8104178e ffff880197a38e40 ffffffffa04eb56d 0000000000000000
> > [ 29.296323] ffff880195844d80 0000000000000500 0000000000000400 0000000000000000
> > [ 29.296326] Call Trace:
> > [ 29.296332] [<ffffffff813e790a>] ? dump_stack+0x4a/0x74
> > [ 29.296337] [<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
> > [ 29.296354] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> > [ 29.296368] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> > [ 29.296408] [<ffffffffa07e01fc>] ? intel_plane_duplicate_state+0x4d/0x69 [i915]
> > [ 29.296415] [<ffffffffa054cb22>] ? drm_plane_helper_update+0x61/0xff [drm_kms_helper]
> > [ 29.296439] [<ffffffffa07caf84>] ? __intel_set_mode+0x796/0x89a [i915]
> > [ 29.296464] [<ffffffffa07cfc43>] ? intel_set_mode+0x6e/0x8f [i915]
> > [ 29.296489] [<ffffffffa07d00b7>] ? intel_get_load_detect_pipe+0x382/0x420 [i915]
> > [ 29.296517] [<ffffffffa07faa8c>] ? intel_tv_detect+0x116/0x43d [i915]
> > [ 29.296522] [<ffffffff8105f8ed>] ? preempt_count_sub+0xab/0xca
> > [ 29.296529] [<ffffffffa054bf5b>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc7/0x392 [drm_kms_helper]
> > [ 29.296538] [<ffffffffa0552fa2>] ? drm_fb_helper_probe_connector_modes+0x3d/0x51 [drm_kms_helper]
> > [ 29.296545] [<ffffffffa0554a25>] ? drm_fb_helper_hotplug_event+0x75/0xac [drm_kms_helper]
> > [ 29.296551] [<ffffffffa054bd85>] ? output_poll_execute+0x125/0x154 [drm_kms_helper]
> > [ 29.296555] [<ffffffff810543c6>] ? process_one_work+0x171/0x28e
> > [ 29.296558] [<ffffffff810546b2>] ? worker_thread+0x1a5/0x272
> > [ 29.296560] [<ffffffff8105450d>] ? process_scheduled_works+0x2a/0x2a
> > [ 29.296564] [<ffffffff8105812d>] ? kthread+0x9e/0xa6
> > [ 29.296567] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> > [ 29.296571] [<ffffffff813ec2ac>] ? ret_from_fork+0x7c/0xb0
> > [ 29.296574] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> > [ 29.296576] ---[ end trace 4742dbfffee243fc ]---
>
> Which makes me wonder whether this is not the more significant warning?
> -Chris

Hi,

This warning is moved from linux-next to v4.0-rc1 now. After system boot is just a black screen.
I ssh'ed into the machine and saved the log. I attached updated dmesg.log with drm.debug=6. Hopefully it helps.
If you need any other debug information, traces, core dump or something else. Feel free to ask.

--
Best regards,
Andrey Skvortsov

PGP Key ID: 0x57A3AEAD


Attachments:
(No filename) (7.18 kB)
signature.asc (819.00 B)
Digital signature
Download all attachments

2015-02-23 20:12:58

by Andrey Skvortsov

[permalink] [raw]
Subject: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object


> > > Those two warnings are more or less symptoms of the black screen (well
> > > the first is just overzealous). More important would be the drm.debug=6
> > > dmesg from boot along with the gdm.log (or equivalent) aned Xorg.0.log
> > > as my guess is that X (or the display server) is crashing.
> >
> > Requested logs with drm.debug=6 are attached. lightdm was running after WARN_ON, but I couldn't restart it.
> > The command hanged.
> >
> > As I booted next-20150202 system crashed several times with a lot of drm_ calls in the backtrace, but I couldn't catch kernel logs,
> > because I have not serial port on the laptop.
> >
> > If you need to get other information or to test patches, I would be glad to help.
>
> Right, here it looks like it freezing in intel_get_load_detect_pipe()
> during the initial configuration probe of X. Given the other crashes,
> we're back to worring about memory corruption.
>
> > [ 29.292333] [drm:intel_tv_detect] [CONNECTOR:33:SVIDEO-1] force=1
> > [ 29.292336] [drm:intel_get_load_detect_pipe] [CONNECTOR:33:SVIDEO-1], [ENCODER:34:TV-34]
> > [ 29.292339] [drm:intel_get_load_detect_pipe] creating tmp fb for load-detection
> > [ 29.292396] [drm:intel_modeset_affected_pipes] set mode pipe masks: modeset: 1, prepare: 1, disable: 0
> > [ 29.292408] [drm:connected_sink_compute_bpp] [CONNECTOR:33:SVIDEO-1] checking for sink bpp constrains
> > [ 29.292413] [drm:intel_tv_compute_config] forcing bpc to 8 for TV
> > [ 29.292416] [drm:intel_modeset_pipe_config] plane bpp: 24, pipe bpp: 24, dithering: 0
> > [ 29.292418] [drm:intel_dump_pipe_config] [CRTC:20][modeset] config for pipe A
> > [ 29.292419] [drm:intel_dump_pipe_config] cpu_transcoder: A
> > [ 29.292421] [drm:intel_dump_pipe_config] pipe bpp: 24, dithering: 0
> > [ 29.292423] [drm:intel_dump_pipe_config] fdi/pch: 0, lanes: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> > [ 29.292425] [drm:intel_dump_pipe_config] dp: 0, gmch_m: 0, gmch_n: 0, link_m: 0, link_n: 0, tu: 0
> > [ 29.292428] [drm:intel_dump_pipe_config] dp: 0, gmch_m2: 0, gmch_n2: 0, link_m2: 0, link_n2: 0, tu2: 0
> > [ 29.292429] [drm:intel_dump_pipe_config] audio: 0, infoframes: 0
> > [ 29.292431] [drm:intel_dump_pipe_config] requested mode:
> > [ 29.292433] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> > [ 29.292435] [drm:intel_dump_pipe_config] adjusted mode:
> > [ 29.292438] [drm:drm_mode_debug_printmodeline] Modeline 0:"NTSC 480i" 0 107520 1280 1368 1496 1712 1024 1027 1034 1104 0x40 0x0
> > [ 29.292440] [drm:intel_dump_crtc_timings] crtc timings: 108000 1280 1368 1496 1712 1024 1027 1034 1104, type: 0x40 flags: 0x0
> > [ 29.292442] [drm:intel_dump_pipe_config] port clock: 108000
> > [ 29.292444] [drm:intel_dump_pipe_config] pipe src size: 1280x1024
> > [ 29.292446] [drm:intel_dump_pipe_config] gmch pfit: control: 0x00000000, ratios: 0x00000000, lvds border: 0x00000000
> > [ 29.292447] [drm:intel_dump_pipe_config] pch pfit: pos: 0x00000000, size: 0x00000000, disabled
> > [ 29.292449] [drm:intel_dump_pipe_config] ips: 0
> > [ 29.292451] [drm:intel_dump_pipe_config] double wide: 0
> > [ 29.292565] ------------[ cut here ]------------
> > [ 29.293785] WARNING: CPU: 0 PID: 53 at include/linux/kref.h:47 drm_framebuffer_reference+0x5b/0x64 [drm]()
> > [ 29.295032] Modules linked in: bnep(E) cfg80211(E) cpufreq_stats(E) cpufreq_powersave(E) cpufreq_userspace(E) cpufreq_conservative(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) cdc_wdm(E) cdc_acm(E) cdc_ether(E) usbnet(E) joydev(E) coretemp(E) kvm_intel(E) kvm(E) i8k(E) btusb(E) psmouse(E) snd_pcsp(E) i915(E) evdev(E) bluetooth(E) i2c_i801(E) snd_hda_codec_generic(E) lpc_ich(E) mfd_core(E) xhci_pci(E) xhci_hcd(E) serio_raw(E) rfkill(E) drm_kms_helper(E) drm(E) i2c_algo_bit(E) i2c_core(E) snd_hda_intel(E) snd_hda_controller(E) snd_hda_codec(E) button(E) snd_hwdep(E) battery(E) snd_pcm(E) snd_timer(E) snd(E) soundcore(E) video(E) ac(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) sd_mod(E) ata_generic(E)
> > [ 29.295080] ahci(E) libahci(E) ata_piix(E) libata(E) scsi_mod(E) b44(E) firewire_ohci(E) sdhci_pci(E) sdhci(E) firewire_core(E) crc_itu_t(E) mii(E) ssb(E) mmc_core(E) libphy(E) uhci_hcd(E) ehci_pci(E) ehci_hcd(E) thermal(E) thermal_sys(E) usbcore(E) usb_common(E)
> > [ 29.296301] CPU: 0 PID: 53 Comm: kworker/0:3 Tainted: G W E 3.19.0-rc6-next-20150202-150201- #4
> > [ 29.296303] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
> > [ 29.296314] Workqueue: events output_poll_execute [drm_kms_helper]
> > [ 29.296316] 0000000000000000 0000000000000009 ffffffff813e790a 0000000000000000
> > [ 29.296319] ffffffff8104178e ffff880197a38e40 ffffffffa04eb56d 0000000000000000
> > [ 29.296323] ffff880195844d80 0000000000000500 0000000000000400 0000000000000000
> > [ 29.296326] Call Trace:
> > [ 29.296332] [<ffffffff813e790a>] ? dump_stack+0x4a/0x74
> > [ 29.296337] [<ffffffff8104178e>] ? warn_slowpath_common+0x9d/0xb5
> > [ 29.296354] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> > [ 29.296368] [<ffffffffa04eb56d>] ? drm_framebuffer_reference+0x5b/0x64 [drm]
> > [ 29.296408] [<ffffffffa07e01fc>] ? intel_plane_duplicate_state+0x4d/0x69 [i915]
> > [ 29.296415] [<ffffffffa054cb22>] ? drm_plane_helper_update+0x61/0xff [drm_kms_helper]
> > [ 29.296439] [<ffffffffa07caf84>] ? __intel_set_mode+0x796/0x89a [i915]
> > [ 29.296464] [<ffffffffa07cfc43>] ? intel_set_mode+0x6e/0x8f [i915]
> > [ 29.296489] [<ffffffffa07d00b7>] ? intel_get_load_detect_pipe+0x382/0x420 [i915]
> > [ 29.296517] [<ffffffffa07faa8c>] ? intel_tv_detect+0x116/0x43d [i915]
> > [ 29.296522] [<ffffffff8105f8ed>] ? preempt_count_sub+0xab/0xca
> > [ 29.296529] [<ffffffffa054bf5b>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc7/0x392 [drm_kms_helper]
> > [ 29.296538] [<ffffffffa0552fa2>] ? drm_fb_helper_probe_connector_modes+0x3d/0x51 [drm_kms_helper]
> > [ 29.296545] [<ffffffffa0554a25>] ? drm_fb_helper_hotplug_event+0x75/0xac [drm_kms_helper]
> > [ 29.296551] [<ffffffffa054bd85>] ? output_poll_execute+0x125/0x154 [drm_kms_helper]
> > [ 29.296555] [<ffffffff810543c6>] ? process_one_work+0x171/0x28e
> > [ 29.296558] [<ffffffff810546b2>] ? worker_thread+0x1a5/0x272
> > [ 29.296560] [<ffffffff8105450d>] ? process_scheduled_works+0x2a/0x2a
> > [ 29.296564] [<ffffffff8105812d>] ? kthread+0x9e/0xa6
> > [ 29.296567] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> > [ 29.296571] [<ffffffff813ec2ac>] ? ret_from_fork+0x7c/0xb0
> > [ 29.296574] [<ffffffff8105808f>] ? __kthread_parkme+0x5c/0x5c
> > [ 29.296576] ---[ end trace 4742dbfffee243fc ]---
>
> Which makes me wonder whether this is not the more significant warning?
> -Chris

Hi,

This warning is moved from linux-next to v4.0-rc1 now. After system boot is just a black screen.
I ssh'ed into the machine and saved the log. I attached updated dmesg.log with drm.debug=6. Hopefully it helps.
If you need any other debug information, traces, core dump or something else. Feel free to ask.

Sorry, forgot to attach log to the previous mail.

--
Best regards,
Andrey Skvortsov

PGP Key ID: 0x57A3AEAD


Attachments:
(No filename) (0.00 B)
signature.asc (819.00 B)
Digital signature
Download all attachments

2015-02-23 21:20:45

by Chris Wilson

[permalink] [raw]
Subject: Re: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object

On Mon, Feb 23, 2015 at 11:12:39PM +0300, Andrey Skvortsov wrote:
> Hi,
>
> This warning is moved from linux-next to v4.0-rc1 now. After system boot is just a black screen.
> I ssh'ed into the machine and saved the log. I attached updated dmesg.log with drm.debug=6. Hopefully it helps.
> If you need any other debug information, traces, core dump or something else. Feel free to ask.

The warning from free_object is annoying (and quite possibly dangerous),
but the actual hang during boot is:

[ 243.876375] INFO: task Xorg:2422 blocked for more than 120 seconds.
[ 243.876382] Tainted: G W E 4.0.0-rc1-150223- #2
[ 243.876388] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 243.876393] Xorg D ffff88019fc12dc0 0 2422 2180 0x00400000
[ 243.876404] ffff8800dabfe1a0 0000000000000002 ffff880194537fd8 ffff880194537ba0
[ 243.876416] ffff8800dab9e22c ffff8800dabfe1a0 ffff8800dab9e230 00000000ffffffff
[ 243.876426] ffffffff813e2479 ffff8800dab9e228 ffffffff813e26a7 0000000000000000
[ 243.876438] Call Trace:
[ 243.876449] [<ffffffff813e2479>] ? schedule+0x6f/0x7c
[ 243.876459] [<ffffffff813e26a7>] ? schedule_preempt_disabled+0x15/0x21
[ 243.876469] [<ffffffff813e3347>] ? __ww_mutex_lock_slowpath+0xdf/0x1c2
[ 243.876480] [<ffffffff813e3446>] ? __ww_mutex_lock+0x1c/0x93
[ 243.876541] [<ffffffffa050e70d>] ? modeset_lock+0x8f/0xf2 [drm]
[ 243.876632] [<ffffffffa09aa0b9>] ? intel_get_load_detect_pipe+0x80/0x427 [i915]
[ 243.876674] [<ffffffffa04fd42f>] ? drm_ut_debug_printk+0x5e/0x63 [drm]
[ 243.876771] [<ffffffffa09d4661>] ? intel_tv_detect+0x115/0x43a [i915]
[ 243.876783] [<ffffffff810608d9>] ? preempt_count_sub+0xbf/0xca
[ 243.876809] [<ffffffffa05d6f24>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc6/0x38d [drm_kms_helper]
[ 243.876860] [<ffffffffa0505b5d>] ? drm_mode_getconnector+0xf4/0x2ac [drm]
[ 243.876900] [<ffffffffa04fa911>] ? drm_ioctl+0x338/0x3c5 [drm]
[ 243.876949] [<ffffffffa0505a69>] ? drm_mode_getcrtc+0xb3/0xb3 [drm]
[ 243.876961] [<ffffffff81167deb>] ? fsnotify+0x314/0x35d
[ 243.876973] [<ffffffff811487be>] ? do_vfs_ioctl+0x379/0x431
[ 243.876983] [<ffffffff811488cc>] ? SyS_ioctl+0x56/0x7c
[ 243.876994] [<ffffffff813e5152>] ? system_call_fastpath+0x12/0x17

i.e. it is a mutex deadlock inside tv detect. Daniel does that make sense?
-Chris

--
Chris Wilson, Intel Open Source Technology Centre

2015-02-23 23:41:10

by Daniel Vetter

[permalink] [raw]
Subject: Re: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object

On Mon, Feb 23, 2015 at 09:20:31PM +0000, Chris Wilson wrote:
> On Mon, Feb 23, 2015 at 11:12:39PM +0300, Andrey Skvortsov wrote:
> > Hi,
> >
> > This warning is moved from linux-next to v4.0-rc1 now. After system boot is just a black screen.
> > I ssh'ed into the machine and saved the log. I attached updated dmesg.log with drm.debug=6. Hopefully it helps.
> > If you need any other debug information, traces, core dump or something else. Feel free to ask.
>
> The warning from free_object is annoying (and quite possibly dangerous),
> but the actual hang during boot is:
>
> [ 243.876375] INFO: task Xorg:2422 blocked for more than 120 seconds.
> [ 243.876382] Tainted: G W E 4.0.0-rc1-150223- #2
> [ 243.876388] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [ 243.876393] Xorg D ffff88019fc12dc0 0 2422 2180 0x00400000
> [ 243.876404] ffff8800dabfe1a0 0000000000000002 ffff880194537fd8 ffff880194537ba0
> [ 243.876416] ffff8800dab9e22c ffff8800dabfe1a0 ffff8800dab9e230 00000000ffffffff
> [ 243.876426] ffffffff813e2479 ffff8800dab9e228 ffffffff813e26a7 0000000000000000
> [ 243.876438] Call Trace:
> [ 243.876449] [<ffffffff813e2479>] ? schedule+0x6f/0x7c
> [ 243.876459] [<ffffffff813e26a7>] ? schedule_preempt_disabled+0x15/0x21
> [ 243.876469] [<ffffffff813e3347>] ? __ww_mutex_lock_slowpath+0xdf/0x1c2
> [ 243.876480] [<ffffffff813e3446>] ? __ww_mutex_lock+0x1c/0x93
> [ 243.876541] [<ffffffffa050e70d>] ? modeset_lock+0x8f/0xf2 [drm]
> [ 243.876632] [<ffffffffa09aa0b9>] ? intel_get_load_detect_pipe+0x80/0x427 [i915]
> [ 243.876674] [<ffffffffa04fd42f>] ? drm_ut_debug_printk+0x5e/0x63 [drm]
> [ 243.876771] [<ffffffffa09d4661>] ? intel_tv_detect+0x115/0x43a [i915]
> [ 243.876783] [<ffffffff810608d9>] ? preempt_count_sub+0xbf/0xca
> [ 243.876809] [<ffffffffa05d6f24>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc6/0x38d [drm_kms_helper]
> [ 243.876860] [<ffffffffa0505b5d>] ? drm_mode_getconnector+0xf4/0x2ac [drm]
> [ 243.876900] [<ffffffffa04fa911>] ? drm_ioctl+0x338/0x3c5 [drm]
> [ 243.876949] [<ffffffffa0505a69>] ? drm_mode_getcrtc+0xb3/0xb3 [drm]
> [ 243.876961] [<ffffffff81167deb>] ? fsnotify+0x314/0x35d
> [ 243.876973] [<ffffffff811487be>] ? do_vfs_ioctl+0x379/0x431
> [ 243.876983] [<ffffffff811488cc>] ? SyS_ioctl+0x56/0x7c
> [ 243.876994] [<ffffffff813e5152>] ? system_call_fastpath+0x12/0x17
>
> i.e. it is a mutex deadlock inside tv detect. Daniel does that make sense?

Botch locking rework for atomic. Fix is

https://patchwork.kernel.org/patch/5861631/

and will land as soon as an affected user has provided a tested-by.
Andrey, can you pls give this a spin?

Thanks, Daniel
--
Daniel Vetter
Software Engineer, Intel Corporation
+41 (0) 79 365 57 48 - http://blog.ffwll.ch

2015-02-28 12:20:52

by Andrey Skvortsov

[permalink] [raw]
Subject: Re: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object

On 24 Feb, Daniel Vetter wrote:
> On Mon, Feb 23, 2015 at 09:20:31PM +0000, Chris Wilson wrote:
> > On Mon, Feb 23, 2015 at 11:12:39PM +0300, Andrey Skvortsov wrote:
> > > Hi,
> > >
> > > This warning is moved from linux-next to v4.0-rc1 now. After system boot is just a black screen.
> > > I ssh'ed into the machine and saved the log. I attached updated dmesg.log with drm.debug=6. Hopefully it helps.
> > > If you need any other debug information, traces, core dump or something else. Feel free to ask.
> >
> > The warning from free_object is annoying (and quite possibly dangerous),
> > but the actual hang during boot is:
> >
> > [ 243.876375] INFO: task Xorg:2422 blocked for more than 120 seconds.
> > [ 243.876382] Tainted: G W E 4.0.0-rc1-150223- #2
> > [ 243.876388] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> > [ 243.876393] Xorg D ffff88019fc12dc0 0 2422 2180 0x00400000
> > [ 243.876404] ffff8800dabfe1a0 0000000000000002 ffff880194537fd8 ffff880194537ba0
> > [ 243.876416] ffff8800dab9e22c ffff8800dabfe1a0 ffff8800dab9e230 00000000ffffffff
> > [ 243.876426] ffffffff813e2479 ffff8800dab9e228 ffffffff813e26a7 0000000000000000
> > [ 243.876438] Call Trace:
> > [ 243.876449] [<ffffffff813e2479>] ? schedule+0x6f/0x7c
> > [ 243.876459] [<ffffffff813e26a7>] ? schedule_preempt_disabled+0x15/0x21
> > [ 243.876469] [<ffffffff813e3347>] ? __ww_mutex_lock_slowpath+0xdf/0x1c2
> > [ 243.876480] [<ffffffff813e3446>] ? __ww_mutex_lock+0x1c/0x93
> > [ 243.876541] [<ffffffffa050e70d>] ? modeset_lock+0x8f/0xf2 [drm]
> > [ 243.876632] [<ffffffffa09aa0b9>] ? intel_get_load_detect_pipe+0x80/0x427 [i915]
> > [ 243.876674] [<ffffffffa04fd42f>] ? drm_ut_debug_printk+0x5e/0x63 [drm]
> > [ 243.876771] [<ffffffffa09d4661>] ? intel_tv_detect+0x115/0x43a [i915]
> > [ 243.876783] [<ffffffff810608d9>] ? preempt_count_sub+0xbf/0xca
> > [ 243.876809] [<ffffffffa05d6f24>] ? drm_helper_probe_single_connector_modes_merge_bits+0xc6/0x38d [drm_kms_helper]
> > [ 243.876860] [<ffffffffa0505b5d>] ? drm_mode_getconnector+0xf4/0x2ac [drm]
> > [ 243.876900] [<ffffffffa04fa911>] ? drm_ioctl+0x338/0x3c5 [drm]
> > [ 243.876949] [<ffffffffa0505a69>] ? drm_mode_getcrtc+0xb3/0xb3 [drm]
> > [ 243.876961] [<ffffffff81167deb>] ? fsnotify+0x314/0x35d
> > [ 243.876973] [<ffffffff811487be>] ? do_vfs_ioctl+0x379/0x431
> > [ 243.876983] [<ffffffff811488cc>] ? SyS_ioctl+0x56/0x7c
> > [ 243.876994] [<ffffffff813e5152>] ? system_call_fastpath+0x12/0x17
> >
> > i.e. it is a mutex deadlock inside tv detect. Daniel does that make sense?
>
> Botch locking rework for atomic. Fix is
>
> https://patchwork.kernel.org/patch/5861631/
>
> and will land as soon as an affected user has provided a tested-by.
> Andrey, can you pls give this a spin?

Hi,

Tested-by: Andrey Skvortsov <[email protected]>

The patch certainly fixes deadlock and Xorg is running again.


Unfortunately this is not the last bug, that breaks i915/drm working
on my laptop. Sometimes system successfully loads with couple warnings mentioned in
previous mail:

[ 26.922953] WARNING: CPU: 1 PID: 767 at drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object+0x13f/0x288 [i915]()
[ 26.922954] WARN_ON(obj->frontbuffer_bits)

and

[ 36.794045] WARNING: CPU: 0 PID: 18 at include/linux/kref.h:47 drm_framebuffer_reference+0x60/0x6b [drm]()

but pretty often kernel crashes during a boot. I caught the kernel log
over netconsole.

[ 36.519781] BUG: unable to handle kernel NULL pointer dereference at 00000000000002ec
[ 36.520752] IP: [<ffffffff8145e13a>] mutex_lock+0xe/0x29
[ 36.520752] PGD 1952fb067 PUD 193c64067 PMD 0
[ 36.520752] Oops: 0002 [#1] PREEMPT SMP
[ 36.520752] Modules linked in: cfg80211(E) bnep(E) cpufreq_stats(E) cpufreq_powersave(E) cpufreq_userspace(E) cpufreq_conservative(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) cdc_ether(E) usbnet(E) coretemp(E) cdc_wdm(E) cdc_acm(E) kvm_intel(E) joydev(E) kvm(E) i8k(E) i915(E) btusb(E) snd_pcsp(E) bluetooth(E) psmouse(E) evdev(E) snd_hda_codec_generic(E) rfkill(E) lpc_ich(E) mfd_core(E) i2c_i801(E) serio_raw(E) snd_hda_intel(E) drm_kms_helper(E) snd_hda_controller(E) snd_hda_codec(E) drm(E) snd_hwdep(E) snd_pcm(E) i2c_algo_bit(E) i2c_core(E) battery(E) button(E) video(E) ac(E) snd_timer(E) snd(E) soundcore(E) acpi_cpufreq(E) processor(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ext4(E) crc16(E) jbd2(E) mbcache(E) dm_mod(E) sd_mod(E) ata_generic(E) ahci(E) libahci(E) firewire_ohci(E) ata_piix(E) libata(E) sdhci_pci(E) firewire_core(E) scsi_mod(E) sdhci(E) crc_itu_t(E) mmc_core(E) thermal(E) thermal_sys(E)
[ 36.520752] CPU: 1 PID: 19 Comm: kworker/1:1 Tainted: G W E 4.0.0-rc1-150225--00001-gb802a6b #10
[ 36.520752] Hardware name: Dell Inc. Vostro 1500 /0NX907, BIOS A06 04/21/2008
[ 36.520752] Workqueue: events output_poll_execute [drm_kms_helper]
[ 36.520752] task: ffff880197f69aa0 ti: ffff880197d1c000 task.ti: ffff880197d1c000
[ 36.520752] RIP: 0010:[<ffffffff8145e13a>] [<ffffffff8145e13a>] mutex_lock+0xe/0x29
[ 36.520752] RSP: 0018:ffff880197d1f838 EFLAGS: 00010246
[ 36.520752] RAX: ffff8801974c80c0 RBX: 00000000000002ec RCX: 0000000080000000
[ 36.520752] RDX: ffff88019fd00000 RSI: ffffffffa03fda6e RDI: 00000000000002ec
[ 36.520752] RBP: ffff880197d1f848 R08: 0000000000000001 R09: ffffffff81ea9154
[ 36.520752] R10: ffffffff81ea9154 R11: ffff88019fd0d300 R12: 00000000000002ec
[ 36.520752] R13: 0000000000000004 R14: ffff880197744d80 R15: ffff88019547e000
[ 36.520752] FS: 0000000000000000(0000) GS:ffff88019fd00000(0000) knlGS:0000000000000000
[ 36.520752] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 36.520752] CR2: 00000000000002ec CR3: 00000000d71b9000 CR4: 00000000000006e0
[ 36.520752] Stack:
[ 36.520752] ffff88019fd00000 ffff880197744d88 ffff880197d1f878 ffffffffa03fda98
[ 36.520752] ffffffffa03fda6e ffff880197744d88 0000000000000000 ffff880196c2e400
[ 36.520752] ffff880197d1f898 ffffffffa03fde15 ffff880197744d80 ffff8801974c80c0
[ 36.520752] Call Trace:
[ 36.520752] [<ffffffffa03fda98>] drm_framebuffer_free+0x2a/0x56 [drm]
[ 36.520752] [<ffffffffa03fda6e>] ? drm_framebuffer_unregister_private+0x43/0x43 [drm]
[ 36.520752] [<ffffffffa03fde15>] kref_sub.constprop.33+0x34/0x3e [drm]
[ 36.520752] [<ffffffffa03fe098>] drm_framebuffer_unreference+0x47/0x4b [drm]
[ 36.520752] [<ffffffffa040b86c>] drm_atomic_set_fb_for_plane+0x20/0x7f [drm]
[ 36.520752] [<ffffffffa049bcc6>] drm_plane_helper_update+0x74/0xca [drm_kms_helper]
[ 36.520752] [<ffffffffa074c88c>] __intel_set_mode+0x767/0x86c [i915]
[ 36.520752] [<ffffffffa075167e>] intel_set_mode+0x6d/0x8e [i915]
[ 36.520752] [<ffffffffa0751b2f>] intel_get_load_detect_pipe+0x3cc/0x46f [i915]
[ 36.520752] [<ffffffffa077d4c4>] intel_tv_detect+0x117/0x459 [i915]
[ 36.520752] [<ffffffff8107eea6>] ? vprintk_default+0x1d/0x1f
[ 36.520752] [<ffffffff8107ee47>] ? vprintk_emit+0x3f6/0x438
[ 36.520752] [<ffffffff8107ee57>] ? vprintk_emit+0x406/0x438
[ 36.520752] [<ffffffffa049b0d3>] drm_helper_probe_single_connector_modes_merge_bits+0xcd/0x3a1 [drm_kms_helper]
[ 36.520752] [<ffffffffa049b3cc>] drm_helper_probe_single_connector_modes+0x13/0x15 [drm_kms_helper]
[ 36.520752] [<ffffffffa04a2494>] drm_fb_helper_probe_connector_modes+0x43/0x5b [drm_kms_helper]
[ 36.520752] [<ffffffffa04a3fa4>] drm_fb_helper_hotplug_event+0x7a/0xb2 [drm_kms_helper]
[ 36.520752] [<ffffffffa075f9a4>] intel_fbdev_output_poll_changed+0x1e/0x20 [i915]
[ 36.520752] [<ffffffffa049adcd>] drm_kms_helper_hotplug_event+0x28/0x2c [drm_kms_helper]
[ 36.520752] [<ffffffffa049aefe>] output_poll_execute+0x12d/0x14e [drm_kms_helper]
[ 36.520752] [<ffffffff81057b52>] process_one_work+0x16e/0x294
[ 36.520752] [<ffffffff81057e58>] worker_thread+0x1b1/0x288
[ 36.520752] [<ffffffff81057ca7>] ? process_scheduled_works+0x2f/0x2f
[ 36.520752] [<ffffffff8105bb92>] kthread+0xa5/0xad
[ 36.520752] [<ffffffff8105baed>] ? __kthread_parkme+0x61/0x61
[ 36.520752] [<ffffffff8145fd6c>] ret_from_fork+0x7c/0xb0
[ 36.520752] [<ffffffff8105baed>] ? __kthread_parkme+0x61/0x61
[ 36.520752] Code: 05 bc c8 ba 7e 85 c0 75 05 e8 d2 b6 db ff 48 83 c4 28 5b 41 5c 41 5d 41 5e 41 5f 5d c3 66 66 66 66 90 55 48 89 e5 53 48 89 fb 52 <f0> ff 0f 79 05 e8 b1 fe ff ff 65 48 8b 04 25 00 aa 00 00 48 89
[ 36.520752] RIP [<ffffffff8145e13a>] mutex_lock+0xe/0x29
[ 36.520752] RSP <ffff880197d1f838>
[ 36.520752] CR2: 00000000000002ec
[ 36.520752] ---[ end trace df8a9d2a655f33b0 ]---


According to the backtrace this looks like a drm regression. The full
kernel log with drm.debug=6 is attached. It was taken from v4.0-rc1
with a patch mentioned above on top of that. The same is for clean v4.0-rc1.



--
Best regards,
Andrey Skvortsov

Secure e-mail with gnupg: See http://www.gnupg.org/
PGP Key ID: 0x57A3AEAD



Attachments:
(No filename) (0.00 B)
signature.asc (819.00 B)
Digital signature
Download all attachments

2015-02-28 21:10:25

by Chris Wilson

[permalink] [raw]
Subject: Re: [Intel-gfx] [Regression] WARNING: drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object

On Sat, Feb 28, 2015 at 03:20:37PM +0300, Andrey Skvortsov wrote:
> Unfortunately this is not the last bug, that breaks i915/drm working
> on my laptop. Sometimes system successfully loads with couple warnings mentioned in
> previous mail:
>
> [ 26.922953] WARNING: CPU: 1 PID: 767 at drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object+0x13f/0x288 [i915]()
> [ 26.922954] WARN_ON(obj->frontbuffer_bits)

That's inocuous, but for the serious hang, you may want to try
video=SVIDEO-1:d on the kernel commandline to workaround the hang. (Check
/sys/class/drm/card/ for the actual name of the connector for TV). I
think Ville mentioned he was looking/looked at the atomic-vs-load_detect
changes that is at the heart of the issue here. (Admittedly he did say
it was worthy of a drink or two.)
-Chris

--
Chris Wilson, Intel Open Source Technology Centre

2015-02-28 21:57:08

by Andrey Skvortsov

[permalink] [raw]
Subject: Re: [Intel-gfx] [Regression] BUG: unable to handle kernel NULL pointer dereference

On 28 Feb, Chris Wilson wrote:
> On Sat, Feb 28, 2015 at 03:20:37PM +0300, Andrey Skvortsov wrote:
> > Unfortunately this is not the last bug, that breaks i915/drm working
> > on my laptop. Sometimes system successfully loads with couple warnings mentioned in
> > previous mail:
> >
> > [ 26.922953] WARNING: CPU: 1 PID: 767 at drivers/gpu/drm/i915/i915_gem.c:4525 i915_gem_free_object+0x13f/0x288 [i915]()
> > [ 26.922954] WARN_ON(obj->frontbuffer_bits)
>
> That's inocuous, but for the serious hang, you may want to try
> video=SVIDEO-1:d on the kernel commandline to workaround the hang. (Check
> /sys/class/drm/card/ for the actual name of the connector for TV). I
> think Ville mentioned he was looking/looked at the atomic-vs-load_detect
> changes that is at the heart of the issue here. (Admittedly he did say
> it was worthy of a drink or two.)
> -Chris

Thank you for the help. I've tried to add to the kernel command line
'video=card0-SVIDEO-1:d video=card0-VGA-1:d'. But despite of that
kernel crashes with 'BUG: unable to handle kernel NULL pointer
dereference at' occasionally.

--
Best regards,
Andrey Skvortsov




Attachments:
(No filename) (1.11 kB)
signature.asc (819.00 B)
Digital signature
Download all attachments