2011-06-16 16:26:26

by Leandro Lucarella

[permalink] [raw]
Subject: Graphic corruption with i915 (2.6.39.1)

Hi, I've been experiencing sporadic graphic corruption since a few
stable kernel versions and I always thought "maybe is fixed in the next
release", so I upgrade, but I usually experience a different problem
then. This problems are exposed very rarely, maybe I have no problems
like for a month or so.

Now I'm using kernel 2.6.39.1 and I experience one of this problem so
I though it was time to report it. This is what I got in dmesg:

[368693.756008] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[368693.756014] render error detected, EIR: 0x00000010
[368693.756017] IPEIR: 0x00000000
[368693.756019] IPEHR: 0x02000004
[368693.756020] INSTDONE: 0xcc8fffff
[368693.756022] INSTPS: 0x8001e025
[368693.756024] INSTDONE1: 0xbfffffbf
[368693.756026] ACTHD: 0x00715128
[368693.756027] page table error
[368693.756029] PGTBL_ER: 0x00000002
[368693.756038] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 17700193 at 17700192, next 17700196)
[368694.258010] [drm:i915_reset] *ERROR* Failed to reset chip.

Here are 2 images of my desktop, without the graphic corruption first
and with the corruption then: http://imgur.com/a/NX7te

# lspci
00:00.0 Host bridge: Intel Corporation 4 Series Chipset DRAM Controller (rev 03)
00:01.0 PCI bridge: Intel Corporation 4 Series Chipset PCI Express Root Port (rev 03)
00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset Integrated Graphics Controller (rev 03)
00:1b.0 Audio device: Intel Corporation N10/ICH 7 Family High Definition Audio Controller (rev 01)
00:1c.0 PCI bridge: Intel Corporation N10/ICH 7 Family PCI Express Port 1 (rev 01)
00:1c.1 PCI bridge: Intel Corporation N10/ICH 7 Family PCI Express Port 2 (rev 01)
00:1d.0 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #1 (rev 01)
00:1d.1 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #2 (rev 01)
00:1d.2 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #3 (rev 01)
00:1d.3 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #4 (rev 01)
00:1d.7 USB Controller: Intel Corporation N10/ICH 7 Family USB2 EHCI Controller (rev 01)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev e1)
00:1f.0 ISA bridge: Intel Corporation 82801GB/GR (ICH7 Family) LPC Interface Bridge (rev 01)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller (rev 01)
00:1f.2 IDE interface: Intel Corporation N10/ICH7 Family SATA IDE Controller (rev 01)
00:1f.3 SMBus: Intel Corporation N10/ICH 7 Family SMBus Controller (rev 01)
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 03)
04:00.0 Multimedia video controller: Brooktree Corporation Bt848 Video Capture (rev 12)

Any ideas on what might be going on and how to fix it are welcome. Please let
me know if you need any other information.

Thanks in advance.

--
Leandro Lucarella (AKA luca) http://llucax.com.ar/
----------------------------------------------------------------------
GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 104C 949E BFB6 5F5A 8D05)
----------------------------------------------------------------------
SEÑOR BIELSA: CON TODO RESPETO ¿USTED LO VE JUGAR A RIQUELME?
-- Crónica TV


2011-06-16 16:19:00

by Leandro Lucarella

[permalink] [raw]
Subject: Re: Graphic corruption with i915 (2.6.39.1)

Chris Wilson, el 16 de junio a las 17:01 me escribiste:
> On Thu, 16 Jun 2011 11:58:42 -0300, Leandro Lucarella <[email protected]> wrote:
> > Hi, I've been experiencing sporadic graphic corruption since a few
> > stable kernel versions and I always thought "maybe is fixed in the next
> > release", so I upgrade, but I usually experience a different problem
> > then. This problems are exposed very rarely, maybe I have no problems
> > like for a month or so.
> >
> > Now I'm using kernel 2.6.39.1 and I experience one of this problem so
> > I though it was time to report it. This is what I got in dmesg:
> >
> > [368693.756008] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
> > [368693.756014] render error detected, EIR: 0x00000010
> > [368693.756017] IPEIR: 0x00000000
> > [368693.756019] IPEHR: 0x02000004
> > [368693.756020] INSTDONE: 0xcc8fffff
> > [368693.756022] INSTPS: 0x8001e025
> > [368693.756024] INSTDONE1: 0xbfffffbf
> > [368693.756026] ACTHD: 0x00715128
> > [368693.756027] page table error
> > [368693.756029] PGTBL_ER: 0x00000002
> > [368693.756038] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 17700193 at 17700192, next 17700196)
> > [368694.258010] [drm:i915_reset] *ERROR* Failed to reset chip.
>
> If you look in /sys/kernel/debug/dri/0/i915_error_state we can see what
> garbage the DDX [xf86-video-intel] fed the gpu.

cannot access /sys/kernel/debug/dri/0/i915_error_state: No such file or directory

The debug directory is not present, should I boot or compile with some
particular option to enable it?

--
Leandro Lucarella (AKA luca) http://llucax.com.ar/
----------------------------------------------------------------------
GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 104C 949E BFB6 5F5A 8D05)
----------------------------------------------------------------------
Refalar: acto de mover el peso de la masa hacia un lugar equivocado pero
concreto. Todo refalo es, por cierto, una sucesión de pequeñísimos
movimientos a los que un centímetro es la proporción aumentada de miles
de porciones de espacio, que, al estar el piso mojado, refala.
-- Ricardo Vaporeso

2011-06-16 16:39:51

by Chris Wilson

[permalink] [raw]
Subject: Re: Graphic corruption with i915 (2.6.39.1)

On Thu, 16 Jun 2011 13:18:03 -0300, Leandro Lucarella <[email protected]> wrote:
> cannot access /sys/kernel/debug/dri/0/i915_error_state: No such file or directory
>
> The debug directory is not present, should I boot or compile with some
> particular option to enable it?

You need to compile with CONFIG_DEBUGFS and at runtime
"mount -tdebugfs debug /sys/kernel/debug".

The /sys/kernel/debug/dri/0/i915_error_state will only be populated
after a hang is detected.
-Chris

--
Chris Wilson, Intel Open Source Technology Centre

2011-06-16 16:58:32

by Leandro Lucarella

[permalink] [raw]
Subject: Re: Graphic corruption with i915 (2.6.39.1)

Chris Wilson, el 16 de junio a las 17:39 me escribiste:
> On Thu, 16 Jun 2011 13:18:03 -0300, Leandro Lucarella <[email protected]> wrote:
> > cannot access /sys/kernel/debug/dri/0/i915_error_state: No such file or directory
> >
> > The debug directory is not present, should I boot or compile with some
> > particular option to enable it?
>
> You need to compile with CONFIG_DEBUGFS and at runtime
> "mount -tdebugfs debug /sys/kernel/debug".
>
> The /sys/kernel/debug/dri/0/i915_error_state will only be populated
> after a hang is detected.

OK, I had CONFIG_DEBUGFS but never mounted it. Unfortunately I rebooted
since the problem, so now that file says: no error state collected

I'll get the contents when it happens again.

--
Leandro Lucarella (AKA luca) http://llucax.com.ar/
----------------------------------------------------------------------
GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 104C 949E BFB6 5F5A 8D05)
----------------------------------------------------------------------
The world's best known word is "okay"
The second most well-known word is "Coca-Cola"

2011-06-16 18:24:45

by Raúl Benencia

[permalink] [raw]
Subject: Re: Graphic corruption with i915 (2.6.39.1)

2011/6/16 Leandro Lucarella <[email protected]>
>
> Chris Wilson, el 16 de junio a las 17:39 me escribiste:
> > On Thu, 16 Jun 2011 13:18:03 -0300, Leandro Lucarella <[email protected]> wrote:
> > > cannot access /sys/kernel/debug/dri/0/i915_error_state: No such file or directory
> > >
> > > The debug directory is not present, should I boot or compile with some
> > > particular option to enable it?
> >
> > You need to compile with CONFIG_DEBUGFS and at runtime
> > "mount -tdebugfs debug /sys/kernel/debug".
> >
> > The /sys/kernel/debug/dri/0/i915_error_state will only be populated
> > after a hang is detected.
>
> OK, I had CONFIG_DEBUGFS but never mounted it. Unfortunately I rebooted
> since the problem, so now that file says: no error state collected
>
> I'll get the contents when it happens again.
>
> --
> Leandro Lucarella (AKA luca) ? ? ? ? ? ? ? ? ? ? http://llucax.com.ar/
> ----------------------------------------------------------------------
> GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145 ?104C 949E BFB6 5F5A 8D05)
> ----------------------------------------------------------------------
> The world's best known word is "okay"
> The second most well-known word is "Coca-Cola"

I'm having exactly the same problem. It happens pretty randomly, but
I've noticed that the frequency in which it occurs is highly increased
when libflashplayer.so is used. Unfortunately I didn't compile my
kernel with CONFIG_DEBUGFS. I'll change this and if I come across
anything interesting, I'll let you know.

Cheers,
--
Ra?l Benencia