LinuxLists.cc - ESP corruption bug - what CPUs are affected?

2004-09-16 17:54:18

Subject: ESP corruption bug - what CPUs are affected?

Hello.

My question is not directly linux-related,
but I think people here can give me some clue
on an issue.

There is a "semi-official" bug in Intel CPUs,
which is described here:
http://www.intel.com/design/intarch/specupdt/27287402.PDF
chapter "Specification Clarifications"
section 4: "Use Of ESP In 16-Bit Code With 32-Bit Interrupt Handlers".

A short quote:
"ISSUE: When a 32-bit IRET is used to return to another privilege level,
and the old level uses a 4G stack (D/B bit in the segment register = 1),
while the new level uses a 64k stack (D/B bit = 0), then only the
lower word of ESP is updated."

Which means that the higher word of ESP gets
trashed. This bug beats dosemu rather badly,
but there seem to be not much info about that
bug available on the net.
What I want to find out, is what CPUs are
affected. I wrote a small test-case. It is
attached. I tried it on Intel Pentium and
on AMD Athlon - bug is present on both. But
I'd like to know if it is also present on a
Transmeta Crusoe, Cyrix and other CPUs.
Can someone please run the test-case on some
of that CPUs and see how it goes?
Note: specifying "32" as an arg to the
test-case, will make it to use the 32bit
segment for the stack, and it will not detect
the bug. It is there to prove that it detects
the Right Thing. Run it without that argument.

I'd also like to know any thoughts on whether
it is possible to work around the bug, probably
in a kernel? Well, I am not hoping on such a
possibility, but who knows...
Anyway, I'd be glad to get any info on that bug.
Why it was not fixed for so many years, looks
also like an interesting question, as for me.

Attachments:

stk.c (2.57 kB)

2004-09-16 18:41:39

by Petr Vandrovec

[permalink] [raw]

Subject: Re: ESP corruption bug - what CPUs are affected?

On 16 Sep 04 at 21:49, Stas Sergeev wrote:
>
> There is a "semi-official" bug in Intel CPUs,
> which is described here:
> http://www.intel.com/design/intarch/specupdt/27287402.PDF
> chapter "Specification Clarifications"
> section 4: "Use Of ESP In 16-Bit Code With 32-Bit Interrupt Handlers".

Not a bug, but a feature.

> What I want to find out, is what CPUs are
> affected. I wrote a small test-case. It is
> attached. I tried it on Intel Pentium and
> on AMD Athlon - bug is present on both. But
> I'd like to know if it is also present on a
> Transmeta Crusoe, Cyrix and other CPUs.

AFAIK all. There are products which depend on this behavior.

> I'd also like to know any thoughts on whether
> it is possible to work around the bug, probably
> in a kernel? Well, I am not hoping on such a
> possibility, but who knows...

IMHO you have to switch to 16bit stack, load upper bits of ESP
with target value, and then execute IRET, while 16bit SS:SP points
to same place where flat ESP pointed.

This way IRET, as stack is 16bit, uses only SP instead of ESP,
and so you can preload upper bits of ESP before executing IRET.

And I do not think that linux NMI handler survives 16bit stack, so
natural solution seems to be to create complete 16bit CPL1
environment, return to it, load ESP as you want, and then do IRET
to return to CPL2 or CPL3. Fortunately V8086 mode is not affected,
so there should be no problem with using CPL1 for this middle step.
But of course it is not something you want to do on each return
from interrupt handler... Well, or maybe you want...

> Anyway, I'd be glad to get any info on that bug.
> Why it was not fixed for so many years, looks
> also like an interesting question, as for me.

It is part of architecture now...
Petr Vandrovec

2004-09-16 19:03:32

by Denis Vlasenko

[permalink] [raw]

Subject: Re: ESP corruption bug - what CPUs are affected?

On Thursday 16 September 2004 20:49, Stas Sergeev wrote:
> Hello.
>
> My question is not directly linux-related,
> but I think people here can give me some clue
> on an issue.
>
> There is a "semi-official" bug in Intel CPUs,
> which is described here:
> http://www.intel.com/design/intarch/specupdt/27287402.PDF
> chapter "Specification Clarifications"
> section 4: "Use Of ESP In 16-Bit Code With 32-Bit Interrupt Handlers".
>
> A short quote:
> "ISSUE: When a 32-bit IRET is used to return to another privilege level,
> and the old level uses a 4G stack (D/B bit in the segment register = 1),
> while the new level uses a 64k stack (D/B bit = 0), then only the
> lower word of ESP is updated."
>
> Which means that the higher word of ESP gets
> trashed. This bug beats dosemu rather badly,
> but there seem to be not much info about that
> bug available on the net.

Well. Strictly speaking it's not a bug.
Code executes as it should. IRET returns to the correct
location. Upper 16 bits of ESP do not affect execution
in this case.

You want CPU to preserve upper bits,
but this will waste a handful of transistors
practically for no gain.

I think dosemu should just work around this.
Is there some subtle problem with that in dosemu?

> What I want to find out, is what CPUs are
> affected. I wrote a small test-case. It is
> attached. I tried it on Intel Pentium and
> on AMD Athlon - bug is present on both. But
> I'd like to know if it is also present on a
> Transmeta Crusoe, Cyrix and other CPUs.
> Can someone please run the test-case on some
> of that CPUs and see how it goes?
> Note: specifying "32" as an arg to the
> test-case, will make it to use the 32bit
> segment for the stack, and it will not detect
> the bug. It is there to prove that it detects
> the Right Thing. Run it without that argument.
>
> I'd also like to know any thoughts on whether
> it is possible to work around the bug, probably
> in a kernel? Well, I am not hoping on such a
> possibility, but who knows...
> Anyway, I'd be glad to get any info on that bug.
> Why it was not fixed for so many years, looks
> also like an interesting question, as for me.

Because it does not matter for 99.9999% of users...

# gcc -O2 stk.c
# ./a.out
sp=0xbffffb30, ss=0x7b
In sighandler: esp=bffffb10
Now sp=0xccf1fb10, ss=0x7f
Esp changed, strange...
# cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 6
model : 8
model name : Unknown CPU Typ
stepping : 1
cpu MHz : 1743.632
cache size : 64 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse syscall mmxext 3dnowext 3dnow
--
vda

2004-09-17 18:11:37

Subject: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Attachments:

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected?

Subject: Re: ESP corruption bug - what CPUs are affected? (patch attempts)

Attachments: