2023-03-11 09:18:03

by Li, Xin3

[permalink] [raw]
Subject: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

The vDSO getcpu() reads CPU ID from the GDT_ENTRY_CPUNODE entry when the RDPID
instruction is not available. And GDT_ENTRY_CPUNODE is defined as 28 on 32-bit
Linux kernel and 15 on 64-bit. But the 32-bit getcpu() on 64-bit Linux kernel
is compiled with 32-bit Linux kernel GDT_ENTRY_CPUNODE, i.e., 28, beyond the
64-bit Linux kernel GDT limit. Thus, it just fails _silently_.

Compile the 32-bit getcpu() on 64-bit Linux kernel with the 64-bit Linux kernel
GDT_ENTRY_CPUNODE.

Fixes: 877cff5296faa6e ("x86/vdso: Fake 32bit VDSO build on 64bit compile for vgetcpu")
Reported-by: kernel test robot <[email protected]>
https://lore.kernel.org/oe-lkp/[email protected]/
Reported-by: Shan Kang <[email protected]>
Signed-off-by: Xin Li <[email protected]>
---
arch/x86/include/asm/segment.h | 4 ++++
1 file changed, 4 insertions(+)

diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h
index 794f69625780..d3b4f8797054 100644
--- a/arch/x86/include/asm/segment.h
+++ b/arch/x86/include/asm/segment.h
@@ -119,7 +119,11 @@

#define GDT_ENTRY_ESPFIX_SS 26
#define GDT_ENTRY_PERCPU 27
+#ifndef BUILD_VDSO32_64
#define GDT_ENTRY_CPUNODE 28
+#else
+#define GDT_ENTRY_CPUNODE 15
+#endif

#define GDT_ENTRY_DOUBLEFAULT_TSS 31

--
2.34.1



2023-03-13 15:22:50

by Dave Hansen

[permalink] [raw]
Subject: Re: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

On 3/11/23 00:48, Xin Li wrote:
> #define GDT_ENTRY_ESPFIX_SS 26
> #define GDT_ENTRY_PERCPU 27
> +#ifndef BUILD_VDSO32_64
> #define GDT_ENTRY_CPUNODE 28
> +#else
> +#define GDT_ENTRY_CPUNODE 15
> +#endif

Isn't this kinda a hack?

First, it means that we'll now have two duplicate versions of this:

#define GDT_ENTRY_CPUNODE 15

in the same file.

Second, if any other users of fake_32bit_build.h for the VDSO show up,
they'll need a similar #ifdef.

I think I'd much rather if we define all of the GDT_ENTRY_* macros in
*one* place, then make that *one* place depend on BUILD_VDSO32_64.

Also, about the *silent* failure... Do we not have a selftest for this
somewhere?

2023-03-13 17:42:56

by Li, Xin3

[permalink] [raw]
Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

> > +#ifndef BUILD_VDSO32_64
> > #define GDT_ENTRY_CPUNODE 28
> > +#else
> > +#define GDT_ENTRY_CPUNODE 15
> > +#endif
>
> Isn't this kinda a hack?
>
> First, it means that we'll now have two duplicate versions of this:
>
> #define GDT_ENTRY_CPUNODE 15
>
> in the same file.
>
> Second, if any other users of fake_32bit_build.h for the VDSO show up, they'll
> need a similar #ifdef.
>
> I think I'd much rather if we define all of the GDT_ENTRY_* macros in
> *one* place, then make that *one* place depend on BUILD_VDSO32_64.

Sounds a better way, let me try.

> Also, about the *silent* failure... Do we not have a selftest for this somewhere?

When lsl is used, we should check ZF which indicates whether the segment limit
is loaded successfully. Seems we need to refactor vdso_read_cpunode() a bit.

Xin

2023-03-18 08:15:12

by Li, Xin3

[permalink] [raw]
Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

> > I think I'd much rather if we define all of the GDT_ENTRY_* macros in
> > *one* place, then make that *one* place depend on BUILD_VDSO32_64.
>
> Sounds a better way, let me try.

Hi Dave,

I tried the following patch, and it works:

diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h
index 794f69625780..9d6411c65920 100644
--- a/arch/x86/include/asm/segment.h
+++ b/arch/x86/include/asm/segment.h
@@ -56,7 +56,7 @@

#define GDT_ENTRY_INVALID_SEG 0

-#ifdef CONFIG_X86_32
+#if defined(CONFIG_X86_32) && !defined(BUILD_VDSO32_64)
/*
* The layout of the per-CPU GDT under Linux:
*


It's simpler and looks reasonable to me. Is it what you suggested?

Thanks!
Xin


>
> > Also, about the *silent* failure... Do we not have a selftest for this
> somewhere?
>
> When lsl is used, we should check ZF which indicates whether the segment limit is
> loaded successfully. Seems we need to refactor vdso_read_cpunode() a bit.
>
> Xin

2023-03-29 08:52:32

by Li, Xin3

[permalink] [raw]
Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel



> -----Original Message-----
> From: Li, Xin3 <[email protected]>
> Sent: Monday, March 13, 2023 10:43 AM
> To: Hansen, Dave <[email protected]>; [email protected];
> [email protected]
> Cc: [email protected]; [email protected]; [email protected];
> [email protected]; [email protected]; [email protected]; Liu, Yujie
> <[email protected]>; Kang, Shan <[email protected]>
> Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit
> getcpu() on 64-bit kernel
>
> > > +#ifndef BUILD_VDSO32_64
> > > #define GDT_ENTRY_CPUNODE 28
> > > +#else
> > > +#define GDT_ENTRY_CPUNODE 15
> > > +#endif
> >
> > Isn't this kinda a hack?
> >
> > First, it means that we'll now have two duplicate versions of this:
> >
> > #define GDT_ENTRY_CPUNODE 15
> >
> > in the same file.
> >
> > Second, if any other users of fake_32bit_build.h for the VDSO show up, they'll
> > need a similar #ifdef.
> >
> > I think I'd much rather if we define all of the GDT_ENTRY_* macros in
> > *one* place, then make that *one* place depend on BUILD_VDSO32_64.
>
> Sounds a better way, let me try.
>
> > Also, about the *silent* failure... Do we not have a selftest for this somewhere?
>
> When lsl is used, we should check ZF which indicates whether the segment limit
> is loaded successfully. Seems we need to refactor vdso_read_cpunode() a bit.

Hi Dave,

How about the following patch to address the silent failure?

diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h
index 794f69625780..d75ce4afff5b 100644
--- a/arch/x86/include/asm/segment.h
+++ b/arch/x86/include/asm/segment.h
@@ -254,7 +254,10 @@ static inline void vdso_read_cpunode(unsigned *cpu, unsigned *node)
*
* If RDPID is available, use it.
*/
- alternative_io ("lsl %[seg],%[p]",
+ alternative_io ("lsl %[seg],%[p]\n"
+ "jz 1f\n"
+ "mov $-1,%[p]\n"
+ "1:",
".byte 0xf3,0x0f,0xc7,0xf8", /* RDPID %eax/rax */
X86_FEATURE_RDPID,
[p] "=a" (p), [seg] "r" (__CPUNODE_SEG));


And the test then reports CPU id 4095 (not a big enough #), which can be
used to indicate a failure of the lsl instruction:

~/selftests$ sudo ./run_kselftest.sh -t x86:test_vsyscall_32
TAP version 13
1..1
# selftests: x86: test_vsyscall_32
# [RUN] test gettimeofday()
# vDSO time offsets: 0.000028 0.000000
# [OK] vDSO gettimeofday()'s timeval was okay
# [RUN] test time()
# [OK] vDSO time() is okay
# [RUN] getcpu() on CPU 0
# [FAIL] vDSO reported CPU 4095 but should be 0
# [FAIL] vDSO reported node 65535 but should be 0
# [RUN] getcpu() on CPU 1
# [FAIL] vDSO reported CPU 4095 but should be 1
# [FAIL] vDSO reported node 65535 but should be 0
not ok 1 selftests: x86: test_vsyscall_32 # exit=1

Thanks!
Xin

2023-03-29 23:19:05

by Li, Xin3

[permalink] [raw]
Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

> > > > +#ifndef BUILD_VDSO32_64
> > > > #define GDT_ENTRY_CPUNODE 28
> > > > +#else
> > > > +#define GDT_ENTRY_CPUNODE 15
> > > > +#endif
> > >
> > > Isn't this kinda a hack?
> > >
> > > First, it means that we'll now have two duplicate versions of this:
> > >
> > > #define GDT_ENTRY_CPUNODE 15
> > >
> > > in the same file.
> > >
> > > Second, if any other users of fake_32bit_build.h for the VDSO show
> > > up, they'll need a similar #ifdef.
> > >
> > > I think I'd much rather if we define all of the GDT_ENTRY_* macros
> > > in
> > > *one* place, then make that *one* place depend on BUILD_VDSO32_64.
> >
> > Sounds a better way, let me try.
> >
> > > Also, about the *silent* failure... Do we not have a selftest for this somewhere?
> >
> > When lsl is used, we should check ZF which indicates whether the
> > segment limit is loaded successfully. Seems we need to refactor
> vdso_read_cpunode() a bit.
>
> Hi Dave,
>
> How about the following patch to address the silent failure?
>
> diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h
> index 794f69625780..d75ce4afff5b 100644
> --- a/arch/x86/include/asm/segment.h
> +++ b/arch/x86/include/asm/segment.h
> @@ -254,7 +254,10 @@ static inline void vdso_read_cpunode(unsigned *cpu,
> unsigned *node)
> *
> * If RDPID is available, use it.
> */
> - alternative_io ("lsl %[seg],%[p]",
> + alternative_io ("lsl %[seg],%[p]\n"
> + "jz 1f\n"
> + "mov $-1,%[p]\n"
> + "1:",
> ".byte 0xf3,0x0f,0xc7,0xf8", /* RDPID %eax/rax */
> X86_FEATURE_RDPID,
> [p] "=a" (p), [seg] "r" (__CPUNODE_SEG));
>
>
> And the test then reports CPU id 4095 (not a big enough #), which can be used to
> indicate a failure of the lsl instruction:

Having to say, it's a *bad* idea to use a special # to indicate an error.
But there is also no reasonable errno for getcpu() to return to its caller,
saying "we had a problem in the syscall's kernel implementation".

>
> ~/selftests$ sudo ./run_kselftest.sh -t x86:test_vsyscall_32 TAP version 13
> 1..1
> # selftests: x86: test_vsyscall_32
> # [RUN] test gettimeofday()
> # vDSO time offsets: 0.000028 0.000000
> # [OK] vDSO gettimeofday()'s timeval was okay # [RUN] test time() # [OK] vDSO
> time() is okay # [RUN] getcpu() on CPU 0
> # [FAIL] vDSO reported CPU 4095 but should be 0
> # [FAIL] vDSO reported node 65535 but should be 0
> # [RUN] getcpu() on CPU 1
> # [FAIL] vDSO reported CPU 4095 but should be 1
> # [FAIL] vDSO reported node 65535 but should be 0
> not ok 1 selftests: x86: test_vsyscall_32 # exit=1
>
> Thanks!
> Xin

2023-03-30 00:12:00

by H. Peter Anvin

[permalink] [raw]
Subject: RE: [PATCH 1/1] x86/vdso: use the right GDT_ENTRY_CPUNODE for 32-bit getcpu() on 64-bit kernel

On March 29, 2023 4:11:09 PM PDT, "Li, Xin3" <[email protected]> wrote:
>> > > > +#ifndef BUILD_VDSO32_64
>> > > > #define GDT_ENTRY_CPUNODE 28
>> > > > +#else
>> > > > +#define GDT_ENTRY_CPUNODE 15
>> > > > +#endif
>> > >
>> > > Isn't this kinda a hack?
>> > >
>> > > First, it means that we'll now have two duplicate versions of this:
>> > >
>> > > #define GDT_ENTRY_CPUNODE 15
>> > >
>> > > in the same file.
>> > >
>> > > Second, if any other users of fake_32bit_build.h for the VDSO show
>> > > up, they'll need a similar #ifdef.
>> > >
>> > > I think I'd much rather if we define all of the GDT_ENTRY_* macros
>> > > in
>> > > *one* place, then make that *one* place depend on BUILD_VDSO32_64.
>> >
>> > Sounds a better way, let me try.
>> >
>> > > Also, about the *silent* failure... Do we not have a selftest for this somewhere?
>> >
>> > When lsl is used, we should check ZF which indicates whether the
>> > segment limit is loaded successfully. Seems we need to refactor
>> vdso_read_cpunode() a bit.
>>
>> Hi Dave,
>>
>> How about the following patch to address the silent failure?
>>
>> diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h
>> index 794f69625780..d75ce4afff5b 100644
>> --- a/arch/x86/include/asm/segment.h
>> +++ b/arch/x86/include/asm/segment.h
>> @@ -254,7 +254,10 @@ static inline void vdso_read_cpunode(unsigned *cpu,
>> unsigned *node)
>> *
>> * If RDPID is available, use it.
>> */
>> - alternative_io ("lsl %[seg],%[p]",
>> + alternative_io ("lsl %[seg],%[p]\n"
>> + "jz 1f\n"
>> + "mov $-1,%[p]\n"
>> + "1:",
>> ".byte 0xf3,0x0f,0xc7,0xf8", /* RDPID %eax/rax */
>> X86_FEATURE_RDPID,
>> [p] "=a" (p), [seg] "r" (__CPUNODE_SEG));
>>
>>
>> And the test then reports CPU id 4095 (not a big enough #), which can be used to
>> indicate a failure of the lsl instruction:
>
>Having to say, it's a *bad* idea to use a special # to indicate an error.
>But there is also no reasonable errno for getcpu() to return to its caller,
>saying "we had a problem in the syscall's kernel implementation".
>
>>
>> ~/selftests$ sudo ./run_kselftest.sh -t x86:test_vsyscall_32 TAP version 13
>> 1..1
>> # selftests: x86: test_vsyscall_32
>> # [RUN] test gettimeofday()
>> # vDSO time offsets: 0.000028 0.000000
>> # [OK] vDSO gettimeofday()'s timeval was okay # [RUN] test time() # [OK] vDSO
>> time() is okay # [RUN] getcpu() on CPU 0
>> # [FAIL] vDSO reported CPU 4095 but should be 0
>> # [FAIL] vDSO reported node 65535 but should be 0
>> # [RUN] getcpu() on CPU 1
>> # [FAIL] vDSO reported CPU 4095 but should be 1
>> # [FAIL] vDSO reported node 65535 but should be 0
>> not ok 1 selftests: x86: test_vsyscall_32 # exit=1
>>
>> Thanks!
>> Xin
>

I don't think we should put a test in the vdso implementation. We need a self test, to be sure, and we should check that LSL works standalone (because of Oracle et al), but putting an assert like this in the vdso is like of odd at best.

If we *do* put in a test, it should trap to the kernel, not return an errno, because it is the equivalent of putting a BUG() in the kernel. In this case we can even do better, because we can execute the getcpu system call at that point.

But... why? We generally trust the kernel implementation.