2004-01-05 23:00:30

by Matthew Dobson

[permalink] [raw]
Subject: Re: [TRIVIAL PATCH] Use valid node number when unmapping CPUs

Nick Piggin wrote:
>
>
> Matthew Dobson wrote:
>
>> The cpu_2_node array for i386 is initialized to 0 for each CPU,
>> effectively mapping all CPUs to node 0 unless changed. When we unmap
>> CPUs, however, we stick a -1 in the array, mapping the CPU to an
>> invalid node. This really isn't helpful. We should map the CPU to
>> node 0, to make sure that callers of cpu_to_node() and friends aren't
>> returned a bogus node number. This trivial patch changes the
>> unmapping code to place a 0 in the node mapping for removed CPUs.
>>
>> Cheers!
>
>
>
> I'd prefer it got initialised to -1 for each cpu, and either set to -1
> or not touched during unmap.
>
>
> 0 is more bogus than the alternatives, isn't it? At least for the subset
> of CPUs not on node 0. Callers should be fixed.

Not really... These macros are usually used for things like scheduling,
memory placement and other decisions. Right now the value doesn't have
to be error-checked, because it is assumed to always return a valid
node. For these types of uses, it's far better to schedule/allocate
something on the wrong node (ie: node 0) than on an invalid node (ie:
node -1). Having a possible negative value for this will break things
when used as an array index (as it often is), and will force us to put
tests to ensure it is a valid value before using it, and introduce
possible races in the future (ie: imagine testing if CPU 17's node
mapping is non-negative, simultaneously unmapping the CPU, then using
the macro again to make a node decision. You may get a negative value
back, thus causing you to index way off the end of your array... BOOM).
If we stick with the convention that we always have a valid (even if
not *correct*) value in these arrays, the worst we should get is poor
performance, not breakage.

Cheers!

-Matt


2004-01-06 00:46:01

by Nick Piggin

[permalink] [raw]
Subject: Re: [TRIVIAL PATCH] Use valid node number when unmapping CPUs



Matthew Dobson wrote:

> Nick Piggin wrote:
>
>>
>>
>> Matthew Dobson wrote:
>>
>>> The cpu_2_node array for i386 is initialized to 0 for each CPU,
>>> effectively mapping all CPUs to node 0 unless changed. When we
>>> unmap CPUs, however, we stick a -1 in the array, mapping the CPU to
>>> an invalid node. This really isn't helpful. We should map the CPU
>>> to node 0, to make sure that callers of cpu_to_node() and friends
>>> aren't returned a bogus node number. This trivial patch changes the
>>> unmapping code to place a 0 in the node mapping for removed CPUs.
>>>
>>> Cheers!
>>
>>
>>
>>
>> I'd prefer it got initialised to -1 for each cpu, and either set to -1
>> or not touched during unmap.
>
> >
>
>>
>> 0 is more bogus than the alternatives, isn't it? At least for the subset
>> of CPUs not on node 0. Callers should be fixed.
>
>
> Not really... These macros are usually used for things like
> scheduling, memory placement and other decisions. Right now the value
> doesn't have to be error-checked, because it is assumed to always
> return a valid node. For these types of uses, it's far better to
> schedule/allocate something on the wrong node (ie: node 0) than on an
> invalid node (ie: node -1). Having a possible negative value for this
> will break things when used as an array index (as it often is), and
> will force us to put tests to ensure it is a valid value before using
> it, and introduce possible races in the future (ie: imagine testing if
> CPU 17's node mapping is non-negative, simultaneously unmapping the
> CPU, then using the macro again to make a node decision. You may get
> a negative value back, thus causing you to index way off the end of
> your array... BOOM). If we stick with the convention that we always
> have a valid (even if not *correct*) value in these arrays, the worst
> we should get is poor performance, not breakage.


OK, then keep the correct node number. No need to change it to node 0.
Having the value not change at all (1) gives you the correct information
at all times, and (2) eliminates the remaining race possibilities.


2004-03-29 15:46:05

by Nick Piggin

[permalink] [raw]
Subject: Re: [TRIVIAL PATCH] Use valid node number when unmapping CPUs



Matthew Dobson wrote:

> Nick Piggin wrote:
>
>>
>>
>> Matthew Dobson wrote:
>>
>>> The cpu_2_node array for i386 is initialized to 0 for each CPU,
>>> effectively mapping all CPUs to node 0 unless changed. When we
>>> unmap CPUs, however, we stick a -1 in the array, mapping the CPU to
>>> an invalid node. This really isn't helpful. We should map the CPU
>>> to node 0, to make sure that callers of cpu_to_node() and friends
>>> aren't returned a bogus node number. This trivial patch changes the
>>> unmapping code to place a 0 in the node mapping for removed CPUs.
>>>
>>> Cheers!
>>
>>
>>
>>
>> I'd prefer it got initialised to -1 for each cpu, and either set to -1
>> or not touched during unmap.
>
> >
>
>>
>> 0 is more bogus than the alternatives, isn't it? At least for the subset
>> of CPUs not on node 0. Callers should be fixed.
>
>
> Not really... These macros are usually used for things like
> scheduling, memory placement and other decisions. Right now the value
> doesn't have to be error-checked, because it is assumed to always
> return a valid node. For these types of uses, it's far better to
> schedule/allocate something on the wrong node (ie: node 0) than on an
> invalid node (ie: node -1). Having a possible negative value for this
> will break things when used as an array index (as it often is), and
> will force us to put tests to ensure it is a valid value before using
> it, and introduce possible races in the future (ie: imagine testing if
> CPU 17's node mapping is non-negative, simultaneously unmapping the
> CPU, then using the macro again to make a node decision. You may get
> a negative value back, thus causing you to index way off the end of
> your array... BOOM). If we stick with the convention that we always
> have a valid (even if not *correct*) value in these arrays, the worst
> we should get is poor performance, not breakage.


OK, then keep the correct node number. No need to change it to node 0.
Having the value not change at all (1) gives you the correct information
at all times, and (2) eliminates the remaining race possibilities.