LinuxLists.cc - [PATCH] move task_struct allocation to arch

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

David Howells wrote:
>
> Hi Linus,
>
> This patch moves task_struct allocation from kernel/fork.c into
> arch/*/kernel/process.c.
>
> David Mosberger wrote:
>
> > David.H> What might be worth doing is to move the task_struct slab
> > David.H> cache and (de-)allocator out of fork.c and to stick it in
> > David.H> the arch somewhere. Then archs aren't bound to have the two
> > David.H> separate. So for a system that can handle lots of memory,
> > David.H> you can allocate the thread_info, task_struct and
> > David.H> supervisor stack all on one very large chunk if you so
> > David.H> wish.
> >
> > Could you do this? I'd prefer if task_info could be completely hidden
> > inside the x86/sparc arch-specific code, but if things are set up such
> > that we at least have the option to keep the stack, task_info, and
> > task_struct in a single chunk of memory (and without pointers between
> > them), I'd have much less of an issue with it.

Is this the first in a multi-step patch series, or something like that?

You just duplicated code in a generic location and pasted it into the
arch. Where's the gain in that? I do see the gain in letting the arch
allocate the task struct, but surely your patch should provide a generic
mechanism for an arch to call by default, instead of duplicating code??

Jeff

--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-02-14 16:33:33

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

> Is this the first in a multi-step patch series, or something like that?

What makes you ask that?

> You just duplicated code in a generic location and pasted it into the
> arch. Where's the gain in that? I do see the gain in letting the arch
> allocate the task struct, but surely your patch should provide a generic
> mechanism for an arch to call by default, instead of duplicating code??

Hmmm... Is it worth going through all fun of creating another CONFIG_xxxx
option to govern the inclusion of such code?

David

2002-02-14 16:47:16

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

David Howells wrote:
> > Is this the first in a multi-step patch series, or something like that?
>
> What makes you ask that?

Because your patch just flat out duplicates code line for line into two
arches.

> > You just duplicated code in a generic location and pasted it into the
> > arch. Where's the gain in that? I do see the gain in letting the arch
> > allocate the task struct, but surely your patch should provide a generic
> > mechanism for an arch to call by default, instead of duplicating code??
>
> Hmmm... Is it worth going through all fun of creating another CONFIG_xxxx
> option to govern the inclusion of such code?

I am wondering where you want to go with this, short term and long
term. Is the implementation of this on other arches gonna look the same
-- just line for line copy of code? With maybe ia64 as the lone
exception?

Jeff

--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-02-14 17:03:36

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Jeff Garzik <[email protected]> wrote:
> David Howells wrote:
> > > Is this the first in a multi-step patch series, or something like that?
> >
> > What makes you ask that?
>
> Because your patch just flat out duplicates code line for line into two
> arches.

What I do to it next depends on the feedback I get back.

> I am wondering where you want to go with this, short term and long
> term. Is the implementation of this on other arches gonna look the same
> -- just line for line copy of code? With maybe ia64 as the lone
> exception?

Ask Linus, he asked for the task_struct/thread_info split. Various people have
complained about the two things being allocated separately (maintainers for
m68k and ia64 archs certainly, and if I remember rightly, x86_64 as well,
though I don't appear to have saved the message for that). However, DaveM
(sparc64) appears to really be in favour of it.

In any case, this is just a corollary to my main aim of getting task ornaments
included in the kernel. The userspace resumption notification was the true
prerequisite, to support which I adjusted entry.S to make sure it didn't cost
any extra clock cycles. This led to Linus requesting that everything that
entry.S needs to access be separated out into another structure (so that
entry.S never accesses task_struct).

David

2002-02-14 20:48:56

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi,

David Howells wrote:

> Ask Linus, he asked for the task_struct/thread_info split. Various people have
> complained about the two things being allocated separately (maintainers for
> m68k and ia64 archs certainly, and if I remember rightly, x86_64 as well,
> though I don't appear to have saved the message for that). However, DaveM
> (sparc64) appears to really be in favour of it.

Ok, so we have the following allocation possibilities.
1. allocate task_struct+thread_info+stack together.
2. allocate task_struct and thread_info+stack.
3. allocate task_struct+thread_info and stack.

1. is what we have so far and I hope Linus actually means a
task_struct/stack split. Linus, could you please clarify?
Architectures without a thread register want either the first or second
option. I don't really have an opinion, whether the first option should
be made obsolete. For architectures with a thread register the third
option makes most sense and is not really a problem, we only have to
specify the dependency between task_struct and thread_info.

bye, Roman

2002-02-14 23:54:30

by Richard Henderson

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

On Thu, Feb 14, 2002 at 05:03:14PM +0000, David Howells wrote:
> Ask Linus, he asked for the task_struct/thread_info split. Various people have
> complained about the two things being allocated separately (maintainers for
> m68k and ia64 archs certainly, and if I remember rightly, x86_64 as well,
> though I don't appear to have saved the message for that). However, DaveM
> (sparc64) appears to really be in favour of it.

I'm not especially in favour of it for Alpha either.

As far as I can see all it's done is added a memory load on the
fast path when a constant displacement folded into the address
would previously have sufficed.

r~

2002-02-15 09:57:02

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

David Howells wrote:
>
> Jeff Garzik <[email protected]> wrote:
> > David Howells wrote:
> > > > Is this the first in a multi-step patch series, or something like that?
> > >
> > > What makes you ask that?
> >
> > Because your patch just flat out duplicates code line for line into two
> > arches.
>
> What I do to it next depends on the feedback I get back.

Here's my feedback, then ;-)

Your patch would be ok IMHO if you additionally changed the arches to
include task struct inside struct thread_info, getting things back down
to a single allocation for thread_info+task_struct, with 'current' once
again being a constant offset from the beginning of thread_info. [this
might require resurrecting the old patches to move task struct
definitions out of sched.h proper]

Basically, I think you are going in the right direction, but the
implementation detail of flat out copying code was what caught my
attention, and looks pretty bad to me. Two easy ways I can think of for
providing generic code would be #include <asm-generic/foo.h>, or
enabling generic code when feature macro HAVE_ARCH_FOO is not defined.

> > I am wondering where you want to go with this, short term and long
> > term. Is the implementation of this on other arches gonna look the same
> > -- just line for line copy of code? With maybe ia64 as the lone
> > exception?
>
> Ask Linus, he asked for the task_struct/thread_info split. Various people
[...]
> any extra clock cycles. This led to Linus requesting that everything that
> entry.S needs to access be separated out into another structure (so that
> entry.S never accesses task_struct).

Seems a sane enough direction...

Jeff

--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-02-15 10:02:22

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Jeff Garzik wrote:
> Your patch would be ok IMHO if you additionally changed the arches to
> include task struct inside struct thread_info, getting things back down
> to a single allocation for thread_info+task_struct, with 'current' once
> again being a constant offset from the beginning of thread_info. [this
> might require resurrecting the old patches to move task struct
> definitions out of sched.h proper]

Actually, you don't really need the definition of task_struct, if
include dependencies get really hairy... you really only need the size
of the task struct.

Jeff

--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-02-15 11:23:03

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi,

On Fri, 15 Feb 2002, Jeff Garzik wrote:

> > any extra clock cycles. This led to Linus requesting that everything that
> > entry.S needs to access be separated out into another structure (so that
> > entry.S never accesses task_struct).
>
> Seems a sane enough direction...

That wouldn't be a problem, if ia32 added the needed infrastructure to
calculate the structure offsets. It's not really that difficult, look at
the m68k implementation, which even gets dependencies right. AFAIK Keith
also added the needed infra structure to his new build system.

bye, Roman

2002-02-15 11:38:33

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

> That wouldn't be a problem, if ia32 added the needed infrastructure to
> calculate the structure offsets. It's not really that difficult, look at
> the m68k implementation, which even gets dependencies right. AFAIK Keith
> also added the needed infra structure to his new build system.

But the offsets aren't fixed. The task structure does not lie adjacent to the
thread_info structure.

David

2002-02-15 12:21:39

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi,

On Fri, 15 Feb 2002, David Howells wrote:

> > That wouldn't be a problem, if ia32 added the needed infrastructure to
> > calculate the structure offsets.
>
> But the offsets aren't fixed. The task structure does not lie adjacent to the
> thread_info structure.

That's not the problem, I meant this sentence: "This led to Linus
requesting that everything that entry.S needs to access be separated out
into another structure." Splitting the task structure and the stack page
is fine. Keeping the most important fields in the stack page is fine too,
if the architecture requires it. But the decision what goes into
thread_info, should not be made only to avoid access to task_struct from
entry.S.

bye, Roman

2002-02-15 12:57:15

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Roman Zippel <[email protected]> wrote:

> > > That wouldn't be a problem, if ia32 added the needed infrastructure to
> > > calculate the structure offsets.
> >
> > But the offsets aren't fixed. The task structure does not lie adjacent to
> > the thread_info structure.
>
> That's not the problem, I meant this sentence: "This led to Linus
> requesting that everything that entry.S needs to access be separated out
> into another structure." Splitting the task structure and the stack page
> is fine. Keeping the most important fields in the stack page is fine too,
> if the architecture requires it.

I think I see what you meant.

However, I'm not particularly keen on the M68K offset generator. I'm not sure
why exactly, but it just doesn't seem the right way to do things... OTOH,
without making the assembler able to parse C structs, I'm not sure there is a
"right" way:-/

> But the decision what goes into thread_info, should not be made only to
> avoid access to task_struct from entry.S.

How about I quote Linus (who explained what he wanted better than I did).

Firstly, in response to me having supplied a patch that made a set of four
byte-size values as the status area in the task_struct:

| For the future, the biggest thing I'd like to see is actually to make
| "work" be a bitmap, because the "bytes are atomic" approach simply isn't
| portable anyway, so we might as well make things _explicitly_ atomic and
| use bit operations. Otherwise the alpha version of "work" would have to be
| four bytes per "bit" of information, which sounds really excessive.

And then after some discussion:

| In particular, there's been all that discussion about cache-coloring the
| "struct task_struct", and my personal suggestion for that whole can of
| worms is to have the "struct low_level" be in the one low cache-line, and
| make it contain a pointer to "struct task_struct" - and just split the two
| up completely. Then the low-level asm code would never have to even look
| at "task_struct", it would only look at this stuff.

(struct low_level became thread_info).

David

2002-02-15 13:49:48

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi,

On Fri, 15 Feb 2002, David Howells wrote:

> Firstly, in response to me having supplied a patch that made a set of four
> byte-size values as the status area in the task_struct:
>
> | For the future, the biggest thing I'd like to see is actually to make
> | "work" be a bitmap, because the "bytes are atomic" approach simply isn't
> | portable anyway, so we might as well make things _explicitly_ atomic and
> | use bit operations. Otherwise the alpha version of "work" would have to be
> | four bytes per "bit" of information, which sounds really excessive.

As I mentioned before I more like the byte approach, since atomic bit
field handling is quite expensive on most architectures, where a simple
set/clear byte is only one or two instructions, if there is byte
load/store instruction. So I'd really like to see to leave the decision to
the architecture, whether to use bit or byte fields.

> And then after some discussion:
>
> | In particular, there's been all that discussion about cache-coloring the
> | "struct task_struct", and my personal suggestion for that whole can of
> | worms is to have the "struct low_level" be in the one low cache-line, and
> | make it contain a pointer to "struct task_struct" - and just split the two
> | up completely. Then the low-level asm code would never have to even look
> | at "task_struct", it would only look at this stuff.
>
> (struct low_level became thread_info).

That I can agree with. :)

bye, Roman

2002-02-15 13:52:07

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Roman Zippel wrote:
>
> Hi,
>
> On Fri, 15 Feb 2002, David Howells wrote:
>
> > Firstly, in response to me having supplied a patch that made a set of four
> > byte-size values as the status area in the task_struct:
> >
> > | For the future, the biggest thing I'd like to see is actually to make
> > | "work" be a bitmap, because the "bytes are atomic" approach simply isn't
> > | portable anyway, so we might as well make things _explicitly_ atomic and
> > | use bit operations. Otherwise the alpha version of "work" would have to be
> > | four bytes per "bit" of information, which sounds really excessive.
>
> As I mentioned before I more like the byte approach, since atomic bit
> field handling is quite expensive on most architectures, where a simple
> set/clear byte is only one or two instructions, if there is byte
> load/store instruction. So I'd really like to see to leave the decision to
> the architecture, whether to use bit or byte fields.

We have tons of code already using atomic test_and_set_bit type
stuff... why not just make sure your bit set/clear stuff is fast? :)

Jeff

--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-02-15 14:08:27

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi Linus,

Roman Zippel <[email protected]> wrote:
> As I mentioned before I more like the byte approach, since atomic bit
> field handling is quite expensive on most architectures, where a simple
> set/clear byte is only one or two instructions, if there is byte
> load/store instruction. So I'd really like to see to leave the decision to
> the architecture, whether to use bit or byte fields.

Should I move the convenience bit operations back to the arch header, so that
the M68K guys can have byte-sized flags (which they can use TAS/TST on)?

David

2002-02-15 14:28:37

[permalink] [raw]

Subject: Re: [PATCH] move task_struct allocation to arch

Hi,

On Fri, 15 Feb 2002, David Howells wrote:

> Should I move the convenience bit operations back to the arch header, so that
> the M68K guys can have byte-sized flags (which they can use TAS/TST on)?

No tas needed, just byte store, so it's not only for m68k, ppc would
benefit from it too and probably some more.

bye, Roman

2002-02-15 14:22:58