2011-06-23 11:10:29

by Stefano Stabellini

[permalink] [raw]
Subject: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

From: Stefano Stabellini <[email protected]>

head_64.S, which sets up the initial page table on x86_64, is not aware
of PSE being enabled or disabled and it always allocates the initial
mapping using 2MB pages.

Therefore on x86_64 find_early_table_space shouldn't update the amount
of pages needed for pte pages depending on the size of the initial
mapping, because we know for sure that no pte pages have been allocated
yet.

Signed-off-by: Stefano Stabellini <[email protected]>
Reported-by: Ingo Molnar <[email protected]>
---
arch/x86/mm/init.c | 7 +++++++
1 files changed, 7 insertions(+), 0 deletions(-)

diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
index 36bacfe..1e3098b 100644
--- a/arch/x86/mm/init.c
+++ b/arch/x86/mm/init.c
@@ -42,12 +42,19 @@ static void __init find_early_table_space(unsigned long start,
(PMD_SIZE * PTRS_PER_PMD));
pmd_mapped *= (PMD_SIZE * PTRS_PER_PMD);

+ /*
+ * On x86_64 do not limit the size we need to cover with 4KB pages
+ * depending on the initial allocation because head_64.S always uses
+ * 2MB pages.
+ */
+#ifdef CONFIG_X86_32
if (start < PFN_PHYS(max_pfn_mapped)) {
if (PFN_PHYS(max_pfn_mapped) < end)
size -= PFN_PHYS(max_pfn_mapped) - start;
else
size = 0;
}
+#endif

#ifndef __PAGETABLE_PUD_FOLDED
if (end > pud_mapped) {
--
1.7.2.3


2011-06-23 12:39:49

by Penttilä Mika

[permalink] [raw]
Subject: Re: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

> From: Stefano Stabellini <[email protected]>

> head_64.S, which sets up the initial page table on x86_64, is not aware
> of PSE being enabled or disabled and it always allocates the initial
> mapping using 2MB pages.

> Therefore on x86_64 find_early_table_space shouldn't update the amount
> of pages needed for pte pages depending on the size of the initial
> mapping, because we know for sure that no pte pages have been allocated
> yet.

> Signed-off-by: Stefano Stabellini <[email protected]>
> Reported-by: Ingo Molnar <[email protected]>
> ---
> arch/x86/mm/init.c | 7 +++++++
> 1 files changed, 7 insertions(+), 0 deletions(-)

> diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
> index 36bacfe..1e3098b 100644
> --- a/arch/x86/mm/init.c
> +++ b/arch/x86/mm/init.c
> @@ -42,12 +42,19 @@ static void __init find_early_table_space(unsigned long start,
> (PMD_SIZE * PTRS_PER_PMD));
> pmd_mapped *= (PMD_SIZE * PTRS_PER_PMD);
>
> + /*
> + * On x86_64 do not limit the size we need to cover with 4KB pages
> + * depending on the initial allocation because head_64.S always uses
> + * 2MB pages.
> + */
> +#ifdef CONFIG_X86_32
> if (start < PFN_PHYS(max_pfn_mapped)) {
> if (PFN_PHYS(max_pfn_mapped) < end)
> size -= PFN_PHYS(max_pfn_mapped) - start;
> else
> size = 0;
> }
> +#endif

And arch/x86/mm/init.c also has:

#if defined(CONFIG_DEBUG_PAGEALLOC) || defined(CONFIG_KMEMCHECK)
/*
* For CONFIG_DEBUG_PAGEALLOC, identity mapping will use small pages.
* This will simplify cpa(), which otherwise needs to support splitting
* large pages into small in interrupt context, etc.
*/
use_pse = use_gbpages = 0;
#else
use_pse = cpu_has_pse;
use_gbpages = direct_gbpages;
#endif


So big pages are also not used for DEBUG_PAGEALLOC and KMEMCHECK configs even if head_32.S did.

--Mika




2011-06-23 12:52:39

by Stefano Stabellini

[permalink] [raw]
Subject: Re: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

On Thu, 23 Jun 2011, Penttilä Mika wrote:
> > From: Stefano Stabellini <[email protected]>
>
> > head_64.S, which sets up the initial page table on x86_64, is not aware
> > of PSE being enabled or disabled and it always allocates the initial
> > mapping using 2MB pages.
>
> > Therefore on x86_64 find_early_table_space shouldn't update the amount
> > of pages needed for pte pages depending on the size of the initial
> > mapping, because we know for sure that no pte pages have been allocated
> > yet.
>
> > Signed-off-by: Stefano Stabellini <[email protected]>
> > Reported-by: Ingo Molnar <[email protected]>
> > ---
> > arch/x86/mm/init.c | 7 +++++++
> > 1 files changed, 7 insertions(+), 0 deletions(-)
>
> > diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
> > index 36bacfe..1e3098b 100644
> > --- a/arch/x86/mm/init.c
> > +++ b/arch/x86/mm/init.c
> > @@ -42,12 +42,19 @@ static void __init find_early_table_space(unsigned long start,
> > (PMD_SIZE * PTRS_PER_PMD));
> > pmd_mapped *= (PMD_SIZE * PTRS_PER_PMD);
> >
> > + /*
> > + * On x86_64 do not limit the size we need to cover with 4KB pages
> > + * depending on the initial allocation because head_64.S always uses
> > + * 2MB pages.
> > + */
> > +#ifdef CONFIG_X86_32
> > if (start < PFN_PHYS(max_pfn_mapped)) {
> > if (PFN_PHYS(max_pfn_mapped) < end)
> > size -= PFN_PHYS(max_pfn_mapped) - start;
> > else
> > size = 0;
> > }
> > +#endif
>
> And arch/x86/mm/init.c also has:
>
> #if defined(CONFIG_DEBUG_PAGEALLOC) || defined(CONFIG_KMEMCHECK)
> /*
> * For CONFIG_DEBUG_PAGEALLOC, identity mapping will use small pages.
> * This will simplify cpa(), which otherwise needs to support splitting
> * large pages into small in interrupt context, etc.
> */
> use_pse = use_gbpages = 0;
> #else
> use_pse = cpu_has_pse;
> use_gbpages = direct_gbpages;
> #endif
>
>
> So big pages are also not used for DEBUG_PAGEALLOC and KMEMCHECK configs even if head_32.S did.

Right, but that is not a problem because head_32.S always uses 4KB
pages.

2011-06-23 14:50:41

by Penttilä Mika

[permalink] [raw]
Subject: RE: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

> From: Stefano Stabellini [mailto:[email protected]]
> Sent: 23. kesäkuuta 2011 15:57
> To: Penttilä Mika
> Cc: Stefano Stabellini; [email protected]
> Subject: Re: [PATCH] x86_64: do not assume head_64.S used 4KB pages
> when !use_pse
>
> On Thu, 23 Jun 2011, Penttilä Mika wrote:
> > > From: Stefano Stabellini <[email protected]>
> >
> > > head_64.S, which sets up the initial page table on x86_64, is not
> aware
> > > of PSE being enabled or disabled and it always allocates the
> initial
> > > mapping using 2MB pages.
> >
> > > Therefore on x86_64 find_early_table_space shouldn't update the
> amount
> > > of pages needed for pte pages depending on the size of the initial
> > > mapping, because we know for sure that no pte pages have been
> allocated
> > > yet.
> >
> > > Signed-off-by: Stefano Stabellini
> <[email protected]>
> > > Reported-by: Ingo Molnar <[email protected]>
> > > ---
> > > arch/x86/mm/init.c | 7 +++++++
> > > 1 files changed, 7 insertions(+), 0 deletions(-)
> >
> > > diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
> > > index 36bacfe..1e3098b 100644
> > > --- a/arch/x86/mm/init.c
> > > +++ b/arch/x86/mm/init.c
> > > @@ -42,12 +42,19 @@ static void __init
> find_early_table_space(unsigned long start,
> > > (PMD_SIZE * PTRS_PER_PMD));
> > > pmd_mapped *= (PMD_SIZE * PTRS_PER_PMD);
> > >
> > > + /*
> > > + * On x86_64 do not limit the size we need to cover with 4KB
> pages
> > > + * depending on the initial allocation because head_64.S always
> uses
> > > + * 2MB pages.
> > > + */
> > > +#ifdef CONFIG_X86_32
> > > if (start < PFN_PHYS(max_pfn_mapped)) {
> > > if (PFN_PHYS(max_pfn_mapped) < end)
> > > size -= PFN_PHYS(max_pfn_mapped) - start;
> > > else
> > > size = 0;
> > > }
> > > +#endif
> >
> > And arch/x86/mm/init.c also has:
> >
> > #if defined(CONFIG_DEBUG_PAGEALLOC) || defined(CONFIG_KMEMCHECK)
> > /*
> > * For CONFIG_DEBUG_PAGEALLOC, identity mapping will use
> small pages.
> > * This will simplify cpa(), which otherwise needs to support
> splitting
> > * large pages into small in interrupt context, etc.
> > */
> > use_pse = use_gbpages = 0;
> > #else
> > use_pse = cpu_has_pse;
> > use_gbpages = direct_gbpages;
> > #endif
> >
> >
> > So big pages are also not used for DEBUG_PAGEALLOC and KMEMCHECK
> configs even if head_32.S did.
>
> Right, but that is not a problem because head_32.S always uses 4KB
> pages.

We use large pages FOR PAE kernels on x86-32 there

--Mika


????{.n?+???????+%?????ݶ??w??{.n?+????{??G?????{ay?ʇڙ?,j??f???h?????????z_??(?階?ݢj"???m??????G????????????&???~???iO???z??v?^?m???? ????????I?

2011-06-23 15:15:33

by Stefano Stabellini

[permalink] [raw]
Subject: RE: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

On Thu, 23 Jun 2011, Penttilä Mika wrote:
> > > And arch/x86/mm/init.c also has:
> > >
> > > #if defined(CONFIG_DEBUG_PAGEALLOC) || defined(CONFIG_KMEMCHECK)
> > > /*
> > > * For CONFIG_DEBUG_PAGEALLOC, identity mapping will use
> > small pages.
> > > * This will simplify cpa(), which otherwise needs to support
> > splitting
> > > * large pages into small in interrupt context, etc.
> > > */
> > > use_pse = use_gbpages = 0;
> > > #else
> > > use_pse = cpu_has_pse;
> > > use_gbpages = direct_gbpages;
> > > #endif
> > >
> > >
> > > So big pages are also not used for DEBUG_PAGEALLOC and KMEMCHECK
> > configs even if head_32.S did.
> >
> > Right, but that is not a problem because head_32.S always uses 4KB
> > pages.
>
> We use large pages FOR PAE kernels on x86-32 there
>

Do you mean we use large pages for PAE kernels on x86_32 in
arch/x86/mm/init.c:init_memory_mapping?
That wouldn't be a problem for this patch.

The problem I am trying to solve occurs when head_64.S doesn't allocate
any pte pages because it is using 2MB pages, while init_memory_mapping
wants to use 4KB pages (for example because the user set
CONFIG_DEBUG_PAGEALLOC).

So on x86_32 is not going to be an issue because head_32.S doesn't use
2MB or 4MB pages as far as I can tell, even if the hardware supports
them.
Please correct me if I am wrong but I don't see any _PAGE_PSE in
head_32.S.

2011-06-23 16:26:08

by Penttilä Mika

[permalink] [raw]
Subject: RE: [PATCH] x86_64: do not assume head_64.S used 4KB pages when !use_pse

> From: Stefano Stabellini [mailto:[email protected]]
> Sent: 23. kesäkuuta 2011 18:20
> To: Penttilä Mika
> Cc: Stefano Stabellini; [email protected]
> Subject: RE: [PATCH] x86_64: do not assume head_64.S used 4KB pages
> when !use_pse
>
> On Thu, 23 Jun 2011, Penttilä Mika wrote:
> > > > And arch/x86/mm/init.c also has:
> > > >
> > > > #if defined(CONFIG_DEBUG_PAGEALLOC) || defined(CONFIG_KMEMCHECK)
> > > > /*
> > > > * For CONFIG_DEBUG_PAGEALLOC, identity mapping will use
> > > small pages.
> > > > * This will simplify cpa(), which otherwise needs to
> support
> > > splitting
> > > > * large pages into small in interrupt context, etc.
> > > > */
> > > > use_pse = use_gbpages = 0;
> > > > #else
> > > > use_pse = cpu_has_pse;
> > > > use_gbpages = direct_gbpages;
> > > > #endif
> > > >
> > > >
> > > > So big pages are also not used for DEBUG_PAGEALLOC and KMEMCHECK
> > > configs even if head_32.S did.
> > >
> > > Right, but that is not a problem because head_32.S always uses 4KB
> > > pages.
> >
> > We use large pages FOR PAE kernels on x86-32 there
> >
>
> Do you mean we use large pages for PAE kernels on x86_32 in
> arch/x86/mm/init.c:init_memory_mapping?
> That wouldn't be a problem for this patch.
>
> The problem I am trying to solve occurs when head_64.S doesn't allocate
> any pte pages because it is using 2MB pages, while init_memory_mapping
> wants to use 4KB pages (for example because the user set
> CONFIG_DEBUG_PAGEALLOC).
>
> So on x86_32 is not going to be an issue because head_32.S doesn't use
> 2MB or 4MB pages as far as I can tell, even if the hardware supports
> them.
> Please correct me if I am wrong but I don't see any _PAGE_PSE in
> head_32.S.


Yes you're right head_32.S doesn't do large pages, I remembered it would..

--Mika


????{.n?+???????+%?????ݶ??w??{.n?+????{??G?????{ay?ʇڙ?,j??f???h?????????z_??(?階?ݢj"???m??????G????????????&???~???iO???z??v?^?m???? ????????I?

2011-06-23 23:46:43

by Stefano Stabellini

[permalink] [raw]
Subject: [tip:x86/mm] x86-64, mm: Do not assume head_64.S used 4KB pages when !use_pse

Commit-ID: 2e9fbad5955f34bf1934bd6a8165c1f49ac8cc46
Gitweb: http://git.kernel.org/tip/2e9fbad5955f34bf1934bd6a8165c1f49ac8cc46
Author: Stefano Stabellini <[email protected]>
AuthorDate: Thu, 23 Jun 2011 12:14:25 +0100
Committer: H. Peter Anvin <[email protected]>
CommitDate: Thu, 23 Jun 2011 16:14:02 -0700

x86-64, mm: Do not assume head_64.S used 4KB pages when !use_pse

head_64.S, which sets up the initial page table on x86_64, is not aware
of PSE being enabled or disabled and it always allocates the initial
mapping using 2MB pages.

Therefore on x86_64 find_early_table_space shouldn't update the amount
of pages needed for pte pages depending on the size of the initial
mapping, because we know for sure that no pte pages have been allocated
yet.

Signed-off-by: Stefano Stabellini <[email protected]>
Link: http://lkml.kernel.org/r/1308827665-28890-1-git-send-email-stefano.stabellini@eu.citrix.com
Reported-by: Ingo Molnar <[email protected]>
Signed-off-by: H. Peter Anvin <[email protected]>
---
arch/x86/mm/init.c | 7 +++++++
1 files changed, 7 insertions(+), 0 deletions(-)

diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
index 36bacfe..e72c9f8 100644
--- a/arch/x86/mm/init.c
+++ b/arch/x86/mm/init.c
@@ -42,12 +42,19 @@ static void __init find_early_table_space(unsigned long start,
(PMD_SIZE * PTRS_PER_PMD));
pmd_mapped *= (PMD_SIZE * PTRS_PER_PMD);

+ /*
+ * On x86_64 do not limit the size we need to cover with 4KB pages
+ * depending on the initial allocation because head_64.S always uses
+ * 2MB pages.
+ */
+#ifdef CONFIG_X86_32
if (start < PFN_PHYS(max_pfn_mapped)) {
if (PFN_PHYS(max_pfn_mapped) < end)
size -= PFN_PHYS(max_pfn_mapped) - start;
else
size = 0;
}
+#endif

#ifndef __PAGETABLE_PUD_FOLDED
if (end > pud_mapped) {