LinuxLists.cc - [PATCH 0/8] x86, acpi: Move acpi_initrd

2013-08-21 10:20:03

Subject: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

This patch-set aims to move acpi_initrd_override() earlier on x86.
Some of the patches are from Yinghai's patch-set:
https://lkml.org/lkml/2013/6/14/561

The difference between this patch-set and Yinghai's original patch-set are:
1. This patch-set doesn't split acpi_initrd_override(), but call it as a
whole operation at early time.
2. Allocate memory from BRK to store override tables.
(This idea is also from Yinghai.)

[Current state]

The current Linux kernel will initialize acpi tables like the following:

1. Find all acpi override table provided by users in initrd.
(Linux allows users to override acpi tables in firmware, by specifying
their own tables in initrd.)

2. Use acpica code to initialize acpi global root table list and install all
tables into it. If any override tables exists, use it to override the one
provided by firmware.

Then others can parse these tables and get useful info.

Both of the two steps happen after direct mapping page tables are setup.

[Issues]

In the current Linux kernel, the initialization of acpi tables is too late for
new functionalities.

We have some issues about this:

* For memory hotplug, we need ACPI SRAT at early time to be aware of which memory
ranges are hotpluggable, and prevent bootmem allocator from allocating memory
for the kernel. (Kernel pages cannot be hotplugged because )

* As suggested by Yinghai Lu <[email protected]>, we should allocate page tables
in local node. This also needs SRAT before direct mapping page tables are setup.

* As mentioned by Toshi Kani <[email protected]>, ACPI SCPR/DBGP/DBG2 tables
allow the OS to initialize serial console/debug ports at early boot time. The
earlier it can be initialized, the better this feature will be. These tables
are not currently used by Linux due to a licensing issue, but it could be
addressed some time soon.

[What are we doing]

We are trying to initialize acip tables as early as possible. But Linux kernel
allows users to override acpi tables by specifying their own tables in initrd.
So we have to do acpi_initrd_override() earlier first.

[About this patch-set]

This patch-set aims to move acpi_initrd_override() as early as possible on x86.
As suggested by Yinghai, we are trying to do it like this:

On 32bit: do it in head_32.S, before paging is enabled. In this case, we can
access initrd with physical address without page tables.

On 64bit: do it in head_64.c, after paging is enabled but before direct mapping
is setup.

And also, acpi_initrd_override() needs to allocate memory for override tables.
But at such an early time, there is no memory allocator works. So the basic idea
from Yinghai is to use BRK. We will extend BRK 256KB in this patch-set.

Tang Chen (6):
x86, acpi: Move table_sigs[] to stack.
x86, acpi, brk: Extend BRK 256KB to store acpi override tables.
x86, brk: Make extend_brk() available with va/pa.
x86, acpi: Make acpi_initrd_override() available with va or pa.
x86, acpi, brk: Make early_alloc_acpi_override_tables_buf() available
with va/pa.
x86, acpi: Do acpi_initrd_override() earlier in head_32.S/head64.c.

Yinghai Lu (2):
x86: Make get_ramdisk_{image|size}() global.
x86, microcode: Use get_ramdisk_{image|size}() in microcode handling.

arch/x86/include/asm/dmi.h | 2 +-
arch/x86/include/asm/setup.h | 11 +++-
arch/x86/kernel/head64.c | 4 +
arch/x86/kernel/head_32.S | 4 +
arch/x86/kernel/microcode_intel_early.c | 8 +-
arch/x86/kernel/setup.c | 93 ++++++++++++++++------
arch/x86/mm/init.c | 2 +-
arch/x86/xen/enlighten.c | 2 +-
arch/x86/xen/mmu.c | 6 +-
arch/x86/xen/p2m.c | 27 ++++---
drivers/acpi/osl.c | 130 ++++++++++++++++++++-----------
include/linux/acpi.h | 5 +-
12 files changed, 196 insertions(+), 98 deletions(-)

2013-08-21 10:18:15

by Tang Chen

[permalink] [raw]

Subject: [PATCH 3/8] x86, acpi: Move table_sigs[] to stack.

On 64bit, we will do acpi_initrd_override() in x86_64_start_kernel(). This is
after CPU enables paging, but before direct mapping page tables are setup.
This is OK because we have an early page fault handler to help to access data
without direct mapping page tables.

But on 32bit, in order to keep the x86_32 and x86_64 unified code clean, we want
to do acpi_initrd_override() in head_32.S, before CPU enables paging. Without
direct mapping page tables, we need to access data with physical address.

For global variables, if we access them, we are using va. So on 32bit, we have
to convert them into pa. But for a global array, it could be too messy.

So this patch move table_sigs[] to stack. Define it in acpi_initrd_override().
It is no more than 36 pointers, so it is OK to put it on stack.

Originally-From: Yinghai Lu <[email protected]>
Signed-off-by: Tang Chen <[email protected]>
---
drivers/acpi/osl.c | 23 +++++++++++------------
1 files changed, 11 insertions(+), 12 deletions(-)

diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
index e7effc1..06996d8 100644
--- a/drivers/acpi/osl.c
+++ b/drivers/acpi/osl.c
@@ -551,18 +551,6 @@ u8 __init acpi_table_checksum(u8 *buffer, u32 length)
return sum;
}

-/* All but ACPI_SIG_RSDP and ACPI_SIG_FACS: */
-static const char * const table_sigs[] = {
- ACPI_SIG_BERT, ACPI_SIG_CPEP, ACPI_SIG_ECDT, ACPI_SIG_EINJ,
- ACPI_SIG_ERST, ACPI_SIG_HEST, ACPI_SIG_MADT, ACPI_SIG_MSCT,
- ACPI_SIG_SBST, ACPI_SIG_SLIT, ACPI_SIG_SRAT, ACPI_SIG_ASF,
- ACPI_SIG_BOOT, ACPI_SIG_DBGP, ACPI_SIG_DMAR, ACPI_SIG_HPET,
- ACPI_SIG_IBFT, ACPI_SIG_IVRS, ACPI_SIG_MCFG, ACPI_SIG_MCHI,
- ACPI_SIG_SLIC, ACPI_SIG_SPCR, ACPI_SIG_SPMI, ACPI_SIG_TCPA,
- ACPI_SIG_UEFI, ACPI_SIG_WAET, ACPI_SIG_WDAT, ACPI_SIG_WDDT,
- ACPI_SIG_WDRT, ACPI_SIG_DSDT, ACPI_SIG_FADT, ACPI_SIG_PSDT,
- ACPI_SIG_RSDT, ACPI_SIG_XSDT, ACPI_SIG_SSDT, NULL };
-
#define ACPI_HEADER_SIZE sizeof(struct acpi_table_header)

/* Must not increase 10 or needs code modification below */
@@ -577,6 +565,17 @@ void __init acpi_initrd_override(void *data, size_t size)
struct cpio_data file;
struct cpio_data early_initrd_files[ACPI_OVERRIDE_TABLES];
char *p;
+ /* All but ACPI_SIG_RSDP and ACPI_SIG_FACS: */
+ const char * const table_sigs[] = {
+ ACPI_SIG_BERT, ACPI_SIG_CPEP, ACPI_SIG_ECDT, ACPI_SIG_EINJ,
+ ACPI_SIG_ERST, ACPI_SIG_HEST, ACPI_SIG_MADT, ACPI_SIG_MSCT,
+ ACPI_SIG_SBST, ACPI_SIG_SLIT, ACPI_SIG_SRAT, ACPI_SIG_ASF,
+ ACPI_SIG_BOOT, ACPI_SIG_DBGP, ACPI_SIG_DMAR, ACPI_SIG_HPET,
+ ACPI_SIG_IBFT, ACPI_SIG_IVRS, ACPI_SIG_MCFG, ACPI_SIG_MCHI,
+ ACPI_SIG_SLIC, ACPI_SIG_SPCR, ACPI_SIG_SPMI, ACPI_SIG_TCPA,
+ ACPI_SIG_UEFI, ACPI_SIG_WAET, ACPI_SIG_WDAT, ACPI_SIG_WDDT,
+ ACPI_SIG_WDRT, ACPI_SIG_DSDT, ACPI_SIG_FADT, ACPI_SIG_PSDT,
+ ACPI_SIG_RSDT, ACPI_SIG_XSDT, ACPI_SIG_SSDT, NULL };

if (data == NULL || size == 0)
return;
--
1.7.1

2013-08-21 10:18:50

by Tang Chen

[permalink] [raw]

Subject: [PATCH 2/8] x86, microcode: Use get_ramdisk_{image|size}() in microcode handling.

From: Yinghai Lu <[email protected]>

Since we made get_ramdisk_{image|size}() global, use them when we want
to access ramdisk.

Signed-off-by: Yinghai Lu <[email protected]>
Cc: Fenghua Yu <[email protected]>
Acked-by: Tejun Heo <[email protected]>
Tested-by: Thomas Renninger <[email protected]>
Reviewed-by: Tang Chen <[email protected]>
Tested-by: Tang Chen <[email protected]>
---
arch/x86/kernel/microcode_intel_early.c | 8 ++++----
1 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/arch/x86/kernel/microcode_intel_early.c b/arch/x86/kernel/microcode_intel_early.c
index 1575deb..4c58a1b 100644
--- a/arch/x86/kernel/microcode_intel_early.c
+++ b/arch/x86/kernel/microcode_intel_early.c
@@ -743,8 +743,8 @@ load_ucode_intel_bsp(void)
struct boot_params *boot_params_p;

boot_params_p = (struct boot_params *)__pa_nodebug(&boot_params);
- ramdisk_image = boot_params_p->hdr.ramdisk_image;
- ramdisk_size = boot_params_p->hdr.ramdisk_size;
+ ramdisk_image = get_ramdisk_image(boot_params_p);
+ ramdisk_size = get_ramdisk_size(boot_params_p);
initrd_start_early = ramdisk_image;
initrd_end_early = initrd_start_early + ramdisk_size;

@@ -753,8 +753,8 @@ load_ucode_intel_bsp(void)
(unsigned long *)__pa_nodebug(&mc_saved_in_initrd),
initrd_start_early, initrd_end_early, &uci);
#else
- ramdisk_image = boot_params.hdr.ramdisk_image;
- ramdisk_size = boot_params.hdr.ramdisk_size;
+ ramdisk_image = get_ramdisk_image(&boot_params);
+ ramdisk_size = get_ramdisk_size(&boot_params);
initrd_start_early = ramdisk_image + PAGE_OFFSET;
initrd_end_early = initrd_start_early + ramdisk_size;

--
1.7.1

2013-08-21 10:19:52

by Tang Chen

[permalink] [raw]

Subject: [PATCH 1/8] x86: Make get_ramdisk_{image|size}() global.

From: Yinghai Lu <[email protected]>

This patch does two things:

1. Make get_ramdisk_image() and get_ramdisk_size() global so that we can use
them in the later patches.

2. In later patches, we are going to call them in head_32.S before paging is
enabled. In that case, we can only use physical address to access global
variable like boot_params. So make them take a boot_params pointer parameter
so that we can pass va or pa to them.

Signed-off-by: Yinghai Lu <[email protected]>
Acked-by: Tejun Heo <[email protected]>
Tested-by: Thomas Renninger <[email protected]>
Reviewed-by: Tang Chen <[email protected]>
Tested-by: Tang Chen <[email protected]>
---
arch/x86/include/asm/setup.h | 3 +++
arch/x86/kernel/setup.c | 28 ++++++++++++++--------------
2 files changed, 17 insertions(+), 14 deletions(-)

diff --git a/arch/x86/include/asm/setup.h b/arch/x86/include/asm/setup.h
index b7bf350..4f71d48 100644
--- a/arch/x86/include/asm/setup.h
+++ b/arch/x86/include/asm/setup.h
@@ -106,6 +106,9 @@ void *extend_brk(size_t size, size_t align);
RESERVE_BRK(name, sizeof(type) * entries)

extern void probe_roms(void);
+u64 get_ramdisk_image(struct boot_params *bp);
+u64 get_ramdisk_size(struct boot_params *bp);
+
#ifdef __i386__

void __init i386_start_kernel(void);
diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index f8ec578..5bfd4c8 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -297,19 +297,19 @@ static void __init reserve_brk(void)

#ifdef CONFIG_BLK_DEV_INITRD

-static u64 __init get_ramdisk_image(void)
+u64 __init get_ramdisk_image(struct boot_params *bp)
{
- u64 ramdisk_image = boot_params.hdr.ramdisk_image;
+ u64 ramdisk_image = bp->hdr.ramdisk_image;

- ramdisk_image |= (u64)boot_params.ext_ramdisk_image << 32;
+ ramdisk_image |= (u64)bp->ext_ramdisk_image << 32;

return ramdisk_image;
}
-static u64 __init get_ramdisk_size(void)
+u64 __init get_ramdisk_size(struct boot_params *bp)
{
- u64 ramdisk_size = boot_params.hdr.ramdisk_size;
+ u64 ramdisk_size = bp->hdr.ramdisk_size;

- ramdisk_size |= (u64)boot_params.ext_ramdisk_size << 32;
+ ramdisk_size |= (u64)bp->ext_ramdisk_size << 32;

return ramdisk_size;
}
@@ -318,8 +318,8 @@ static u64 __init get_ramdisk_size(void)
static void __init relocate_initrd(void)
{
/* Assume only end is not page aligned */
- u64 ramdisk_image = get_ramdisk_image();
- u64 ramdisk_size = get_ramdisk_size();
+ u64 ramdisk_image = get_ramdisk_image(&boot_params);
+ u64 ramdisk_size = get_ramdisk_size(&boot_params);
u64 area_size = PAGE_ALIGN(ramdisk_size);
u64 ramdisk_here;
unsigned long slop, clen, mapaddr;
@@ -358,8 +358,8 @@ static void __init relocate_initrd(void)
ramdisk_size -= clen;
}

- ramdisk_image = get_ramdisk_image();
- ramdisk_size = get_ramdisk_size();
+ ramdisk_image = get_ramdisk_image(&boot_params);
+ ramdisk_size = get_ramdisk_size(&boot_params);
printk(KERN_INFO "Move RAMDISK from [mem %#010llx-%#010llx] to"
" [mem %#010llx-%#010llx]\n",
ramdisk_image, ramdisk_image + ramdisk_size - 1,
@@ -369,8 +369,8 @@ static void __init relocate_initrd(void)
static void __init early_reserve_initrd(void)
{
/* Assume only end is not page aligned */
- u64 ramdisk_image = get_ramdisk_image();
- u64 ramdisk_size = get_ramdisk_size();
+ u64 ramdisk_image = get_ramdisk_image(&boot_params);
+ u64 ramdisk_size = get_ramdisk_size(&boot_params);
u64 ramdisk_end = PAGE_ALIGN(ramdisk_image + ramdisk_size);

if (!boot_params.hdr.type_of_loader ||
@@ -382,8 +382,8 @@ static void __init early_reserve_initrd(void)
static void __init reserve_initrd(void)
{
/* Assume only end is not page aligned */
- u64 ramdisk_image = get_ramdisk_image();
- u64 ramdisk_size = get_ramdisk_size();
+ u64 ramdisk_image = get_ramdisk_image(&boot_params);
+ u64 ramdisk_size = get_ramdisk_size(&boot_params);
u64 ramdisk_end = PAGE_ALIGN(ramdisk_image + ramdisk_size);
u64 mapped_size;

--
1.7.1

2013-08-21 10:20:20

by Tang Chen

[permalink] [raw]

Subject: [PATCH 7/8] x86, acpi, brk: Make early_alloc_acpi_override_tables_buf() available with va/pa.

We are using the same trick in previous patch.

Introduce a "bool is_phys" to early_alloc_acpi_override_tables_buf(). When it
is true, convert all golbal variables va to pa, so that we can access them on
32bit before paging is enabled.

NOTE: Do not call printk() on 32bit before paging is enabled
because it will use global variables.

Signed-off-by: Tang Chen <[email protected]>
---
arch/x86/kernel/setup.c | 2 +-
drivers/acpi/osl.c | 11 ++++++++---
include/linux/acpi.h | 2 +-
3 files changed, 10 insertions(+), 5 deletions(-)

diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index 1290ea7..5729cd2 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -1071,7 +1071,7 @@ void __init setup_arch(char **cmdline_p)

#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
/* Allocate buffer to store acpi override tables in brk. */
- early_alloc_acpi_override_tables_buf();
+ early_alloc_acpi_override_tables_buf(false);
#endif

/*
diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
index ccdb5a6..25ba68d 100644
--- a/drivers/acpi/osl.c
+++ b/drivers/acpi/osl.c
@@ -560,10 +560,15 @@ u8 __init acpi_table_checksum(u8 *buffer, u32 length)
/* Reserve 256KB in BRK to store acpi override tables */
#define ACPI_OVERRIDE_TABLES_SIZE (256 * 1024)
RESERVE_BRK(acpi_override_tables_alloc, ACPI_OVERRIDE_TABLES_SIZE);
-void __init early_alloc_acpi_override_tables_buf(void)
+void __init early_alloc_acpi_override_tables_buf(bool is_phys)
{
- acpi_tables_addr = __pa(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
- PAGE_SIZE, false));
+ u64 *acpi_tables_addr_p;
+
+ acpi_tables_addr_p = is_phys ? (u64 *)__pa_nodebug(&acpi_tables_addr) :
+ (u64 *)&acpi_tables_addr;
+
+ *acpi_tables_addr_p = __pa_nodebug(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
+ PAGE_SIZE, is_phys));
}

/**
diff --git a/include/linux/acpi.h b/include/linux/acpi.h
index af4da51..17f2e8e 100644
--- a/include/linux/acpi.h
+++ b/include/linux/acpi.h
@@ -81,7 +81,7 @@ typedef int (*acpi_tbl_entry_handler)(struct acpi_subtable_header *header,

#ifdef CONFIG_ACPI_INITRD_TABLE_OVERRIDE
void acpi_initrd_override(void *data, size_t size, bool is_phys);
-void early_alloc_acpi_override_tables_buf(void);
+void early_alloc_acpi_override_tables_buf(bool is_phys);
#else
static inline void acpi_initrd_override(void *data, size_t size, bool is_phys)
{
--
1.7.1

2013-08-21 10:20:27

by Tang Chen

[permalink] [raw]

Subject: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

We are going to do acpi_initrd_override() at very early time:

On 32bit: do it in head_32.S, before paging is enabled. In this case, we can
access initrd with physical address without page tables.

On 64bit: do it in head_64.c, after paging is enabled but before direct mapping
is setup.

On 64bit, we have an early page fault handler to help to access data
with direct mapping page tables. So it is easy to do in head_64.c.

And we need to allocate memory to store override tables. At such an early time,
no memory allocator works. So we can only use BRK.

As mentioned above, on 32bit before paging is enabled, we have to access variables
with pa. So introduce a "bool is_phys" parameter to extend_brk(), and convert va
to pa is it is true.

Signed-off-by: Tang Chen <[email protected]>
---
arch/x86/include/asm/dmi.h | 2 +-
arch/x86/include/asm/setup.h | 2 +-
arch/x86/kernel/setup.c | 20 ++++++++++++++------
arch/x86/mm/init.c | 2 +-
arch/x86/xen/enlighten.c | 2 +-
arch/x86/xen/mmu.c | 6 +++---
arch/x86/xen/p2m.c | 27 ++++++++++++++-------------
drivers/acpi/osl.c | 2 +-
8 files changed, 36 insertions(+), 27 deletions(-)

diff --git a/arch/x86/include/asm/dmi.h b/arch/x86/include/asm/dmi.h
index fd8f9e2..3b51d81 100644
--- a/arch/x86/include/asm/dmi.h
+++ b/arch/x86/include/asm/dmi.h
@@ -9,7 +9,7 @@

static __always_inline __init void *dmi_alloc(unsigned len)
{
- return extend_brk(len, sizeof(int));
+ return extend_brk(len, sizeof(int), false);
}

/* Use early IO mappings for DMI because it's initialized early */
diff --git a/arch/x86/include/asm/setup.h b/arch/x86/include/asm/setup.h
index 4f71d48..96d00da 100644
--- a/arch/x86/include/asm/setup.h
+++ b/arch/x86/include/asm/setup.h
@@ -75,7 +75,7 @@ extern struct boot_params boot_params;

/* exceedingly early brk-like allocator */
extern unsigned long _brk_end;
-void *extend_brk(size_t size, size_t align);
+void *extend_brk(size_t size, size_t align, bool is_phys);

/*
* Reserve space in the brk section. The name must be unique within
diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index 51fcd5d..a189909 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -259,19 +259,27 @@ static inline void __init copy_edd(void)
}
#endif

-void * __init extend_brk(size_t size, size_t align)
+void * __init extend_brk(size_t size, size_t align, bool is_phys)
{
size_t mask = align - 1;
void *ret;
+ unsigned long *brk_start, *brk_end, *brk_limit;

- BUG_ON(_brk_start == 0);
+ brk_start = is_phys ? (unsigned long *)__pa_nodebug(&_brk_start) :
+ (unsigned long *)&_brk_start;
+ brk_end = is_phys ? (unsigned long *)__pa_nodebug(&_brk_end) :
+ (unsigned long *)&_brk_end;
+ brk_limit = is_phys ? (unsigned long *)__pa_nodebug(__brk_limit) :
+ (unsigned long *)__brk_limit;
+
+ BUG_ON(*brk_start == 0);
BUG_ON(align & mask);

- _brk_end = (_brk_end + mask) & ~mask;
- BUG_ON((char *)(_brk_end + size) > __brk_limit);
+ *brk_end = (*brk_end + mask) & ~mask;
+ BUG_ON((char *)(*brk_end + size) > brk_limit);

- ret = (void *)_brk_end;
- _brk_end += size;
+ ret = (void *)(*brk_end);
+ *brk_end += size;

memset(ret, 0, size);

diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
index 2ec29ac..189a9e2 100644
--- a/arch/x86/mm/init.c
+++ b/arch/x86/mm/init.c
@@ -86,7 +86,7 @@ void __init early_alloc_pgt_buf(void)
unsigned long tables = INIT_PGT_BUF_SIZE;
phys_addr_t base;

- base = __pa(extend_brk(tables, PAGE_SIZE));
+ base = __pa(extend_brk(tables, PAGE_SIZE, false));

pgt_buf_start = base >> PAGE_SHIFT;
pgt_buf_end = pgt_buf_start;
diff --git a/arch/x86/xen/enlighten.c b/arch/x86/xen/enlighten.c
index 193097e..2d5a34f 100644
--- a/arch/x86/xen/enlighten.c
+++ b/arch/x86/xen/enlighten.c
@@ -1629,7 +1629,7 @@ void __ref xen_hvm_init_shared_info(void)

if (!shared_info_page)
shared_info_page = (struct shared_info *)
- extend_brk(PAGE_SIZE, PAGE_SIZE);
+ extend_brk(PAGE_SIZE, PAGE_SIZE, false);
xatp.domid = DOMID_SELF;
xatp.idx = 0;
xatp.space = XENMAPSPACE_shared_info;
diff --git a/arch/x86/xen/mmu.c b/arch/x86/xen/mmu.c
index fdc3ba2..573bc50 100644
--- a/arch/x86/xen/mmu.c
+++ b/arch/x86/xen/mmu.c
@@ -1768,7 +1768,7 @@ static void __init xen_map_identity_early(pmd_t *pmd, unsigned long max_pfn)
unsigned long pfn;

level1_ident_pgt = extend_brk(sizeof(pte_t) * LEVEL1_IDENT_ENTRIES,
- PAGE_SIZE);
+ PAGE_SIZE, false);

ident_pte = 0;
pfn = 0;
@@ -1980,7 +1980,7 @@ static void __init xen_write_cr3_init(unsigned long cr3)
* swapper_pg_dir.
*/
swapper_kernel_pmd =
- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);
copy_page(swapper_kernel_pmd, initial_kernel_pmd);
swapper_pg_dir[KERNEL_PGD_BOUNDARY] =
__pgd(__pa(swapper_kernel_pmd) | _PAGE_PRESENT);
@@ -2003,7 +2003,7 @@ void __init xen_setup_kernel_pagetable(pgd_t *pgd, unsigned long max_pfn)
pmd_t *kernel_pmd;

initial_kernel_pmd =
- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);

max_pfn_mapped = PFN_DOWN(__pa(xen_start_info->pt_base) +
xen_start_info->nr_pt_frames * PAGE_SIZE +
diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c
index 95fb2aa..bbdcf20 100644
--- a/arch/x86/xen/p2m.c
+++ b/arch/x86/xen/p2m.c
@@ -281,13 +281,13 @@ void __ref xen_build_mfn_list_list(void)

/* Pre-initialize p2m_top_mfn to be completely missing */
if (p2m_top_mfn == NULL) {
- p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_mid_mfn_init(p2m_mid_missing_mfn);

- p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_top_mfn_p_init(p2m_top_mfn_p);

- p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_top_mfn_init(p2m_top_mfn);
} else {
/* Reinitialise, mfn's all change after migration */
@@ -322,7 +322,7 @@ void __ref xen_build_mfn_list_list(void)
* runtime. extend_brk() will BUG if we call
* it too late.
*/
- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_mid_mfn_init(mid_mfn_p);

p2m_top_mfn_p[topidx] = mid_mfn_p;
@@ -351,16 +351,16 @@ void __init xen_build_dynamic_phys_to_machine(void)

xen_max_p2m_pfn = max_pfn;

- p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_init(p2m_missing);

- p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_mid_init(p2m_mid_missing);

- p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_top_init(p2m_top);

- p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_init(p2m_identity);

/*
@@ -373,7 +373,8 @@ void __init xen_build_dynamic_phys_to_machine(void)
unsigned mididx = p2m_mid_index(pfn);

if (p2m_top[topidx] == p2m_mid_missing) {
- unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE,
+ false);
p2m_mid_init(mid);

p2m_top[topidx] = mid;
@@ -609,7 +610,7 @@ static bool __init early_alloc_p2m_middle(unsigned long pfn, bool check_boundary
return false;

/* Boundary cross-over for the edges: */
- p2m = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ p2m = extend_brk(PAGE_SIZE, PAGE_SIZE, false);

p2m_init(p2m);

@@ -635,7 +636,7 @@ static bool __init early_alloc_p2m(unsigned long pfn)
mid = p2m_top[topidx];
mid_mfn_p = p2m_top_mfn_p[topidx];
if (mid == p2m_mid_missing) {
- mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ mid = extend_brk(PAGE_SIZE, PAGE_SIZE, false);

p2m_mid_init(mid);

@@ -645,7 +646,7 @@ static bool __init early_alloc_p2m(unsigned long pfn)
}
/* And the save/restore P2M tables.. */
if (mid_mfn_p == p2m_mid_missing_mfn) {
- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
p2m_mid_mfn_init(mid_mfn_p);

p2m_top_mfn_p[topidx] = mid_mfn_p;
@@ -858,7 +859,7 @@ static void __init m2p_override_init(void)
unsigned i;

m2p_overrides = extend_brk(sizeof(*m2p_overrides) * M2P_OVERRIDE_HASH,
- sizeof(unsigned long));
+ sizeof(unsigned long), false);

for (i = 0; i < M2P_OVERRIDE_HASH; i++)
INIT_LIST_HEAD(&m2p_overrides[i]);
diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
index 4c1baa7..dff7fcc 100644
--- a/drivers/acpi/osl.c
+++ b/drivers/acpi/osl.c
@@ -563,7 +563,7 @@ RESERVE_BRK(acpi_override_tables_alloc, ACPI_OVERRIDE_TABLES_SIZE);
void __init early_alloc_acpi_override_tables_buf(void)
{
acpi_tables_addr = __pa(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
- PAGE_SIZE));
+ PAGE_SIZE, false));
}

void __init acpi_initrd_override(void *data, size_t size)
--
1.7.1

2013-08-21 10:20:53

by Tang Chen

[permalink] [raw]

Subject: [PATCH 4/8] x86, acpi, brk: Extend BRK 256KB to store acpi override tables.

When finding acpi override tables in initrd, we need to allocate memory to
store these tables. But at such an early time, we don't have any memory
allocator. The basic idea is to use BRK.

This patch reserves 256KB in BRK, and allocate it to store override tables,
instead of memblock.

This idea is from Yinghai Lu <[email protected]>.

Signed-off-by: Tang Chen <[email protected]>
---
arch/x86/kernel/setup.c | 5 +++++
drivers/acpi/osl.c | 44 ++++++++++++++++++++++----------------------
include/linux/acpi.h | 1 +
3 files changed, 28 insertions(+), 22 deletions(-)

diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index 5bfd4c8..51fcd5d 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -1061,6 +1061,11 @@ void __init setup_arch(char **cmdline_p)

early_alloc_pgt_buf();

+#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
+ /* Allocate buffer to store acpi override tables in brk. */
+ early_alloc_acpi_override_tables_buf();
+#endif
+
/*
* Need to conclude brk, before memblock_x86_fill()
* it could use memblock_find_in_range, could overlap with
diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
index 06996d8..4c1baa7 100644
--- a/drivers/acpi/osl.c
+++ b/drivers/acpi/osl.c
@@ -48,6 +48,7 @@

#include <asm/io.h>
#include <asm/uaccess.h>
+#include <asm/setup.h>

#include <acpi/acpi.h>
#include <acpi/acpi_bus.h>
@@ -556,6 +557,15 @@ u8 __init acpi_table_checksum(u8 *buffer, u32 length)
/* Must not increase 10 or needs code modification below */
#define ACPI_OVERRIDE_TABLES 10

+/* Reserve 256KB in BRK to store acpi override tables */
+#define ACPI_OVERRIDE_TABLES_SIZE (256 * 1024)
+RESERVE_BRK(acpi_override_tables_alloc, ACPI_OVERRIDE_TABLES_SIZE);
+void __init early_alloc_acpi_override_tables_buf(void)
+{
+ acpi_tables_addr = __pa(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
+ PAGE_SIZE));
+}
+
void __init acpi_initrd_override(void *data, size_t size)
{
int sig, no, table_nr = 0, total_offset = 0;
@@ -619,7 +629,18 @@ void __init acpi_initrd_override(void *data, size_t size)
pr_info("%4.4s ACPI table found in initrd [%s%s][0x%x]\n",
table->signature, cpio_path, file.name, table->length);

+ /*
+ * If the override tables in cpio file exceeds the BRK buffer,
+ * ignore the current table and go for the next one.
+ */
all_tables_size += table->length;
+ if (all_tables_size > ACPI_OVERRIDE_TABLES_SIZE) {
+ pr_warning("ACPI OVERRIDE: ACPI override tables exceeds buffer size."
+ " Ignoring table %4.4s\n", table->signature);
+ all_tables_size -= table->length;
+ continue;
+ }
+
early_initrd_files[table_nr].data = file.data;
early_initrd_files[table_nr].size = file.size;
table_nr++;
@@ -627,34 +648,13 @@ void __init acpi_initrd_override(void *data, size_t size)
if (table_nr == 0)
return;

- acpi_tables_addr =
- memblock_find_in_range(0, max_low_pfn_mapped << PAGE_SHIFT,
- all_tables_size, PAGE_SIZE);
- if (!acpi_tables_addr) {
- WARN_ON(1);
- return;
- }
- /*
- * Only calling e820_add_reserve does not work and the
- * tables are invalid (memory got used) later.
- * memblock_reserve works as expected and the tables won't get modified.
- * But it's not enough on X86 because ioremap will
- * complain later (used by acpi_os_map_memory) that the pages
- * that should get mapped are not marked "reserved".
- * Both memblock_reserve and e820_add_region (via arch_reserve_mem_area)
- * works fine.
- */
- memblock_reserve(acpi_tables_addr, all_tables_size);
- arch_reserve_mem_area(acpi_tables_addr, all_tables_size);
-
- p = early_ioremap(acpi_tables_addr, all_tables_size);
+ p = __va(acpi_tables_addr);

for (no = 0; no < table_nr; no++) {
memcpy(p + total_offset, early_initrd_files[no].data,
early_initrd_files[no].size);
total_offset += early_initrd_files[no].size;
}
- early_iounmap(p, all_tables_size);
}
#endif /* CONFIG_ACPI_INITRD_TABLE_OVERRIDE */

diff --git a/include/linux/acpi.h b/include/linux/acpi.h
index 353ba25..381579e 100644
--- a/include/linux/acpi.h
+++ b/include/linux/acpi.h
@@ -81,6 +81,7 @@ typedef int (*acpi_tbl_entry_handler)(struct acpi_subtable_header *header,

#ifdef CONFIG_ACPI_INITRD_TABLE_OVERRIDE
void acpi_initrd_override(void *data, size_t size);
+void early_alloc_acpi_override_tables_buf(void);
#else
static inline void acpi_initrd_override(void *data, size_t size)
{
--
1.7.1

2013-08-21 10:21:53

by Tang Chen

[permalink] [raw]

Subject: [PATCH 8/8] x86, acpi: Do acpi_initrd_override() earlier in head_32.S/head64.c.

Introduce x86_acpi_initrd_override() to do acpi table override job. This function
can be called before or after paging is enabled. On 32bit, it will be called before
paging is enabled. On 64bit, it will be called after paging is enabled but before
direct mapping page tables are setup.

Originally-From: Yinghai Lu <[email protected]>
Signed-off-by: Tang Chen <[email protected]>
---
arch/x86/include/asm/setup.h | 6 +++++
arch/x86/kernel/head64.c | 4 +++
arch/x86/kernel/head_32.S | 4 +++
arch/x86/kernel/setup.c | 51 ++++++++++++++++++++++++++++++++---------
4 files changed, 54 insertions(+), 11 deletions(-)

diff --git a/arch/x86/include/asm/setup.h b/arch/x86/include/asm/setup.h
index 96d00da..9f32cb4 100644
--- a/arch/x86/include/asm/setup.h
+++ b/arch/x86/include/asm/setup.h
@@ -42,6 +42,12 @@ extern void visws_early_detect(void);
static inline void visws_early_detect(void) { }
#endif

+#ifdef CONFIG_ACPI_INITRD_TABLE_OVERRIDE
+void x86_acpi_initrd_override(void);
+#else
+static inline void x86_acpi_initrd_override(void) { }
+#endif /* CONFIG_ACPI_INITRD_TABLE_OVERRIDE */
+
extern unsigned long saved_video_mode;

extern void reserve_standard_io_resources(void);
diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
index 55b6761..88e19b4 100644
--- a/arch/x86/kernel/head64.c
+++ b/arch/x86/kernel/head64.c
@@ -175,6 +175,10 @@ void __init x86_64_start_kernel(char * real_mode_data)
if (console_loglevel == 10)
early_printk("Kernel alive\n");

+#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
+ x86_acpi_initrd_override();
+#endif
+
clear_page(init_level4_pgt);
/* set init_level4_pgt kernel high mapping*/
init_level4_pgt[511] = early_level4_pgt[511];
diff --git a/arch/x86/kernel/head_32.S b/arch/x86/kernel/head_32.S
index 5dd87a8..e04e13b 100644
--- a/arch/x86/kernel/head_32.S
+++ b/arch/x86/kernel/head_32.S
@@ -149,6 +149,10 @@ ENTRY(startup_32)
call load_ucode_bsp
#endif

+#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
+ call x86_acpi_initrd_override
+#endif
+
/*
* Initialize page tables. This creates a PDE and a set of page
* tables, which are located immediately beyond __brk_base. The variable
diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index 5729cd2..b48a0ff 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -833,7 +833,46 @@ static void __init trim_low_memory_range(void)
{
memblock_reserve(0, ALIGN(reserve_low, PAGE_SIZE));
}
-
+
+#ifdef CONFIG_ACPI_INITRD_TABLE_OVERRIDE
+/**
+ * x86_acpi_initrd_override - Find all acpi override tables in initrd, and copy
+ * them to acpi_tables_addr.
+ *
+ * On 32bit platform, this function is call in head_32.S, before paging is
+ * enabled. So we have to use physical address.
+ *
+ * On 64bit platform, this function is call in head_64.c, after paging is
+ * enabled but before direct mapping page tables are set up. Since we have an
+ * early page fault handler on 64bit, so it is OK to use virtual address.
+ */
+void __init x86_acpi_initrd_override(void)
+{
+ unsigned long ramdisk_image, ramdisk_size;
+ void *p = NULL;
+
+#ifdef CONFIG_X86_32
+ struct boot_params *boot_params_p;
+
+ boot_params_p = (struct boot_params *)__pa(&boot_params);
+ ramdisk_image = get_ramdisk_image(boot_params_p);
+ ramdisk_size = get_ramdisk_size(boot_params_p);
+ p = (void *)ramdisk_image;
+
+ early_alloc_acpi_override_tables_buf(true);
+ acpi_initrd_override(p, ramdisk_size, true);
+#else
+ ramdisk_image = get_ramdisk_image(&boot_params);
+ ramdisk_size = get_ramdisk_size(&boot_params);
+ if (ramdisk_image)
+ p = (void *)__va(ramdisk_image);
+
+ early_alloc_acpi_override_tables_buf(false);
+ acpi_initrd_override(p, ramdisk_size, false);
+#endif /* CONFIG_X86_32 */
+}
+#endif /* CONFIG_ACPI_INITRD_TABLE_OVERRIDE */
+
/*
* Determine if we were loaded by an EFI loader. If so, then we have also been
* passed the efi memmap, systab, etc., so we should use these data structures
@@ -1069,11 +1108,6 @@ void __init setup_arch(char **cmdline_p)

early_alloc_pgt_buf();

-#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
- /* Allocate buffer to store acpi override tables in brk. */
- early_alloc_acpi_override_tables_buf(false);
-#endif
-
/*
* Need to conclude brk, before memblock_x86_fill()
* it could use memblock_find_in_range, could overlap with
@@ -1132,11 +1166,6 @@ void __init setup_arch(char **cmdline_p)

reserve_initrd();

-#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
- acpi_initrd_override((void *)initrd_start, initrd_end - initrd_start,
- false);
-#endif
-
reserve_crashkernel();

vsmp_init();
--
1.7.1

2013-08-21 10:47:03

by Tang Chen

[permalink] [raw]

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Hi all,

This patch-set has not been fully tested. I sent them first for you
to review. Please comment if we can agree on this solution.

Thanks.:)

On 08/21/2013 06:15 PM, Tang Chen wrote:
> This patch-set aims to move acpi_initrd_override() earlier on x86.
> Some of the patches are from Yinghai's patch-set:
> https://lkml.org/lkml/2013/6/14/561
>
> The difference between this patch-set and Yinghai's original patch-set are:
> 1. This patch-set doesn't split acpi_initrd_override(), but call it as a
> whole operation at early time.
> 2. Allocate memory from BRK to store override tables.
> (This idea is also from Yinghai.)
>
>
> [Current state]
>
> The current Linux kernel will initialize acpi tables like the following:
>
> 1. Find all acpi override table provided by users in initrd.
> (Linux allows users to override acpi tables in firmware, by specifying
> their own tables in initrd.)
>
> 2. Use acpica code to initialize acpi global root table list and install all
> tables into it. If any override tables exists, use it to override the one
> provided by firmware.
>
> Then others can parse these tables and get useful info.
>
> Both of the two steps happen after direct mapping page tables are setup.
>
> [Issues]
>
> In the current Linux kernel, the initialization of acpi tables is too late for
> new functionalities.
>
> We have some issues about this:
>
> * For memory hotplug, we need ACPI SRAT at early time to be aware of which memory
> ranges are hotpluggable, and prevent bootmem allocator from allocating memory
> for the kernel. (Kernel pages cannot be hotplugged because )
>
> * As suggested by Yinghai Lu<[email protected]>, we should allocate page tables
> in local node. This also needs SRAT before direct mapping page tables are setup.
>
> * As mentioned by Toshi Kani<[email protected]>, ACPI SCPR/DBGP/DBG2 tables
> allow the OS to initialize serial console/debug ports at early boot time. The
> earlier it can be initialized, the better this feature will be. These tables
> are not currently used by Linux due to a licensing issue, but it could be
> addressed some time soon.
>
>
> [What are we doing]
>
> We are trying to initialize acip tables as early as possible. But Linux kernel
> allows users to override acpi tables by specifying their own tables in initrd.
> So we have to do acpi_initrd_override() earlier first.
>
>
> [About this patch-set]
>
> This patch-set aims to move acpi_initrd_override() as early as possible on x86.
> As suggested by Yinghai, we are trying to do it like this:
>
> On 32bit: do it in head_32.S, before paging is enabled. In this case, we can
> access initrd with physical address without page tables.
>
> On 64bit: do it in head_64.c, after paging is enabled but before direct mapping
> is setup.
>
> And also, acpi_initrd_override() needs to allocate memory for override tables.
> But at such an early time, there is no memory allocator works. So the basic idea
> from Yinghai is to use BRK. We will extend BRK 256KB in this patch-set.
>
>
> Tang Chen (6):
> x86, acpi: Move table_sigs[] to stack.
> x86, acpi, brk: Extend BRK 256KB to store acpi override tables.
> x86, brk: Make extend_brk() available with va/pa.
> x86, acpi: Make acpi_initrd_override() available with va or pa.
> x86, acpi, brk: Make early_alloc_acpi_override_tables_buf() available
> with va/pa.
> x86, acpi: Do acpi_initrd_override() earlier in head_32.S/head64.c.
>
> Yinghai Lu (2):
> x86: Make get_ramdisk_{image|size}() global.
> x86, microcode: Use get_ramdisk_{image|size}() in microcode handling.
>
> arch/x86/include/asm/dmi.h | 2 +-
> arch/x86/include/asm/setup.h | 11 +++-
> arch/x86/kernel/head64.c | 4 +
> arch/x86/kernel/head_32.S | 4 +
> arch/x86/kernel/microcode_intel_early.c | 8 +-
> arch/x86/kernel/setup.c | 93 ++++++++++++++++------
> arch/x86/mm/init.c | 2 +-
> arch/x86/xen/enlighten.c | 2 +-
> arch/x86/xen/mmu.c | 6 +-
> arch/x86/xen/p2m.c | 27 ++++---
> drivers/acpi/osl.c | 130 ++++++++++++++++++++-----------
> include/linux/acpi.h | 5 +-
> 12 files changed, 196 insertions(+), 98 deletions(-)
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to [email protected]. For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email:<a href=mailto:"[email protected]"> [email protected]</a>
>

2013-08-21 10:48:08

by Tang Chen

[permalink] [raw]

Subject: [PATCH 6/8] x86, acpi: Make acpi_initrd_override() available with va or pa.

We are using the same trick in previous patch.

Introduce a "bool is_phys" to acpi_initrd_override(). When it
is true, convert all golbal variables va to pa, so that we can
access them on 32bit before paging is enabled.

NOTE: Do not call printk() on 32bit before paging is enabled
because it will use global variables.

Originally-From: Yinghai Lu <[email protected]>
Signed-off-by: Tang Chen <[email protected]>
---
arch/x86/kernel/setup.c | 3 +-
drivers/acpi/osl.c | 68 ++++++++++++++++++++++++++++++++++------------
include/linux/acpi.h | 4 +-
3 files changed, 54 insertions(+), 21 deletions(-)

diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
index a189909..1290ea7 100644
--- a/arch/x86/kernel/setup.c
+++ b/arch/x86/kernel/setup.c
@@ -1133,7 +1133,8 @@ void __init setup_arch(char **cmdline_p)
reserve_initrd();

#if defined(CONFIG_ACPI) && defined(CONFIG_BLK_DEV_INITRD)
- acpi_initrd_override((void *)initrd_start, initrd_end - initrd_start);
+ acpi_initrd_override((void *)initrd_start, initrd_end - initrd_start,
+ false);
#endif

reserve_crashkernel();
diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
index dff7fcc..ccdb5a6 100644
--- a/drivers/acpi/osl.c
+++ b/drivers/acpi/osl.c
@@ -566,7 +566,19 @@ void __init early_alloc_acpi_override_tables_buf(void)
PAGE_SIZE, false));
}

-void __init acpi_initrd_override(void *data, size_t size)
+/**
+ * acpi_initrd_override - Initialize acpi_tables_addr with acpi override tables
+ * @data: cpio file address (va or pa)
+ * @size: size of cpio file
+ * @is_phys: true if @data is pa, false otherwise
+ *
+ * This function will find all acpi override tables provided by initrd, and
+ * store the addresses in acpi_tables_addr.
+ *
+ * This function could be called before paging is enabled. Before paging is
+ * enabled, caller should use physical address, and set @is_phys as true.
+ */
+void __init acpi_initrd_override(void *data, size_t size, bool is_phys)
{
int sig, no, table_nr = 0, total_offset = 0;
long offset = 0;
@@ -586,10 +598,17 @@ void __init acpi_initrd_override(void *data, size_t size)
ACPI_SIG_UEFI, ACPI_SIG_WAET, ACPI_SIG_WDAT, ACPI_SIG_WDDT,
ACPI_SIG_WDRT, ACPI_SIG_DSDT, ACPI_SIG_FADT, ACPI_SIG_PSDT,
ACPI_SIG_RSDT, ACPI_SIG_XSDT, ACPI_SIG_SSDT, NULL };
+ u64 *acpi_tables_addr_p = &acpi_tables_addr;
+ int *all_tables_size_p = &all_tables_size;

if (data == NULL || size == 0)
return;

+ if (is_phys) {
+ acpi_tables_addr_p = (u64 *)__pa_nodebug(&acpi_tables_addr);
+ all_tables_size_p = (int *)__pa_nodebug(&all_tables_size);
+ }
+
for (no = 0; no < ACPI_OVERRIDE_TABLES; no++) {
file = find_cpio_data(cpio_path, data, size, &offset);
if (!file.data)
@@ -599,8 +618,9 @@ void __init acpi_initrd_override(void *data, size_t size)
size -= offset;

if (file.size < sizeof(struct acpi_table_header)) {
- pr_err("ACPI OVERRIDE: Table smaller than ACPI header [%s%s]\n",
- cpio_path, file.name);
+ if (!is_phys)
+ pr_err("ACPI OVERRIDE: Table smaller than ACPI header [%s%s]\n",
+ cpio_path, file.name);
continue;
}

@@ -611,36 +631,48 @@ void __init acpi_initrd_override(void *data, size_t size)
break;

if (!table_sigs[sig]) {
- pr_err("ACPI OVERRIDE: Unknown signature [%s%s]\n",
- cpio_path, file.name);
+ if (!is_phys)
+ pr_err("ACPI OVERRIDE: Unknown signature [%s%s]\n",
+ cpio_path, file.name);
continue;
}
if (file.size != table->length) {
- pr_err("ACPI OVERRIDE: File length does not match table length [%s%s]\n",
- cpio_path, file.name);
+ if (!is_phys)
+ pr_err("ACPI OVERRIDE: File length does not match table length [%s%s]\n",
+ cpio_path, file.name);
continue;
}
if (acpi_table_checksum(file.data, table->length)) {
- pr_err("ACPI OVERRIDE: Bad table checksum [%s%s]\n",
- cpio_path, file.name);
+ if (!is_phys)
+ pr_err("ACPI OVERRIDE: Bad table checksum [%s%s]\n",
+ cpio_path, file.name);
continue;
}

- pr_info("%4.4s ACPI table found in initrd [%s%s][0x%x]\n",
- table->signature, cpio_path, file.name, table->length);
+ if (!is_phys)
+ pr_info("%4.4s ACPI table found in initrd [%s%s][0x%x]\n",
+ table->signature, cpio_path,
+ file.name, table->length);

/*
* If the override tables in cpio file exceeds the BRK buffer,
* ignore the current table and go for the next one.
*/
- all_tables_size += table->length;
- if (all_tables_size > ACPI_OVERRIDE_TABLES_SIZE) {
- pr_warning("ACPI OVERRIDE: ACPI override tables exceeds buffer size."
- " Ignoring table %4.4s\n", table->signature);
- all_tables_size -= table->length;
+ *all_tables_size_p += table->length;
+ if (*all_tables_size_p > ACPI_OVERRIDE_TABLES_SIZE) {
+ if (!is_phys)
+ pr_warning("ACPI OVERRIDE: ACPI override tables exceeds buffer size."
+ " Ignoring table %4.4s\n",
+ table->signature);
+ *all_tables_size_p -= table->length;
continue;
}

+ /*
+ * file.data is the offset of the table in initrd. If @data is
+ * pa, then we find pa. If @data is va, then we find va. No need
+ * to convert.
+ */
early_initrd_files[table_nr].data = file.data;
early_initrd_files[table_nr].size = file.size;
table_nr++;
@@ -648,8 +680,8 @@ void __init acpi_initrd_override(void *data, size_t size)
if (table_nr == 0)
return;

- p = __va(acpi_tables_addr);
-
+ p = is_phys ? (char *)(*acpi_tables_addr_p) :
+ (char *)__va(*acpi_tables_addr_p);
for (no = 0; no < table_nr; no++) {
memcpy(p + total_offset, early_initrd_files[no].data,
early_initrd_files[no].size);
diff --git a/include/linux/acpi.h b/include/linux/acpi.h
index 381579e..af4da51 100644
--- a/include/linux/acpi.h
+++ b/include/linux/acpi.h
@@ -80,10 +80,10 @@ typedef int (*acpi_tbl_entry_handler)(struct acpi_subtable_header *header,
const unsigned long end);

#ifdef CONFIG_ACPI_INITRD_TABLE_OVERRIDE
-void acpi_initrd_override(void *data, size_t size);
+void acpi_initrd_override(void *data, size_t size, bool is_phys);
void early_alloc_acpi_override_tables_buf(void);
#else
-static inline void acpi_initrd_override(void *data, size_t size)
+static inline void acpi_initrd_override(void *data, size_t size, bool is_phys)
{
}
#endif
--
1.7.1

2013-08-21 12:27:38

by Konrad Rzeszutek Wilk

[permalink] [raw]

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Tang Chen <[email protected]> wrote:
>We are going to do acpi_initrd_override() at very early time:
>
>On 32bit: do it in head_32.S, before paging is enabled. In this case,
>we can
> access initrd with physical address without page tables.
>
>On 64bit: do it in head_64.c, after paging is enabled but before direct
>mapping
> is setup.
>
> On 64bit, we have an early page fault handler to help to access data
> with direct mapping page tables. So it is easy to do in head_64.c.
>
>And we need to allocate memory to store override tables. At such an
>early time,
>no memory allocator works. So we can only use BRK.
>
>As mentioned above, on 32bit before paging is enabled, we have to
>access variables
>with pa. So introduce a "bool is_phys" parameter to extend_brk(), and
>convert va
>to pa is it is true.

Could you do it differently? Meaning have a global symbol (paging_enabled) which will be used by most of the functions you changed in this patch and the next ones? It would naturally be enabled when paging is on and __va addresses can be used.

That could also be used in the printk case to do a BUG_ON before paging is enabled on 32bit. Or perhaps use a different code path to deal with using __pa address.

?
>
>Signed-off-by: Tang Chen <[email protected]>
>---
> arch/x86/include/asm/dmi.h | 2 +-
> arch/x86/include/asm/setup.h | 2 +-
> arch/x86/kernel/setup.c | 20 ++++++++++++++------
> arch/x86/mm/init.c | 2 +-
> arch/x86/xen/enlighten.c | 2 +-
> arch/x86/xen/mmu.c | 6 +++---
> arch/x86/xen/p2m.c | 27 ++++++++++++++-------------
> drivers/acpi/osl.c | 2 +-
> 8 files changed, 36 insertions(+), 27 deletions(-)
>
>diff --git a/arch/x86/include/asm/dmi.h b/arch/x86/include/asm/dmi.h
>index fd8f9e2..3b51d81 100644
>--- a/arch/x86/include/asm/dmi.h
>+++ b/arch/x86/include/asm/dmi.h
>@@ -9,7 +9,7 @@
>
> static __always_inline __init void *dmi_alloc(unsigned len)
> {
>- return extend_brk(len, sizeof(int));
>+ return extend_brk(len, sizeof(int), false);
> }
>
> /* Use early IO mappings for DMI because it's initialized early */
>diff --git a/arch/x86/include/asm/setup.h
>b/arch/x86/include/asm/setup.h
>index 4f71d48..96d00da 100644
>--- a/arch/x86/include/asm/setup.h
>+++ b/arch/x86/include/asm/setup.h
>@@ -75,7 +75,7 @@ extern struct boot_params boot_params;
>
> /* exceedingly early brk-like allocator */
> extern unsigned long _brk_end;
>-void *extend_brk(size_t size, size_t align);
>+void *extend_brk(size_t size, size_t align, bool is_phys);
>
> /*
> * Reserve space in the brk section. The name must be unique within
>diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
>index 51fcd5d..a189909 100644
>--- a/arch/x86/kernel/setup.c
>+++ b/arch/x86/kernel/setup.c
>@@ -259,19 +259,27 @@ static inline void __init copy_edd(void)
> }
> #endif
>
>-void * __init extend_brk(size_t size, size_t align)
>+void * __init extend_brk(size_t size, size_t align, bool is_phys)
> {
> size_t mask = align - 1;
> void *ret;
>+ unsigned long *brk_start, *brk_end, *brk_limit;
>
>- BUG_ON(_brk_start == 0);
>+ brk_start = is_phys ? (unsigned long *)__pa_nodebug(&_brk_start) :
>+ (unsigned long *)&_brk_start;
>+ brk_end = is_phys ? (unsigned long *)__pa_nodebug(&_brk_end) :
>+ (unsigned long *)&_brk_end;
>+ brk_limit = is_phys ? (unsigned long *)__pa_nodebug(__brk_limit) :
>+ (unsigned long *)__brk_limit;
>+
>+ BUG_ON(*brk_start == 0);
> BUG_ON(align & mask);
>
>- _brk_end = (_brk_end + mask) & ~mask;
>- BUG_ON((char *)(_brk_end + size) > __brk_limit);
>+ *brk_end = (*brk_end + mask) & ~mask;
>+ BUG_ON((char *)(*brk_end + size) > brk_limit);
>
>- ret = (void *)_brk_end;
>- _brk_end += size;
>+ ret = (void *)(*brk_end);
>+ *brk_end += size;
>
> memset(ret, 0, size);
>
>diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
>index 2ec29ac..189a9e2 100644
>--- a/arch/x86/mm/init.c
>+++ b/arch/x86/mm/init.c
>@@ -86,7 +86,7 @@ void __init early_alloc_pgt_buf(void)
> unsigned long tables = INIT_PGT_BUF_SIZE;
> phys_addr_t base;
>
>- base = __pa(extend_brk(tables, PAGE_SIZE));
>+ base = __pa(extend_brk(tables, PAGE_SIZE, false));
>
> pgt_buf_start = base >> PAGE_SHIFT;
> pgt_buf_end = pgt_buf_start;
>diff --git a/arch/x86/xen/enlighten.c b/arch/x86/xen/enlighten.c
>index 193097e..2d5a34f 100644
>--- a/arch/x86/xen/enlighten.c
>+++ b/arch/x86/xen/enlighten.c
>@@ -1629,7 +1629,7 @@ void __ref xen_hvm_init_shared_info(void)
>
> if (!shared_info_page)
> shared_info_page = (struct shared_info *)
>- extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> xatp.domid = DOMID_SELF;
> xatp.idx = 0;
> xatp.space = XENMAPSPACE_shared_info;
>diff --git a/arch/x86/xen/mmu.c b/arch/x86/xen/mmu.c
>index fdc3ba2..573bc50 100644
>--- a/arch/x86/xen/mmu.c
>+++ b/arch/x86/xen/mmu.c
>@@ -1768,7 +1768,7 @@ static void __init xen_map_identity_early(pmd_t
>*pmd, unsigned long max_pfn)
> unsigned long pfn;
>
> level1_ident_pgt = extend_brk(sizeof(pte_t) * LEVEL1_IDENT_ENTRIES,
>- PAGE_SIZE);
>+ PAGE_SIZE, false);
>
> ident_pte = 0;
> pfn = 0;
>@@ -1980,7 +1980,7 @@ static void __init xen_write_cr3_init(unsigned
>long cr3)
> * swapper_pg_dir.
> */
> swapper_kernel_pmd =
>- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
>+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);
> copy_page(swapper_kernel_pmd, initial_kernel_pmd);
> swapper_pg_dir[KERNEL_PGD_BOUNDARY] =
> __pgd(__pa(swapper_kernel_pmd) | _PAGE_PRESENT);
>@@ -2003,7 +2003,7 @@ void __init xen_setup_kernel_pagetable(pgd_t
>*pgd, unsigned long max_pfn)
> pmd_t *kernel_pmd;
>
> initial_kernel_pmd =
>- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
>+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);
>
> max_pfn_mapped = PFN_DOWN(__pa(xen_start_info->pt_base) +
> xen_start_info->nr_pt_frames * PAGE_SIZE +
>diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c
>index 95fb2aa..bbdcf20 100644
>--- a/arch/x86/xen/p2m.c
>+++ b/arch/x86/xen/p2m.c
>@@ -281,13 +281,13 @@ void __ref xen_build_mfn_list_list(void)
>
> /* Pre-initialize p2m_top_mfn to be completely missing */
> if (p2m_top_mfn == NULL) {
>- p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_mid_mfn_init(p2m_mid_missing_mfn);
>
>- p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_top_mfn_p_init(p2m_top_mfn_p);
>
>- p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_top_mfn_init(p2m_top_mfn);
> } else {
> /* Reinitialise, mfn's all change after migration */
>@@ -322,7 +322,7 @@ void __ref xen_build_mfn_list_list(void)
> * runtime. extend_brk() will BUG if we call
> * it too late.
> */
>- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_mid_mfn_init(mid_mfn_p);
>
> p2m_top_mfn_p[topidx] = mid_mfn_p;
>@@ -351,16 +351,16 @@ void __init
>xen_build_dynamic_phys_to_machine(void)
>
> xen_max_p2m_pfn = max_pfn;
>
>- p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_init(p2m_missing);
>
>- p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_mid_init(p2m_mid_missing);
>
>- p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_top_init(p2m_top);
>
>- p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_init(p2m_identity);
>
> /*
>@@ -373,7 +373,8 @@ void __init xen_build_dynamic_phys_to_machine(void)
> unsigned mididx = p2m_mid_index(pfn);
>
> if (p2m_top[topidx] == p2m_mid_missing) {
>- unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE,
>+ false);
> p2m_mid_init(mid);
>
> p2m_top[topidx] = mid;
>@@ -609,7 +610,7 @@ static bool __init early_alloc_p2m_middle(unsigned
>long pfn, bool check_boundary
> return false;
>
> /* Boundary cross-over for the edges: */
>- p2m = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ p2m = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>
> p2m_init(p2m);
>
>@@ -635,7 +636,7 @@ static bool __init early_alloc_p2m(unsigned long
>pfn)
> mid = p2m_top[topidx];
> mid_mfn_p = p2m_top_mfn_p[topidx];
> if (mid == p2m_mid_missing) {
>- mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ mid = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>
> p2m_mid_init(mid);
>
>@@ -645,7 +646,7 @@ static bool __init early_alloc_p2m(unsigned long
>pfn)
> }
> /* And the save/restore P2M tables.. */
> if (mid_mfn_p == p2m_mid_missing_mfn) {
>- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
> p2m_mid_mfn_init(mid_mfn_p);
>
> p2m_top_mfn_p[topidx] = mid_mfn_p;
>@@ -858,7 +859,7 @@ static void __init m2p_override_init(void)
> unsigned i;
>
> m2p_overrides = extend_brk(sizeof(*m2p_overrides) * M2P_OVERRIDE_HASH,
>- sizeof(unsigned long));
>+ sizeof(unsigned long), false);
>
> for (i = 0; i < M2P_OVERRIDE_HASH; i++)
> INIT_LIST_HEAD(&m2p_overrides[i]);
>diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
>index 4c1baa7..dff7fcc 100644
>--- a/drivers/acpi/osl.c
>+++ b/drivers/acpi/osl.c
>@@ -563,7 +563,7 @@ RESERVE_BRK(acpi_override_tables_alloc,
>ACPI_OVERRIDE_TABLES_SIZE);
> void __init early_alloc_acpi_override_tables_buf(void)
> {
> acpi_tables_addr = __pa(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
>- PAGE_SIZE));
>+ PAGE_SIZE, false));
> }
>
> void __init acpi_initrd_override(void *data, size_t size)

2013-08-21 12:37:00

by H. Peter Anvin

[permalink] [raw]

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Global symbols are inaccessible in physical mode.

This is incidentally yet another example of "PV/weird platform violence", since in their absence it would be trivial to work around this by using segmentation.

Konrad Rzeszutek Wilk <[email protected]> wrote:
>Tang Chen <[email protected]> wrote:
>>We are going to do acpi_initrd_override() at very early time:
>>
>>On 32bit: do it in head_32.S, before paging is enabled. In this case,
>>we can
>> access initrd with physical address without page tables.
>>
>>On 64bit: do it in head_64.c, after paging is enabled but before
>direct
>>mapping
>> is setup.
>>
>> On 64bit, we have an early page fault handler to help to access
>data
>> with direct mapping page tables. So it is easy to do in
>head_64.c.
>>
>>And we need to allocate memory to store override tables. At such an
>>early time,
>>no memory allocator works. So we can only use BRK.
>>
>>As mentioned above, on 32bit before paging is enabled, we have to
>>access variables
>>with pa. So introduce a "bool is_phys" parameter to extend_brk(), and
>>convert va
>>to pa is it is true.
>
>Could you do it differently? Meaning have a global symbol
>(paging_enabled) which will be used by most of the functions you
>changed in this patch and the next ones? It would naturally be enabled
>when paging is on and __va addresses can be used.
>
>That could also be used in the printk case to do a BUG_ON before paging
>is enabled on 32bit. Or perhaps use a different code path to deal with
>using __pa address.
>
>?
>>
>>Signed-off-by: Tang Chen <[email protected]>
>>---
>> arch/x86/include/asm/dmi.h | 2 +-
>> arch/x86/include/asm/setup.h | 2 +-
>> arch/x86/kernel/setup.c | 20 ++++++++++++++------
>> arch/x86/mm/init.c | 2 +-
>> arch/x86/xen/enlighten.c | 2 +-
>> arch/x86/xen/mmu.c | 6 +++---
>> arch/x86/xen/p2m.c | 27 ++++++++++++++-------------
>> drivers/acpi/osl.c | 2 +-
>> 8 files changed, 36 insertions(+), 27 deletions(-)
>>
>>diff --git a/arch/x86/include/asm/dmi.h b/arch/x86/include/asm/dmi.h
>>index fd8f9e2..3b51d81 100644
>>--- a/arch/x86/include/asm/dmi.h
>>+++ b/arch/x86/include/asm/dmi.h
>>@@ -9,7 +9,7 @@
>>
>> static __always_inline __init void *dmi_alloc(unsigned len)
>> {
>>- return extend_brk(len, sizeof(int));
>>+ return extend_brk(len, sizeof(int), false);
>> }
>>
>> /* Use early IO mappings for DMI because it's initialized early */
>>diff --git a/arch/x86/include/asm/setup.h
>>b/arch/x86/include/asm/setup.h
>>index 4f71d48..96d00da 100644
>>--- a/arch/x86/include/asm/setup.h
>>+++ b/arch/x86/include/asm/setup.h
>>@@ -75,7 +75,7 @@ extern struct boot_params boot_params;
>>
>> /* exceedingly early brk-like allocator */
>> extern unsigned long _brk_end;
>>-void *extend_brk(size_t size, size_t align);
>>+void *extend_brk(size_t size, size_t align, bool is_phys);
>>
>> /*
>> * Reserve space in the brk section. The name must be unique within
>>diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
>>index 51fcd5d..a189909 100644
>>--- a/arch/x86/kernel/setup.c
>>+++ b/arch/x86/kernel/setup.c
>>@@ -259,19 +259,27 @@ static inline void __init copy_edd(void)
>> }
>> #endif
>>
>>-void * __init extend_brk(size_t size, size_t align)
>>+void * __init extend_brk(size_t size, size_t align, bool is_phys)
>> {
>> size_t mask = align - 1;
>> void *ret;
>>+ unsigned long *brk_start, *brk_end, *brk_limit;
>>
>>- BUG_ON(_brk_start == 0);
>>+ brk_start = is_phys ? (unsigned long *)__pa_nodebug(&_brk_start) :
>>+ (unsigned long *)&_brk_start;
>>+ brk_end = is_phys ? (unsigned long *)__pa_nodebug(&_brk_end) :
>>+ (unsigned long *)&_brk_end;
>>+ brk_limit = is_phys ? (unsigned long *)__pa_nodebug(__brk_limit) :
>>+ (unsigned long *)__brk_limit;
>>+
>>+ BUG_ON(*brk_start == 0);
>> BUG_ON(align & mask);
>>
>>- _brk_end = (_brk_end + mask) & ~mask;
>>- BUG_ON((char *)(_brk_end + size) > __brk_limit);
>>+ *brk_end = (*brk_end + mask) & ~mask;
>>+ BUG_ON((char *)(*brk_end + size) > brk_limit);
>>
>>- ret = (void *)_brk_end;
>>- _brk_end += size;
>>+ ret = (void *)(*brk_end);
>>+ *brk_end += size;
>>
>> memset(ret, 0, size);
>>
>>diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c
>>index 2ec29ac..189a9e2 100644
>>--- a/arch/x86/mm/init.c
>>+++ b/arch/x86/mm/init.c
>>@@ -86,7 +86,7 @@ void __init early_alloc_pgt_buf(void)
>> unsigned long tables = INIT_PGT_BUF_SIZE;
>> phys_addr_t base;
>>
>>- base = __pa(extend_brk(tables, PAGE_SIZE));
>>+ base = __pa(extend_brk(tables, PAGE_SIZE, false));
>>
>> pgt_buf_start = base >> PAGE_SHIFT;
>> pgt_buf_end = pgt_buf_start;
>>diff --git a/arch/x86/xen/enlighten.c b/arch/x86/xen/enlighten.c
>>index 193097e..2d5a34f 100644
>>--- a/arch/x86/xen/enlighten.c
>>+++ b/arch/x86/xen/enlighten.c
>>@@ -1629,7 +1629,7 @@ void __ref xen_hvm_init_shared_info(void)
>>
>> if (!shared_info_page)
>> shared_info_page = (struct shared_info *)
>>- extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> xatp.domid = DOMID_SELF;
>> xatp.idx = 0;
>> xatp.space = XENMAPSPACE_shared_info;
>>diff --git a/arch/x86/xen/mmu.c b/arch/x86/xen/mmu.c
>>index fdc3ba2..573bc50 100644
>>--- a/arch/x86/xen/mmu.c
>>+++ b/arch/x86/xen/mmu.c
>>@@ -1768,7 +1768,7 @@ static void __init xen_map_identity_early(pmd_t
>>*pmd, unsigned long max_pfn)
>> unsigned long pfn;
>>
>> level1_ident_pgt = extend_brk(sizeof(pte_t) * LEVEL1_IDENT_ENTRIES,
>>- PAGE_SIZE);
>>+ PAGE_SIZE, false);
>>
>> ident_pte = 0;
>> pfn = 0;
>>@@ -1980,7 +1980,7 @@ static void __init xen_write_cr3_init(unsigned
>>long cr3)
>> * swapper_pg_dir.
>> */
>> swapper_kernel_pmd =
>>- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
>>+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);
>> copy_page(swapper_kernel_pmd, initial_kernel_pmd);
>> swapper_pg_dir[KERNEL_PGD_BOUNDARY] =
>> __pgd(__pa(swapper_kernel_pmd) | _PAGE_PRESENT);
>>@@ -2003,7 +2003,7 @@ void __init xen_setup_kernel_pagetable(pgd_t
>>*pgd, unsigned long max_pfn)
>> pmd_t *kernel_pmd;
>>
>> initial_kernel_pmd =
>>- extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE);
>>+ extend_brk(sizeof(pmd_t) * PTRS_PER_PMD, PAGE_SIZE, false);
>>
>> max_pfn_mapped = PFN_DOWN(__pa(xen_start_info->pt_base) +
>> xen_start_info->nr_pt_frames * PAGE_SIZE +
>>diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c
>>index 95fb2aa..bbdcf20 100644
>>--- a/arch/x86/xen/p2m.c
>>+++ b/arch/x86/xen/p2m.c
>>@@ -281,13 +281,13 @@ void __ref xen_build_mfn_list_list(void)
>>
>> /* Pre-initialize p2m_top_mfn to be completely missing */
>> if (p2m_top_mfn == NULL) {
>>- p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_mid_missing_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_mid_mfn_init(p2m_mid_missing_mfn);
>>
>>- p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_top_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_top_mfn_p_init(p2m_top_mfn_p);
>>
>>- p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_top_mfn = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_top_mfn_init(p2m_top_mfn);
>> } else {
>> /* Reinitialise, mfn's all change after migration */
>>@@ -322,7 +322,7 @@ void __ref xen_build_mfn_list_list(void)
>> * runtime. extend_brk() will BUG if we call
>> * it too late.
>> */
>>- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_mid_mfn_init(mid_mfn_p);
>>
>> p2m_top_mfn_p[topidx] = mid_mfn_p;
>>@@ -351,16 +351,16 @@ void __init
>>xen_build_dynamic_phys_to_machine(void)
>>
>> xen_max_p2m_pfn = max_pfn;
>>
>>- p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_init(p2m_missing);
>>
>>- p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_mid_missing = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_mid_init(p2m_mid_missing);
>>
>>- p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_top = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_top_init(p2m_top);
>>
>>- p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m_identity = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_init(p2m_identity);
>>
>> /*
>>@@ -373,7 +373,8 @@ void __init
>xen_build_dynamic_phys_to_machine(void)
>> unsigned mididx = p2m_mid_index(pfn);
>>
>> if (p2m_top[topidx] == p2m_mid_missing) {
>>- unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ unsigned long **mid = extend_brk(PAGE_SIZE, PAGE_SIZE,
>>+ false);
>> p2m_mid_init(mid);
>>
>> p2m_top[topidx] = mid;
>>@@ -609,7 +610,7 @@ static bool __init early_alloc_p2m_middle(unsigned
>>long pfn, bool check_boundary
>> return false;
>>
>> /* Boundary cross-over for the edges: */
>>- p2m = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ p2m = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>>
>> p2m_init(p2m);
>>
>>@@ -635,7 +636,7 @@ static bool __init early_alloc_p2m(unsigned long
>>pfn)
>> mid = p2m_top[topidx];
>> mid_mfn_p = p2m_top_mfn_p[topidx];
>> if (mid == p2m_mid_missing) {
>>- mid = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ mid = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>>
>> p2m_mid_init(mid);
>>
>>@@ -645,7 +646,7 @@ static bool __init early_alloc_p2m(unsigned long
>>pfn)
>> }
>> /* And the save/restore P2M tables.. */
>> if (mid_mfn_p == p2m_mid_missing_mfn) {
>>- mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE);
>>+ mid_mfn_p = extend_brk(PAGE_SIZE, PAGE_SIZE, false);
>> p2m_mid_mfn_init(mid_mfn_p);
>>
>> p2m_top_mfn_p[topidx] = mid_mfn_p;
>>@@ -858,7 +859,7 @@ static void __init m2p_override_init(void)
>> unsigned i;
>>
>> m2p_overrides = extend_brk(sizeof(*m2p_overrides) *
>M2P_OVERRIDE_HASH,
>>- sizeof(unsigned long));
>>+ sizeof(unsigned long), false);
>>
>> for (i = 0; i < M2P_OVERRIDE_HASH; i++)
>> INIT_LIST_HEAD(&m2p_overrides[i]);
>>diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c
>>index 4c1baa7..dff7fcc 100644
>>--- a/drivers/acpi/osl.c
>>+++ b/drivers/acpi/osl.c
>>@@ -563,7 +563,7 @@ RESERVE_BRK(acpi_override_tables_alloc,
>>ACPI_OVERRIDE_TABLES_SIZE);
>> void __init early_alloc_acpi_override_tables_buf(void)
>> {
>> acpi_tables_addr = __pa(extend_brk(ACPI_OVERRIDE_TABLES_SIZE,
>>- PAGE_SIZE));
>>+ PAGE_SIZE, false));
>> }
>>
>> void __init acpi_initrd_override(void *data, size_t size)

--
Sent from my mobile phone. Please excuse brevity and lack of formatting.

2013-08-21 13:06:54

Subject: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: [PATCH 3/8] x86, acpi: Move table_sigs[] to stack.

Subject: [PATCH 2/8] x86, microcode: Use get_ramdisk_{image|size}() in microcode handling.

Subject: [PATCH 1/8] x86: Make get_ramdisk_{image|size}() global.

Subject: [PATCH 7/8] x86, acpi, brk: Make early_alloc_acpi_override_tables_buf() available with va/pa.

Subject: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Subject: [PATCH 4/8] x86, acpi, brk: Extend BRK 256KB to store acpi override tables.

Subject: [PATCH 8/8] x86, acpi: Do acpi_initrd_override() earlier in head_32.S/head64.c.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: [PATCH 6/8] x86, acpi: Make acpi_initrd_override() available with va or pa.

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 5/8] x86, brk: Make extend_brk() available with va/pa.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: RE: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.

Subject: Re: [PATCH 0/8] x86, acpi: Move acpi_initrd_override() earlier.