2018-08-26 09:05:23

by Jia He

[permalink] [raw]
Subject: [PATCH] irqchip/gic-v3-its: add allocation max order limitation for lpi_id_bits

There is a WARN_ON when my QDF2400 server boots up (pagesize is 4k)
============begin===============
[ 0.000000] GICv3: GIC: Using split EOI/Deactivate mode
[ 0.000000] GICv3: Distributor has no Range Selector support
[ 0.000000] GICv3: VLPI support, no direct LPI support
[ 0.000000] ACPI: SRAT not present
[ 0.000000] ITS [mem 0xff7efe0000-0xff7effffff]
[ 0.000000] ITS@0x000000ff7efe0000: Using ITS number 0
[ 0.000000] GIC: enabling workaround for ITS: QDF2400 erratum 0065
[ 0.000000] ITS@0x000000ff7efe0000: allocated 524288 Devices
@179f000000 (indirect, esz 8, psz 64K, shr 1)
[ 0.000000] ITS@0x000000ff7efe0000: allocated 8192 Interrupt
Collections @179f930000 (flat, esz 8, psz 64K, shr 1)
[ 0.000000] ITS@0x000000ff7efe0000: allocated 65536 Virtual CPUs
@179f980000 (flat, esz 8, psz 64K, shr 1)
[ 0.000000] WARNING: CPU: 0 PID: 0 at mm/page_alloc.c:4066
__alloc_pages_nodemask+0x2d8/0x1188
[ 0.000000] Modules linked in:
[ 0.000000] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.18.0+ #66
[ 0.000000] pstate: 20000085 (nzCv daIf -PAN -UAO)
[ 0.000000] pc : __alloc_pages_nodemask+0x2d8/0x1188
[ 0.000000] lr : __alloc_pages_nodemask+0x134/0x1188
[ 0.000000] sp : ffff0000091b3a30
[ 0.000000] x29: ffff0000091b3a30 x28: 0000000000000000
[ 0.000000] x27: ffff00000a045000 x26: 0000000000000000
[ 0.000000] x25: 0000000000000000 x24: ffff0000091c1000
[ 0.000000] x23: ffff0000091b3b98 x22: 000000000000000c
[ 0.000000] x21: ffff0000082dc130 x20: 0000000000000001
[ 0.000000] x19: 0000000000000000 x18: 000000000000003f
[ 0.000000] x17: 0000000000000000 x16: 0000000000000000
[ 0.000000] x15: ffffffffffffffff x14: 202c74616c662820
[ 0.000000] x13: ffff000009c5f9e0 x12: 0000000000000077
[ 0.000000] x11: 0000000000000078 x10: 0000000097110f47
[ 0.000000] x9 : 0000000000000000 x8 : 00000000009fff3f
[ 0.000000] x7 : 000000000000002b x6 : 000000000000000c
[ 0.000000] x5 : 0000000000000001 x4 : 0000000000000000
[ 0.000000] x3 : ffff000008fd1000 x2 : ffff000008fd1000
[ 0.000000] x1 : ffff0000091c1000 x0 : 0000000000000000
[ 0.000000] Call trace:
[ 0.000000] __alloc_pages_nodemask+0x2d8/0x1188
[ 0.000000] alloc_pages_current+0x8c/0xd8
[ 0.000000] its_allocate_prop_table+0x5c/0xb8
[ 0.000000] its_init+0x220/0x3c0
[ 0.000000] gic_init_bases+0x250/0x380
[ 0.000000] gic_acpi_init+0x16c/0x2a4
[ 0.000000] acpi_match_madt+0x50/0x88
[ 0.000000] acpi_table_parse_entries_array+0x180/0x204
[ 0.000000] acpi_table_parse_entries+0x60/0x84
[ 0.000000] acpi_table_parse_madt+0x40/0x4c
[ 0.000000] __acpi_probe_device_table+0x94/0xe8
[ 0.000000] irqchip_init+0x38/0x40
[ 0.000000] init_IRQ+0x70/0x9c
[ 0.000000] start_kernel+0x310/0x4c0
[ 0.000000] irq event stamp: 0
[ 0.000000] hardirqs last enabled at (0): [<0000000000000000>]
(null)
[ 0.000000] hardirqs last disabled at (0): [<0000000000000000>]
(null)
[ 0.000000] softirqs last enabled at (0): [<0000000000000000>]
(null)
[ 0.000000] softirqs last disabled at (0): [<0000000000000000>]
(null)
[ 0.000000] ---[ end trace 943781056d97862b ]---
============end============

In its_alloc_lpi_tables, lpi_id_bits is 24, without this patch,
its_allocate_prop_table will try to allocate 16M(order 12 if
pagesize=4k). Thus it causes the WARN_ON.

This patch fixes it by limiting the lpi_id_bits.

Signed-off-by: Jia He <[email protected]>
---
drivers/irqchip/irq-gic-v3-its.c | 5 ++++-
1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
index 316a575..79e6993 100644
--- a/drivers/irqchip/irq-gic-v3-its.c
+++ b/drivers/irqchip/irq-gic-v3-its.c
@@ -1624,8 +1624,11 @@ static void its_free_prop_table(struct page *prop_page)
static int __init its_alloc_lpi_tables(void)
{
phys_addr_t paddr;
+ u32 max_bits; /*max order limitation in alloc_page*/

- lpi_id_bits = GICD_TYPER_ID_BITS(gic_rdists->gicd_typer);
+ max_bits = PAGE_SHIFT + MAX_ORDER - 1;
+ lpi_id_bits = min_t(u32, max_bits,
+ GICD_TYPER_ID_BITS(gic_rdists->gicd_typer));
gic_rdists->prop_page = its_allocate_prop_table(GFP_NOWAIT);
if (!gic_rdists->prop_page) {
pr_err("Failed to allocate PROPBASE\n");
--
1.8.3.1



2018-08-26 19:02:47

by Marc Zyngier

[permalink] [raw]
Subject: Re: [PATCH] irqchip/gic-v3-its: add allocation max order limitation for lpi_id_bits

[I'm travelling, so expect some major delays in responding to email]

Hi Jia,

On Sun, 26 Aug 2018 10:00:51 +0100,
Jia He <[email protected]> wrote:
>
> There is a WARN_ON when my QDF2400 server boots up (pagesize is 4k)

[snip]

> In its_alloc_lpi_tables, lpi_id_bits is 24, without this patch,
> its_allocate_prop_table will try to allocate 16M(order 12 if
> pagesize=4k). Thus it causes the WARN_ON.

Gah! QDF and its 24bit INTIDs... Making life hell for everyone ;-)

Sorry for breaking it.

>
> This patch fixes it by limiting the lpi_id_bits.
>
> Signed-off-by: Jia He <[email protected]>
> ---
> drivers/irqchip/irq-gic-v3-its.c | 5 ++++-
> 1 file changed, 4 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
> index 316a575..79e6993 100644
> --- a/drivers/irqchip/irq-gic-v3-its.c
> +++ b/drivers/irqchip/irq-gic-v3-its.c
> @@ -1624,8 +1624,11 @@ static void its_free_prop_table(struct page *prop_page)
> static int __init its_alloc_lpi_tables(void)
> {
> phys_addr_t paddr;
> + u32 max_bits; /*max order limitation in alloc_page*/
>
> - lpi_id_bits = GICD_TYPER_ID_BITS(gic_rdists->gicd_typer);
> + max_bits = PAGE_SHIFT + MAX_ORDER - 1;
> + lpi_id_bits = min_t(u32, max_bits,
> + GICD_TYPER_ID_BITS(gic_rdists->gicd_typer));
> gic_rdists->prop_page = its_allocate_prop_table(GFP_NOWAIT);
> if (!gic_rdists->prop_page) {
> pr_err("Failed to allocate PROPBASE\n");
> --
> 1.8.3.1
>

I find it rather odd that we end-up with different interrupt ranges
depending on the CPU page size. Also, allocating that much memory for
LPIs is rather pointless, as we actually have a pretty low limit of
interrupts the system can deal with (see IRQ_BITMAP_BITS, which is
slightly more than 8k). I've so far seen *one* request to push it up,
but I doubt that it is a real use case.

Capping lpi_id_bits at 16 (which is what we had before) is plenty,
will save a some memory, and gives some margin before we need to push
it up again.

Thanks,

M.

--
Jazz is not dead, it just smell funny.

2018-08-27 01:43:32

by Jia He

[permalink] [raw]
Subject: Re: [PATCH] irqchip/gic-v3-its: add allocation max order limitation for lpi_id_bits

Hi Marc
Thanks for the comments

On 8/27/2018 3:01 AM, Marc Zyngier Wrote:
> [I'm travelling, so expect some major delays in responding to email]
>
> Hi Jia,
>
> On Sun, 26 Aug 2018 10:00:51 +0100,
> Jia He <[email protected]> wrote:
>>
>> There is a WARN_ON when my QDF2400 server boots up (pagesize is 4k)
>
> [snip]
>
>> In its_alloc_lpi_tables, lpi_id_bits is 24, without this patch,
>> its_allocate_prop_table will try to allocate 16M(order 12 if
>> pagesize=4k). Thus it causes the WARN_ON.
>
> Gah! QDF and its 24bit INTIDs... Making life hell for everyone ;-)
>
> Sorry for breaking it.

np, maybe QDF2400 is a little bit special

>
>>
>> This patch fixes it by limiting the lpi_id_bits.
>>
>> Signed-off-by: Jia He <[email protected]>
>> ---
>> drivers/irqchip/irq-gic-v3-its.c | 5 ++++-
>> 1 file changed, 4 insertions(+), 1 deletion(-)
>>
>> diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
>> index 316a575..79e6993 100644
>> --- a/drivers/irqchip/irq-gic-v3-its.c
>> +++ b/drivers/irqchip/irq-gic-v3-its.c
>> @@ -1624,8 +1624,11 @@ static void its_free_prop_table(struct page *prop_page)
>> static int __init its_alloc_lpi_tables(void)
>> {
>> phys_addr_t paddr;
>> + u32 max_bits; /*max order limitation in alloc_page*/
>>
>> - lpi_id_bits = GICD_TYPER_ID_BITS(gic_rdists->gicd_typer);
>> + max_bits = PAGE_SHIFT + MAX_ORDER - 1;
>> + lpi_id_bits = min_t(u32, max_bits,
>> + GICD_TYPER_ID_BITS(gic_rdists->gicd_typer));
>> gic_rdists->prop_page = its_allocate_prop_table(GFP_NOWAIT);
>> if (!gic_rdists->prop_page) {
>> pr_err("Failed to allocate PROPBASE\n");
>> --
>> 1.8.3.1
>>
>
> I find it rather odd that we end-up with different interrupt ranges
> depending on the CPU page size. Also, allocating that much memory for
> LPIs is rather pointless, as we actually have a pretty low limit of
> interrupts the system can deal with (see IRQ_BITMAP_BITS, which is
> slightly more than 8k). I've so far seen *one* request to push it up,
> but I doubt that it is a real use case.
>
> Capping lpi_id_bits at 16 (which is what we had before) is plenty,
> will save a some memory, and gives some margin before we need to push
> it up again.

Do you want me to revert commit fe8e93504 to cap the lpi_id_bits
to no greater than ITS_MAX_LPI_NRBITS(16) instead this patch?
--
Cheers,
Jia

2018-08-27 09:11:26

by Marc Zyngier

[permalink] [raw]
Subject: Re: [PATCH] irqchip/gic-v3-its: add allocation max order limitation for lpi_id_bits

On Mon, 27 Aug 2018 02:41:04 +0100,
Jia He <[email protected]> wrote:
>
> Hi Marc
> Thanks for the comments
>
> On 8/27/2018 3:01 AM, Marc Zyngier Wrote:
> > [I'm travelling, so expect some major delays in responding to email]
> >
> > Hi Jia,
> >
> > On Sun, 26 Aug 2018 10:00:51 +0100,
> > Jia He <[email protected]> wrote:
> >>
> >> There is a WARN_ON when my QDF2400 server boots up (pagesize is 4k)
> >
> > [snip]
> >
> >> In its_alloc_lpi_tables, lpi_id_bits is 24, without this patch,
> >> its_allocate_prop_table will try to allocate 16M(order 12 if
> >> pagesize=4k). Thus it causes the WARN_ON.
> >
> > Gah! QDF and its 24bit INTIDs... Making life hell for everyone ;-)
> >
> > Sorry for breaking it.
>
> np, maybe QDF2400 is a little bit special
>
> >
> >>
> >> This patch fixes it by limiting the lpi_id_bits.
> >>
> >> Signed-off-by: Jia He <[email protected]>
> >> ---
> >> drivers/irqchip/irq-gic-v3-its.c | 5 ++++-
> >> 1 file changed, 4 insertions(+), 1 deletion(-)
> >>
> >> diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c
> >> index 316a575..79e6993 100644
> >> --- a/drivers/irqchip/irq-gic-v3-its.c
> >> +++ b/drivers/irqchip/irq-gic-v3-its.c
> >> @@ -1624,8 +1624,11 @@ static void its_free_prop_table(struct page *prop_page)
> >> static int __init its_alloc_lpi_tables(void)
> >> {
> >> phys_addr_t paddr;
> >> + u32 max_bits; /*max order limitation in alloc_page*/
> >>
> >> - lpi_id_bits = GICD_TYPER_ID_BITS(gic_rdists->gicd_typer);
> >> + max_bits = PAGE_SHIFT + MAX_ORDER - 1;
> >> + lpi_id_bits = min_t(u32, max_bits,
> >> + GICD_TYPER_ID_BITS(gic_rdists->gicd_typer));
> >> gic_rdists->prop_page = its_allocate_prop_table(GFP_NOWAIT);
> >> if (!gic_rdists->prop_page) {
> >> pr_err("Failed to allocate PROPBASE\n");
> >> --
> >> 1.8.3.1
> >>
> >
> > I find it rather odd that we end-up with different interrupt ranges
> > depending on the CPU page size. Also, allocating that much memory for
> > LPIs is rather pointless, as we actually have a pretty low limit of
> > interrupts the system can deal with (see IRQ_BITMAP_BITS, which is
> > slightly more than 8k). I've so far seen *one* request to push it up,
> > but I doubt that it is a real use case.
> >
> > Capping lpi_id_bits at 16 (which is what we had before) is plenty,
> > will save a some memory, and gives some margin before we need to push
> > it up again.
>
> Do you want me to revert commit fe8e93504 to cap the lpi_id_bits
> to no greater than ITS_MAX_LPI_NRBITS(16) instead this patch?

Yes, this is probably the best course of action at this time. When I
dropped this, I was mostly thinking of the LPI allocator (which now
works with an almost constant memory footprint), and didn't pay enough
attention to the property table allocation (which is still a function
of the number of LPIs).

Thanks,

M.

--
Jazz is not dead, it just smell funny.