LinuxLists.cc - Kernel 5.9-rc Regression: Boot failure with nvme

2020-08-29 17:08:18

Subject: Kernel 5.9-rc Regression: Boot failure with nvme

Hi,

(please keep me in the loop, as I'm currently not suscribed)

both 5.9-rc1 and -rc2 fail to boot with my TOSHIBA-RD400 NVME:

[ 1.015590] ------------[ cut here ]------------
[ 1.015594] WARNING: CPU: 7 PID: 99 at mm/page_alloc.c:4864
__alloc_pages_nodemask+0x299/0x330
[ 1.015594] Modules linked in: syscopyarea xhci_pci(+)
xhci_pci_renesas sysfillrect xhci_hcd nvme aesni_intel(+) crypto_simd
sysimgblt fb_sys_fops cryptd nvme_core t10_pi glue_helper drm hwmon
scsi_mod agpgart i2c_core usbcore video wmi button dm_mirror
dm_region_hash dm_log dm_mod unix ipv6 autofs4
[ 1.015602] CPU: 7 PID: 99 Comm: kworker/u16:1 Not tainted
5.9.0-rc2-dirty #12
[ 1.015603] Hardware name: To Be Filled By O.E.M. To Be Filled By
O.E.M./Z170 Gaming K6, BIOS P7.50 10/18/2018
[ 1.015607] Workqueue: nvme-reset-wq nvme_reset_work [nvme]
[ 1.015608] RIP: 0010:__alloc_pages_nodemask+0x299/0x330
[ 1.015609] Code: 66 0f 85 46 ff ff ff e8 24 46 dd ff e9 3c ff ff ff
e8 4b 4f fc ff 48 89 c7 e9 ad fe ff ff 81 e5 00 20 00 00 0f 85 7b ff ff
ff <0f> 0b e9 74 ff ff ff 31 c0 e9 1b fe ff ff 65 48 8b 04 25 00 6d 01
[ 1.015610] RSP: 0000:ffffb3ed002abcb8 EFLAGS: 00010246
[ 1.015611] RAX: 0000000000000000 RBX: ffff9c8e827c4118 RCX:
0000000000000000
[ 1.015611] RDX: 0000000000000000 RSI: 0000000000000034 RDI:
0000000000000cc0
[ 1.015612] RBP: 0000000000000000 R08: 0000000000000000 R09:
ffffffffffffffff
[ 1.015612] R10: 0000000000000006 R11: ffffb3ee002abb97 R12:
0000000000000000
[ 1.015613] R13: 0000000000000000 R14: ffff9c8e921060b0 R15:
0000000000000cc0
[ 1.015614] FS: 0000000000000000(0000) GS:ffff9c8e96bc0000(0000)
knlGS:0000000000000000
[ 1.015615] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1.015615] CR2: 0000559988e2f4d8 CR3: 00000008433e4004 CR4:
00000000003706e0
[ 1.015616] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 1.015617] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[ 1.015617] Call Trace:
[ 1.015621] dma_direct_alloc_pages+0x1e9/0x2c0
[ 1.015623] ? pci_alloc_irq_vectors_affinity+0xa5/0x100
[ 1.015626] nvme_alloc_queue+0x10a/0x170 [nvme]
[ 1.015629] nvme_reset_work+0x70b/0x12b0 [nvme]
[ 1.015631] ? nvme_irq_check+0x30/0x30 [nvme]
[ 1.015634] process_one_work+0x1da/0x3d0
[ 1.015636] worker_thread+0x4a/0x3c0
[ 1.015637] ? process_one_work+0x3d0/0x3d0
[ 1.015638] kthread+0x114/0x160
[ 1.015640] ? kthread_park+0x90/0x90
[ 1.015641] ret_from_fork+0x22/0x30
[ 1.015643] ---[ end trace 268d4f4db1ef121e ]---

Resulting in:
[ 1.015644] nvme nvme0: Removing after probe failure status: -12

If you need more infos I can try to provide them.

Greetings
Daniel

2020-08-29 17:58:32

by Linus Torvalds

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

Just adding Christoph to the participants list, since at a guess it's
due to his changes whether they came from the nvme side or the dma
side..

Christoph?

Linus

On Sat, Aug 29, 2020 at 10:06 AM Daniel Exner <[email protected]> wrote:
>
> Hi,
>
> (please keep me in the loop, as I'm currently not suscribed)
>
> both 5.9-rc1 and -rc2 fail to boot with my TOSHIBA-RD400 NVME:
>
> [ 1.015590] ------------[ cut here ]------------
> [ 1.015594] WARNING: CPU: 7 PID: 99 at mm/page_alloc.c:4864
> __alloc_pages_nodemask+0x299/0x330
> [ 1.015594] Modules linked in: syscopyarea xhci_pci(+)
> xhci_pci_renesas sysfillrect xhci_hcd nvme aesni_intel(+) crypto_simd
> sysimgblt fb_sys_fops cryptd nvme_core t10_pi glue_helper drm hwmon
> scsi_mod agpgart i2c_core usbcore video wmi button dm_mirror
> dm_region_hash dm_log dm_mod unix ipv6 autofs4
> [ 1.015602] CPU: 7 PID: 99 Comm: kworker/u16:1 Not tainted
> 5.9.0-rc2-dirty #12
> [ 1.015603] Hardware name: To Be Filled By O.E.M. To Be Filled By
> O.E.M./Z170 Gaming K6, BIOS P7.50 10/18/2018
> [ 1.015607] Workqueue: nvme-reset-wq nvme_reset_work [nvme]
> [ 1.015608] RIP: 0010:__alloc_pages_nodemask+0x299/0x330
> [ 1.015609] Code: 66 0f 85 46 ff ff ff e8 24 46 dd ff e9 3c ff ff ff
> e8 4b 4f fc ff 48 89 c7 e9 ad fe ff ff 81 e5 00 20 00 00 0f 85 7b ff ff
> ff <0f> 0b e9 74 ff ff ff 31 c0 e9 1b fe ff ff 65 48 8b 04 25 00 6d 01
> [ 1.015610] RSP: 0000:ffffb3ed002abcb8 EFLAGS: 00010246
> [ 1.015611] RAX: 0000000000000000 RBX: ffff9c8e827c4118 RCX:
> 0000000000000000
> [ 1.015611] RDX: 0000000000000000 RSI: 0000000000000034 RDI:
> 0000000000000cc0
> [ 1.015612] RBP: 0000000000000000 R08: 0000000000000000 R09:
> ffffffffffffffff
> [ 1.015612] R10: 0000000000000006 R11: ffffb3ee002abb97 R12:
> 0000000000000000
> [ 1.015613] R13: 0000000000000000 R14: ffff9c8e921060b0 R15:
> 0000000000000cc0
> [ 1.015614] FS: 0000000000000000(0000) GS:ffff9c8e96bc0000(0000)
> knlGS:0000000000000000
> [ 1.015615] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1.015615] CR2: 0000559988e2f4d8 CR3: 00000008433e4004 CR4:
> 00000000003706e0
> [ 1.015616] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [ 1.015617] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [ 1.015617] Call Trace:
> [ 1.015621] dma_direct_alloc_pages+0x1e9/0x2c0
> [ 1.015623] ? pci_alloc_irq_vectors_affinity+0xa5/0x100
> [ 1.015626] nvme_alloc_queue+0x10a/0x170 [nvme]
> [ 1.015629] nvme_reset_work+0x70b/0x12b0 [nvme]
> [ 1.015631] ? nvme_irq_check+0x30/0x30 [nvme]
> [ 1.015634] process_one_work+0x1da/0x3d0
> [ 1.015636] worker_thread+0x4a/0x3c0
> [ 1.015637] ? process_one_work+0x3d0/0x3d0
> [ 1.015638] kthread+0x114/0x160
> [ 1.015640] ? kthread_park+0x90/0x90
> [ 1.015641] ret_from_fork+0x22/0x30
> [ 1.015643] ---[ end trace 268d4f4db1ef121e ]---
>
>
> Resulting in:
> [ 1.015644] nvme nvme0: Removing after probe failure status: -12
>
> If you need more infos I can try to provide them.
>
> Greetings
> Daniel
>
>
>

2020-08-29 17:59:09

by Christoph Hellwig

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

On Sat, Aug 29, 2020 at 10:54:47AM -0700, Linus Torvalds wrote:
> Just adding Christoph to the participants list, since at a guess it's
> due to his changes whether they came from the nvme side or the dma
> side..
>
> Christoph?

This kinda looks like the sqsize regression we had in earlier 5.9-rc,
but that should have been fixed in -rc2 with

7442ddcedc344b6fa073692f165dffdd1889e780
Author: John Garry <[email protected]>
Date: Fri Aug 14 23:34:25 2020 +0800

nvme-pci: Use u32 for nvme_dev.q_depth and nvme_queue.q_depth

Daniel, can you double check that you don't have that commit?

>
> Linus
>
> On Sat, Aug 29, 2020 at 10:06 AM Daniel Exner <[email protected]> wrote:
> >
> > Hi,
> >
> > (please keep me in the loop, as I'm currently not suscribed)
> >
> > both 5.9-rc1 and -rc2 fail to boot with my TOSHIBA-RD400 NVME:
> >
> > [ 1.015590] ------------[ cut here ]------------
> > [ 1.015594] WARNING: CPU: 7 PID: 99 at mm/page_alloc.c:4864
> > __alloc_pages_nodemask+0x299/0x330
> > [ 1.015594] Modules linked in: syscopyarea xhci_pci(+)
> > xhci_pci_renesas sysfillrect xhci_hcd nvme aesni_intel(+) crypto_simd
> > sysimgblt fb_sys_fops cryptd nvme_core t10_pi glue_helper drm hwmon
> > scsi_mod agpgart i2c_core usbcore video wmi button dm_mirror
> > dm_region_hash dm_log dm_mod unix ipv6 autofs4
> > [ 1.015602] CPU: 7 PID: 99 Comm: kworker/u16:1 Not tainted
> > 5.9.0-rc2-dirty #12
> > [ 1.015603] Hardware name: To Be Filled By O.E.M. To Be Filled By
> > O.E.M./Z170 Gaming K6, BIOS P7.50 10/18/2018
> > [ 1.015607] Workqueue: nvme-reset-wq nvme_reset_work [nvme]
> > [ 1.015608] RIP: 0010:__alloc_pages_nodemask+0x299/0x330
> > [ 1.015609] Code: 66 0f 85 46 ff ff ff e8 24 46 dd ff e9 3c ff ff ff
> > e8 4b 4f fc ff 48 89 c7 e9 ad fe ff ff 81 e5 00 20 00 00 0f 85 7b ff ff
> > ff <0f> 0b e9 74 ff ff ff 31 c0 e9 1b fe ff ff 65 48 8b 04 25 00 6d 01
> > [ 1.015610] RSP: 0000:ffffb3ed002abcb8 EFLAGS: 00010246
> > [ 1.015611] RAX: 0000000000000000 RBX: ffff9c8e827c4118 RCX:
> > 0000000000000000
> > [ 1.015611] RDX: 0000000000000000 RSI: 0000000000000034 RDI:
> > 0000000000000cc0
> > [ 1.015612] RBP: 0000000000000000 R08: 0000000000000000 R09:
> > ffffffffffffffff
> > [ 1.015612] R10: 0000000000000006 R11: ffffb3ee002abb97 R12:
> > 0000000000000000
> > [ 1.015613] R13: 0000000000000000 R14: ffff9c8e921060b0 R15:
> > 0000000000000cc0
> > [ 1.015614] FS: 0000000000000000(0000) GS:ffff9c8e96bc0000(0000)
> > knlGS:0000000000000000
> > [ 1.015615] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > [ 1.015615] CR2: 0000559988e2f4d8 CR3: 00000008433e4004 CR4:
> > 00000000003706e0
> > [ 1.015616] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> > 0000000000000000
> > [ 1.015617] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> > 0000000000000400
> > [ 1.015617] Call Trace:
> > [ 1.015621] dma_direct_alloc_pages+0x1e9/0x2c0
> > [ 1.015623] ? pci_alloc_irq_vectors_affinity+0xa5/0x100
> > [ 1.015626] nvme_alloc_queue+0x10a/0x170 [nvme]
> > [ 1.015629] nvme_reset_work+0x70b/0x12b0 [nvme]
> > [ 1.015631] ? nvme_irq_check+0x30/0x30 [nvme]
> > [ 1.015634] process_one_work+0x1da/0x3d0
> > [ 1.015636] worker_thread+0x4a/0x3c0
> > [ 1.015637] ? process_one_work+0x3d0/0x3d0
> > [ 1.015638] kthread+0x114/0x160
> > [ 1.015640] ? kthread_park+0x90/0x90
> > [ 1.015641] ret_from_fork+0x22/0x30
> > [ 1.015643] ---[ end trace 268d4f4db1ef121e ]---
> >
> >
> > Resulting in:
> > [ 1.015644] nvme nvme0: Removing after probe failure status: -12
> >
> > If you need more infos I can try to provide them.
> >
> > Greetings
> > Daniel
> >
> >
> >
---end quoted text---

2020-08-29 18:38:13

by Gabriel C

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

Am Sa., 29. Aug. 2020 um 19:57 Uhr schrieb Christoph Hellwig <[email protected]>:
>
> On Sat, Aug 29, 2020 at 10:54:47AM -0700, Linus Torvalds wrote:
> > Just adding Christoph to the participants list, since at a guess it's
> > due to his changes whether they came from the nvme side or the dma
> > side..
> >
> > Christoph?
>
> This kinda looks like the sqsize regression we had in earlier 5.9-rc,
> but that should have been fixed in -rc2 with

git tag --contains 7442ddcedc344b6fa073692f165dffdd1889e780
returns nothing, that commit only exits in master, so probably in -rc3.

>
> 7442ddcedc344b6fa073692f165dffdd1889e780
> Author: John Garry <[email protected]>
> Date: Fri Aug 14 23:34:25 2020 +0800
>
> nvme-pci: Use u32 for nvme_dev.q_depth and nvme_queue.q_depth
>
> Daniel, can you double check that you don't have that commit?
>
> >
> > Linus
> >
> > On Sat, Aug 29, 2020 at 10:06 AM Daniel Exner <[email protected]> wrote:
> > >
> > > Hi,
> > >
> > > (please keep me in the loop, as I'm currently not suscribed)
> > >
> > > both 5.9-rc1 and -rc2 fail to boot with my TOSHIBA-RD400 NVME:
> > >
> > > [ 1.015590] ------------[ cut here ]------------
> > > [ 1.015594] WARNING: CPU: 7 PID: 99 at mm/page_alloc.c:4864
> > > __alloc_pages_nodemask+0x299/0x330
> > > [ 1.015594] Modules linked in: syscopyarea xhci_pci(+)
> > > xhci_pci_renesas sysfillrect xhci_hcd nvme aesni_intel(+) crypto_simd
> > > sysimgblt fb_sys_fops cryptd nvme_core t10_pi glue_helper drm hwmon
> > > scsi_mod agpgart i2c_core usbcore video wmi button dm_mirror
> > > dm_region_hash dm_log dm_mod unix ipv6 autofs4
> > > [ 1.015602] CPU: 7 PID: 99 Comm: kworker/u16:1 Not tainted
> > > 5.9.0-rc2-dirty #12
> > > [ 1.015603] Hardware name: To Be Filled By O.E.M. To Be Filled By
> > > O.E.M./Z170 Gaming K6, BIOS P7.50 10/18/2018
> > > [ 1.015607] Workqueue: nvme-reset-wq nvme_reset_work [nvme]
> > > [ 1.015608] RIP: 0010:__alloc_pages_nodemask+0x299/0x330
> > > [ 1.015609] Code: 66 0f 85 46 ff ff ff e8 24 46 dd ff e9 3c ff ff ff
> > > e8 4b 4f fc ff 48 89 c7 e9 ad fe ff ff 81 e5 00 20 00 00 0f 85 7b ff ff
> > > ff <0f> 0b e9 74 ff ff ff 31 c0 e9 1b fe ff ff 65 48 8b 04 25 00 6d 01
> > > [ 1.015610] RSP: 0000:ffffb3ed002abcb8 EFLAGS: 00010246
> > > [ 1.015611] RAX: 0000000000000000 RBX: ffff9c8e827c4118 RCX:
> > > 0000000000000000
> > > [ 1.015611] RDX: 0000000000000000 RSI: 0000000000000034 RDI:
> > > 0000000000000cc0
> > > [ 1.015612] RBP: 0000000000000000 R08: 0000000000000000 R09:
> > > ffffffffffffffff
> > > [ 1.015612] R10: 0000000000000006 R11: ffffb3ee002abb97 R12:
> > > 0000000000000000
> > > [ 1.015613] R13: 0000000000000000 R14: ffff9c8e921060b0 R15:
> > > 0000000000000cc0
> > > [ 1.015614] FS: 0000000000000000(0000) GS:ffff9c8e96bc0000(0000)
> > > knlGS:0000000000000000
> > > [ 1.015615] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > > [ 1.015615] CR2: 0000559988e2f4d8 CR3: 00000008433e4004 CR4:
> > > 00000000003706e0
> > > [ 1.015616] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> > > 0000000000000000
> > > [ 1.015617] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> > > 0000000000000400
> > > [ 1.015617] Call Trace:
> > > [ 1.015621] dma_direct_alloc_pages+0x1e9/0x2c0
> > > [ 1.015623] ? pci_alloc_irq_vectors_affinity+0xa5/0x100
> > > [ 1.015626] nvme_alloc_queue+0x10a/0x170 [nvme]
> > > [ 1.015629] nvme_reset_work+0x70b/0x12b0 [nvme]
> > > [ 1.015631] ? nvme_irq_check+0x30/0x30 [nvme]
> > > [ 1.015634] process_one_work+0x1da/0x3d0
> > > [ 1.015636] worker_thread+0x4a/0x3c0
> > > [ 1.015637] ? process_one_work+0x3d0/0x3d0
> > > [ 1.015638] kthread+0x114/0x160
> > > [ 1.015640] ? kthread_park+0x90/0x90
> > > [ 1.015641] ret_from_fork+0x22/0x30
> > > [ 1.015643] ---[ end trace 268d4f4db1ef121e ]---
> > >
> > >
> > > Resulting in:
> > > [ 1.015644] nvme nvme0: Removing after probe failure status: -12
> > >
> > > If you need more infos I can try to provide them.
> > >
> > > Greetings
> > > Daniel
> > >
> > >
> > >
> ---end quoted text---

2020-08-29 18:41:50

by Linus Torvalds

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

On Sat, Aug 29, 2020 at 11:36 AM Gabriel C <[email protected]> wrote:
>
> > This kinda looks like the sqsize regression we had in earlier 5.9-rc,
> > but that should have been fixed in -rc2 with
>
> git tag --contains 7442ddcedc344b6fa073692f165dffdd1889e780
> returns nothing, that commit only exits in master, so probably in -rc3.

Right you are - that commit is not in rc2.

Daniel - that commit will be in rc3 when I cut that tomorrow, but if
you are willing to check current -git to verify that yes, it's fixed,
that would be lovely.

Linus

2020-08-29 19:38:30

by Daniel Exner

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

On 29.08.20 20:40, Linus Torvalds wrote:
> On Sat, Aug 29, 2020 at 11:36 AM Gabriel C <[email protected]> wrote:
>>
>>> This kinda looks like the sqsize regression we had in earlier 5.9-rc,
>>> but that should have been fixed in -rc2 with
>>
>> git tag --contains 7442ddcedc344b6fa073692f165dffdd1889e780
>> returns nothing, that commit only exits in master, so probably in -rc3.
>
> Right you are - that commit is not in rc2.
>
> Daniel - that commit will be in rc3 when I cut that tomorrow, but if
> you are willing to check current -git to verify that yes, it's fixed,
> that would be lovely.

Tried current git and indeed it boots just fine. Thanks everyone!

Greetings
Daniel

2020-08-29 21:39:27

by David Rientjes

[permalink] [raw]

Subject: Re: Kernel 5.9-rc Regression: Boot failure with nvme

On Sat, 29 Aug 2020, Christoph Hellwig wrote:

> > Just adding Christoph to the participants list, since at a guess it's
> > due to his changes whether they came from the nvme side or the dma
> > side..
> >
> > Christoph?
>
> This kinda looks like the sqsize regression we had in earlier 5.9-rc,
> but that should have been fixed in -rc2 with
>
> 7442ddcedc344b6fa073692f165dffdd1889e780
> Author: John Garry <[email protected]>
> Date: Fri Aug 14 23:34:25 2020 +0800
>
> nvme-pci: Use u32 for nvme_dev.q_depth and nvme_queue.q_depth
>
> Daniel, can you double check that you don't have that commit?
>

Looks like Daniel has confirmed that this indeed does fix his issue --
great!

Christoph, re the plan to backport the atomic DMA pool support to 5.4 LTS
for the purposes of fixing the AMD SEV allocation issues, I've composed
the following list:

e860c299ac0d dma-remap: separate DMA atomic pools from direct remap code
c84dc6e68a1d dma-pool: add additional coherent pools to map to gfp mask
54adadf9b085 dma-pool: dynamically expanding atomic pools
76a19940bd62 dma-direct: atomic allocations must come from atomic coherent pools
2edc5bb3c5cc dma-pool: add pool sizes to debugfs
1d659236fb43 dma-pool: scale the default DMA coherent pool size with memory capacity
3ee06a6d532f dma-pool: fix too large DMA pools on medium memory size systems
dbed452a078d dma-pool: decouple DMA_REMAP from DMA_COHERENT_POOL
** 633d5fce78a6 dma-direct: always align allocation size in dma_direct_alloc_pages()
** 96a539fa3bb7 dma-direct: re-encrypt memory if dma_direct_alloc_pages() fails
** 56fccf21d196 dma-direct: check return value when encrypting or decrypting memory
** 1a2b3357e860 dma-direct: add missing set_memory_decrypted() for coherent mapping
d07ae4c48690 dma-mapping: DMA_COHERENT_POOL should select GENERIC_ALLOCATOR
71cdec4fab76 dma-mapping: warn when coherent pool is depleted
567f6a6eba0c dma-direct: provide function to check physical memory area validity
23e469be6239 dma-pool: get rid of dma_in_atomic_pool()
48b6703858dd dma-pool: introduce dma_guess_pool()
81e9d894e03f dma-pool: make sure atomic pool suits device
d9765e41d8e9 dma-pool: do not allocate pool memory from CMA
9420139f516d dma-pool: fix coherent pool allocations for IOMMU mappings
d7e673ec2c8e dma-pool: Only allocate from CMA when in same memory zone

[ The commits prefixed with ** are not absolutely required for atomic DMA
but rather fix other issues with SEV in the DMA layer that I found
along the way. They are likely deserving of their own stable
backports, but added them here because it's probably best to backport
in order to minimize conflicts. We'll simply make a note of that in
the cover letter for the stable backport series. ]

Do you know of any others to add? NVMe specific fixes, perhaps John
Garry's fix above, Intel IOMMU fixes maybe?