LinuxLists.cc - [regression/bisected] 4.19 cycle boot time IO stalls

2018-09-05 08:57:02

Subject: [regression/bisected] 4.19 cycle boot time IO stalls

Greetings,

I've been seeing $subject, decided to take the time to try to bisect
the little bugger. The hangs are not 100% repeatable, and while
bisection with a 5 boot go/nogo threshold seemed to go smoothly, it
ended up fingering a merge commit (sigh).

Box has an SSD (unused only by windows 10 box came with) and 3 spinning
rust buckets that I normally use with BFQ via a udev rule, but CFQ does
the same, so scheduler is seemingly irrelevant. However, in 7 crash
dumps, all of which look about like the data below the bisect log,
there is something relevant, namely the hung 'tlp' task (powersaving
script of some sort for laptops according to the package description).
That knob twiddling script is present/hung in all, making me a tad
suspicious, and indeed, testing with the final (bad) kernel, all I have
to do to eliminate hangs is to remove the 'tlp' package. Verified via
remove, 5 boots work fine, reinstall, 2 of 5 hang, remove again, 10 of
10 work fine, reinstall, 2 in a row hang.

Seems pretty certain that tlp script is what inspires bug to raise its
ugly head. WRT bisection result itself, munged merge seems far less
likely than a false negative having knocked bisection off course.

72f02ba66bd83b54054da20eae550123de84da6f is the first bad commit
git bisect start
# good: [94710cac0ef4ee177a63b5227664b38c95bbf703] Linux 4.18
git bisect good 94710cac0ef4ee177a63b5227664b38c95bbf703
# bad: [60c1f89241d49bacf71035470684a8d7b4bb46ea] Merge tag 'dma-mapping-4.19-2' of git://git.infradead.org/users/hch/dma-mapping
git bisect bad 60c1f89241d49bacf71035470684a8d7b4bb46ea
# good: [54dbe75bbf1e189982516de179147208e90b5e45] Merge tag 'drm-next-2018-08-15' of git://anongit.freedesktop.org/drm/drm
git bisect good 54dbe75bbf1e189982516de179147208e90b5e45
# bad: [d5acba26bfa097a618be425522b1ec4269d3edaf] Merge tag 'char-misc-4.19-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc
git bisect bad d5acba26bfa097a618be425522b1ec4269d3edaf
# bad: [9bd553929f68921be0f2014dd06561e0c8249a0d] Merge tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma
git bisect bad 9bd553929f68921be0f2014dd06561e0c8249a0d
# bad: [f91e654474d413201ae578820fb63f8a811f6c4e] Merge branch 'next-integrity' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/linux-security
git bisect bad f91e654474d413201ae578820fb63f8a811f6c4e
# good: [c1c2ad82c772966d3cdb9a4852329fa2cf71853a] Merge tag 'edac_for_4.19' of git://git.kernel.org/pub/scm/linux/kernel/git/bp/bp
git bisect good c1c2ad82c772966d3cdb9a4852329fa2cf71853a
# good: [51372570ac3c919b036e760f4ca449e81cf8e995] scsi: core: use blk_mq_run_hw_queues in scsi_kick_queue
git bisect good 51372570ac3c919b036e760f4ca449e81cf8e995
# good: [ac7da1b787d9ea43680c487613269742c48d8747] Merge branches 'clk-actions-s700', 'clk-exynos-unused', 'clk-qcom-dispcc-845', 'clk-scmi-round' and 'clk-cs2000-spdx' into clk-next
git bisect good ac7da1b787d9ea43680c487613269742c48d8747
# good: [6ff0497402ef7269ee6a72f62eb85adaa7a4768e] gpiolib: Fix of_node inconsistency
git bisect good 6ff0497402ef7269ee6a72f62eb85adaa7a4768e
# bad: [2b2f2aedba985108cbc92a761ac0d9fc4c774616] Merge tag 'gfs2-4.19.fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/gfs2/linux-gfs2
git bisect bad 2b2f2aedba985108cbc92a761ac0d9fc4c774616
# good: [1d45bb7f9d2a5cbae1e5d9a5f72adad84db4d318] gfs2: Use iomap for stuffed direct I/O reads
git bisect good 1d45bb7f9d2a5cbae1e5d9a5f72adad84db4d318
# good: [db06f826ec12bf0701ea7fc0a3c0aa00b84417c8] Merge tag 'clk-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/clk/linux
git bisect good db06f826ec12bf0701ea7fc0a3c0aa00b84417c8
# good: [3f30f929bb17877ebc1653c6f3ff41863f1ba524] gfs2: cleanup: call gfs2_rgrp_ondisk2lvb from gfs2_rgrp_out
git bisect good 3f30f929bb17877ebc1653c6f3ff41863f1ba524
# good: [dffe12a82826082d2129ef91b17b257254cb60fc] gfs2: Fix gfs2_testbit to use clone bitmaps
git bisect good dffe12a82826082d2129ef91b17b257254cb60fc
# good: [f5580d0f8bf60993a5fbc73ee04678070ffbba57] gfs2: eliminate update_rgrp_lvb_unlinked
git bisect good f5580d0f8bf60993a5fbc73ee04678070ffbba57
# bad: [72f02ba66bd83b54054da20eae550123de84da6f] Merge tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi
git bisect bad 72f02ba66bd83b54054da20eae550123de84da6f
# first bad commit: [72f02ba66bd83b54054da20eae550123de84da6f] Merge tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

crash> ps | grep UN
417 2 7 ffff8803f8992940 UN 0.0 0 0 [kworker/u16:4]
476 2 7 ffff8803f02e9b80 UN 0.0 0 0 [jbd2/sdb3-8]
3078 3072 6 ffff8803fb70e040 UN 1.0 1706708 177212 sddm-greeter
3204 3155 0 ffff8803fb6e3700 UN 0.0 13784 3520 tlp
crash> bt 417
PID: 417 TASK: ffff8803f8992940 CPU: 7 COMMAND: "kworker/u16:4"
#0 [ffff8803f04636f0] __schedule at ffffffff815cca93
#1 [ffff8803f0463780] schedule at ffffffff815cd0a8
#2 [ffff8803f0463788] io_schedule at ffffffff8108fb72
#3 [ffff8803f0463798] blk_mq_get_tag at ffffffff812b3ff1
#4 [ffff8803f0463800] blk_mq_get_request at ffffffff812aee79
#5 [ffff8803f0463838] blk_mq_make_request at ffffffff812b153e
#6 [ffff8803f04638a8] generic_make_request at ffffffff812a54c2
#7 [ffff8803f0463900] __test_set_page_writeback at ffffffff81172a4e
#8 [ffff8803f0463950] ext4_bio_write_page at ffffffffa01530d9 [ext4]
#9 [ffff8803f0463998] mpage_submit_page at ffffffffa0134903 [ext4]
#10 [ffff8803f04639b0] mpage_process_page_bufs at ffffffffa0134a06 [ext4]
#11 [ffff8803f04639c8] mpage_prepare_extent_to_map at ffffffffa0135812 [ext4]
#12 [ffff8803f0463a98] ext4_writepages at ffffffffa0139ffd [ext4]
#13 [ffff8803f0463be8] do_writepages at ffffffff81173dec
#14 [ffff8803f0463c60] __writeback_single_inode at ffffffff8120f4fd
#15 [ffff8803f0463ca8] writeback_sb_inodes at ffffffff8120fbda
#16 [ffff8803f0463d38] __writeback_inodes_wb at ffffffff8120fe7d
#17 [ffff8803f0463d78] wb_writeback at ffffffff812101c3
#18 [ffff8803f0463e10] wb_workfn at ffffffff8121201a
#19 [ffff8803f0463e98] process_one_work at ffffffff8107e895
#20 [ffff8803f0463ed8] worker_thread at ffffffff8107eaa0
#21 [ffff8803f0463f10] kthread at ffffffff810844e3
#22 [ffff8803f0463f50] ret_from_fork at ffffffff81600205
crash> bt 476
PID: 476 TASK: ffff8803f02e9b80 CPU: 7 COMMAND: "jbd2/sdb3-8"
#0 [ffff8803f23b7c70] __schedule at ffffffff815cca93
#1 [ffff8803f23b7d00] schedule at ffffffff815cd0a8
#2 [ffff8803f23b7d08] jbd2_journal_commit_transaction at ffffffffa0100fec [jbd2]
#3 [ffff8803f23b7ea8] kjournald2 at ffffffffa010668d [jbd2]
#4 [ffff8803f23b7f10] kthread at ffffffff810844e3
#5 [ffff8803f23b7f50] ret_from_fork at ffffffff81600205
crash> bt 3078
PID: 3078 TASK: ffff8803fb70e040 CPU: 6 COMMAND: "sddm-greeter"
#0 [ffff8803ef7dfcd8] __schedule at ffffffff815cca93
#1 [ffff8803ef7dfd68] schedule at ffffffff815cd0a8
#2 [ffff8803ef7dfd70] io_schedule at ffffffff8108fb72
#3 [ffff8803ef7dfd80] generic_file_read_iter at ffffffff81167607
#4 [ffff8803ef7dfe58] __vfs_read at ffffffff811e44db
#5 [ffff8803ef7dfed0] vfs_read at ffffffff811e45d9
#6 [ffff8803ef7dff00] ksys_read at ffffffff811e4b42
#7 [ffff8803ef7dff38] do_syscall_64 at ffffffff810021ab
#8 [ffff8803ef7dff50] entry_SYSCALL_64_after_hwframe at ffffffff81600088
RIP: 00007f12702eeb78 RSP: 00007ffcbad8f1a0 RFLAGS: 00000246
RAX: ffffffffffffffda RBX: 000055aeebda4558 RCX: 00007f12702eeb78
RDX: 0000000000004000 RSI: 000055aeebda4558 RDI: 000000000000001f
RBP: 0000000000004000 R8: 0000000000000000 R9: 0000000000000000
R10: fffffffffffffef8 R11: 0000000000000246 R12: 0000000000004000
R13: 0000000000004000 R14: 000000000000001f R15: 000055aeebda4558
ORIG_RAX: 0000000000000000 CS: 0033 SS: 002b
crash> bt 3204
PID: 3204 TASK: ffff8803fb6e3700 CPU: 0 COMMAND: "tlp"
#0 [ffff8803fbb8bc20] __schedule at ffffffff815cca93
#1 [ffff8803fbb8bcb0] schedule at ffffffff815cd0a8
#2 [ffff8803fbb8bcb8] io_schedule at ffffffff8108fb72
#3 [ffff8803fbb8bcc8] generic_file_read_iter at ffffffff81167607
#4 [ffff8803fbb8bda0] __vfs_read at ffffffff811e44db
#5 [ffff8803fbb8be18] vfs_read at ffffffff811e45d9
#6 [ffff8803fbb8be48] kernel_read at ffffffff811e46ac
#7 [ffff8803fbb8be60] prepare_binprm at ffffffff811ea206
#8 [ffff8803fbb8be98] __do_execve_file at ffffffff811ebdf5
#9 [ffff8803fbb8bf10] __x64_sys_execve at ffffffff811ec1e4
#10 [ffff8803fbb8bf38] do_syscall_64 at ffffffff810021ab
#11 [ffff8803fbb8bf50] entry_SYSCALL_64_after_hwframe at ffffffff81600088
RIP: 00007fc972412477 RSP: 00007ffc81d0a458 RFLAGS: 00000202
RAX: ffffffffffffffda RBX: 0000559e02ed3ad0 RCX: 00007fc972412477
RDX: 0000559e02d583a0 RSI: 0000559e02eda3d0 RDI: 0000559e02ed3ad0
RBP: 0000559e02eda3d0 R8: 0000559e02ed66c0 R9: 0000559e02ed8180
R10: 00000000000005b8 R11: 0000000000000202 R12: 0000559e02edf9b0
R13: 0000559e02d583a0 R14: 0000000000000000 R15: 0000559e02ed53e0
ORIG_RAX: 000000000000003b CS: 0033 SS: 002b
crash>

2018-09-05 13:40:44

by Jens Axboe

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

On 9/5/18 2:55 AM, Mike Galbraith wrote:
> Greetings,
>
> I've been seeing $subject, decided to take the time to try to bisect
> the little bugger. The hangs are not 100% repeatable, and while
> bisection with a 5 boot go/nogo threshold seemed to go smoothly, it
> ended up fingering a merge commit (sigh).
>
> Box has an SSD (unused only by windows 10 box came with) and 3 spinning
> rust buckets that I normally use with BFQ via a udev rule, but CFQ does
> the same, so scheduler is seemingly irrelevant. However, in 7 crash
> dumps, all of which look about like the data below the bisect log,
> there is something relevant, namely the hung 'tlp' task (powersaving
> script of some sort for laptops according to the package description).
> That knob twiddling script is present/hung in all, making me a tad
> suspicious, and indeed, testing with the final (bad) kernel, all I have
> to do to eliminate hangs is to remove the 'tlp' package. Verified via
> remove, 5 boots work fine, reinstall, 2 of 5 hang, remove again, 10 of
> 10 work fine, reinstall, 2 in a row hang.
>
> Seems pretty certain that tlp script is what inspires bug to raise its
> ugly head. WRT bisection result itself, munged merge seems far less
> likely than a false negative having knocked bisection off course.
>
> 72f02ba66bd83b54054da20eae550123de84da6f is the first bad commit

I bet it's the host busy change from Ming, which I already
reported as being the culprit for another test failure I had. For
some reason it's not merged yet, nudge nudge Martin. You can test
by reverting:

commit 328728630d9f2bf14b82ca30b5e47489beefe361
Author: Ming Lei <[email protected]>
Date: Sun Jun 24 22:03:27 2018 +0800

scsi: core: avoid host-wide host_busy counter for scsi_mq

which was slightly modified by 265d59aacbce, so you'll want to
yank that one first.

BTW, that suse email for me hasn't worked in 12 years :-)

--
Jens Axboe

2018-09-05 16:27:28

by Martin K. Petersen

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

Jens,

> I bet it's the host busy change from Ming, which I already
> reported as being the culprit for another test failure I had. For
> some reason it's not merged yet, nudge nudge Martin.

I reverted the two offending changes last week. James just hasn't sent
Linus a pull request yet...

--
Martin K. Petersen Oracle Linux Engineering

2018-09-05 16:31:51

by Mike Galbraith

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

On Wed, 2018-09-05 at 07:39 -0600, Jens Axboe wrote:
>
> I bet it's the host busy change from Ming, which I already
> reported as being the culprit for another test failure I had. For
> some reason it's not merged yet, nudge nudge Martin. You can test
> by reverting:
>
> commit 328728630d9f2bf14b82ca30b5e47489beefe361
> Author: Ming Lei <[email protected]>
> Date: Sun Jun 24 22:03:27 2018 +0800
>
> scsi: core: avoid host-wide host_busy counter for scsi_mq
>
> which was slightly modified by 265d59aacbce, so you'll want to
> yank that one first.

Bingo (woohoo, no dog slow take 2 required).

> BTW, that suse email for me hasn't worked in 12 years :-)

Yeah, I must have double tapped mouse, and picked up one of the many
ancient artifacts in Evolution's DB. Sorry about that.

-Mike

2018-09-05 16:33:31

by James Bottomley

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

On Wed, 2018-09-05 at 12:25 -0400, Martin K. Petersen wrote:
> Jens,
>
> > I bet it's the host busy change from Ming, which I already
> > reported as being the culprit for another test failure I had. For
> > some reason it's not merged yet, nudge nudge Martin.
>
> I reverted the two offending changes last week. James just hasn't
> sent Linus a pull request yet...

Yes, I did, just before packing up my board and kit and saying goodbye
to the carefree days of summer on the beach:

https://marc.info/?l=linux-scsi&m=153613531415018&w=2

I'm actually at heathrow now and should be home tomorrow, so things
should be back to normal next week.

James

2018-09-05 16:33:43

by Martin K. Petersen

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

> I reverted the two offending changes last week. James just hasn't sent
> Linus a pull request yet...

Actually, he did earlier today.

--
Martin K. Petersen Oracle Linux Engineering

2018-09-05 16:34:45

by Linus Torvalds

[permalink] [raw]

Subject: Re: [regression/bisected] 4.19 cycle boot time IO stalls

On Wed, Sep 5, 2018 at 9:31 AM Martin K. Petersen
<[email protected]> wrote:
>
> > I reverted the two offending changes last week. James just hasn't sent
> > Linus a pull request yet...
>
> Actually, he did earlier today.

And I've just pulled and pushed it out, so it's in my public tree now.

Thanks,

Linus