by kernel test robot

[permalink] [raw]

Subject: Re: [PATCH 06/15] blkcg: always associate a bio with a blkg

Attachments:

(No filename) (2.31 kB)
.config.gz (31.84 kB)
Download all attachments

2018-08-31 09:33:06

by kernel test robot

[permalink] [raw]

Subject: Re: [PATCH 07/15] blkcg: consolidate bio_issue_init and blkg association

Hi Dennis,

Thank you for the patch! Yet something to improve:

[auto build test ERROR on block/for-next]
[also build test ERROR on v4.19-rc1 next-20180831]
[if your patch is applied to the wrong git tree, please drop us a note to help improve the system]

url: https://github.com/0day-ci/linux/commits/Dennis-Zhou/blkcg-ref-count-refactor-cleanup-blkcg-avg_lat/20180831-161742
base: https://git.kernel.org/pub/scm/linux/kernel/git/axboe/linux-block.git for-next
config: x86_64-randconfig-x017-201834 (attached as .config)
compiler: gcc-7 (Debian 7.3.0-16) 7.3.0
reproduce:
# save the attached .config to linux build tree
make ARCH=x86_64

All errors (new ones prefixed by >>):

In file included from block/bounce.c:13:0:
include/linux/bio.h:566:17: warning: 'struct blkcg_gq' declared inside parameter list will not be visible outside of this definition or declaration
struct blkcg_gq *blkg) { return 0; }
^~~~~~~~
block/bounce.c: In function 'bounce_clone_bio':
>> block/bounce.c:262:23: error: 'struct bio' has no member named 'bi_issue'; did you mean 'bi_disk'?
bio_issue_init(&bio->bi_issue, bio_sectors(bio));
^~~~~~~~
bi_disk

vim +262 block/bounce.c

197
198 static struct bio *bounce_clone_bio(struct bio *bio_src, gfp_t gfp_mask,
199 struct bio_set *bs)
200 {
201 struct bvec_iter iter;
202 struct bio_vec bv;
203 struct bio *bio;
204
205 /*
206 * Pre immutable biovecs, __bio_clone() used to just do a memcpy from
207 * bio_src->bi_io_vec to bio->bi_io_vec.
208 *
209 * We can't do that anymore, because:
210 *
211 * - The point of cloning the biovec is to produce a bio with a biovec
212 * the caller can modify: bi_idx and bi_bvec_done should be 0.
213 *
214 * - The original bio could've had more than BIO_MAX_PAGES biovecs; if
215 * we tried to clone the whole thing bio_alloc_bioset() would fail.
216 * But the clone should succeed as long as the number of biovecs we
217 * actually need to allocate is fewer than BIO_MAX_PAGES.
218 *
219 * - Lastly, bi_vcnt should not be looked at or relied upon by code
220 * that does not own the bio - reason being drivers don't use it for
221 * iterating over the biovec anymore, so expecting it to be kept up
222 * to date (i.e. for clones that share the parent biovec) is just
223 * asking for trouble and would force extra work on
224 * __bio_clone_fast() anyways.
225 */
226
227 bio = bio_alloc_bioset(gfp_mask, bio_segments(bio_src), bs);
228 if (!bio)
229 return NULL;
230 bio->bi_disk = bio_src->bi_disk;
231 bio->bi_opf = bio_src->bi_opf;
232 bio->bi_write_hint = bio_src->bi_write_hint;
233 bio->bi_iter.bi_sector = bio_src->bi_iter.bi_sector;
234 bio->bi_iter.bi_size = bio_src->bi_iter.bi_size;
235
236 switch (bio_op(bio)) {
237 case REQ_OP_DISCARD:
238 case REQ_OP_SECURE_ERASE:
239 case REQ_OP_WRITE_ZEROES:
240 break;
241 case REQ_OP_WRITE_SAME:
242 bio->bi_io_vec[bio->bi_vcnt++] = bio_src->bi_io_vec[0];
243 break;
244 default:
245 bio_for_each_segment(bv, bio_src, iter)
246 bio->bi_io_vec[bio->bi_vcnt++] = bv;
247 break;
248 }
249
250 if (bio_integrity(bio_src)) {
251 int ret;
252
253 ret = bio_integrity_clone(bio, bio_src, gfp_mask);
254 if (ret < 0) {
255 bio_put(bio);
256 return NULL;
257 }
258 }
259
260 bio_clone_blkcg_association(bio, bio_src);
261
> 262 bio_issue_init(&bio->bi_issue, bio_sectors(bio));
263
264 return bio;
265 }
266

---
0-DAY kernel test infrastructure Open Source Technology Center
https://lists.01.org/pipermail/kbuild-all Intel Corporation

Attachments:

(No filename) (3.95 kB)
.config.gz (33.79 kB)
Download all attachments

2018-08-31 10:07:15

by kernel test robot

[permalink] [raw]

Hello,

On Thu, Aug 30, 2018 at 09:53:47PM -0400, Dennis Zhou wrote:
> From: "Dennis Zhou (Facebook)" <[email protected]>
>
> Previously, blkg's were only assigned as needed by blk-iolatency and
> blk-throttle. bio->css was also always being associated while blkg was
> being looked up and then thrown away in blkcg_bio_issue_check.
>
> This patch beings the cleanup of bio->css and bio->bi_blkg by always
^
begins

> +int bio_associate_create_blkg(struct request_queue *q, struct bio *bio)
> +{
> + struct blkcg *blkcg;
> + struct blkcg_gq *blkg;
> + int ret = 0;
> +
> + /* someone has already associated this bio with a blkg */
> + if (bio->bi_blkg)
> + return ret;
> +
> + rcu_read_lock();
> +
> + bio_associate_blkcg(bio, NULL);
> + blkcg = bio_blkcg(bio);
> +
> + if (!blkcg->css.parent) {
> + ret = bio_associate_blkg(bio, q->root_blkg);
> + goto assoc_out;
> + }
> +
> + blkg = blkg_lookup_create(blkcg, q);
> + if (IS_ERR(blkg))
> + blkg = q->root_blkg;
> +
> + ret = bio_associate_blkg(bio, blkg);
> +assoc_out:

Maybe if/else instead of goto?

Other than that,

Acked-by: Tejun Heo <[email protected]>

Thanks.

--
tejun

2018-08-31 23:46:54

by Tejun Heo

[permalink] [raw]

FYI, we noticed the following commit (built with gcc-7):

commit: 6ef69a3a0b4ac904f7c3b9cb78b5d51520dc84f4 ("[PATCH 13/15] blkcg: change blkg reference counting to use percpu_ref")
url: https://github.com/0day-ci/linux/commits/Dennis-Zhou/blkcg-ref-count-refactor-cleanup-blkcg-avg_lat/20180831-161742
base: https://git.kernel.org/cgit/linux/kernel/git/axboe/linux-block.git for-next

in testcase: trinity
with following parameters:

runtime: 300s

test-description: Trinity is a linux system call fuzz tester.
test-url: http://codemonkey.org.uk/projects/trinity/

on test machine: qemu-system-i386 -enable-kvm -cpu SandyBridge -m 256M

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):

+------------------------------------------------------------------------------------+------------+------------+
| | 22f657e287 | 6ef69a3a0b |
+------------------------------------------------------------------------------------+------------+------------+
| boot_successes | 0 | 0 |
| boot_failures | 14 | 33 |
| WARNING:at_mm/slab_common.c:#kmalloc_slab | 14 | 33 |
| EIP:kmalloc_slab | 14 | 33 |
| Mem-Info | 14 | 33 |
| WARNING:at_arch/x86/mm/dump_pagetables.c:#note_page | 14 | 31 |
| EIP:note_page | 14 | 31 |
| WARNING:suspicious_RCU_usage | 0 | 33 |
| include/linux/rcupdate.h:#Illegal_context_switch_in_RCU_read-side_critical_section | 0 | 33 |
| BUG:sleeping_function_called_from_invalid_context_at_kernel/locking/mutex.c | 0 | 33 |
+------------------------------------------------------------------------------------+------------+------------+

[ 5.313007] WARNING: suspicious RCU usage
[ 5.313705] 4.19.0-rc1-00175-g6ef69a3 #633 Tainted: G W
[ 5.314812] -----------------------------
[ 5.315231] include/linux/rcupdate.h:302 Illegal context switch in RCU read-side critical section!
[ 5.315231]
[ 5.315231] other info that might help us debug this:
[ 5.315231]
[ 5.315231]
[ 5.315231] rcu_scheduler_active = 2, debug_locks = 1
[ 5.315231] 4 locks held by swapper/1:
[ 5.315231] #0: (ptrval) (&dev->mutex){....}, at: __driver_attach+0x45/0xb0
[ 5.315231] #1: (ptrval) (ide_cfg_mtx){+.+.}, at: ide_port_setup_devices+0x1c/0x120
[ 5.315231] #2: (ptrval) (rcu_read_lock){....}, at: blkcg_init_queue+0x21/0x160
[ 5.315231] #3: (ptrval) (&(&q->__queue_lock)->rlock){....}, at: blkcg_init_queue+0x5e/0x160
[ 5.315231]
[ 5.315231] stack backtrace:
[ 5.315231] CPU: 0 PID: 1 Comm: swapper Tainted: G W 4.19.0-rc1-00175-g6ef69a3 #633
[ 5.315231] Call Trace:
[ 5.315231] ? dump_stack+0x16/0x26
[ 5.315231] ? lockdep_rcu_suspicious+0x91/0xa0
[ 5.315231] ? ___might_sleep+0x182/0x230
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? __might_sleep+0x2d/0x80
[ 5.315231] ? __mutex_lock+0x21/0x4e0
[ 5.315231] ? kvm_sched_clock_read+0x14/0x30
[ 5.315231] ? sched_clock+0x9/0x10
[ 5.315231] ? sched_clock_local+0x87/0x160
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? mutex_lock_killable_nested+0x14/0x20
[ 5.315231] ? pcpu_alloc+0x2c5/0x610
[ 5.315231] ? pcpu_alloc+0x2c5/0x610
[ 5.315231] ? kfree+0xdd/0x140
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? __alloc_percpu_gfp+0xb/0x10
[ 5.315231] ? percpu_ref_init+0x1e/0x90
[ 5.315231] ? blkg_create+0x18f/0x510
[ 5.315231] ? blkcg_init_queue+0x6c/0x160
[ 5.315231] ? blkcg_init_queue+0x21/0x160
[ 5.315231] ? blk_alloc_queue_node+0x2c5/0x370
[ 5.315231] ? ide_port_setup_devices+0x77/0x120
[ 5.315231] ? ide_host_register+0x567/0x5e0
[ 5.315231] ? ide_pci_init_two+0x56b/0x800
[ 5.315231] ? sched_clock_local+0x87/0x160
[ 5.315231] ? _raw_spin_unlock_irqrestore+0x2a/0x50
[ 5.315231] ? lockdep_hardirqs_on+0xec/0x1a0
[ 5.315231] ? _raw_spin_unlock_irqrestore+0x2a/0x50
[ 5.315231] ? trace_hardirqs_on+0x36/0xe0
[ 5.315231] ? __pm_runtime_resume+0x4e/0x80
[ 5.315231] ? ide_pci_init_one+0xd/0x10
[ 5.315231] ? piix_init_one+0x16/0x20
[ 5.315231] ? pci_device_probe+0xb5/0x140
[ 5.315231] ? really_probe+0x19b/0x290
[ 5.315231] ? driver_probe_device+0x49/0x140
[ 5.315231] ? __driver_attach+0xa9/0xb0
[ 5.315231] ? driver_probe_device+0x140/0x140
[ 5.315231] ? bus_for_each_dev+0x4f/0x80
[ 5.315231] ? driver_attach+0x14/0x20
[ 5.315231] ? driver_probe_device+0x140/0x140
[ 5.315231] ? bus_add_driver+0x157/0x1e0
[ 5.315231] ? pci_bus_num_vf+0x10/0x10
[ 5.315231] ? driver_register+0x51/0xe0
[ 5.315231] ? pdc202new_ide_init+0x16/0x16
[ 5.315231] ? __pci_register_driver+0x4b/0x50
[ 5.315231] ? piix_ide_init+0x8f/0x94
[ 5.315231] ? do_one_initcall+0xa1/0x1a7
[ 5.315231] ? rcu_read_lock_sched_held+0x4f/0x70
[ 5.315231] ? trace_initcall_level+0x57/0x80
[ 5.315231] ? kernel_init_freeable+0xdb/0x180
[ 5.315231] ? kernel_init_freeable+0x100/0x180
[ 5.315231] ? rest_init+0x90/0x90
[ 5.315231] ? kernel_init+0x8/0xf0
[ 5.315231] ? ret_from_fork+0x19/0x24
[ 5.315231] BUG: sleeping function called from invalid context at kernel/locking/mutex.c:908
[ 5.315231] in_atomic(): 1, irqs_disabled(): 1, pid: 1, name: swapper
[ 5.315231] 4 locks held by swapper/1:
[ 5.315231] #0: (ptrval) (&dev->mutex){....}, at: __driver_attach+0x45/0xb0
[ 5.315231] #1: (ptrval) (ide_cfg_mtx){+.+.}, at: ide_port_setup_devices+0x1c/0x120
[ 5.315231] #2: (ptrval) (rcu_read_lock){....}, at: blkcg_init_queue+0x21/0x160
[ 5.315231] #3: (ptrval) (&(&q->__queue_lock)->rlock){....}, at: blkcg_init_queue+0x5e/0x160
[ 5.315231] irq event stamp: 996210
[ 5.315231] hardirqs last enabled at (996209): [<47b49149>] kmem_cache_alloc_trace+0xa9/0x250
[ 5.315231] hardirqs last disabled at (996210): [<48cfea62>] _raw_spin_lock_irq+0x12/0x60
[ 5.315231] softirqs last enabled at (996106): [<48d01516>] __do_softirq+0x246/0x344
[ 5.315231] softirqs last disabled at (996097): [<47a0a74c>] do_softirq_own_stack+0x1c/0x30
[ 5.315231] CPU: 0 PID: 1 Comm: swapper Tainted: G W 4.19.0-rc1-00175-g6ef69a3 #633
[ 5.315231] Call Trace:
[ 5.315231] ? dump_stack+0x16/0x26
[ 5.315231] ? ___might_sleep+0x13b/0x230
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? __might_sleep+0x2d/0x80
[ 5.315231] ? __mutex_lock+0x21/0x4e0
[ 5.315231] ? kvm_sched_clock_read+0x14/0x30
[ 5.315231] ? sched_clock+0x9/0x10
[ 5.315231] ? sched_clock_local+0x87/0x160
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? mutex_lock_killable_nested+0x14/0x20
[ 5.315231] ? pcpu_alloc+0x2c5/0x610
[ 5.315231] ? pcpu_alloc+0x2c5/0x610
[ 5.315231] ? kfree+0xdd/0x140
[ 5.315231] ? blkg_alloc+0x140/0x140
[ 5.315231] ? __alloc_percpu_gfp+0xb/0x10
[ 5.315231] ? percpu_ref_init+0x1e/0x90
[ 5.315231] ? blkg_create+0x18f/0x510
[ 5.315231] ? blkcg_init_queue+0x6c/0x160
[ 5.315231] ? blkcg_init_queue+0x21/0x160
[ 5.315231] ? blk_alloc_queue_node+0x2c5/0x370
[ 5.315231] ? ide_port_setup_devices+0x77/0x120
[ 5.315231] ? ide_host_register+0x567/0x5e0
[ 5.315231] ? ide_pci_init_two+0x56b/0x800
[ 5.315231] ? sched_clock_local+0x87/0x160
[ 5.315231] ? _raw_spin_unlock_irqrestore+0x2a/0x50
[ 5.315231] ? lockdep_hardirqs_on+0xec/0x1a0
[ 5.315231] ? _raw_spin_unlock_irqrestore+0x2a/0x50
[ 5.315231] ? trace_hardirqs_on+0x36/0xe0
[ 5.315231] ? __pm_runtime_resume+0x4e/0x80
[ 5.315231] ? ide_pci_init_one+0xd/0x10
[ 5.315231] ? piix_init_one+0x16/0x20
[ 5.315231] ? pci_device_probe+0xb5/0x140
[ 5.315231] ? really_probe+0x19b/0x290
[ 5.315231] ? driver_probe_device+0x49/0x140
[ 5.315231] ? __driver_attach+0xa9/0xb0
[ 5.315231] ? driver_probe_device+0x140/0x140
[ 5.315231] ? bus_for_each_dev+0x4f/0x80
[ 5.315231] ? driver_attach+0x14/0x20
[ 5.315231] ? driver_probe_device+0x140/0x140
[ 5.315231] ? bus_add_driver+0x157/0x1e0
[ 5.315231] ? pci_bus_num_vf+0x10/0x10
[ 5.315231] ? driver_register+0x51/0xe0
[ 5.315231] ? pdc202new_ide_init+0x16/0x16
[ 5.315231] ? __pci_register_driver+0x4b/0x50
[ 5.315231] ? piix_ide_init+0x8f/0x94
[ 5.315231] ? do_one_initcall+0xa1/0x1a7
[ 5.315231] ? rcu_read_lock_sched_held+0x4f/0x70
[ 5.315231] ? trace_initcall_level+0x57/0x80
[ 5.315231] ? kernel_init_freeable+0xdb/0x180
[ 5.315231] ? kernel_init_freeable+0x100/0x180
[ 5.315231] ? rest_init+0x90/0x90
[ 5.315231] ? kernel_init+0x8/0xf0
[ 5.315231] ? ret_from_fork+0x19/0x24
[ 5.418590] ide_generic: please use "probe_mask=0x3f" module parameter for probing all legacy ISA IDE ports
[ 5.420208] Loading iSCSI transport class v2.0-870.
[ 5.424773] rdac: device handler registered
[ 5.425612] hp_sw: device handler registered
[ 5.426442] alua: device handler registered
[ 5.427168] st: Version 20160209, fixed bufsize 32768, s/g segs 256
[ 5.428294] osst :I: Tape driver with OnStream support version 0.99.4
[ 5.428294] osst :I: $Id: osst.c,v 1.73 2005/01/01 21:13:34 wriede Exp $
[ 5.431683] Rounding down aligned max_sectors from 4294967295 to 4294967288
[ 5.433091] db_root: cannot open: /etc/target
[ 5.434136] SSFDC read-only Flash Translation layer
[ 5.435107] L440GX flash mapping: failed to find PIIX4 ISA bridge, cannot continue
[ 5.436418] device id = 2440
[ 5.436921] device id = 2480
[ 5.437430] device id = 24c0
[ 5.437931] device id = 24d0
[ 5.438464] device id = 25a1
[ 5.438975] device id = 2670
[ 5.439673] slram: not enough parameters.
[ 5.557989] No valid DiskOnChip devices found
[ 5.575575] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.577267] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.578759] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.580241] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.581727] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.583206] [nandsim] warning: read_byte: unexpected data output cycle, state is STATE_READY return 0x0
[ 5.584700] nand: device found, Manufacturer ID: 0x98, Chip ID: 0x39
[ 5.585730] nand: Toshiba NAND 128MiB 1,8V 8-bit
[ 5.586474] nand: 128 MiB, SLC, erase size: 16 KiB, page size: 512, OOB size: 16
[ 5.588284] flash size: 128 MiB
[ 5.588800] page size: 512 bytes
[ 5.589327] OOB area size: 16 bytes
[ 5.589887] sector size: 16 KiB
[ 5.591573] pages number: 262144

To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email

Thanks,
Rong, Chen

Attachments:

(No filename) (11.55 kB)
config-4.19.0-rc1-00175-g6ef69a3 (132.76 kB)
job-script (3.96 kB)
dmesg.xz (31.46 kB)
Download all attachments

2018-09-07 04:00:04

by Chen, Rong A

[permalink] [raw]

Subject: [LKP] [blkcg] c02c58dab2: WARNING:at_block/blk-throttle.c:#blk_throtl_bio

FYI, we noticed the following commit (built with gcc-6):

commit: c02c58dab2480ec45dc43e1e10970d763e6b7f1f ("[PATCH 06/15] blkcg: always associate a bio with a blkg")
url: https://github.com/0day-ci/linux/commits/Dennis-Zhou/blkcg-ref-count-refactor-cleanup-blkcg-avg_lat/20180831-161742
base: https://git.kernel.org/cgit/linux/kernel/git/axboe/linux-block.git for-next

in testcase: trinity
with following parameters:

runtime: 300s

test-description: Trinity is a linux system call fuzz tester.
test-url: http://codemonkey.org.uk/projects/trinity/

on test machine: qemu-system-x86_64 -enable-kvm -cpu host -smp 2 -m 1G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):

+------------------------------------------------------------------+------------+------------+
| | 1a3eeea831 | c02c58dab2 |
+------------------------------------------------------------------+------------+------------+
| boot_successes | 6 | 0 |
| boot_failures | 6 | 16 |
| invoked_oom-killer:gfp_mask=0x | 4 | 6 |
| Mem-Info | 5 | 9 |
| Kernel_panic-not_syncing:Out_of_memory_and_no_killable_processes | 4 | 6 |
| Out_of_memory:Kill_process | 2 | 3 |
| WARNING:at_block/blk-throttle.c:#blk_throtl_bio | 0 | 10 |
| RIP:blk_throtl_bio | 0 | 10 |
+------------------------------------------------------------------+------------+------------+

[ 120.023103] WARNING: CPU: 1 PID: 1 at block/blk-throttle.c:2149 blk_throtl_bio+0xdaf/0x2490
[ 120.051033] CPU: 1 PID: 1 Comm: swapper/0 Not tainted 4.19.0-rc1-00168-gc02c58d #1
[ 120.074200] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.10.2-1 04/01/2014
[ 120.100912] RIP: 0010:blk_throtl_bio+0xdaf/0x2490
[ 120.114291] Code: 08 84 d2 0f 85 1a 13 00 00 66 41 81 4f 14 00 02 e9 ed f3 ff ff 48 83 05 ce 31 ff 05 01 e9 f3 fb ff ff 48 83 05 39 45 ff 05 01 <0f> 0b 48 83 05 37 45 ff 05 01 e9 25 f3 ff ff 49 8d bf 28 02 00 00
[ 120.167531] RSP: 0000:ffff880030e36fd0 EFLAGS: 00010202
[ 120.184191] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 120.204216] RDX: 0000000000000000 RSI: ffffffffa72f04c0 RDI: 0000000000000206
[ 120.224215] RBP: ffff880030e370a0 R08: 0000000002384946 R09: 00000000023a2d90
[ 120.247539] R10: ffffed00062bc4fa R11: ffff8800315e27d3 R12: ffff880030f60150
[ 120.267576] R13: ffff88001c7a5b00 R14: ffff880030794400 R15: ffff880030f60140
[ 120.287560] FS: 0000000000000000(0000) GS:ffff880031400000(0000) knlGS:0000000000000000
[ 120.314198] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 120.330963] CR2: 0000000000000000 CR3: 000000000fc6d000 CR4: 00000000000006a0
[ 120.350859] Call Trace:
[ 120.360930] ? bio_associate_create_blkg+0x30d/0x950
[ 120.374222] ? reacquire_held_locks+0x400/0x400
[ 120.387606] ? bio_associate_blkg+0x240/0x240
[ 120.404166] ? do_raw_spin_unlock+0x16a/0x2d0
[ 120.417570] ? _raw_spin_unlock+0x5c/0xa0
[ 120.430924] generic_make_request_checks+0x8fa/0x14b0
[ 120.444270] ? percpu_ref_put_many+0x1c0/0x1c0
[ 120.460957] ? kasan_check_write+0x24/0x30
[ 120.474243] ? sched_clock_local+0x99/0x1c0
[ 120.487567] generic_make_request+0x237/0xdf0
[ 120.500949] ? sched_clock_cpu+0x20c/0x2a0
[ 120.514268] ? blk_plug_queued_count+0x180/0x180
[ 120.530925] ? debug_smp_processor_id+0x1f/0x30
[ 120.544243] submit_bio+0x2a0/0x410
[ 120.557564] ? submit_bio+0x2a0/0x410
[ 120.568470] ? lock_acquire+0x112/0x1d0
[ 120.580903] ? guard_bio_eod+0xb0/0x420
[ 120.590936] ? direct_make_request+0x240/0x240
[ 120.607501] ? guard_bio_eod+0x1b6/0x420
[ 120.617592] ? bio_add_page+0xd0/0x100
[ 120.630913] submit_bh_wbc+0x526/0x840
[ 120.640978] ? unlock_buffer+0x40/0x40
[ 120.654304] block_read_full_page+0x807/0xba0
[ 120.667591] ? bh_submit_read+0x240/0x240
[ 120.673290] ? create_page_buffers+0x210/0x210
[ 120.691014] ? add_to_page_cache_locked+0x20/0x20
[ 120.707532] ? alloc_page_interleave+0x139/0x1b0
[ 120.724230] ? __next_node_in+0x59/0x70
[ 120.734304] blkdev_readpage+0x1b/0x30
[ 120.747597] do_read_cache_page+0x795/0x1150
[ 120.760923] ? blkdev_writepages+0x20/0x20
[ 120.774234] ? kasan_unpoison_shadow+0x3d/0x60
[ 120.787530] ? preempt_count_add+0x159/0x210
[ 120.800913] ? pagecache_get_page+0x6f0/0x6f0
[ 120.814162] ? __this_cpu_preempt_check+0x1b/0x30
[ 120.830921] ? kasan_unpoison_shadow+0x3d/0x60
[ 120.844270] ? kasan_alloc_pages+0x40/0x50
[ 120.857909] ? get_page_from_freelist+0x2023/0x33b0
[ 120.870909] read_cache_page+0x53/0x90
[ 120.884132] read_dev_sector+0xc8/0x2c0
[ 120.897509] ? set_info+0x110/0x110
[ 120.907509] msdos_partition+0x231/0x2610
[ 120.920894] ? memcpy+0x6d/0x80
[ 120.930915] ? vsnprintf+0x96f/0x1cf0
[ 120.944176] ? set_info+0x110/0x110
[ 120.984325] ? snprintf+0x8f/0xb0
[ 120.997559] ? snprintf+0x8f/0xb0
[ 121.007498] ? vscnprintf+0x40/0x40
[ 121.017520] ? __next_node_in+0x59/0x70
[ 121.030968] ? set_info+0x110/0x110
[ 121.044221] ? set_info+0x110/0x110
[ 121.057506] check_partition+0x3db/0x7d0
[ 121.070920] rescan_partitions+0x192/0xac0
[ 121.080900] ? __might_sleep+0xad/0x1e0
[ 121.094193] ? bd_set_size+0x305/0x3c0
[ 121.107505] __blkdev_get+0x8c3/0x13b0
[ 121.117547] ? bd_set_size+0x3c0/0x3c0
[ 121.130907] ? debug_smp_processor_id+0x1f/0x30
[ 121.144195] blkdev_get+0x41c/0x9f0
[ 121.157679] ? refcount_sub_and_test_checked+0x100/0x1e0
[ 121.170883] ? __blkdev_get+0x13b0/0x13b0
[ 121.184233] ? do_raw_spin_unlock+0x16a/0x2d0
[ 121.197551] ? refcount_dec_and_test_checked+0x19/0x30
[ 121.214214] ? kobject_put+0x61/0x5a0
[ 121.227913] __device_add_disk+0xfc6/0x12d0
[ 121.241719] ? lock_acquire+0x112/0x1d0
[ 121.254191] ? bdget_disk+0xb0/0xb0
[ 121.291132] ? lockdep_init_map+0x11/0x20
[ 121.304263] ? lockdep_init_map+0x11/0x20
[ 121.317606] ? __raw_spin_lock_init+0x3d/0x120
[ 121.331152] ? device_initialize+0x2d3/0x3e0
[ 121.344254] device_add_disk+0x16/0x20
[ 121.357510] null_add_dev+0xcd5/0x1fe0
[ 121.370892] null_init+0x4cb/0x6c6
[ 121.380903] ? pkt_init+0x578/0x578
[ 121.394231] do_one_initcall+0x191/0x3f3
[ 121.407482] ? start_kernel+0xa52/0xa52
[ 121.417522] ? kasan_unpoison_shadow+0x3d/0x60
[ 121.434217] kernel_init_freeable+0x52f/0x6bb
[ 121.447500] ? rest_init+0x1a0/0x1a0
[ 121.457489] kernel_init+0x16/0x220
[ 121.470867] ? rest_init+0x1a0/0x1a0
[ 121.480914] ret_from_fork+0x1f/0x30
[ 121.494348] ---[ end trace bb75a6ff13d6153b ]---

To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email

Thanks,
Rong, Chen

Attachments:

(No filename) (7.11 kB)
config-4.19.0-rc1-00168-gc02c58d (119.98 kB)
job-script (4.01 kB)
dmesg.xz (19.66 kB)
trinity (31.07 kB)
Download all attachments

2018-09-11 02:37:14

by Chen, Rong A

[permalink] [raw]

Subject: [LKP] [blkcg] 22f657e287: general_protection_fault:#[##]

FYI, we noticed the following commit (built with gcc-7):

commit: 22f657e2876612270ad346b7f5ba2493ba434d41 ("[PATCH 12/15] blkcg: cleanup and make blk_get_rl use blkg_lookup_create")
url: https://github.com/0day-ci/linux/commits/Dennis-Zhou/blkcg-ref-count-refactor-cleanup-blkcg-avg_lat/20180831-161742
base: https://git.kernel.org/cgit/linux/kernel/git/axboe/linux-block.git for-next

in testcase: trinity
with following parameters:

runtime: 300s

test-description: Trinity is a linux system call fuzz tester.
test-url: http://codemonkey.org.uk/projects/trinity/

on test machine: qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap -smp 2 -m 512M

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):

+------------------------------------------------------------------+------------+------------+
| | f743a58719 | 22f657e287 |
+------------------------------------------------------------------+------------+------------+
| boot_successes | 3 | 0 |
| boot_failures | 10 | 16 |
| invoked_oom-killer:gfp_mask=0x | 6 | 6 |
| Mem-Info | 6 | 6 |
| Kernel_panic-not_syncing:Out_of_memory_and_no_killable_processes | 6 | 6 |
| IP-Config:Auto-configuration_of_network_failed | 4 | |
| general_protection_fault:#[##] | 0 | 10 |
| RIP:get_request | 0 | 10 |
| Kernel_panic-not_syncing:Fatal_exception | 0 | 10 |
+------------------------------------------------------------------+------------+------------+

[ 93.607840] SCSI Media Changer driver v0.25
[ 93.667470] scsi host0: scsi_debug: version 0188 [20180128]
[ 93.667470] dev_size_mb=8, opts=0x0, submit_queues=1, statistics=0
[ 93.756552] kasan: CONFIG_KASAN_INLINE enabled
[ 93.766196] kasan: GPF could be caused by NULL-ptr deref or user memory access
[ 93.766196] general protection fault: 0000 [#1] PREEMPT KASAN
[ 93.766196] CPU: 0 PID: 27 Comm: kworker/u2:1 Not tainted 4.19.0-rc1-00174-g22f657e #1
[ 93.766196] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.10.2-1 04/01/2014
[ 93.766196] Workqueue: events_unbound async_run_entry_fn
[ 93.766196] RIP: 0010:get_request+0x11f/0xe24
[ 93.766196] Code: 83 b8 f0 00 00 00 00 74 02 0f 0b e8 6b 78 46 ff 48 8b 44 24 10 48 8d 78 60 48 b8 00 00 00 00 00 fc ff df 48 89 fa 48 c1 ea 03 <80> 3c 02 00 74 05 e8 6d 16 63 ff 48 8b 44 24 10 48 bd 00 00 00 00
[ 93.766196] RSP: 0000:ffff880016c07850 EFLAGS: 00010006
[ 93.766196] RAX: dffffc0000000000 RBX: dffffc0000000000 RCX: 0000000000000008
[ 93.766196] RDX: 000000000000000c RSI: 0000000000000020 RDI: 0000000000000060
[ 93.766196] RBP: ffff88001463b390 R08: 0000000000600000 R09: ffffed0002d80f0f
[ 93.766196] R10: 0000000000000000 R11: ffff880016c07877 R12: 0000000000600000
[ 93.766196] R13: 0000000000000000 R14: 0000000000000020 R15: ffff880014639540
[ 93.766196] FS: 0000000000000000(0000) GS:ffffffff8427e000(0000) knlGS:0000000000000000
[ 93.766196] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 93.766196] CR2: 0000000000000000 CR3: 000000000422c001 CR4: 00000000000206b0
[ 93.766196] Call Trace:
[ 93.766196] ? blk_rq_init+0x27c/0x27c
[ 93.766196] ? blk_exit_rl+0x55/0x55
[ 93.766196] ? __wake_up_common_lock+0x140/0x140
[ 93.766196] ? tracer_preempt_on+0x16/0x25
[ 93.766196] ? preempt_count_sub+0x12d/0x136
[ 93.766196] ? task_unlock+0xa/0x1a
[ 93.766196] ? create_task_io_context+0x2c7/0x2cf
[ 93.766196] blk_get_request+0x14d/0x277
[ 93.766196] __scsi_execute+0x67/0x466
[ 93.766196] scsi_probe_and_add_lun+0x399/0x1d14
[ 93.766196] ? rpm_resume+0xad5/0xb05
[ 93.766196] ? scsi_sanitize_inquiry_string+0x77/0x77
[ 93.766196] ? rpm_put_suppliers+0x10e/0x10e
[ 93.766196] ? scsi_target_reap_ref_release+0x6a/0x6a
[ 93.766196] ? tracer_preempt_on+0x16/0x25
[ 93.766196] ? preempt_count_sub+0x12d/0x136
[ 93.766196] __scsi_scan_target+0x130/0x6af
[ 93.766196] ? __free_pages+0x3c/0x3c
[ 93.766196] ? scsi_probe_and_add_lun+0x1d14/0x1d14
[ 93.766196] ? rpm_resume+0xad5/0xb05
[ 93.766196] ? rpm_put_suppliers+0x10e/0x10e
[ 93.766196] ? __switch_to_asm+0x30/0x60
[ 93.766196] ? ___might_sleep+0xac/0x33e
[ 93.766196] scsi_scan_channel+0xcb/0xe8
[ 93.766196] scsi_scan_host_selected+0x1ca/0x201
[ 93.766196] ? do_scsi_scan_host+0x18a/0x18a
[ 93.766196] do_scan_async+0x3e/0x2ff
[ 93.766196] ? do_scsi_scan_host+0x18a/0x18a
[ 93.766196] async_run_entry_fn+0x1c5/0x33c
[ 93.766196] process_one_work+0x4c0/0x6cd
[ 93.766196] ? preempt_count_sub+0x12d/0x136
[ 93.766196] worker_thread+0x4b3/0x610
[ 93.766196] ? __kthread_parkme+0x9f/0x148
[ 93.766196] kthread+0x2c5/0x2d4
[ 93.766196] ? process_scheduled_works+0x6d/0x6d
[ 93.766196] ? __kthread_cancel_work+0x16b/0x16b
[ 93.766196] ret_from_fork+0x35/0x40
[ 93.766196] ---[ end trace a8869917661828b0 ]---
[ 93.766196] RIP: 0010:get_request+0x11f/0xe24
[ 93.766196] Code: 83 b8 f0 00 00 00 00 74 02 0f 0b e8 6b 78 46 ff 48 8b 44 24 10 48 8d 78 60 48 b8 00 00 00 00 00 fc ff df 48 89 fa 48 c1 ea 03 <80> 3c 02 00 74 05 e8 6d 16 63 ff 48 8b 44 24 10 48 bd 00 00 00 00
[ 93.766196] RSP: 0000:ffff880016c07850 EFLAGS: 00010006
[ 93.766196] RAX: dffffc0000000000 RBX: dffffc0000000000 RCX: 0000000000000008
[ 93.766196] RDX: 000000000000000c RSI: 0000000000000020 RDI: 0000000000000060
[ 93.766196] RBP: ffff88001463b390 R08: 0000000000600000 R09: ffffed0002d80f0f
[ 93.766196] R10: 0000000000000000 R11: ffff880016c07877 R12: 0000000000600000
[ 93.766196] R13: 0000000000000000 R14: 0000000000000020 R15: ffff880014639540
[ 93.766196] FS: 0000000000000000(0000) GS:ffffffff8427e000(0000) knlGS:0000000000000000
[ 93.766196] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 93.766196] CR2: 0000000000000000 CR3: 000000000422c001 CR4: 00000000000206b0
[ 93.766196] Kernel panic - not syncing: Fatal exception
[ 93.766196] Kernel Offset: disabled

Elapsed time: 100

#!/bin/bash

To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email

Thanks,
Rong, Chen

Attachments:

(No filename) (6.63 kB)
config-4.19.0-rc1-00174-g22f657e (116.92 kB)
job-script (3.77 kB)
dmesg.xz (10.59 kB)
Download all attachments