Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754802Ab3H1Srn (ORCPT ); Wed, 28 Aug 2013 14:47:43 -0400 Received: from mail-pa0-f54.google.com ([209.85.220.54]:51601 "EHLO mail-pa0-f54.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753173Ab3H1Srl (ORCPT ); Wed, 28 Aug 2013 14:47:41 -0400 MIME-Version: 1.0 In-Reply-To: References: <52132243.8050803@profihost.ag> <521323D1.1070503@profihost.ag> <20130821234715.GA4630@moria.home.lan> <5215A130.9080508@profihost.ag> <20130822054336.GA14935@kmo-pixel> <5215A8A8.90900@profihost.ag> <20130822060215.GB14935@kmo-pixel> <5215BEA4.6040101@profihost.ag> <521BAACB.4070003@profihost.ag> <20130828060521.GC8032@kmo-pixel> Date: Wed, 28 Aug 2013 11:47:40 -0700 Message-ID: Subject: Re: bcache: Fix a writeback performance regression From: kernel neophyte To: Stefan Priebe - Profihost AG Cc: Kent Overstreet , Jens Axboe , "linux-kernel@vger.kernel.org" , "linux-bcache@vger.kernel.org" Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 46224 Lines: 780 On Wed, Aug 28, 2013 at 11:38 AM, Stefan Priebe - Profihost AG wrote: > I don't had one for a few days. Which kernel so you use? 3.10 kernel with all of kent's stable patches and perf patches.. and all defaults except for: echo writeback > /sys/block/bcache0/bcache/cache_mode echo 0 > /sys/block/bcache0/bcache/sequential_cutoff echo 0 > /sys/fs/bcache/1e26e5ca-5cb9-4c5a-ad7e-3c787524a9dc/congested_read_threshold_us echo 0 > /sys/fs/bcache/1e26e5ca-5cb9-4c5a-ad7e-3c787524a9dc/congested_write_threshold_us -Neo > > Stefan > > This mail was sent with my iPhone. > > Am 28.08.2013 um 20:12 schrieb kernel neophyte : > >> On Tue, Aug 27, 2013 at 11:05 PM, Kent Overstreet wrote: >>> On Tue, Aug 27, 2013 at 08:37:54PM -0700, kernel neophyte wrote: >>>> Hi Kent, >>>> >>>> I am still seeing deadlock: >>> >>> Seeing blk_throtl_bio (what asshole misspells words in their function >>> names, so when you go to grep for them you grep for the wrong thing?) in >>> the backtrace - is that what it takes to hit it? >> >> I don't think so, >> I had another deadlock: >> >> [ 1179.193990] bio: create slab at 2 >> [ 1179.194491] bcache: run_cache_set() invalidating existing data >> [ 1179.212308] bcache: register_cache() registered cache device rxd1 >> [ 1203.846220] bcache: register_bdev() registered backing device sda3 >> [ 1227.780088] bcache: bch_cached_dev_attach() Caching sda3 as bcache0 >> on set a079dda3-27fd-4b1c-a616-d46110bfae6c >> [ 2162.497950] INFO: task kworker/0:0:4 blocked for more than 120 seconds. >> [ 2162.497961] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.497967] kworker/0:0 D 0000000000000000 0 4 2 0x00000000 >> [ 2162.497982] Workqueue: events write_dirty_finish >> [ 2162.497986] ffff882fa6ac9b30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.497992] ffff882fa6a84cb0 ffff882fa6ac9fd8 ffff882fa6ac9fd8 >> ffff882fa6ac9fd8 >> [ 2162.497996] ffff882f78139990 ffff882fa6a84cb0 ffffffff815117a1 >> ffff882fa6a84cb0 >> [ 2162.498001] Call Trace: >> [ 2162.498010] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.498015] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498019] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.498028] [] schedule+0x29/0x70 >> [ 2162.498034] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.498043] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.498048] [] ? down_read+0x24/0x2b >> [ 2162.498052] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.498057] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498066] [] ? update_curr+0x141/0x1f0 >> [ 2162.498072] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.498076] [] bch_btree_insert+0xb4/0x120 >> [ 2162.498082] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.498087] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.498093] [] ? dequeue_task+0x93/0xb0 >> [ 2162.498100] [] process_one_work+0x174/0x490 >> [ 2162.498105] [] worker_thread+0x11b/0x370 >> [ 2162.498110] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.498115] [] kthread+0xc0/0xd0 >> [ 2162.498120] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498127] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498131] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498145] INFO: task bcache_allocato:1766 blocked for more than >> 120 seconds. >> [ 2162.498149] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498154] bcache_allocato D 0000000000000001 0 1766 2 0x00000000 >> [ 2162.498158] ffff8822630f3d88 0000000000000046 ffff8822630f3da8 >> ffffffff810808ad >> [ 2162.498163] ffff882f891de640 ffff8822630f3fd8 ffff8822630f3fd8 >> ffff8822630f3fd8 >> [ 2162.498167] ffff882f81a91990 ffff882f891de640 ffff882f88b33320 >> ffff8801fad20d98 >> [ 2162.498172] Call Trace: >> [ 2162.498177] [] ? dequeue_task_fair+0x2cd/0x530 >> [ 2162.498182] [] schedule+0x29/0x70 >> [ 2162.498187] [] schedule_preempt_disabled+0xe/0x10 >> [ 2162.498192] [] __mutex_lock_slowpath+0x112/0x1b0 >> [ 2162.498197] [] mutex_lock+0x2a/0x50 >> [ 2162.498202] [] bch_allocator_thread+0x10f/0xe20 >> [ 2162.498207] [] ? bch_bucket_add_unused+0xe0/0xe0 >> [ 2162.498211] [] kthread+0xc0/0xd0 >> [ 2162.498215] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498220] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498224] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498228] INFO: task bcache_writebac:1771 blocked for more than >> 120 seconds. >> [ 2162.498233] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498237] bcache_writebac D 0000000000000000 0 1771 2 0x00000000 >> [ 2162.498241] ffff88197d2adc58 0000000000000046 ffff88197d2add48 >> ffffffff811269e2 >> [ 2162.498245] ffff882f891d8000 ffff88197d2adfd8 ffff88197d2adfd8 >> ffff88197d2adfd8 >> [ 2162.498250] ffff882f81a91990 ffff882f891d8000 ffff88197d2adc68 >> ffff882f89550ba0 >> [ 2162.498254] Call Trace: >> [ 2162.498264] [] ? __alloc_pages_nodemask+0x152/0x9b0 >> [ 2162.498269] [] schedule+0x29/0x70 >> [ 2162.498274] [] schedule_timeout+0x1e5/0x250 >> [ 2162.498280] [] ? try_to_wake_up+0x24f/0x2b0 >> [ 2162.498285] [] __down_common+0x8e/0xe3 >> [ 2162.498293] [] ? alloc_pages_current+0x90/0x170 >> [ 2162.498298] [] __down+0x1d/0x1f >> [ 2162.498304] [] down+0x41/0x50 >> [ 2162.498310] [] bch_writeback_thread+0x35d/0x7f0 >> [ 2162.498315] [] ? idle_balance+0xeb/0x150 >> [ 2162.498321] [] ? write_dirty+0xc0/0xc0 >> [ 2162.498325] [] kthread+0xc0/0xd0 >> [ 2162.498329] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498334] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498338] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498342] INFO: task kworker/0:2:1789 blocked for more than 120 seconds. >> [ 2162.498346] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498351] kworker/0:2 D ffffffff81813a60 0 1789 2 0x00000000 >> [ 2162.498357] Workqueue: events write_dirty_finish >> [ 2162.498360] ffff88277062fb30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.498364] ffff882fa6786640 ffff88277062ffd8 ffff88277062ffd8 >> ffff88277062ffd8 >> [ 2162.498368] ffffffff81c10440 ffff882fa6786640 ffffffff815117a1 >> ffff882fa6786640 >> [ 2162.498373] Call Trace: >> [ 2162.498378] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.498382] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498387] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.498392] [] schedule+0x29/0x70 >> [ 2162.498397] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.498402] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.498407] [] ? down_read+0x24/0x2b >> [ 2162.498411] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.498416] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498420] [] ? update_curr+0x141/0x1f0 >> [ 2162.498425] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.498430] [] bch_btree_insert+0xb4/0x120 >> [ 2162.498435] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.498439] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.498444] [] ? dequeue_task+0x93/0xb0 >> [ 2162.498449] [] process_one_work+0x174/0x490 >> [ 2162.498454] [] worker_thread+0x11b/0x370 >> [ 2162.498459] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.498463] [] kthread+0xc0/0xd0 >> [ 2162.498467] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498472] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498476] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498480] INFO: task iozone:2052 blocked for more than 120 seconds. >> [ 2162.498484] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498488] iozone D 0000000000000001 0 2052 1944 0x00000000 >> [ 2162.498492] ffff882f7e2d3978 0000000000000082 ffff882f7e2d3938 >> ffffffff81301d7a >> [ 2162.498497] ffff882f8a116640 ffff882f7e2d3fd8 ffff882f7e2d3fd8 >> ffff882f7e2d3fd8 >> [ 2162.498501] ffff882f88b33320 ffff882f8a116640 ffff882f7e2d3978 >> ffff882fbf2339f8 >> [ 2162.498505] Call Trace: >> [ 2162.498511] [] ? generic_make_request+0xca/0x100 >> [ 2162.498517] [] schedule+0x29/0x70 >> [ 2162.498522] [] io_schedule+0x8f/0xd0 >> [ 2162.498530] [] do_blockdev_direct_IO+0x1a7c/0x1fb0 >> [ 2162.498539] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >> [ 2162.498546] [] __blockdev_direct_IO+0x55/0x60 >> [ 2162.498552] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >> [ 2162.498556] [] ? update_curr+0x141/0x1f0 >> [ 2162.498562] [] ext2_direct_IO+0x79/0xe0 [ext2] >> [ 2162.498567] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >> [ 2162.498572] [] ? set_next_entity+0xa5/0xc0 >> [ 2162.498579] [] ? current_fs_time+0x16/0x60 >> [ 2162.498585] [] generic_file_direct_write+0xc6/0x180 >> [ 2162.498590] [] __generic_file_aio_write+0x2dd/0x3b0 >> [ 2162.498595] [] ? mutex_unlock+0x14/0x20 >> [ 2162.498600] [] generic_file_aio_write+0x69/0xd0 >> [ 2162.498607] [] do_sync_write+0x7a/0xb0 >> [ 2162.498612] [] vfs_write+0xce/0x1e0 >> [ 2162.498617] [] ? generic_file_llseek_size+0x110/0x110 >> [ 2162.498621] [] SyS_write+0x52/0xa0 >> [ 2162.498627] [] system_call_fastpath+0x16/0x1b >> [ 2162.498630] INFO: task kworker/0:1:2293 blocked for more than 120 seconds. >> [ 2162.498635] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498639] kworker/0:1 D ffffffff81813a60 0 2293 2 0x00000000 >> [ 2162.498645] Workqueue: events write_dirty_finish >> [ 2162.498647] ffff882f8a05bb30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.498652] ffff882f8a163320 ffff882f8a05bfd8 ffff882f8a05bfd8 >> ffff882f8a05bfd8 >> [ 2162.498656] ffffffff81c10440 ffff882f8a163320 ffffffff815117a1 >> ffff882f8a163320 >> [ 2162.498661] Call Trace: >> [ 2162.498665] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.498669] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498674] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.498679] [] schedule+0x29/0x70 >> [ 2162.498684] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.498689] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.498693] [] ? down_read+0x24/0x2b >> [ 2162.498698] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.498702] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498707] [] ? update_curr+0x141/0x1f0 >> [ 2162.498712] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.498716] [] bch_btree_insert+0xb4/0x120 >> [ 2162.498721] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.498726] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.498731] [] ? dequeue_task+0x93/0xb0 >> [ 2162.498735] [] process_one_work+0x174/0x490 >> [ 2162.498740] [] worker_thread+0x11b/0x370 >> [ 2162.498745] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.498749] [] kthread+0xc0/0xd0 >> [ 2162.498753] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498758] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498762] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498766] INFO: task kworker/0:3:2534 blocked for more than 120 seconds. >> [ 2162.498772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498777] kworker/0:3 D ffffffff81813a60 0 2534 2 0x00000000 >> [ 2162.498783] Workqueue: events write_dirty_finish >> [ 2162.498785] ffff88237d0bdb30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.498790] ffff882fa66fe640 ffff88237d0bdfd8 ffff88237d0bdfd8 >> ffff88237d0bdfd8 >> [ 2162.498794] ffffffff81c10440 ffff882fa66fe640 ffffffff815117a1 >> ffff882fa66fe640 >> [ 2162.498799] Call Trace: >> [ 2162.498803] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.498807] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498812] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.498817] [] schedule+0x29/0x70 >> [ 2162.498822] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.498826] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.498831] [] ? down_read+0x24/0x2b >> [ 2162.498836] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.498840] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498845] [] ? update_curr+0x141/0x1f0 >> [ 2162.498849] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.498854] [] bch_btree_insert+0xb4/0x120 >> [ 2162.498859] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.498864] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.498868] [] ? dequeue_task+0x93/0xb0 >> [ 2162.498873] [] process_one_work+0x174/0x490 >> [ 2162.498878] [] worker_thread+0x11b/0x370 >> [ 2162.498882] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.498886] [] kthread+0xc0/0xd0 >> [ 2162.498890] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498896] [] ret_from_fork+0x7c/0xb0 >> [ 2162.498900] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.498903] INFO: task kworker/0:4:2537 blocked for more than 120 seconds. >> [ 2162.498908] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.498914] kworker/0:4 D 0000000000000000 0 2537 2 0x00000000 >> [ 2162.498920] Workqueue: events write_dirty_finish >> [ 2162.498922] ffff882f76d8fb30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.498926] ffff882f88b30000 ffff882f76d8ffd8 ffff882f76d8ffd8 >> ffff882f76d8ffd8 >> [ 2162.498931] ffff882f78beb320 ffff882f88b30000 ffffffff815117a1 >> ffff882f88b30000 >> [ 2162.498935] Call Trace: >> [ 2162.498940] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.498944] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498948] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.498953] [] schedule+0x29/0x70 >> [ 2162.498958] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.498963] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.498968] [] ? down_read+0x24/0x2b >> [ 2162.498972] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.498977] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.498981] [] ? update_curr+0x141/0x1f0 >> [ 2162.498986] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.498990] [] bch_btree_insert+0xb4/0x120 >> [ 2162.498995] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.499000] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.499004] [] ? dequeue_task+0x93/0xb0 >> [ 2162.499009] [] process_one_work+0x174/0x490 >> [ 2162.499014] [] worker_thread+0x11b/0x370 >> [ 2162.499019] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.499023] [] kthread+0xc0/0xd0 >> [ 2162.499027] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.499032] [] ret_from_fork+0x7c/0xb0 >> [ 2162.499036] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.499039] INFO: task kworker/1:1:2544 blocked for more than 120 seconds. >> [ 2162.499044] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.499050] kworker/1:1 D ffffffff81813d40 0 2544 2 0x00000000 >> [ 2162.499057] Workqueue: bcache bch_data_insert_keys >> [ 2162.499059] ffff882fa677b3f8 0000000000000046 ffff882fa677b418 >> ffff882f8a709000 >> [ 2162.499063] ffff882f88b33320 ffff882fa677bfd8 ffff882fa677bfd8 >> ffff882fa677bfd8 >> [ 2162.499068] ffff882fa6471990 ffff882f88b33320 ffff882f88b33320 >> ffff8801fad20d98 >> [ 2162.499072] Call Trace: >> [ 2162.499077] [] schedule+0x29/0x70 >> [ 2162.499082] [] schedule_preempt_disabled+0xe/0x10 >> [ 2162.499087] [] __mutex_lock_slowpath+0x112/0x1b0 >> [ 2162.499092] [] mutex_lock+0x2a/0x50 >> [ 2162.499096] [] bch_mca_shrink+0x1b5/0x2f0 >> [ 2162.499102] [] ? prune_super+0x162/0x1b0 >> [ 2162.499109] [] shrink_slab+0x154/0x300 >> [ 2162.499113] [] ? resched_task+0x68/0x70 >> [ 2162.499118] [] ? check_preempt_curr+0x75/0xa0 >> [ 2162.499126] [] ? fragmentation_index+0x19/0x70 >> [ 2162.499131] [] do_try_to_free_pages+0x20f/0x4b0 >> [ 2162.499137] [] try_to_free_pages+0xe4/0x1a0 >> [ 2162.499143] [] __alloc_pages_nodemask+0x60c/0x9b0 >> [ 2162.499149] [] alloc_pages_current+0xba/0x170 >> [ 2162.499155] [] __get_free_pages+0xe/0x40 >> [ 2162.499159] [] mca_data_alloc+0x73/0x1d0 >> [ 2162.499163] [] mca_alloc+0x277/0x470 >> [ 2162.499168] [] bch_btree_node_alloc+0x8c/0x1c0 >> [ 2162.499173] [] btree_split+0x110/0x5c0 >> [ 2162.499179] [] ? scsi_pool_alloc_command+0x33/0x80 >> [ 2162.499184] [] ? bch_keylist_pop_front+0x47/0x50 >> [ 2162.499188] [] ? bch_btree_insert_keys+0x56/0x250 >> [ 2162.499193] [] ? update_group_power+0x13f/0x220 >> [ 2162.499198] [] bch_btree_insert_node+0xb2/0x2f0 >> [ 2162.499202] [] btree_insert_fn+0x28/0x50 >> [ 2162.499207] [] bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.499211] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.499216] [] ? down_write+0x16/0x40 >> [ 2162.499221] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.499225] [] bch_btree_map_nodes_recurse+0x110/0x170 >> [ 2162.499230] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.499235] [] ? dio_bio_end_io+0x5a/0x90 >> [ 2162.499239] [] ? update_curr+0x141/0x1f0 >> [ 2162.499244] [] __bch_btree_map_nodes+0x13e/0x1c0 >> [ 2162.499248] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.499254] [] ? bch_journal+0x42f/0x4b0 >> [ 2162.499258] [] bch_btree_insert+0xb4/0x120 >> [ 2162.499264] [] bch_data_insert_keys+0x3e/0x160 >> [ 2162.499268] [] process_one_work+0x174/0x490 >> [ 2162.499273] [] worker_thread+0x11b/0x370 >> [ 2162.499278] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.499282] [] kthread+0xc0/0xd0 >> [ 2162.499286] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.499292] [] ret_from_fork+0x7c/0xb0 >> [ 2162.499296] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.499299] INFO: task kworker/0:5:2552 blocked for more than 120 seconds. >> [ 2162.499304] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >> disables this message. >> [ 2162.499310] kworker/0:5 D 0000000000000000 0 2552 2 0x00000000 >> [ 2162.499411] Workqueue: events write_dirty_finish >> [ 2162.499413] ffff88226314db30 0000000000000046 ffffffff81511b8c >> ffffffff815137c0 >> [ 2162.499418] ffff882fa6781990 ffff88226314dfd8 ffff88226314dfd8 >> ffff88226314dfd8 >> [ 2162.499422] ffff882fa6a84cb0 ffff882fa6781990 ffffffff815117a1 >> ffff882fa6781990 >> [ 2162.499427] Call Trace: >> [ 2162.499431] [] ? bch_btree_map_nodes_recurse+0x6c/0x170 >> [ 2162.499435] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.499440] [] ? bch_btree_node_get+0x71/0x280 >> [ 2162.499445] [] schedule+0x29/0x70 >> [ 2162.499450] [] rwsem_down_read_failed+0x9d/0xe5 >> [ 2162.499455] [] call_rwsem_down_read_failed+0x14/0x30 >> [ 2162.499460] [] ? down_read+0x24/0x2b >> [ 2162.499464] [] __bch_btree_map_nodes+0xe5/0x1c0 >> [ 2162.499468] [] ? bch_btree_insert_node+0x2f0/0x2f0 >> [ 2162.499473] [] ? dirty_io_destructor+0xe/0x10 >> [ 2162.499478] [] bch_btree_insert+0xb4/0x120 >> [ 2162.499483] [] write_dirty_finish+0x1e3/0x270 >> [ 2162.499488] [] ? pick_next_task_fair+0x6b/0x150 >> [ 2162.499492] [] ? dequeue_task+0x93/0xb0 >> [ 2162.499497] [] process_one_work+0x174/0x490 >> [ 2162.499502] [] worker_thread+0x11b/0x370 >> [ 2162.499507] [] ? manage_workers.isra.23+0x2d0/0x2d0 >> [ 2162.499511] [] kthread+0xc0/0xd0 >> [ 2162.499515] [] ? flush_kthread_worker+0xb0/0xb0 >> [ 2162.499520] [] ret_from_fork+0x7c/0xb0 >> [ 2162.499524] [] ? flush_kthread_worker+0xb0/0xb0 >> >> >> >>>> [ 930.169234] bcache: bch_cached_dev_attach() Caching sda3 as bcache0 >>>> on set 06496904-10ca-489a-ae75-68c6a07d3db1 >>>> [ 2522.956188] INFO: task bcache_writebac:2058 blocked for more than >>>> 120 seconds. >>>> [ 2522.956199] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.956205] bcache_writebac D ffffffff81813a60 0 2058 2 0x00000000 >>>> [ 2522.956212] ffff881c39c1fcd8 0000000000000046 0000000000000001 >>>> 0000000000000001 >>>> [ 2522.956218] ffff882f8b56b320 ffff881c39c1ffd8 ffff881c39c1ffd8 >>>> ffff881c39c1ffd8 >>>> [ 2522.956222] ffff882fa6aeb320 ffff882f8b56b320 ffff882f8b56b320 >>>> ffff882f8b56b320 >>>> [ 2522.956227] Call Trace: >>>> [ 2522.956241] [] schedule+0x29/0x70 >>>> [ 2522.956247] [] rwsem_down_write_failed+0xf5/0x1a0 >>>> [ 2522.956255] [] call_rwsem_down_write_failed+0x13/0x20 >>>> [ 2522.956261] [] ? down_write+0x31/0x40 >>>> [ 2522.956269] [] bch_writeback_thread+0x62/0x7f0 >>>> [ 2522.956278] [] ? idle_balance+0xeb/0x150 >>>> [ 2522.956284] [] ? write_dirty+0xc0/0xc0 >>>> [ 2522.956290] [] kthread+0xc0/0xd0 >>>> [ 2522.956294] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956302] [] ret_from_fork+0x7c/0xb0 >>>> [ 2522.956306] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956313] INFO: task kworker/u64:1:2433 blocked for more than 120 seconds. >>>> [ 2522.956317] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.956322] kworker/u64:1 D ffffffff81813a60 0 2433 2 0x00000000 >>>> [ 2522.956332] Workqueue: writeback bdi_writeback_workfn (flush-252:0) >>>> [ 2522.956336] ffff882f899d1560 0000000000000046 ffffffff81120635 >>>> 0000000000000100 >>>> [ 2522.956340] ffff88219b488000 ffff882f899d1fd8 ffff882f899d1fd8 >>>> ffff882f899d1fd8 >>>> [ 2522.956345] ffff882fa6aeb320 ffff88219b488000 ffffffff8131ad55 >>>> ffff88219b488000 >>>> [ 2522.956349] Call Trace: >>>> [ 2522.956359] [] ? mempool_alloc_slab+0x15/0x20 >>>> [ 2522.956367] [] ? blk_throtl_bio+0x3b5/0x540 >>>> [ 2522.956372] [] schedule+0x29/0x70 >>>> [ 2522.956378] [] rwsem_down_read_failed+0x9d/0xe5 >>>> [ 2522.956383] [] call_rwsem_down_read_failed+0x14/0x30 >>>> [ 2522.956388] [] ? down_read+0x24/0x2b >>>> [ 2522.956394] [] cached_dev_make_request+0x752/0xf00 >>>> [ 2522.956400] [] ? generic_make_request_checks+0x1e8/0x3a0 >>>> [ 2522.956405] [] generic_make_request+0xca/0x100 >>>> [ 2522.956409] [] submit_bio+0x79/0x160 >>>> [ 2522.956417] [] ? bio_alloc_bioset+0xa0/0x1d0 >>>> [ 2522.956421] [] _submit_bh+0x13f/0x200 >>>> [ 2522.956425] [] submit_bh+0x10/0x20 >>>> [ 2522.956430] [] __block_write_full_page+0x1d8/0x360 >>>> [ 2522.956439] [] ? scsi_request_fn+0xbb/0x530 >>>> [ 2522.956444] [] ? end_buffer_async_read+0x130/0x130 >>>> [ 2522.956449] [] ? I_BDEV+0x10/0x10 >>>> [ 2522.956454] [] ? I_BDEV+0x10/0x10 >>>> [ 2522.956459] [] block_write_full_page_endio+0xca/0x100 >>>> [ 2522.956464] [] block_write_full_page+0x15/0x20 >>>> [ 2522.956469] [] blkdev_writepage+0x18/0x20 >>>> [ 2522.956473] [] __writepage+0x17/0x40 >>>> [ 2522.956477] [] write_cache_pages+0x20e/0x460 >>>> [ 2522.956481] [] ? set_page_dirty_lock+0x60/0x60 >>>> [ 2522.956486] [] ? __set_page_dirty+0x71/0xc0 >>>> [ 2522.956490] [] generic_writepages+0x4a/0x70 >>>> [ 2522.956495] [] do_writepages+0x20/0x40 >>>> [ 2522.956501] [] __writeback_single_inode+0x45/0x280 >>>> [ 2522.956507] [] ? __alloc_pages_nodemask+0x152/0x9b0 >>>> [ 2522.956513] [] writeback_sb_inodes+0x19d/0x3c0 >>>> [ 2522.956518] [] __writeback_inodes_wb+0x9e/0xd0 >>>> [ 2522.956523] [] wb_writeback+0x27b/0x320 >>>> [ 2522.956529] [] ? get_nr_dirty_inodes+0x58/0x80 >>>> [ 2522.956534] [] wb_check_old_data_flush+0x9f/0xb0 >>>> [ 2522.956539] [] wb_do_writeback+0x151/0x1d0 >>>> [ 2522.956547] [] ? set_worker_desc+0x6f/0x80 >>>> [ 2522.956552] [] bdi_writeback_workfn+0x7a/0x200 >>>> [ 2522.956557] [] process_one_work+0x174/0x490 >>>> [ 2522.956562] [] worker_thread+0x11b/0x370 >>>> [ 2522.956567] [] ? manage_workers.isra.23+0x2d0/0x2d0 >>>> [ 2522.956571] [] kthread+0xc0/0xd0 >>>> [ 2522.956575] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956581] [] ret_from_fork+0x7c/0xb0 >>>> [ 2522.956585] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956588] INFO: task iozone:2435 blocked for more than 120 seconds. >>>> [ 2522.956593] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.956597] iozone D 0000000000000001 0 2435 1545 0x00000000 >>>> [ 2522.956602] ffff882a75671978 0000000000000082 ffff882a75671938 >>>> ffffffff81301d7a >>>> [ 2522.956606] ffff882f88dc1990 ffff882a75671fd8 ffff882a75671fd8 >>>> ffff882a75671fd8 >>>> [ 2522.956611] ffff882f8ec91990 ffff882f88dc1990 ffff882a75671978 >>>> ffff882fbf2539f8 >>>> [ 2522.956615] Call Trace: >>>> [ 2522.956619] [] ? generic_make_request+0xca/0x100 >>>> [ 2522.956625] [] schedule+0x29/0x70 >>>> [ 2522.956630] [] io_schedule+0x8f/0xd0 >>>> [ 2522.956636] [] do_blockdev_direct_IO+0x1a7c/0x1fb0 >>>> [ 2522.956645] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >>>> [ 2522.956652] [] __blockdev_direct_IO+0x55/0x60 >>>> [ 2522.956658] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >>>> [ 2522.956662] [] ? update_curr+0x141/0x1f0 >>>> [ 2522.956669] [] ext2_direct_IO+0x79/0xe0 [ext2] >>>> [ 2522.956674] [] ? ext2_get_blocks+0xa60/0xa60 [ext2] >>>> [ 2522.956680] [] ? current_fs_time+0x16/0x60 >>>> [ 2522.956685] [] generic_file_direct_write+0xc6/0x180 >>>> [ 2522.956690] [] __generic_file_aio_write+0x2dd/0x3b0 >>>> [ 2522.956696] [] generic_file_aio_write+0x69/0xd0 >>>> [ 2522.956702] [] do_sync_write+0x7a/0xb0 >>>> [ 2522.956706] [] ? fsnotify+0x1f8/0x2b0 >>>> [ 2522.956712] [] vfs_write+0xce/0x1e0 >>>> [ 2522.956716] [] SyS_write+0x52/0xa0 >>>> [ 2522.956722] [] system_call_fastpath+0x16/0x1b >>>> [ 2522.956726] INFO: task kworker/2:1:2470 blocked for more than 120 seconds. >>>> [ 2522.956730] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.956735] kworker/2:1 D ffffffff81813d40 0 2470 2 0x00000000 >>>> [ 2522.956743] Workqueue: bcache bch_data_insert_keys >>>> [ 2522.956745] ffff882a5b0c9628 0000000000000046 ffff882a5b0c9648 >>>> 0000004000000000 >>>> [ 2522.956749] ffff882f8ec91990 ffff882a5b0c9fd8 ffff882a5b0c9fd8 >>>> ffff882a5b0c9fd8 >>>> [ 2522.956754] ffff882fa6489990 ffff882f8ec91990 ffff882f8ec91990 >>>> ffff8828dba00d98 >>>> [ 2522.956758] Call Trace: >>>> [ 2522.956763] [] schedule+0x29/0x70 >>>> [ 2522.956769] [] schedule_preempt_disabled+0xe/0x10 >>>> [ 2522.956774] [] __mutex_lock_slowpath+0x112/0x1b0 >>>> [ 2522.956779] [] mutex_lock+0x2a/0x50 >>>> [ 2522.956783] [] bch_mca_shrink+0x1b5/0x2f0 >>>> [ 2522.956791] [] ? prune_super+0x162/0x1b0 >>>> [ 2522.956799] [] shrink_slab+0x154/0x300 >>>> [ 2522.956805] [] ? resched_task+0x68/0x70 >>>> [ 2522.956810] [] ? check_preempt_curr+0x75/0xa0 >>>> [ 2522.956816] [] ? fragmentation_index+0x19/0x70 >>>> [ 2522.956822] [] do_try_to_free_pages+0x20f/0x4b0 >>>> [ 2522.956827] [] try_to_free_pages+0xe4/0x1a0 >>>> [ 2522.956833] [] __alloc_pages_nodemask+0x60c/0x9b0 >>>> [ 2522.956843] [] alloc_pages_current+0xba/0x170 >>>> [ 2522.956848] [] __get_free_pages+0xe/0x40 >>>> [ 2522.956852] [] mca_data_alloc+0x73/0x1d0 >>>> [ 2522.956857] [] mca_bucket_alloc+0x14a/0x1f0 >>>> [ 2522.956861] [] mca_alloc+0x360/0x470 >>>> [ 2522.956866] [] ? __bch_bset_search+0x1d1/0x480 >>>> [ 2522.956871] [] bch_btree_node_get+0x10e/0x280 >>>> [ 2522.956875] [] bch_btree_map_nodes_recurse+0xe2/0x170 >>>> [ 2522.956880] [] ? bch_btree_insert_node+0x2f0/0x2f0 >>>> [ 2522.956886] [] ? dio_bio_end_io+0x5a/0x90 >>>> [ 2522.956890] [] ? update_curr+0x141/0x1f0 >>>> [ 2522.956895] [] __bch_btree_map_nodes+0x13e/0x1c0 >>>> [ 2522.956899] [] ? bch_btree_insert_node+0x2f0/0x2f0 >>>> [ 2522.956905] [] ? bch_journal+0x42f/0x4b0 >>>> [ 2522.956909] [] bch_btree_insert+0xb4/0x120 >>>> [ 2522.956915] [] bch_data_insert_keys+0x3e/0x160 >>>> [ 2522.956920] [] process_one_work+0x174/0x490 >>>> [ 2522.956925] [] worker_thread+0x11b/0x370 >>>> [ 2522.956930] [] ? manage_workers.isra.23+0x2d0/0x2d0 >>>> [ 2522.956933] [] kthread+0xc0/0xd0 >>>> [ 2522.956938] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956943] [] ret_from_fork+0x7c/0xb0 >>>> [ 2522.956947] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.956951] INFO: task kworker/3:2:2471 blocked for more than 120 seconds. >>>> [ 2522.956955] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.956959] kworker/3:2 D ffffffff81813a60 0 2471 2 0x00000000 >>>> [ 2522.956966] Workqueue: events update_writeback_rate >>>> [ 2522.956968] ffff882a5d125cd0 0000000000000046 0000000000000003 >>>> ffff882a5d125d68 >>>> [ 2522.956972] ffff882f8ec94cb0 ffff882a5d125fd8 ffff882a5d125fd8 >>>> ffff882a5d125fd8 >>>> [ 2522.956977] ffff882fa6aeb320 ffff882f8ec94cb0 0000000200000003 >>>> ffff882f8ec94cb0 >>>> [ 2522.956981] Call Trace: >>>> [ 2522.956987] [] schedule+0x29/0x70 >>>> [ 2522.956992] [] rwsem_down_read_failed+0x9d/0xe5 >>>> [ 2522.956997] [] call_rwsem_down_read_failed+0x14/0x30 >>>> [ 2522.957002] [] ? down_read+0x24/0x2b >>>> [ 2522.957007] [] update_writeback_rate+0x30/0x230 >>>> [ 2522.957011] [] process_one_work+0x174/0x490 >>>> [ 2522.957017] [] worker_thread+0x11b/0x370 >>>> [ 2522.957021] [] ? manage_workers.isra.23+0x2d0/0x2d0 >>>> [ 2522.957025] [] kthread+0xc0/0xd0 >>>> [ 2522.957030] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.957035] [] ret_from_fork+0x7c/0xb0 >>>> [ 2522.957039] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.957048] INFO: task kworker/u64:2:2540 blocked for more than 120 seconds. >>>> [ 2522.957052] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>> disables this message. >>>> [ 2522.957056] kworker/u64:2 D ffffffff81813a60 0 2540 2 0x00000000 >>>> [ 2522.957062] Workqueue: bch_btree_io btree_node_write_work >>>> [ 2522.957064] ffff882a4fdc9778 0000000000000046 ffff882a4fdc9738 >>>> ffff882f86e3be58 >>>> [ 2522.957069] ffff882fa4bc1990 ffff882a4fdc9fd8 ffff882a4fdc9fd8 >>>> ffff882a4fdc9fd8 >>>> [ 2522.957073] ffff882fa6aeb320 ffff882fa4bc1990 ffff882f86e3be00 >>>> ffff8828dba00d98 >>>> [ 2522.957078] Call Trace: >>>> [ 2522.957083] [] schedule+0x29/0x70 >>>> [ 2522.957088] [] schedule_preempt_disabled+0xe/0x10 >>>> [ 2522.957092] [] __mutex_lock_slowpath+0x112/0x1b0 >>>> [ 2522.957102] [] ? ata_scsiop_mode_sense+0x380/0x380 >>>> [ 2522.957107] [] mutex_lock+0x2a/0x50 >>>> [ 2522.957111] [] bch_mca_shrink+0x1b5/0x2f0 >>>> [ 2522.957116] [] ? prune_super+0x162/0x1b0 >>>> [ 2522.957121] [] shrink_slab+0x154/0x300 >>>> [ 2522.957126] [] ? resched_task+0x68/0x70 >>>> [ 2522.957130] [] ? check_preempt_curr+0x75/0xa0 >>>> [ 2522.957135] [] ? fragmentation_index+0x19/0x70 >>>> [ 2522.957140] [] do_try_to_free_pages+0x20f/0x4b0 >>>> [ 2522.957146] [] try_to_free_pages+0xe4/0x1a0 >>>> [ 2522.957152] [] __alloc_pages_nodemask+0x60c/0x9b0 >>>> [ 2522.957158] [] alloc_pages_current+0xba/0x170 >>>> [ 2522.957163] [] __get_free_pages+0xe/0x40 >>>> [ 2522.957168] [] __btree_sort+0x48/0x230 >>>> [ 2522.957173] [] ? __bch_btree_iter_init+0x7c/0xc0 >>>> [ 2522.957178] [] bch_btree_sort_partial+0x101/0x120 >>>> [ 2522.957182] [] ? __btree_node_write_done+0x100/0x100 >>>> [ 2522.957187] [] bch_btree_sort_lazy+0x68/0x90 >>>> [ 2522.957191] [] bch_btree_node_write+0x36a/0x4a0 >>>> [ 2522.957196] [] ? idle_balance+0xeb/0x150 >>>> [ 2522.957201] [] ? pwq_activate_delayed_work+0x4c/0xb0 >>>> [ 2522.957205] [] btree_node_write_work+0x57/0x80 >>>> [ 2522.957210] [] process_one_work+0x174/0x490 >>>> [ 2522.957215] [] worker_thread+0x11b/0x370 >>>> [ 2522.957219] [] ? manage_workers.isra.23+0x2d0/0x2d0 >>>> [ 2522.957223] [] kthread+0xc0/0xd0 >>>> [ 2522.957228] [] ? flush_kthread_worker+0xb0/0xb0 >>>> [ 2522.957233] [] ret_from_fork+0x7c/0xb0 >>>> [ 2522.957237] [] ? flush_kthread_worker+0xb0/0xb0 >>>> >>>> >>>> I can easily reproduce this, please let me know if you require any >>>> info/test any patch. >>>> >>>> Thanks, >>>> Neo >>>> >>>> On Mon, Aug 26, 2013 at 12:21 PM, Stefan Priebe wrote: >>>>> Hi Kent, >>>>> >>>>> a new one under 3.10: >>>>> >>>>> 2013-08-26 21:05:30 INFO: task ceph-osd:8939 blocked for more than 120 >>>>> seconds. >>>>> 2013-08-26 21:05:30 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>>> disables this message. >>>>> 2013-08-26 21:05:30 ceph-osd D ffffffff8160f760 0 8939 1 0x00000000 >>>>> 2013-08-26 21:05:30 ffff880bd9a61820 0000000000000086 ffff880bd9a61fd8 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:30 ffff880bd9a60010 0000000000012c80 0000000000012c80 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:30 ffff880bd9a61fd8 0000000000012c80 ffff880bd7abb1c0 >>>>> ffff880c48dee380 >>>>> 2013-08-26 21:05:29 [] xfs_vm_readpages+0x18/0x20 >>>>> 2013-08-26 21:05:29 [] read_pages+0x43/0x100 >>>>> 2013-08-26 21:05:29 [] ? __page_cache_alloc+0x9f/0xc0 >>>>> 2013-08-26 21:05:29 [] >>>>> __do_page_cache_readahead+0x14b/0x160 >>>>> 2013-08-26 21:05:29 [] ra_submit+0x1c/0x20 >>>>> 2013-08-26 21:05:29 [] ondemand_readahead+0x115/0x240 >>>>> 2013-08-26 21:05:29 [] >>>>> page_cache_sync_readahead+0x2e/0x40 >>>>> 2013-08-26 21:05:29 [] T.1003+0x33e/0x430 >>>>> 2013-08-26 21:05:29 [] >>>>> generic_file_aio_read+0xe0/0x220 >>>>> 2013-08-26 21:05:29 [] xfs_file_aio_read+0x15a/0x2a0 >>>>> 2013-08-26 21:05:29 [] do_sync_read+0x7a/0xb0 >>>>> 2013-08-26 21:05:29 [] vfs_read+0xb1/0x130 >>>>> 2013-08-26 21:05:29 [] SyS_pread64+0x97/0xa0 >>>>> 2013-08-26 21:05:29 [] system_call_fastpath+0x16/0x1b >>>>> 2013-08-26 21:05:29 INFO: task ceph-osd:8896 blocked for more than 120 >>>>> seconds. >>>>> 2013-08-26 21:05:29 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>>> disables this message. >>>>> 2013-08-26 21:05:29 ceph-osd D ffffffff8160f760 0 8896 1 0x00000000 >>>>> 2013-08-26 21:05:29 ffff880bd9a9dc28 0000000000000086 ffff880bd9a9dfd8 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:29 ffff880bd9a9c010 0000000000012c80 0000000000012c80 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:29 ffff880bd9a9dfd8 0000000000012c80 ffff880bd79298e0 >>>>> ffff880c48dd6380 >>>>> 2013-08-26 21:05:29 Call Trace: >>>>> 2013-08-26 21:05:29 [] ? __lock_page+0x70/0x70 >>>>> 2013-08-26 21:05:28 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>>> disables this message. >>>>> 2013-08-26 21:05:28 ceph-osd D ffffffff8160f760 0 8798 1 0x00000000 >>>>> 2013-08-26 21:05:28 ffff880bd998b740 0000000000000086 ffff880bd998bfd8 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:28 ffff880bd998a010 0000000000012c80 0000000000012c80 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:28 ffff880bd998bfd8 0000000000012c80 ffff880bd99c0000 >>>>> ffff880c48dd6380 >>>>> 2013-08-26 21:05:28 Call Trace: >>>>> 2013-08-26 21:05:28 [] schedule+0x24/0x70 >>>>> 2013-08-26 21:05:28 [] >>>>> rwsem_down_read_failed+0x9d/0xe5 >>>>> 2013-08-26 21:05:28 [] >>>>> call_rwsem_down_read_failed+0x14/0x30 >>>>> 2013-08-26 21:05:28 [] ? down_read+0x12/0x20 >>>>> 2013-08-26 21:05:28 [] btree_read_async+0xa2/0x1a0 >>>>> [bcache] >>>>> 2013-08-26 21:05:28 [] closure_queue+0x43/0x60 >>>>> [bcache] >>>>> 2013-08-26 21:05:28 [] T.1050+0x63/0x70 [bcache] >>>>> 2013-08-26 21:05:28 [] >>>>> cached_dev_make_request+0x20a/0x350 [bcache] >>>>> 2013-08-26 21:05:28 [] generic_make_request+0xc2/0x100 >>>>> 2013-08-26 21:05:28 [] submit_bio+0x67/0x130 >>>>> 2013-08-26 21:05:28 [] do_mpage_readpage+0x2a0/0x660 >>>>> 2013-08-26 21:05:28 [] ? >>>>> __inc_zone_page_state+0x2e/0x30 >>>>> 2013-08-26 21:05:28 [] ? >>>>> add_to_page_cache_locked+0x81/0x110 >>>>> 2013-08-26 21:05:28 [] mpage_readpages+0xfa/0x150 >>>>> 2013-08-26 21:05:28 [] ? >>>>> xfs_get_blocks_direct+0x20/0x20 >>>>> 2013-08-26 21:05:28 [] ? >>>>> xfs_get_blocks_direct+0x20/0x20 >>>>> 2013-08-26 21:05:28 [] ? >>>>> __alloc_pages_nodemask+0x123/0x240 >>>>> 2013-08-26 21:05:27 [] ? __close_fd+0x43/0x80 >>>>> 2013-08-26 21:05:27 [] SyS_fdatasync+0xe/0x20 >>>>> 2013-08-26 21:05:27 [] system_call_fastpath+0x16/0x1b >>>>> 2013-08-26 21:05:27 INFO: task ceph-osd:8746 blocked for more than 120 >>>>> seconds. >>>>> 2013-08-26 21:05:27 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" >>>>> disables this message. >>>>> 2013-08-26 21:05:27 ceph-osd D ffffffff8160f760 0 8746 1 0x00000000 >>>>> 2013-08-26 21:05:27 ffff880c2dde1d38 0000000000000086 ffff880c2dde1fd8 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:27 ffff880c2dde0010 0000000000012c80 0000000000012c80 >>>>> 0000000000012c80 >>>>> 2013-08-26 21:05:27 ffff880c2dde1fd8 0000000000012c80 ffff880c239eb1c0 >>>>> ffff880c48dd6380 >>>>> 2013-08-26 21:05:27 Call Trace: >>>>> 2013-08-26 21:05:27 [] schedule+0x24/0x70 >>>>> 2013-08-26 21:05:27 [] schedule_timeout+0x16d/0x200 >>>>> 2013-08-26 21:05:27 [] ? >>>>> __queue_delayed_work+0xaa/0x1a0 >>>>> 2013-08-26 21:05:27 [] ? >>>>> try_to_grab_pending+0x109/0x190 >>>>> 2013-08-26 21:05:27 [] wait_for_completion+0x95/0x110 >>>>> 2013-08-26 21:05:27 [] ? try_to_wake_up+0x2a0/0x2a0 >>>>> 2013-08-26 21:05:27 [] ? bdi_queue_work+0x77/0xc0 >>>>> 2013-08-26 21:05:27 [] >>>>> writeback_inodes_sb_nr+0x83/0xb0 >>>>> 2013-08-26 21:05:27 [] writeback_inodes_sb+0x5a/0x70 >>>>> 2013-08-26 21:05:27 [] __sync_filesystem+0x4a/0x50 >>>>> 2013-08-26 21:05:27 [] sync_filesystem+0x32/0x60 >>>>> 2013-08-26 21:05:27 [] SyS_syncfs+0x50/0x90 >>>>> 2013-08-26 21:05:27 [] system_call_fastpath+0x16/0x1b >>>>> 2013-08-26 21:05:27 INFO: task ceph-osd:8798 blocked for more than 120 >>>>> seconds. >>>>> >>>>> Stefan >>>>> >>>>> Am 22.08.2013 09:32, schrieb Stefan Priebe - Profihost AG: >>>>> >>>>>> great! >>>>>> >>>>>> Everything seems to work fine now! Except read_dirty always going to >>>>>> negative values after a reboot. >>>>>> >>>>>> Stefan >>>>>> >>>>>> Am 22.08.2013 08:02, schrieb Kent Overstreet: >>>>>>> >>>>>>> On Thu, Aug 22, 2013 at 07:59:04AM +0200, Stefan Priebe wrote: >>>>>>>> >>>>>>>> >>>>>>>>> schedule_timeout() is not the same as >>>>>>>>> schedule_timeout_interruptible(). >>>>>>>> >>>>>>>> >>>>>>>> just search and replace? So i can try on my own. >>>>>>> >>>>>>> >>>>>>> The one in read_dirty(), line ~330 >>>>>>> -- >>>>>>> To unsubscribe from this list: send the line "unsubscribe linux-bcache" >>>>>>> in >>>>>>> the body of a message to majordomo@vger.kernel.org >>>>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>>> -- >>>>> To unsubscribe from this list: send the line "unsubscribe linux-bcache" in >>>>> the body of a message to majordomo@vger.kernel.org >>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-bcache" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/