Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933793AbcKJOff (ORCPT ); Thu, 10 Nov 2016 09:35:35 -0500 Received: from arcturus.aphlor.org ([188.246.204.175]:48362 "EHLO arcturus.aphlor.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933033AbcKJOfd (ORCPT ); Thu, 10 Nov 2016 09:35:33 -0500 Date: Thu, 10 Nov 2016 09:35:20 -0500 From: Dave Jones To: Chris Mason Cc: Linus Torvalds , Jens Axboe , Andy Lutomirski , Andy Lutomirski , Al Viro , Josef Bacik , David Sterba , linux-btrfs , Linux Kernel , Dave Chinner Subject: Re: btrfs btree_ctree_super fault Message-ID: <20161110143520.6fdu34afv45ui3y6@codemonkey.org.uk> Mail-Followup-To: Dave Jones , Chris Mason , Linus Torvalds , Jens Axboe , Andy Lutomirski , Andy Lutomirski , Al Viro , Josef Bacik , David Sterba , linux-btrfs , Linux Kernel , Dave Chinner References: <203e0319-bc9b-245c-e162-709267540d22@fb.com> <20161026233808.GC15247@clm-mbp.thefacebook.com> <20161026234751.e66xyzjiwifvbuha@codemonkey.org.uk> <20161031185514.b22zvbxvga4xcinz@codemonkey.org.uk> <20161031194454.GA49877@clm-mbp.thefacebook.com> <20161106165539.ybwm6rqvzh2k6uja@codemonkey.org.uk> <20161108145912.fcjvwxcpqgd7kjei@codemonkey.org.uk> <01d76d90-8d90-e09b-40a0-63488425348d@fb.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <01d76d90-8d90-e09b-40a0-63488425348d@fb.com> User-Agent: NeoMutt/20161104 (1.7.1) X-Spam-Flag: skipped (authorised relay user) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3598 Lines: 75 On Tue, Nov 08, 2016 at 10:08:04AM -0500, Chris Mason wrote: > > And another new one: > > > > kernel BUG at fs/btrfs/ctree.c:3172! > > > > Call Trace: > > [] __btrfs_drop_extents+0xb00/0xe30 [btrfs] > > We've been hunting this one for at least two years. It's the white > whale of btrfs bugs. Josef has a semi-reliable reproducer now, but I > think it's not the same as the pagevec based problems you reported earlier. Great, now for whatever reason, I'm hitting this over and over. Even better, after the last time I hit it, it reboot and this happened during boot.. BTRFS info (device sda6): disk space caching is enabled BTRFS info (device sda6): has skinny extents BTRFS info (device sda3): disk space caching is enabled ------------[ cut here ]------------ WARNING: CPU: 1 PID: 443 at fs/btrfs/file.c:546 btrfs_drop_extent_cache+0x411/0x420 [btrfs] CPU: 1 PID: 443 Comm: mount Not tainted 4.9.0-rc4-think+ #1 ffffc90000c4b468 ffffffff813b66bc 0000000000000000 0000000000000000 ffffc90000c4b4a8 ffffffff81086d2b 0000022200c4b488 000000000002f265 40c8dded1afd6000 ffff8804ff5cddc8 ffff8804ef26f2b8 40c8dded1afd5000 Call Trace: [] dump_stack+0x4f/0x73 [] __warn+0xcb/0xf0 [] warn_slowpath_null+0x1d/0x20 [] btrfs_drop_extent_cache+0x411/0x420 [btrfs] [] ? alloc_debug_processing+0x73/0x1b0 [] __btrfs_drop_extents+0x44f/0xe30 [btrfs] [] ? btrfs_alloc_path+0x1a/0x20 [btrfs] [] ? btrfs_alloc_path+0x1a/0x20 [btrfs] [] ? kmem_cache_alloc+0x2aa/0x330 [] ? btrfs_alloc_path+0x1a/0x20 [btrfs] [] btrfs_drop_extents+0x79/0xa0 [btrfs] [] replay_one_extent+0x1e1/0x710 [btrfs] [] replay_one_buffer+0x26d/0x7e0 [btrfs] [] ? ___slab_alloc.constprop.83+0x27c/0x5c0 [] ? btrfs_alloc_path+0x1a/0x20 [btrfs] [] ? debug_smp_processor_id+0x17/0x20 [] walk_up_log_tree+0xeb/0x240 [btrfs] [] walk_log_tree+0xa6/0x1d0 [btrfs] [] btrfs_recover_log_trees+0x1dc/0x460 [btrfs] [] ? replay_one_extent+0x710/0x710 [btrfs] [] open_ctree+0x2575/0x2670 [btrfs] [] btrfs_mount+0xd0b/0xe10 [btrfs] [] ? pcpu_alloc+0x2d4/0x660 [] ? lockdep_init_map+0x61/0x200 [] ? __init_waitqueue_head+0x3b/0x50 [] mount_fs+0x14/0xa0 [] vfs_kern_mount+0x6b/0x150 [] btrfs_mount+0x2c8/0xe10 [btrfs] [] ? pcpu_alloc+0x2d4/0x660 [] ? lockdep_init_map+0x61/0x200 [] ? lockdep_init_map+0x61/0x200 [] ? __init_waitqueue_head+0x3b/0x50 [] mount_fs+0x14/0xa0 [] vfs_kern_mount+0x6b/0x150 [] do_mount+0x1c2/0xda0 [] ? memdup_user+0x60/0x90 [] SyS_mount+0x83/0xd0 [] do_syscall_64+0x61/0x170 [] entry_SYSCALL64_slow_path+0x25/0x25 ---[ end trace d3fa03bb9c115bbe ]--- BTRFS: error (device sda3) in btrfs_replay_log:2491: errno=-17 Object already exists (Failed to recover log tree) BTRFS error (device sda3): cleaner transaction attach returned -30 BTRFS error (device sda3): open_ctree failed Guess I'll hit it with btrfsck and hope for the best.. Dave