Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp3017248ybl; Mon, 19 Aug 2019 10:48:23 -0700 (PDT) X-Google-Smtp-Source: APXvYqzPkvZoLlFeDxBlR25//mJhKYxyM/05myumCNh179RN43Cs3D8O0N0IS4bkNYM/LQLSl5xr X-Received: by 2002:a65:60d3:: with SMTP id r19mr21234712pgv.91.1566236903623; Mon, 19 Aug 2019 10:48:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1566236903; cv=none; d=google.com; s=arc-20160816; b=XBT39DD8U7642ImoWluwC2bWXG0A7Up6uhjQSSDYfW7SwQeuUeqA/krgHHBHc9cSLm ZbWamEVV8XZr2uqF+xS5z5fPNq6xSAYmeSZtD51mDyvaKCDD2aXY6SSFvIqJa6CjLMQJ NNvF9glKX19JNfXjOneluKvifGmUsIVkuFgloTHYVlELGEpWY9YrIdakdFGRgeoZcEdI tnX6O0cupavyGccEKGf0xbVgNRXZQEV/9N2I1uygRsoetNUId5YkNj+0Odr2ZpNI/EaC kaBWq2iTqkYP76w8b5TF8TTlYAi9yQhdCPn2lZbqcR5+JEd/ljk6VNPpye1/JJPQOu3N ckww== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:mail-followup-to :reply-to:message-id:subject:cc:to:from:date; bh=l/BFVd3aDwV8g9x59YTlA8azfU4dbjtOaBFtKGBUn64=; b=ssIfFo+q3YXCkuB+1qIyzBhWDBewFzB+9x/62SsirtOxXsRFWeOkTDmwhUe63EHw9h zQYtSOKVnfmnSVtz64xBVlTpKFxW/ffVXJTxOwvXxYjiSPqhv9k0qrm1JuWPzSuXnwQK 0PN9AOEWj99MB3rDS8JLbPv4f1kYbuzQ/TnE4VRmCrg4lPLc31mo94rtOjSC9ad8Cm1m hQxMfRSLWKJ216ndrK1kUZS04W2ID5gwMREsvTYXtfDUhvlwoN5fhaDgYtB178+sTQHF gHEBk1VKO3SBNsC6mVVtwFf7uJ9nWyYIKo3gbFLrfRMSVw/cTjBrQ92NdOrDqPMv8L6+ N0eg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q15si10164294pls.184.2019.08.19.10.48.08; Mon, 19 Aug 2019 10:48:23 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728275AbfHSRph (ORCPT + 99 others); Mon, 19 Aug 2019 13:45:37 -0400 Received: from mx2.suse.de ([195.135.220.15]:42508 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726959AbfHSRph (ORCPT ); Mon, 19 Aug 2019 13:45:37 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id E9441AE78; Mon, 19 Aug 2019 17:45:34 +0000 (UTC) Received: by ds.suse.cz (Postfix, from userid 10065) id 22F1EDA7DA; Mon, 19 Aug 2019 19:46:01 +0200 (CEST) Date: Mon, 19 Aug 2019 19:46:00 +0200 From: David Sterba To: Christophe Leroy Cc: erhard_f@mailbox.org, Chris Mason , Josef Bacik , David Sterba , Andrew Morton , linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, linux-btrfs@vger.kernel.org, linux-mm@kvack.org, stable@vger.kernel.org Subject: Re: [PATCH] btrfs: fix allocation of bitmap pages. Message-ID: <20190819174600.GN24086@twin.jikos.cz> Reply-To: dsterba@suse.cz Mail-Followup-To: dsterba@suse.cz, Christophe Leroy , erhard_f@mailbox.org, Chris Mason , Josef Bacik , David Sterba , Andrew Morton , linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, linux-btrfs@vger.kernel.org, linux-mm@kvack.org, stable@vger.kernel.org References: <20190817074439.84C6C1056A3@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190817074439.84C6C1056A3@localhost.localdomain> User-Agent: Mutt/1.5.23.1 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Aug 17, 2019 at 07:44:39AM +0000, Christophe Leroy wrote: > Various notifications of type "BUG kmalloc-4096 () : Redzone > overwritten" have been observed recently in various parts of > the kernel. After some time, it has been made a relation with > the use of BTRFS filesystem. > > [ 22.809700] BUG kmalloc-4096 (Tainted: G W ): Redzone overwritten > [ 22.809971] ----------------------------------------------------------------------------- > > [ 22.810286] INFO: 0xbe1a5921-0xfbfc06cd. First byte 0x0 instead of 0xcc > [ 22.810866] INFO: Allocated in __load_free_space_cache+0x588/0x780 [btrfs] age=22 cpu=0 pid=224 > [ 22.811193] __slab_alloc.constprop.26+0x44/0x70 > [ 22.811345] kmem_cache_alloc_trace+0xf0/0x2ec > [ 22.811588] __load_free_space_cache+0x588/0x780 [btrfs] > [ 22.811848] load_free_space_cache+0xf4/0x1b0 [btrfs] > [ 22.812090] cache_block_group+0x1d0/0x3d0 [btrfs] > [ 22.812321] find_free_extent+0x680/0x12a4 [btrfs] > [ 22.812549] btrfs_reserve_extent+0xec/0x220 [btrfs] > [ 22.812785] btrfs_alloc_tree_block+0x178/0x5f4 [btrfs] > [ 22.813032] __btrfs_cow_block+0x150/0x5d4 [btrfs] > [ 22.813262] btrfs_cow_block+0x194/0x298 [btrfs] > [ 22.813484] commit_cowonly_roots+0x44/0x294 [btrfs] > [ 22.813718] btrfs_commit_transaction+0x63c/0xc0c [btrfs] > [ 22.813973] close_ctree+0xf8/0x2a4 [btrfs] > [ 22.814107] generic_shutdown_super+0x80/0x110 > [ 22.814250] kill_anon_super+0x18/0x30 > [ 22.814437] btrfs_kill_super+0x18/0x90 [btrfs] > [ 22.814590] INFO: Freed in proc_cgroup_show+0xc0/0x248 age=41 cpu=0 pid=83 > [ 22.814841] proc_cgroup_show+0xc0/0x248 > [ 22.814967] proc_single_show+0x54/0x98 > [ 22.815086] seq_read+0x278/0x45c > [ 22.815190] __vfs_read+0x28/0x17c > [ 22.815289] vfs_read+0xa8/0x14c > [ 22.815381] ksys_read+0x50/0x94 > [ 22.815475] ret_from_syscall+0x0/0x38 > > Commit 69d2480456d1 ("btrfs: use copy_page for copying pages instead > of memcpy") changed the way bitmap blocks are copied. But allthough > bitmaps have the size of a page, they were allocated with kzalloc(). > > Most of the time, kzalloc() allocates aligned blocks of memory, so > copy_page() can be used. But when some debug options like SLAB_DEBUG > are activated, kzalloc() may return unaligned pointer. > > On powerpc, memcpy(), copy_page() and other copying functions use > 'dcbz' instruction which provides an entire zeroed cacheline to avoid > memory read when the intention is to overwrite a full line. Functions > like memcpy() are writen to care about partial cachelines at the start > and end of the destination, but copy_page() assumes it gets pages. This assumption is not documented nor any pitfalls mentioned in include/asm-generic/page.h that provides the generic implementation. I as an API user cannot check each arch implementation for additional constraints or I would expect that it deals with the boundary cases the same way as arch-specific memcpy implementations. Another thing that is lost is the slub debugging support for all architectures, because get_zeroed_pages lacking the red zones and sanity checks. I find working with raw pages in this code a bit inconsistent with the rest of btrfs code, but that's rather minor compared to the above. Summing it up, I think that the proper fix should go to copy_page implementation on architectures that require it or make it clear what are the copy_page constraints.