Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp5827946ybv; Wed, 12 Feb 2020 00:48:03 -0800 (PST) X-Google-Smtp-Source: APXvYqzRuXbgejZeAve8V36lleUwY3dc2GBFqKvD5Kd9jyRc5wtbhFEpQFIUVfrClaqD1J2RyZsX X-Received: by 2002:aca:54cc:: with SMTP id i195mr5438504oib.126.1581497283183; Wed, 12 Feb 2020 00:48:03 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581497283; cv=none; d=google.com; s=arc-20160816; b=KVEd4ZSMgDgEtkh+LBGbPltJv0/8XxNWUk4sQ5gslqVZJDnYScP1zkR45Yy8pd701k XRrOsOtLt0B+UHZuA8Ca5xG5ZgsAub/elTjrnF0MfiWL8klZuqOBFRlDygSc5O5l1hIY tPsNFwBh67Q9/n7d8jpG7FVvr67chtXMU91/lhJnnvBUiHmmgFJ5vQBCbFXoMa1O2MxR MOqWabGkFLwR9pRqmmh+MOfx2il3rJm4CNBf1Q27uC6lEoN6cqmagzZ5tutQAs6XX2L8 HL0ZxiMnA7tYlzH02nIArzgIwqwDYHQ5LcVAbKO0eBMIVVDf71nPZ+50YxD5DUb8+t4d rpNw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=PJnU6KLabGR3PmoLsacou67qOKm9yLXeqiQnNl6k70I=; b=h9w5p2RHIUycTqveagkOaSBU+R9pZX3OURbc9r85cR30yL8nuI+86A4KyYkBJ9BUnU A2usN0j0EjIx3gcu4ChoqYjCV9Y9Un/oCyUyuEZlVJINf1WZ2v6ySvApXojln3T7fPsA sdQQroKuCrEcOiqN8MU2OFXSpH563o2hIJ7VkN8nckvf/t0o/5NxbssBVN0b6Hm185YP sNsgFTctBDoiUbYY9laGVAC1dKJTe30oX7Y0EY4Tm2d/kzhfIihKnLyNl69ULnn+PgFt KQGjXVxFABJKYE+40ct+R3FLauuGcNjW2bCavA5JDNB5M5LZqIps1TjSi0CB6WsePXl3 stBg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e77si3432685ote.153.2020.02.12.00.47.52; Wed, 12 Feb 2020 00:48:03 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728516AbgBLIro (ORCPT + 99 others); Wed, 12 Feb 2020 03:47:44 -0500 Received: from mx2.suse.de ([195.135.220.15]:60894 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728544AbgBLIro (ORCPT ); Wed, 12 Feb 2020 03:47:44 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 0EFA8AC9D; Wed, 12 Feb 2020 08:47:42 +0000 (UTC) Received: by quack2.suse.cz (Postfix, from userid 1000) id 04FC01E0E01; Wed, 12 Feb 2020 09:47:41 +0100 (CET) Date: Wed, 12 Feb 2020 09:47:40 +0100 From: Jan Kara To: "zhangyi (F)" Cc: Jan Kara , tytso@mit.edu, linux-ext4@vger.kernel.org, luoshijie1@huawei.com, zhangxiaoxu5@huawei.com Subject: Re: [PATCH 2/2] jbd2: do not clear the BH_Mapped flag when forgetting a metadata buffer Message-ID: <20200212084740.GB25573@quack2.suse.cz> References: <20200203140458.37397-1-yi.zhang@huawei.com> <20200203140458.37397-3-yi.zhang@huawei.com> <20200206114647.GB3994@quack2.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-ext4-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Tue 11-02-20 14:51:10, zhangyi (F) wrote: > On 2020/2/6 19:46, Jan Kara wrote: > > On Mon 03-02-20 22:04:58, zhangyi (F) wrote: > >> Commit 904cdbd41d74 ("jbd2: clear dirty flag when revoking a buffer from > >> an older transaction") set the BH_Freed flag when forgetting a metadata > >> buffer which belongs to the committing transaction, it indicate the > >> committing process clear dirty bits when it is done with the buffer. But > >> it also clear the BH_Mapped flag at the same time, which may trigger > >> below NULL pointer oops when block_size < PAGE_SIZE. > >> > >> rmdir 1 kjournald2 mkdir 2 > >> jbd2_journal_commit_transaction > >> commit transaction N > >> jbd2_journal_forget > >> set_buffer_freed(bh1) > >> jbd2_journal_commit_transaction > >> commit transaction N+1 > >> ... > >> clear_buffer_mapped(bh1) > >> ext4_getblk(bh2 ummapped) > >> ... > >> grow_dev_page > >> init_page_buffers > >> bh1->b_private=NULL > >> bh2->b_private=NULL > >> jbd2_journal_put_journal_head(jh1) > >> __journal_remove_journal_head(hb1) > >> jh1 is NULL and trigger oops > >> > >> *) Dir entry block bh1 and bh2 belongs to one page, and the bh2 has > >> already been unmapped. > >> > >> For the metadata buffer we forgetting, clear the dirty flags is enough, > >> so this patch add BH_Unmap flag for the journal_unmap_buffer() case and > >> keep the mapped flag for the metadata buffer. > >> > >> Fixes: 904cdbd41d74 ("jbd2: clear dirty flag when revoking a buffer from an older transaction") > >> Signed-off-by: zhangyi (F) > [..] > > > > Also rather than introducing this new buffer_unmap bit, I'd use the fact > > this special treatment is needed only for buffers coming from the block device > > mapping. And we can check for that like: > > > > /* > > * We can (and need to) unmap buffer only for normal mappings. > > * Block device buffers need to stay mapped all the time. > > * We need to be careful about the check because the page > > * mapping can get cleared under our hands. > > */ > > mapping = READ_ONCE(bh->b_page->mapping); > > if (mapping && !sb_is_blkdev_sb(mapping->host->i_sb)) { > > ... > > } > > Think about it again, it may missing clearing of mapped flag if 'mapping' > of journalled data page was cleared, and finally trigger exception if > we reuse the buffer again. So I think it should be: > > if (!(mapping && sb_is_blkdev_sb(mapping->host->i_sb))) { > ... > } Well, if b_page->mapping got cleared, it means the page got fully truncated and in such case buffers can never be reused - the page and buffers will be freed once we are done with them. So what you are concerned about cannot happen. But you're right it is good to explain this in the comment. Honza -- Jan Kara SUSE Labs, CR