Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp1952971iob; Thu, 19 May 2022 20:26:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwaxkXYPhYjWCqsH+sXa5qL6aYEaC5AjhJMQWcmFpJiu0uI6EEyKfdswkkHDhIJMn7N5yDh X-Received: by 2002:a05:6a02:106:b0:36c:96dd:8c17 with SMTP id bg6-20020a056a02010600b0036c96dd8c17mr6889887pgb.190.1653017199978; Thu, 19 May 2022 20:26:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653017199; cv=none; d=google.com; s=arc-20160816; b=X35oQnLdB/2HUKXckmPSYdSBJJTN6B5a8Q1L8l/Ll51YLG4Aznj5euy1m2rn7G+CoI PoO3IMTepCVacpCvsm+Pj+t58loKxLa80sPg6eDyttHKKL/EDjFMZ8y632kqHKuqeKNh 51a+mBcUusOEe7YaTlF+RxPKN0G+nGrsBVfz8Zo+4KYkuBrECDTE9sgs5Ru5jSq1Qd8+ fwnz53DzECQGWd1S+ANZA9Dw0WGBdUGaKlm0IpAHl7dxc7aDf8XrsMGPe3eunNMigexN 3+n2KH7vH5RJCifzsof++G1bsmd9BGP+TM+diqbwYJSqb4Ke4o6Bobz/6xdLZoOAjC/M 2CMQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=H/GxLQaB7GezXLedyohaAuk7CJEHblBrWxweQUPeBzs=; b=BtIErNQBJ5oA/c3wI31KtuDi+wmVHqRGuEARlW+52qmwbJtldS67lMB6fsexHcOmJ6 +/6cjx6c9trUnvcqOc+aHRcSL0co/06o/2FJO2OzDM7LVnpGtsN7/tl5jude2ndlH8Aa qIcY9HrEQ7gM9E7d4xHi0rB51yqMeFAgrR0Ex/c4tLqcTFsSXh4GTG4s8QhIttrYmdSJ UJatW/Q0MXe5/Pfi7Ut5lnWAn5SclMbzcNPHECx/9HLA8nIS3HfKeobnu+aHuz8/OeTP gRJ6JDoXYhf51i7J5o/vAkx53Xzqg28jXAFZfSnzB0zyJa32mXa6S0eL+83E7ysi353Q L6WQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel-dk.20210112.gappssmtp.com header.s=20210112 header.b=eryMaoyV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a21-20020a631a15000000b003f276bebf30si8682983pga.393.2022.05.19.20.26.25; Thu, 19 May 2022 20:26:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel-dk.20210112.gappssmtp.com header.s=20210112 header.b=eryMaoyV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230117AbiESMa1 (ORCPT + 99 others); Thu, 19 May 2022 08:30:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44264 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230213AbiESMaY (ORCPT ); Thu, 19 May 2022 08:30:24 -0400 Received: from mail-pf1-x42c.google.com (mail-pf1-x42c.google.com [IPv6:2607:f8b0:4864:20::42c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5848CBA57C for ; Thu, 19 May 2022 05:30:23 -0700 (PDT) Received: by mail-pf1-x42c.google.com with SMTP id v11so5028775pff.6 for ; Thu, 19 May 2022 05:30:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20210112.gappssmtp.com; s=20210112; h=message-id:date:mime-version:user-agent:subject:content-language:to :cc:references:from:in-reply-to:content-transfer-encoding; bh=H/GxLQaB7GezXLedyohaAuk7CJEHblBrWxweQUPeBzs=; b=eryMaoyVK6JbYQ3PjtfoZF0MUDDkQtXC03BMyUCFiKHn8IZ50uHRroCWYqVjhWC8/F hLrREtxLxevnRTZ9hRF2IRqngVvN6F8fIeW2N99YdY/pV+wq3WWeNRj/PGWcUzsVlwAA 6ZS9nQRLp+JVPBIC1yMBKZnd2wseOgVbw0wHluYWNRI8NAuyholwUKGTdTRip7gfciXE 3DwAfbyP5XAffOh1N//V87st+ZiE/b5NAgSWLR4Zz6zPeCKvXzZI4+e2tGh8/cgkKZgN fvEXesW9mTc5R7wEW9MrX/BDFe88syyt9jCqOj8/+ZWD4b3dPDQ1ciPVNTGmreGv8jSR 4Vfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:date:mime-version:user-agent:subject :content-language:to:cc:references:from:in-reply-to :content-transfer-encoding; bh=H/GxLQaB7GezXLedyohaAuk7CJEHblBrWxweQUPeBzs=; b=2pAt8bwc5GJZgTDr67IxZQx6+GVXQK/piXGOJb6vVuYCgPOOMth6mlpfoSEtdleIzc lAt9dnaSEXsL/zcPi4sQ3IidlrGYHLKb2niDYYRAMAwezA9GGHIJWB3ZXEhDq8EnBbtr kIZSuxTAAYhh4LYqNb886BTD1I3eGas1wdQ6tkUpYK5dsbp3Hqf5+PVVlSsYVvvzfLcA blfvp0laY1E5JxYIKf4besHAAMxSAgamAQNh3yzHhrVdmhEQTIcjlHG6r8x3HS70kVDX 02RhUydDVQS5cOxanWdbfYlon7d8SdpOAraUUBeL/8nGo2JVEk3KcBGWFcJxjL52GGHw HyzA== X-Gm-Message-State: AOAM532c+YHXCFWkBxXQL/PuXO0uyb3HCbvWS1HXGb4Hiwpomh29HzxE oVFfitlfcidTqQo9vhI01ivG/g== X-Received: by 2002:aa7:8149:0:b0:518:f2e:220d with SMTP id d9-20020aa78149000000b005180f2e220dmr4456864pfn.65.1652963422725; Thu, 19 May 2022 05:30:22 -0700 (PDT) Received: from [192.168.1.100] ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id g23-20020a170902d5d700b00161a8886635sm3559966plh.286.2022.05.19.05.30.21 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 19 May 2022 05:30:21 -0700 (PDT) Message-ID: Date: Thu, 19 May 2022 06:30:20 -0600 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux aarch64; rv:91.0) Gecko/20100101 Thunderbird/91.8.1 Subject: =?UTF-8?Q?Re=3a_=5bPATCH_v3_1/1=5d_fs-writeback=3a_writeback=5fsb?= =?UTF-8?Q?=5finodes=ef=bc=9aRecalculate_=27wrote=27_according_skipped_pages?= Content-Language: en-US To: Zhihao Cheng , hch@lst.de, torvalds@linux-foundation.org, mingo@redhat.com, viro@zeniv.linux.org.uk Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, yukuai3@huawei.com References: <20220510133805.1988292-1-chengzhihao1@huawei.com> From: Jens Axboe In-Reply-To: <20220510133805.1988292-1-chengzhihao1@huawei.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 5/10/22 7:38 AM, Zhihao Cheng wrote: > Commit 505a666ee3fc ("writeback: plug writeback in wb_writeback() and > writeback_inodes_wb()") has us holding a plug during wb_writeback, which > may cause a potential ABBA dead lock: > > wb_writeback fat_file_fsync > blk_start_plug(&plug) > for (;;) { > iter i-1: some reqs have been added into plug->mq_list // LOCK A > iter i: > progress = __writeback_inodes_wb(wb, work) > . writeback_sb_inodes // fat's bdev > . __writeback_single_inode > . . generic_writepages > . . __block_write_full_page > . . . . __generic_file_fsync > . . . . sync_inode_metadata > . . . . writeback_single_inode > . . . . __writeback_single_inode > . . . . fat_write_inode > . . . . __fat_write_inode > . . . . sync_dirty_buffer // fat's bdev > . . . . lock_buffer(bh) // LOCK B > . . . . submit_bh > . . . . blk_mq_get_tag // LOCK A > . . . trylock_buffer(bh) // LOCK B > . . . redirty_page_for_writepage > . . . wbc->pages_skipped++ > . . --wbc->nr_to_write > . wrote += write_chunk - wbc.nr_to_write // wrote > 0 > . requeue_inode > . redirty_tail_locked > if (progress) // progress > 0 > continue; > iter i+1: > queue_io > // similar process with iter i, infinite for-loop ! > } > blk_finish_plug(&plug) // flush plug won't be called > > Above process triggers a hungtask like: > [ 399.044861] INFO: task bb:2607 blocked for more than 30 seconds. > [ 399.046824] Not tainted 5.18.0-rc1-00005-gefae4d9eb6a2-dirty > [ 399.051539] task:bb state:D stack: 0 pid: 2607 ppid: > 2426 flags:0x00004000 > [ 399.051556] Call Trace: > [ 399.051570] __schedule+0x480/0x1050 > [ 399.051592] schedule+0x92/0x1a0 > [ 399.051602] io_schedule+0x22/0x50 > [ 399.051613] blk_mq_get_tag+0x1d3/0x3c0 > [ 399.051640] __blk_mq_alloc_requests+0x21d/0x3f0 > [ 399.051657] blk_mq_submit_bio+0x68d/0xca0 > [ 399.051674] __submit_bio+0x1b5/0x2d0 > [ 399.051708] submit_bio_noacct+0x34e/0x720 > [ 399.051718] submit_bio+0x3b/0x150 > [ 399.051725] submit_bh_wbc+0x161/0x230 > [ 399.051734] __sync_dirty_buffer+0xd1/0x420 > [ 399.051744] sync_dirty_buffer+0x17/0x20 > [ 399.051750] __fat_write_inode+0x289/0x310 > [ 399.051766] fat_write_inode+0x2a/0xa0 > [ 399.051783] __writeback_single_inode+0x53c/0x6f0 > [ 399.051795] writeback_single_inode+0x145/0x200 > [ 399.051803] sync_inode_metadata+0x45/0x70 > [ 399.051856] __generic_file_fsync+0xa3/0x150 > [ 399.051880] fat_file_fsync+0x1d/0x80 > [ 399.051895] vfs_fsync_range+0x40/0xb0 > [ 399.051929] __x64_sys_fsync+0x18/0x30 > > In my test, 'need_resched()' (which is imported by 590dca3a71 "fs-writeback: > unplug before cond_resched in writeback_sb_inodes") in function > 'writeback_sb_inodes()' seldom comes true, unless cond_resched() is deleted > from write_cache_pages(). > > Fix it by correcting wrote number according number of skipped pages > in writeback_sb_inodes(). > > Goto Link to find a reproducer. I can take this one for 5.19, thanks. -- Jens Axboe