Received: by 2002:a05:6358:45e:b0:b5:b6eb:e1f9 with SMTP id 30csp538428rwe; Fri, 26 Aug 2022 09:26:34 -0700 (PDT) X-Google-Smtp-Source: AA6agR78dqsls5Lwwdjk+NVcKleo9UyGo7H47SNcFEBA0pZrQYZtSrIwgJYEZZkXct/BpwQJT22m X-Received: by 2002:a05:6402:28c3:b0:43a:6d78:1b64 with SMTP id ef3-20020a05640228c300b0043a6d781b64mr7432913edb.93.1661531193922; Fri, 26 Aug 2022 09:26:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1661531193; cv=none; d=google.com; s=arc-20160816; b=ME00WDiHN5hrlr6akQdLTS7W5GnObVubc196KHan6evNh/XzxnzroRb0I1yLGpxxlS Y5iy/fK+4SPTuC04WFNutEmD7y7qbI36+/Ib5aIv1Rn15/iTe2Ct77WSnX3pa9VMxSam FsMc+qSw6egPoDWu9GmpG6N/Y3WHJim67x+AFzutqW1QWgbg7reZBY856DHCbH6oDOwb amwTzCB+QWyLqGsBgci+VKBoRUblH39MyWSQSL1LUMJ33mPaminuPhXorS34Mtj1og2H y8TZcnVh9NSB0qIVjPM3nI4KDm6wA38saXXL+3ImgH26tZI1ah5hmB4MN8VYsd1djbBg 300g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:subject:from:cc:to :content-language:user-agent:mime-version:date:message-id :feedback-id:dkim-signature:dkim-signature; bh=2XpK+MsnzSh3YxXW1jI/LTxDkomrIwNcVnN1BvA8HKs=; b=QX36xlGK47O5mM4mC475lzIQ0ZMY0fAaRF8KimEdKSPAHOhY6ZD7tOsG6HddgXsZJR Jb0KyURLKnJsHx8aFIez5aEWHk6sFrISmGWkN3o3OrE8cdAzWkblEKo/TcvKzEERh+L+ S23FuWtwyVCIKIDx99iztHMbK+LHfW2idZy08rbDkxpD+euywj5HewBHi+EHmq8iEhWr AsUvMlMsbnSEmLIQmjNZ/tnUFnndEZPihOyA5StjsmQWYyB5LwbDDpCzCo6KgPkkGcx+ wa7IndGbsp3B8XUGJ54gi/aV0V0O1IjyemtHvBA2TXV6StKQtx/6Ue3pLXRzpqIAXdSk Yu+Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@dustymabe.com header.s=fm3 header.b=Gj24Ppia; dkim=pass header.i=@messagingengine.com header.s=fm1 header.b=izAHRUFD; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a9-20020a05640213c900b00447c0cafc5csi1351139edx.504.2022.08.26.09.26.06; Fri, 26 Aug 2022 09:26:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@dustymabe.com header.s=fm3 header.b=Gj24Ppia; dkim=pass header.i=@messagingengine.com header.s=fm1 header.b=izAHRUFD; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240471AbiHZQPo (ORCPT + 99 others); Fri, 26 Aug 2022 12:15:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47728 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231512AbiHZQP3 (ORCPT ); Fri, 26 Aug 2022 12:15:29 -0400 Received: from out5-smtp.messagingengine.com (out5-smtp.messagingengine.com [66.111.4.29]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 690DFA2869; Fri, 26 Aug 2022 09:15:24 -0700 (PDT) Received: from compute3.internal (compute3.nyi.internal [10.202.2.43]) by mailout.nyi.internal (Postfix) with ESMTP id CC0055C05D4; Fri, 26 Aug 2022 12:15:23 -0400 (EDT) Received: from mailfrontend1 ([10.202.2.162]) by compute3.internal (MEProxy); Fri, 26 Aug 2022 12:15:23 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=dustymabe.com; h=cc:cc:content-transfer-encoding:content-type:date:date:from :from:in-reply-to:message-id:mime-version:reply-to:sender :subject:subject:to:to; s=fm3; t=1661530523; x=1661616923; bh=2X pK+MsnzSh3YxXW1jI/LTxDkomrIwNcVnN1BvA8HKs=; b=Gj24PpiaInCrHO2Nnr N0ZxqlcfGR/NP5Y/qiNNXxXfLMRqUOW6g7jamo3g/rYlojpljy+Ex+mcELoLsZzp O8aH9cvZb7gjcJVT/xkd3qqUVrR5B3Xpwl2fRZ9+X/juAr2NwbJHVCwuWTKBo0hF v18Q/BSt4j3IFbB8j4cEyJxZcnD8YBLnW1fmoMAO0lzNQZ1ldoUYDNmlLPaXvaZX rgyFcT1HiXH9zV6MG11hMvSAqgwCDoaDrg6OZxnP+qRSXY75x8oQECUyu43mZ1Jk 8YSgrM632qckjMk2ILGmEva+Xf76UO+U6o/wqkT+zEkTJLDLhERBEDO1445XIAou 3S1A== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:cc:content-transfer-encoding :content-type:date:date:feedback-id:feedback-id:from:from :in-reply-to:message-id:mime-version:reply-to:sender:subject :subject:to:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm1; t=1661530523; x=1661616923; bh=2XpK+MsnzSh3Y xXW1jI/LTxDkomrIwNcVnN1BvA8HKs=; b=izAHRUFD+UxY2v8i3Dd8fOenj5Cm+ 09JhWOvl8mDXLayVo/3AV3iNV0iEagOskknCOHySaElCvU4bU86Pg1qiJKN0a9nH pYjl/f9ROvAt0X1xYZGeG/URaA9QwHVBg6cNDP9cdBzhTEyJJHSRoFuedDCfgJw8 Ug8OKuJUmTWV1GnSFDm3IzsxuYiBKIVK+9cQF5kEtAyVXwwrJXdEQVYn3skuiEj2 CrrX7vBbY8Fs/cc7gLvLONZasHiq44X/CbHHvCGZW9iUbzZzKAvjNnHJwxNKe7Sy ra5iVzkECgGjOazs9a9835NbfdmuVafEf/wjyA2LH1Y/DDmYbz4l7YnrA== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvfedrvdejhedgleekucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurhepkfffgggfvfevhffutgfgsehtjeertddtfeejnecuhfhrohhmpeffuhhsthih ucforggsvgcuoeguuhhsthihseguuhhsthihmhgrsggvrdgtohhmqeenucggtffrrghtth gvrhhnpeeuhfevheeltddtveegiedvfeffteeghefgfeevjeffgfdtfeevtdeivdduveeu ueenucffohhmrghinhepkhgvrhhnvghlrdhorhhgpdhrvgguhhgrthdrtghomhdpghhith hhuhgsrdgtohhmnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhf rhhomhepughushhthiesughushhthihmrggsvgdrtghomh X-ME-Proxy: Feedback-ID: i13394474:Fastmail Received: by mail.messagingengine.com (Postfix) with ESMTPA; Fri, 26 Aug 2022 12:15:22 -0400 (EDT) Message-ID: <017845ae-fbae-70f6-5f9e-29aff2742b8c@dustymabe.com> Date: Fri, 26 Aug 2022 12:15:22 -0400 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.8.0 Content-Language: en-US To: Jens Axboe , linux-block@vger.kernel.org, linux-kernel@vger.kernel.org Cc: hch@lst.de From: Dusty Mabe Subject: regression caused by block: freeze the queue earlier in del_gendisk Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,HEXHASH_WORD,RCVD_IN_DNSWL_LOW, SPF_HELO_PASS,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hey All, I think I've found a regression introduced by: a09b314 o block: freeze the queue earlier in del_gendisk In Fedora CoreOS we have tests that set up RAID1 on the /boot/ and /root/ partitions and then subsequently removes one of the disks to simulate a failure. Sometime recently this test started timing out occasionally. Looking a bit closer it appears instances are getting stuck during reboot with a bunch of looping messages: ``` [ 17.978854] block device autoloading is deprecated and will be removed. [ 17.982555] block device autoloading is deprecated and will be removed. [ 17.985537] block device autoloading is deprecated and will be removed. [ 17.987546] block device autoloading is deprecated and will be removed. [ 17.989540] block device autoloading is deprecated and will be removed. [ 17.991547] block device autoloading is deprecated and will be removed. [ 17.993555] block device autoloading is deprecated and will be removed. [ 17.995539] block device autoloading is deprecated and will be removed. [ 17.997577] block device autoloading is deprecated and will be removed. [ 17.999544] block device autoloading is deprecated and will be removed. [ 22.979465] blkdev_get_no_open: 1666 callbacks suppressed ... ... ... [ 618.221270] blkdev_get_no_open: 1664 callbacks suppressed [ 618.221273] block device autoloading is deprecated and will be removed. [ 618.224274] block device autoloading is deprecated and will be removed. [ 618.227267] block device autoloading is deprecated and will be removed. [ 618.229274] block device autoloading is deprecated and will be removed. [ 618.231277] block device autoloading is deprecated and will be removed. [ 618.233277] block device autoloading is deprecated and will be removed. [ 618.235282] block device autoloading is deprecated and will be removed. [ 618.237370] block device autoloading is deprecated and will be removed. [ 618.239356] block device autoloading is deprecated and will be removed. [ 618.241290] block device autoloading is deprecated and will be removed. ``` Using the Fedora kernels I narrowed it down to being introduced between `kernel-5.19.0-0.rc3.27.fc37` (good) and `kernel-5.19.0-0.rc4.33.fc37` (bad). I then did a bisect and found: ``` $ git bisect bad a09b314005f3a0956ebf56e01b3b80339df577cc is the first bad commit commit a09b314005f3a0956ebf56e01b3b80339df577cc Author: Christoph Hellwig Date: Tue Jun 14 09:48:27 2022 +0200 block: freeze the queue earlier in del_gendisk Freeze the queue earlier in del_gendisk so that the state does not change while we remove debugfs and sysfs files. Ming mentioned that being able to observer request in debugfs might be useful while the queue is being frozen in del_gendisk, which is made possible by this change. Signed-off-by: Christoph Hellwig Link: https://lore.kernel.org/r/20220614074827.458955-5-hch@lst.de Signed-off-by: Jens Axboe block/genhd.c | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) ``` Reverting this commit on top of latest git master (4c612826b) gave me successful results. Any ideas on what could be amiss here? Luckily the patch is tiny so hopefully it might be obvious. More details (including logs) in the following locations: - https://bugzilla.redhat.com/show_bug.cgi?id=2121791 - https://github.com/coreos/fedora-coreos-tracker/issues/1282 Thanks! Dusty