Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp618720pxy; Wed, 28 Apr 2021 10:35:16 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyUI6B8PD4xhJhXRycv1ihqJyaRVdvhmcTPbwjL/YUf38D5ZUVw3TdHi9s+jaa6so5rG18Y X-Received: by 2002:a17:906:a10e:: with SMTP id t14mr30186877ejy.103.1619631315294; Wed, 28 Apr 2021 10:35:15 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619631315; cv=none; d=google.com; s=arc-20160816; b=g/36g3Npdc8Mq3EZKzhUwQydj4ZPMZOZdfObrQYamPeZ7LB390aDkSzYpcOX7E2TE3 5+rEM0E8GfzarhSN9PFoRhL1V1EoEYPcfHbRgbgrc92CEQO2D5BPMc4FgHIKssnUk12G LbTrqSUce3aJAZtItWiA5iwsHybVUiTBozajwyNFzdeI6uI3w3qkSvwq/7AB8wCgQgiV Xw+7OZq7lLGlDugdmjxiGaCiRQaVwfMEPw7ON+YY1Tn0HGE85fZB1QLESZ9iC377QMEl cwT2GB7I2O2+Swx9tJUa1TfCEWjwzDtxxblflB8MnZQtqaElsTOLlzwQHITLM4F6Spg5 fl4A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=vApQDLBUW+cngBPQzIa6nFbj0v2dudA0q2YvFfhKQFE=; b=oqJagr/Lr7DgQwx3172A4lXh360Gt4tCNQ2idHPLdoZwTxe7H/QL2BS8AEThFFiGqw Zg/SwMgLk/ucFP0pVX5DAXfOMhnhYYSMSDDGXsJkfHjBC/uJ2nFupaaav1qsxqnHoAqr UAiG2d2dHS2ucfReOU5DcCr5+FgjE02T9pqfOPAwnZcVSJPyjSYfNgpNg1NuU77+Opxo iQguSuDS0IOQMiRm10Cghm6nwzyT4aP5Y6dW4FaE8mIHL3pmKL1HYLRrqRZwuWvitfeC z/oR16vefVUa8VASLzN4yLMeHmAvwSe58Ful1SoJfblIq8HcvYGRo8mbvCv/GncmG0D6 9XQA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c2si579238ejx.595.2021.04.28.10.34.51; Wed, 28 Apr 2021 10:35:15 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239255AbhD1OEY (ORCPT + 99 others); Wed, 28 Apr 2021 10:04:24 -0400 Received: from outgoing-auth-1.mit.edu ([18.9.28.11]:34416 "EHLO outgoing.mit.edu" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S230057AbhD1OEW (ORCPT ); Wed, 28 Apr 2021 10:04:22 -0400 Received: from cwcc.thunk.org (pool-72-74-133-215.bstnma.fios.verizon.net [72.74.133.215]) (authenticated bits=0) (User authenticated as tytso@ATHENA.MIT.EDU) by outgoing.mit.edu (8.14.7/8.12.4) with ESMTP id 13SE3HW3018990 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 28 Apr 2021 10:03:17 -0400 Received: by cwcc.thunk.org (Postfix, from userid 15806) id DB1D515C3C3D; Wed, 28 Apr 2021 10:03:16 -0400 (EDT) Date: Wed, 28 Apr 2021 10:03:16 -0400 From: "Theodore Ts'o" To: kernel test robot Cc: Harshad Shirwadkar , LKML , Linux Memory Management List , lkp@lists.01.org, lkp@intel.com, dm-devel@redhat.com Subject: Re: [ext4] 21175ca434: mdadm-selftests.enchmarks/mdadm-selftests/tests/01r1fail.fail Message-ID: References: <20210427081539.GF32408@xsang-OptiPlex-9020> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210427081539.GF32408@xsang-OptiPlex-9020> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org (Hmm, why did you cc linux-km on this report? I would have thought dm-devel would have made more sense?) On Tue, Apr 27, 2021 at 04:15:39PM +0800, kernel test robot wrote: > > FYI, we noticed the following commit (built with gcc-9): > > commit: 21175ca434c5d49509b73cf473618b01b0b85437 ("ext4: make prefetch_block_bitmaps default") > https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master > > in testcase: mdadm-selftests > version: mdadm-selftests-x86_64-5d518de-1_20201008 > with following parameters: > > disk: 1HDD > test_prefix: 01r1 > ucode: 0x21 So this failure makes no sense to me. Looking at the kmesg failure logs, it's failing in the md layer: kern :info : [ 99.775514] md/raid1:md0: not clean -- starting background reconstruction kern :info : [ 99.783372] md/raid1:md0: active with 3 out of 4 mirrors kern :info : [ 99.789735] md0: detected capacity change from 0 to 37888 kern :info : [ 99.796216] md: resync of RAID array md0 kern :crit : [ 99.900450] md/raid1:md0: Disk failure on loop2, disabling device. md/raid1:md0: Operation continuing on 2 devices. kern :crit : [ 99.918281] md/raid1:md0: Disk failure on loop1, disabling device. md/raid1:md0: Operation continuing on 1 devices. kern :info : [ 100.835833] md: md0: resync interrupted. kern :info : [ 101.852898] md: resync of RAID array md0 kern :info : [ 101.858347] md: md0: resync done. user :notice: [ 102.109684] /lkp/benchmarks/mdadm-selftests/tests/01r1fail... FAILED - see /var/tmp/01r1fail.log and /var/tmp/fail01r1fail.log for details The referenced commit just turns block bitmap prefetching in ext4. This should not cause md to failure; if so, that's an md bug, not an ext4 bug. There should not be anything that the file system is doing that would cause the kernel to think there is a disk failure. By the way, the reproduction instructions aren't working currently: > To reproduce: > > git clone https://github.com/intel/lkp-tests.git > cd lkp-tests > bin/lkp install job.yaml # job file is attached in this email This fails because lkp is trying to apply a patch which does not apply with the current version of the md tools. > bin/lkp split-job --compatible job.yaml > bin/lkp run compatible-job.yaml And the current versions lkp don't generate a compatible-job.yaml file when you run "lkp split-job --compatable"; instead it generates a new yaml file with a set of random characters to generate a unique name. (What Multics parlance would be called a "shriek name"[1] :-) Since I was having trouble running the reproduction; could you send the /var/tmp/*fail.logs so we could have a bit more insight what is going on? Thanks! - Ted