Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp304166ybb; Thu, 19 Mar 2020 22:35:54 -0700 (PDT) X-Google-Smtp-Source: ADFU+vsGhH9sn7wfyUYRSaLZOj/Iz2r7WwE+5OjYWENJn/ZmClI2R/hU8QIBepCpjWpfuqgm5Uz0 X-Received: by 2002:a9d:5c83:: with SMTP id a3mr295403oti.347.1584682554546; Thu, 19 Mar 2020 22:35:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584682554; cv=none; d=google.com; s=arc-20160816; b=GDMD7vmjocx+nle3TMuwfznWK4P7YWT1IaCLySpP9cwWq3H/3V+nH2X+abmgsHiuDi 4yNZeaaUeBPMrdXez/4JZQFSvNTDY47U/QYbWkMCse21t2BcOmuXcSpmuGFEqA3oNHjl h4W5EonUqG/tNgibrsBuV4gV7N12DJG+Q8Ljb49cdOFxQK2g91hLu03oAQkgA0Ba0OhG dSdfUlUhUS/fmkAME72KcoObHFxSUQgqjbtyRNRq3DEUDGA6gqepjvveuSz465BlyNG3 1w75vYGdHuIfvHJl8Qqn2qlI5UzN7x+aqeay1LSPXIJEpN4TFosmFGPO7+IMINr9rQR7 dEFQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:from :references:cc:to:subject; bh=TG3Xgi2igm1e1fCd0MFpqPsPRpygerlt36XhR69XN7A=; b=Fo5Haf4EZdxQmk387/mqdMOlkoRy0+tRLZyz4CHG77XTPVFedPsehfeGi9fsx9Xfob HFkxMHtjqgsOCnprj4YjRQ4uKXmwNcgrWhdYbLopU9z62HVK+nJ6v/yPWK1sxa2Lv03G L6QbPjaDPKYYapGqKl+cKv0Dei60d5/vGkP4nUCxF2incC482g5FO3sycCgGUftguqmP Y59aE6zgmtCzWXK2ZLYwUVx8J1eB5HrNA8GfEO6PV0Bj7l6XfO9HUPHy74QmCW+DwSm3 p1/AZqCinf50/dz9zNREGq5UIfTpqJG3jh/2vuqNu/EE1XvyMwy09i0ZMurabD7HJWFW i3SA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u10si2151509oib.91.2020.03.19.22.35.33; Thu, 19 Mar 2020 22:35:54 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726767AbgCTFfB (ORCPT + 99 others); Fri, 20 Mar 2020 01:35:01 -0400 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:60690 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725883AbgCTFfA (ORCPT ); Fri, 20 Mar 2020 01:35:00 -0400 Received: from pps.filterd (m0098393.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 02K5XeI1028516 for ; Fri, 20 Mar 2020 01:34:59 -0400 Received: from e06smtp02.uk.ibm.com (e06smtp02.uk.ibm.com [195.75.94.98]) by mx0a-001b2d01.pphosted.com with ESMTP id 2yu863w4xj-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Fri, 20 Mar 2020 01:34:59 -0400 Received: from localhost by e06smtp02.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Fri, 20 Mar 2020 05:34:57 -0000 Received: from b06cxnps4076.portsmouth.uk.ibm.com (9.149.109.198) by e06smtp02.uk.ibm.com (192.168.101.132) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Fri, 20 Mar 2020 05:34:53 -0000 Received: from d06av26.portsmouth.uk.ibm.com (d06av26.portsmouth.uk.ibm.com [9.149.105.62]) by b06cxnps4076.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 02K5Yqjf25821294 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 20 Mar 2020 05:34:53 GMT Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id D5496AE045; Fri, 20 Mar 2020 05:34:52 +0000 (GMT) Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id B7AD0AE04D; Fri, 20 Mar 2020 05:34:51 +0000 (GMT) Received: from localhost.localdomain (unknown [9.199.45.54]) by d06av26.portsmouth.uk.ibm.com (Postfix) with ESMTP; Fri, 20 Mar 2020 05:34:51 +0000 (GMT) Subject: Re: Ext4 corruption with VM images as 3 > drop_caches To: linux-ext4@vger.kernel.org, "Theodore Y. Ts'o" Cc: "Aneesh Kumar K.V" , Jan Kara References: <87pndagw7s.fsf@linux.ibm.com> From: Ritesh Harjani Date: Fri, 20 Mar 2020 11:04:50 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 MIME-Version: 1.0 In-Reply-To: <87pndagw7s.fsf@linux.ibm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 x-cbid: 20032005-0008-0000-0000-000003606B07 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 20032005-0009-0000-0000-00004A81CB29 Message-Id: <20200320053451.B7AD0AE04D@d06av26.portsmouth.uk.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138,18.0.645 definitions=2020-03-19_10:2020-03-19,2020-03-19 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 phishscore=0 lowpriorityscore=0 malwarescore=0 spamscore=0 clxscore=1015 impostorscore=0 mlxlogscore=999 mlxscore=0 suspectscore=0 adultscore=0 bulkscore=0 priorityscore=1501 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2003020000 definitions=main-2003200022 Sender: linux-ext4-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On 3/19/20 6:54 PM, Ritesh Harjani wrote: > > > On 3/18/20 9:17 AM, Aneesh Kumar K.V wrote: >> Hi, >> >> With new vm install I am finding corruption with the vm image if I >> follow up the install with echo 3 > /proc/sys/vm/drop_caches >> >> The file system reports below error. >> >> Begin: Running /scripts/local-bottom ... done. >> Begin: Running /scripts/init-bottom ... >> [    4.916017] EXT4-fs error (device vda2): ext4_lookup:1700: inode >> #787185: comm sh: iget: checksum invalid >> done. >> [    5.244312] EXT4-fs error (device vda2): ext4_lookup:1700: inode >> #917954: comm init: iget: checksum invalid >> [    5.257246] EXT4-fs error (device vda2): ext4_lookup:1700: inode >> #917954: comm init: iget: checksum invalid >> /sbin/init: error while loading shared libraries: libc.so.6: cannot >> open shared object file: Error 74 >> [    5.271207] Kernel panic - not syncing: Attempted to kill init! >> exitcode=0x00007f00 >> >> And debugfs reports >> >> debugfs:  stat <917954> >> Inode: 917954   Type: bad type    Mode:  0000   Flags: 0x0 >> Generation: 0    Version: 0x00000000 >> User:     0   Group:     0   Size: 0 >> File ACL: 0 >> Links: 0   Blockcount: 0 >> Fragment:  Address: 0    Number: 0    Size: 0 >> ctime: 0x00000000 -- Wed Dec 31 18:00:00 1969 >> atime: 0x00000000 -- Wed Dec 31 18:00:00 1969 >> mtime: 0x00000000 -- Wed Dec 31 18:00:00 1969 >> Size of extra inode fields: 0 >> Inode checksum: 0x00000000 >> BLOCKS: >> debugfs: >> >> Bisecting this finds >> Commit 244adf6426ee31a83f397b700d964cff12a247d3("ext4: make >> dioread_nolock the default") >> as bad. If I revert the same on top of linus >> upstream(fb33c6510d5595144d585aa194d377cf74d31911) >> I don't hit the corrupttion anymore. > > Tried replicating this and could easily replicate it on Power box. > I tried to reproduce this on x86 too, but could not reproduce on x86. > Now one difference on Power could be that pagesize is 64K and fs > blocksize is 4K. > > The issue looks like the guest qemu image file is not properly written > back, after host does echo 3 > drop_caches. (correct me if this is not > the case). Ok. So tried this issue with passing "cache=directsync" parameter to drive file. This parameter says it should bypass the host side page cache. With this parameter, I don't see this issue on Power box. -ritesh > > I tried replicating via below test, but it could not reproduce. > > Any idea what kind of unit test could be written for this? > I am not sure how exactly qemu is writing to it's image file. > > > 1. Create 2 files. "mmap-file", "mmap-data". > 2. "mmap-file" is a 2GB sparse file. Then at some random offsets (tried > with both 64KB align and 4KB align offsets), try to write > pagesize/blocksize amount of known data pattern. > 3. These offsets (which are pagesize/blocksize align) are recorded into > "mmap-data" file via normal read/write calls. > 4. Then after we wrote to both files, we munmap the "mmap-file" and > close both of these files. > 5. Then we do echo 3 > drop_caches. > 6. Then in the verify phase, using the offsets written in "mmap-data" > file, I read the "mmap-file" to verify if it's contents are proper or > not. > With that could not reproduce this issue. > > > -ritesh > >