From: Sandeep Joshi Subject: process hangs in ext4_sync_file Date: Mon, 21 Oct 2013 18:09:02 +0530 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 To: linux-ext4@vger.kernel.org Return-path: Received: from mail-vb0-f52.google.com ([209.85.212.52]:51496 "EHLO mail-vb0-f52.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753319Ab3JUMjF (ORCPT ); Mon, 21 Oct 2013 08:39:05 -0400 Received: by mail-vb0-f52.google.com with SMTP id f12so3804208vbg.25 for ; Mon, 21 Oct 2013 05:39:02 -0700 (PDT) Sender: linux-ext4-owner@vger.kernel.org List-ID: I am seeing a problem reported 4 years earlier https://lkml.org/lkml/2009/3/12/226 (same stack as seen by Alexander) The problem is reproducible. Let me know if you need any info in addition to that seen below. I have multiple threads in a process doing heavy IO on a ext4 filesystem mounted with (discard, noatime) on a SSD or HDD. This is on Linux 3.8.0-29-generic #42~precise1-Ubuntu SMP Wed Aug 14 16:19:23 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux For upto minutes at a time, one of the threads seems to hang in sync to disk. When I check the thread stack in /proc, I find that the stack is one of the following two ] sleep_on_page+0xe/0x20 [] wait_on_page_bit+0x78/0x80 [] filemap_fdatawait_range+0x10c/0x1a0 [] filemap_write_and_wait_range+0x68/0x80 [] ext4_sync_file+0x6f/0x2b0 [] vfs_fsync+0x2b/0x40 [] sys_msync+0x143/0x1d0 [] system_call_fastpath+0x1a/0x1f [] 0xffffffffffffffff OR [] jbd2_log_wait_commit+0xb5/0x130 [] jbd2_complete_transaction+0x53/0x90 [] ext4_sync_file+0x1ed/0x2b0 [] vfs_fsync+0x2b/0x40 [] sys_msync+0x143/0x1d0 [] system_call_fastpath+0x1a/0x1f [] 0xffffffffffffffff Any clues? -Sandeep