Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752783Ab0KDTx6 (ORCPT ); Thu, 4 Nov 2010 15:53:58 -0400 Received: from fxip-0047f.externet.hu ([88.209.222.127]:43428 "EHLO pomaz-ex.szeredi.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751570Ab0KDTx4 (ORCPT ); Thu, 4 Nov 2010 15:53:56 -0400 To: Andrea Arcangeli CC: dave@linux.vnet.ibm.com, miklos@szeredi.hu, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, shenlinf@cn.ibm.com, volobuev@us.ibm.com, mel@linux.vnet.ibm.com, dingc@cn.ibm.com, lnxninja@us.ibm.com In-reply-to: <20101104164144.GI11602@random.random> (message from Andrea Arcangeli on Thu, 4 Nov 2010 17:41:44 +0100) Subject: Re: Deadlocks with transparent huge pages and userspace fs daemons References: <1288817005.4235.11393.camel@nimitz> <20101104164144.GI11602@random.random> Message-Id: From: Miklos Szeredi Date: Thu, 04 Nov 2010 20:53:28 +0100 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1572 Lines: 36 On Thu, 4 Nov 2010, Andrea Arcangeli wrote: > On Wed, Nov 03, 2010 at 01:43:25PM -0700, Dave Hansen wrote: > > some IBM testers ran into some deadlocks. It appears that the > > khugepaged process is trying to migrate one of a filesystem daemon's > > pages while khugepaged holds the daemon's mmap_sem for write. > > Correct. So now I'm wondering what happens if some library of this > daemon happens to execute a munmap that calls split_vma and allocates > memory while holding the mmap_sem, and the memory allocation triggers > I/O that will have to be executed by the daemon. mmap_sem is not really relevant here(*), page lock is. And in vmscan.c, there's not a single blocking lock_page(). Also, as I mentioned, fuse does writeback in a special way: it copies dirty pages to non-page cache pages which don't interact in any way with reclaim. Fuse writeback is instantaneous from the reclaim PoV. > I think this could be fixed in userland, this applies to openvpn too > if used as nfs backend. How? Thanks, Miklos (*) In the original gpfs trace it is relevant but only because the page migration is triggered by khugepaged. In the reproduced example the page migration is triggered directly by an allocation. Since page migration does blocking lock_page(), there's really no way to avoid a deadlock in that case. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/