Return-Path: Received: from rcsinet10.oracle.com ([148.87.113.121]:57511 "EHLO rcsinet10.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751751Ab1ASC7I (ORCPT ); Tue, 18 Jan 2011 21:59:08 -0500 Date: Wed, 19 Jan 2011 10:56:33 +0800 From: Wengang Wang To: Chuck Lever Cc: Nick Piggin , Trond Myklebust , linux-fsdevel , Linux NFS Mailing List Subject: Re: NFS dio aio bug Message-ID: <20110119025633.GA7713@laptop.jp.oracle.com> References: <4E70BD9B-DB23-42EC-B28D-998E16FEC189@oracle.com> Content-Type: text/plain; charset=us-ascii In-Reply-To: Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 On 11-01-18 17:01, Chuck Lever wrote: > > On Jan 14, 2011, at 4:48 PM, Nick Piggin wrote: > > > On Sat, Jan 15, 2011 at 3:08 AM, Chuck Lever wrote: > >> Hi Nick- > >> > >> On Jan 13, 2011, at 8:29 PM, Nick Piggin wrote: > >> > >>> Hi Trond, > >>> > >>> I'm getting use after frees in aio code in NFS > >> > >> Can you describe how to reproduce this? > > > > It was with the aio-dio stress code from xfstests, #207 I think > > or 208. > > > > Running it for a short time and then ^C ing it would tend to > > trigger it. > > > > I'll have to get you more details after I come back from > > travelling in a week's time. > > I can reproduce this fairly reliably here with test 208. ^C makes it happen almost immediately, but it can also trigger sometimes just by running the test. > > > > >> > >>> [ 2703.396766] Call Trace: > >>> [ 2703.396858] [] ? native_sched_clock+0x27/0x80 > >>> [ 2703.396959] [] ? put_lock_stats+0xe/0x40 > >>> [ 2703.397058] [] ? lock_release_holdtime+0xa8/0x140 > >>> [ 2703.397159] [] lock_acquire+0x95/0x1b0 > >>> [ 2703.397260] [] ? aio_put_req+0x2b/0x60 > >>> [ 2703.397361] [] ? get_parent_ip+0x11/0x50 > >>> [ 2703.397464] [] _raw_spin_lock_irq+0x41/0x80 > >>> [ 2703.397564] [] ? aio_put_req+0x2b/0x60 > >>> [ 2703.397662] [] aio_put_req+0x2b/0x60 > >>> [ 2703.397761] [] do_io_submit+0x2be/0x7c0 > >>> [ 2703.397895] [] sys_io_submit+0xb/0x10 > >>> [ 2703.397995] [] system_call_fastpath+0x16/0x1b > >>> > >>> Adding some tracing, it is due to nfs completing the request then > >>> returning something other than -EIOCBQUEUED, so aio.c > >>> also completes the request. > >> > >> Is this with reads, writes, or both? Are the I/O requests smaller than, equal to, or larger than rsize or wsize? > >> > >> We have a related bug report: hitting the BUG at fs/aio.c:552 (OEL5) and similar for more recent kernels. Looks like dreq refcounting is faulty somehow. > > > > I only saw it with writes. The request was being completed > > in nfs direct write path when I added some tracing. It was > > very easy to reproduce, I just didn't have time to bisect it, > > but I can do that when I get back if you don't have it solved > > by then. > > I'm looking at it now. This looks exactly like the symptoms of our bug. Thanks for the report and the reproducer. Yes, there is a fs/aio.c:552 bug filed. I can reproduce it as will and it happens with both read and write. Currently, I have this anylysis(correct me if I am wrong): In the sys_io_submit path, the vfs doesn't hope the FS under ground drop the ki_users in "error" case. Actually it is -ERESTARTSYS, in my test, instead of -EIOCBQUEUED. But seems the nfs drops ki_users in error case as well as successful cases too. I now have a v1 patch to "fix" it. I will post it later after more thinking about it. thanks, wengang.