Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751862Ab3IQAFW (ORCPT ); Mon, 16 Sep 2013 20:05:22 -0400 Received: from dkim1.fusionio.com ([66.114.96.53]:36369 "EHLO dkim1.fusionio.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751216Ab3IQAFT (ORCPT ); Mon, 16 Sep 2013 20:05:19 -0400 X-ASG-Debug-ID: 1379376318-0421b502223dbb80001-xx1T2L X-Barracuda-Envelope-From: JBacik@fusionio.com Date: Mon, 16 Sep 2013 20:05:16 -0400 From: Josef Bacik To: Andrew Morton CC: Josef Bacik , , , , Subject: Re: [PATCH] rwsem: add rwsem_is_contended Message-ID: <20130917000516.GJ2446@localhost.localdomain> X-ASG-Orig-Subj: Re: [PATCH] rwsem: add rwsem_is_contended References: <1377872041-390-1-git-send-email-jbacik@fusionio.com> <20130916160547.371b74f91511a42ac263449e@linux-foundation.org> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline In-Reply-To: <20130916160547.371b74f91511a42ac263449e@linux-foundation.org> User-Agent: Mutt/1.5.21 (2011-07-01) X-Originating-IP: [10.101.1.160] X-Barracuda-Connect: cas1.int.fusionio.com[10.101.1.40] X-Barracuda-Start-Time: 1379376318 X-Barracuda-Encrypted: AES128-SHA X-Barracuda-URL: http://10.101.1.181:8000/cgi-mod/mark.cgi X-Barracuda-Spam-Score: 0.00 X-Barracuda-Spam-Status: No, SCORE=0.00 using global scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=9.0 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.2.140674 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1876 Lines: 36 On Mon, Sep 16, 2013 at 04:05:47PM -0700, Andrew Morton wrote: > On Fri, 30 Aug 2013 10:14:01 -0400 Josef Bacik wrote: > > > Btrfs uses an rwsem to control access to its extent tree. Threads will hold a > > read lock on this rwsem while they scan the extent tree, and if need_resched() > > they will drop the lock and schedule. The transaction commit needs to take a > > write lock for this rwsem for a very short period to switch out the commit > > roots. If there are a lot of threads doing this caching operation we can starve > > out the committers which slows everybody out. To address this we want to add > > this functionality to see if our rwsem has anybody waiting to take a write lock > > so we can drop it and schedule for a bit to allow the commit to continue. > > Thanks, > > > > This sounds rather nasty and hacky. Rather then working around a > locking shortcoming in a caller it would be better to fix/enhance the > core locking code. What would such a change need to do? > > Presently rwsem waiters are fifo-queued, are they not? So the commit > thread will eventually get that lock. Apparently that's not working > adequately for you but I don't fully understand what it is about these > dynamics which is causing observable problems. > So the problem is not that its normal lock starvation, it's more our particular use case that is causing the starvation. We can have lots of people holding readers and simply never give them up for long periods of time, which is why we need this is_contended helper so we know to drop things and let the committer through. Thanks, Josef -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/