Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751591Ab2FMEVi (ORCPT ); Wed, 13 Jun 2012 00:21:38 -0400 Received: from mga01.intel.com ([192.55.52.88]:46811 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750761Ab2FMEVh (ORCPT ); Wed, 13 Jun 2012 00:21:37 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.71,315,1320652800"; d="scan'208";a="164932956" Date: Wed, 13 Jun 2012 12:21:15 +0800 From: Fengguang Wu To: Dave Chinner Cc: Wanpeng Li , linux-kernel@vger.kernel.org, Gavin Shan , Wanpeng Li Subject: Re: [PATCH] writeback: avoid race when update bandwidth Message-ID: <20120613042115.GA25842@localhost> References: <1339496803-2885-1-git-send-email-liwp.linux@gmail.com> <20120612112129.GA16639@localhost> <20120613035647.GU22848@dastard> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20120613035647.GU22848@dastard> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2261 Lines: 56 On Wed, Jun 13, 2012 at 01:56:47PM +1000, Dave Chinner wrote: > On Tue, Jun 12, 2012 at 07:21:29PM +0800, Fengguang Wu wrote: > > On Tue, Jun 12, 2012 at 06:26:43PM +0800, Wanpeng Li wrote: > > > From: Wanpeng Li > > > > That email address is no longer in use? > > > > > Since bdi->wb.list_lock is used to protect the b_* lists, > > > so the flushers who call wb_writeback to writeback pages will > > > stuck when bandwidth update policy holds this lock. In order > > > to avoid this race we can introduce a new bandwidth_lock who > > > is responsible for protecting bandwidth update policy. > > This is not a race condition - it is a lock contention condition. Nod. > > This looks good to me. wb.list_lock could be contended and it's better > > for bdi_update_bandwidth() to use a standalone and hardly contended > > lock. > > I'm not sure it will be "hardly contended". That's a global lock, so > now we'll end up with updates on different bdis contending and it's > not uncommon to see a couple of thousand processes on large machines > beating on balance_dirty_pages(). Putting a global scope lock > around such a function doesn't seem like a good solution to me. It's more about the number of bdi's than the number of processes that matters. Because here is a per-bdi 200ms ratelimit: bdi_update_bandwidth(): if (time_is_after_eq_jiffies(bdi->bw_time_stamp + BANDWIDTH_INTERVAL)) return; // lock it So a global should be enough when there are only dozens of disks. However, the global bandwidth_lock will probably become a problem when there comes hundreds of disks. If there are (or will be) such setups, I'm fine to revert to the old per-bdi locking. > Oh, and if you want to remove the dirty_lock from > global_update_limit(), then replacing the lock with a cmpxchg loop > will do it just fine.... Yes. But to be frank, I don't care about that dirty_lock at all, because it has its own 200ms rate limiting :-) Thanks, Fengguang -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/