Received: by 2002:a4a:311b:0:0:0:0:0 with SMTP id k27-v6csp4869867ooa; Tue, 14 Aug 2018 11:43:01 -0700 (PDT) X-Google-Smtp-Source: AA+uWPxxOqpMrRYgor7KaHdo8oyASwgHe9SWlq/7GIs9X4sizQWdhDXaWIECYBZlsCIKbIGST4j6 X-Received: by 2002:a62:68c3:: with SMTP id d186-v6mr24720154pfc.70.1534272181536; Tue, 14 Aug 2018 11:43:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1534272181; cv=none; d=google.com; s=arc-20160816; b=Temc7A+/OLpXvt3YeBL/+me5ZZQiv2W7ENYUrGjEg4b2kfTV86SLJnIZdbNuvLNsNQ FGcGymCWDq/dtEqQfjcGSlUjMHc3LzrTkJKNlGW3q0bw2N2EbkS+WdRRbwQZA4rT1Ew+ zy7OyMBP685q1GzeC6pTjXubRAIWZI5xYrv/XNFh/Smdim+OjLqGDP4RWUabP5NQ4Wh7 8T1bG6OBjxjyB7DqxD6Cet9HiPc9mNXgNji23gfLIL5h+UT1nXnKm/zweNOXtIVPnl2s vE4MPDa7Rv069AXyx4bKTlVkA1AdEHSgZyDD4DYBjhonzO1+XcGST7uN20KrPuYqiuEx BKmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=mks/At/tdeTrBCOD+WnHFyB1ndIn+HPM2roKXeLKDFE=; b=CVkCZoiQAA0qyQhaEQfIOHv/i/g+znUWDz80LK2RBqQDTHvN6tukKaONSlxO1B9Inm KuHXp0bA7SsCVvfA2nkouy41trb6L0CRC34j2mGk3fjHJlzikxc9OabyB3w1JfbwefFH c7DWB/khgiD0IbAvyTUySx349nh+1j0RNZOWdcCxWtEzbmDGBz+bdBU4yX+FwJXzQYTf 7WtfRSblMRZmnbdE/JH/ySeLzKR649EfPonfO6Q5E1JiGDV+MhGz1oF6fTCwbS0HfPDE UE72kTmUqlrQIrv2WJLg4MUzTAHLO0zkB2U6bNuQCJxiBjAIrgT7/OqaEVztA9lASk97 4hQg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m11-v6si17149326pla.45.2018.08.14.11.42.44; Tue, 14 Aug 2018 11:43:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728239AbeHNVaQ (ORCPT + 99 others); Tue, 14 Aug 2018 17:30:16 -0400 Received: from fieldses.org ([173.255.197.46]:58848 "EHLO fieldses.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728146AbeHNVaQ (ORCPT ); Tue, 14 Aug 2018 17:30:16 -0400 Received: by fieldses.org (Postfix, from userid 2815) id C01C21F67; Tue, 14 Aug 2018 14:41:46 -0400 (EDT) Date: Tue, 14 Aug 2018 14:41:46 -0400 From: "J. Bruce Fields" To: NeilBrown Cc: Jeff Layton , Alexander Viro , Martin Wilck , linux-fsdevel@vger.kernel.org, Frank Filz , linux-kernel@vger.kernel.org Subject: Re: [PATCH 0/5 v2] locks: avoid thundering-herd wake-ups Message-ID: <20180814184146.GM7906@fieldses.org> References: <153421852728.24426.2111161640156686201.stgit@noble> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <153421852728.24426.2111161640156686201.stgit@noble> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This version looks correct to me, and simpler. I'll be curious to hear whatever you learn from testing! --b. On Tue, Aug 14, 2018 at 01:56:51PM +1000, NeilBrown wrote: > > V2, which added wake_non_conflicts() was more broken than V1 - as > Bruce explained there is no transitivity in the blocking relation > between locks. > So this series takes a simpler approach. > It still attached waiters between other waiters as necessary to ensure > that: > - a waiter is blocked by it's parent (fl->blocker) and all further > ancestors, and > - the list of waiters on fl_blocked are mutually non-conflicting. > > When a lock (the root of a tree of requests) is released, only its > immediate children (fl_blocked) are woken. > When any lock is woken (either because its fl_blocker was released > to due to a signal or similar) it with either: > - be granted > - be aborted > - be re-queued beneath some other lock. > > In the first case tree of blocked locks is moved across to the newly > created lock, and the invariants still hold. > In the order two cases, the tree or blocked waiters are all detached > and woken. > > Note that this series has not received much testing yet. > > Original description: > If you have a many-core machine, and have many threads all wanting to > briefly lock a give file (udev is known to do this), you can get quite > poor performance. > > When one thread releases a lock, it wakes up all other threads that > are waiting (classic thundering-herd) - one will get the lock and the > others go to sleep. > When you have few cores, this is not very noticeable: by the time the > 4th or 5th thread gets enough CPU time to try to claim the lock, the > earlier threads have claimed it, done what was needed, and released. > With 50+ cores, the contention can easily be measured. > > This patchset creates a tree of pending lock request in which siblings > don't conflict and each lock request does conflict with its parent. > When a lock is released, only requests which don't conflict with each > other a woken. > > Testing shows that lock-acquisitions-per-second is now fairly stable even > as number of contending process goes to 1000. Without this patch, > locks-per-second drops off steeply after a few 10s of processes. > > There is a small cost to this extra complexity. > At 20 processes running a particular test on 72 cores, the lock > acquisitions per second drops from 1.8 million to 1.4 million with > this patch. For 100 processes, this patch still provides 1.4 million > while without this patch there are about 700,000. > > NeilBrown > > --- > > NeilBrown (5): > fs/locks: rename some lists and pointers. > fs/locks: split out __locks_wake_up_blocks(). > fs/locks: allow a lock request to block other requests. > fs/locks: change all *_conflict() functions to return bool. > fs/locks: create a tree of dependent requests. > > > fs/cifs/file.c | 2 - > fs/locks.c | 156 ++++++++++++++++++++++++++------------- > include/linux/fs.h | 7 +- > include/trace/events/filelock.h | 16 ++-- > 4 files changed, 119 insertions(+), 62 deletions(-) > > -- > Signature