Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1298742imm; Wed, 8 Aug 2018 14:29:38 -0700 (PDT) X-Google-Smtp-Source: AA+uWPzM1SfPKGNzio1zHsiBdsWNnpOvHxvoWcz4qm08DI7PiNYTwt8tIGp9g7jZZyCMINviR6uB X-Received: by 2002:a17:902:9a06:: with SMTP id v6-v6mr4140444plp.316.1533763778441; Wed, 08 Aug 2018 14:29:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533763778; cv=none; d=google.com; s=arc-20160816; b=YLg6c2DQ7KGo4Mr3i+cC816XddLqh5QKsRF10lYd+opsKWdFqpR2NivgJ7bdxWm/Ip YwG3W9OjKkFPDa3IQlO24FY2tLCiRyn3vFw7TKRIQCLxATIscbOhYIqk79+cVdtKmBr8 WbvJhEd+1hDGu5Uz4EiwGf9XOVYHTrtWyBzqwQVL1D3+l2OoqH34eYOYrB+/l6F+1LTP u6QmgPptsCsnaKw3hN7PS4XbVVGz7v3EVmJJJzimAOH0w3giWtqLbis1Fj21xSgvm640 xnpEEoZbdP1zBM529UIurD6twbTu4b82xqUhEwdQv0vXcwMST7I0PHC7H7Q/6WySgFu5 8KRQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=8eK288FeyrTHnFASa7i8P6hoFyNl1tV6NaBZpxCU9hs=; b=OwccKB9EYASlu2M6Jhgormfq/aYz5K/3TVtpKaMltS/9D5O5Ot1PzUQb2PJhWOLlmC r3isT4fqx+FJbIkej+nEBWSE5W1jFlwCNt9so/Lh+U+LQL8BGVajXiTewQUbJ4NnS1xp TkAqVQO1MXOvEY8YlnDwnjw7dSbg6Hivj9gabPEqdJOQLMcjpObjljyApEFQcgQ1DezJ 4U9XWrj9H9SBGn700TU7XGwCk7NaOwTqXJUI3qy2vonWAIniO2VYM97id7wp0+x64Fd+ 4YIMU8BZ1g+b/7GCCBMBA/xjLPxAVbDQYD+F94rJwP6lrZ4Poy8RofjHkuYXg5XhFKaV kdKA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d33-v6si4979719pgd.245.2018.08.08.14.29.24; Wed, 08 Aug 2018 14:29:38 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730021AbeHHXuC (ORCPT + 99 others); Wed, 8 Aug 2018 19:50:02 -0400 Received: from fieldses.org ([173.255.197.46]:47224 "EHLO fieldses.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728233AbeHHXuC (ORCPT ); Wed, 8 Aug 2018 19:50:02 -0400 Received: by fieldses.org (Postfix, from userid 2815) id 597AEBD3; Wed, 8 Aug 2018 17:28:32 -0400 (EDT) Date: Wed, 8 Aug 2018 17:28:32 -0400 From: "J. Bruce Fields" To: NeilBrown Cc: Jeff Layton , Alexander Viro , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Martin Wilck Subject: Re: [PATCH 0/4] locks: avoid thundering-herd wake-ups Message-ID: <20180808212832.GF23873@fieldses.org> References: <153369219467.12605.13472423449508444601.stgit@noble> <20180808195445.GD23873@fieldses.org> <20180808200912.GE23873@fieldses.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180808200912.GE23873@fieldses.org> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Aug 08, 2018 at 04:09:12PM -0400, J. Bruce Fields wrote: > On Wed, Aug 08, 2018 at 03:54:45PM -0400, J. Bruce Fields wrote: > > On Wed, Aug 08, 2018 at 11:51:07AM +1000, NeilBrown wrote: > > > If you have a many-core machine, and have many threads all wanting to > > > briefly lock a give file (udev is known to do this), you can get quite > > > poor performance. > > > > > > When one thread releases a lock, it wakes up all other threads that > > > are waiting (classic thundering-herd) - one will get the lock and the > > > others go to sleep. > > > When you have few cores, this is not very noticeable: by the time the > > > 4th or 5th thread gets enough CPU time to try to claim the lock, the > > > earlier threads have claimed it, done what was needed, and released. > > > With 50+ cores, the contention can easily be measured. > > > > > > This patchset creates a tree of pending lock request in which siblings > > > don't conflict and each lock request does conflict with its parent. > > > When a lock is released, only requests which don't conflict with each > > > other a woken. > > > > Are you sure you aren't depending on the (incorrect) assumption that "X > > blocks Y" is a transitive relation? > > > > OK I should be able to answer that question myself, my patience for > > code-reading is at a real low this afternoon.... > > In other words, is there the possibility of a tree of, say, exclusive > locks with (offset, length) like: > > (0, 2) waiting on (1, 2) waiting on (2, 2) waiting on (0, 4) > > and when waking (0, 4) you could wake up (2, 2) but not (0, 2), leaving > a process waiting without there being an actual conflict. After batting it back and forth with Jeff on IRC.... So do I understand right that when we wake a waiter, we leave its own tree of waiters intact, and when it wakes if it finds a conflict it just adds it lock (with tree of waiters) in to the tree of the conflicting lock? If so then yes I think that depends on the transitivity assumption--you're assuming that finding a conflict between the root of the tree and a lock proves that all the other members of the tree also conflict. So maybe this example works. (All locks are exclusive and written (offset, length), X->Y means X is waiting on Y.) process acquires (0,3) 2nd process requests (1,2), is put to sleep. 3rd process requests (0,2), is put to sleep. The tree of waiters now looks like (0,2)->(1,2)->(0,3) (0,3) is unlocked. A 4th process races in and locks (2,2). The 2nd process wakes up, sees this new conflict, and waits on (2,2). Now the tree looks like (0,2)->(1,2)->(2,2), and (0,2) is waiting for no reason. ? --b.