Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp643639imm; Wed, 18 Jul 2018 08:16:46 -0700 (PDT) X-Google-Smtp-Source: AAOMgpc+rka6od0HH1zjDMR5r4+vNHCcWAUV9Y/Sqzqhag+7GFvy1t9KE9s5/YUu1mAUK4Cp/WXm X-Received: by 2002:a17:902:9a8a:: with SMTP id w10-v6mr6299116plp.333.1531927006642; Wed, 18 Jul 2018 08:16:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531927006; cv=none; d=google.com; s=arc-20160816; b=X6bnuf84tx4jETecfYXbpnL8wnZi1Bs/GxTRVdasylfpvjMiyBTeDQJyoxq4XRAW93 rkk2WMwh4twXCK0ECeiTIGuJ77cSHvMAJKG285T6qWoqi2OrjNya6fAJqFZY29kvOhd7 OtX8h7NKvV+2keBg2mMr5bqQCeoCs9SG9LHY57rrUmJmUYXhA4kXUNUEftOvxwnvC6OE 76DHZzS9ICWMI4P76aJHR2prXxmpfGlufyfoW44bcaFxyk8o5SRXtWpoO4D/DQXva8sg R4vrWzm0IQuscZCBPiU+or8FOu2AmPOICitJUgB4iWu3hTpg2NC66ofyFGpKVPVEDLKz 1NPg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:organization:from:references:cc:to:subject :arc-authentication-results; bh=W15DDaKHMb7cMncT7FBkbUyDKfLQECEthdqB+DidQcc=; b=btfzkSlo7jUegTGYDG5MSyF6aucVO8IaNU177JHE/PX4HLua6Bj7zSZsJA+QbeKe03 lUIY9jVT+zMbVht+xEoyMNLp2TkhiLHj7hPybqbgYMnXaBoJcP6EePRmAti7KJ68Bxcz g139oqAErtyTCHcGxFXqV4xHbJs5gZsaMAvXCoSmbIbNPeybAjgAUCoYMKpQ7lJcbFxL CNHSApLOpe65TMVJ5fWxKauo6l6F8MzzvA9CFBu44KAzxq6hwrIPiSfmQEyJHC+bcH/p kkMBKxBFTirj3ebBnNEwdWtI4M1cQ9/J5v2WA2KWUv1p+3OvLGgieBxHDn0quqGf+KTw y5yA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s26-v6si3439226pge.339.2018.07.18.08.16.31; Wed, 18 Jul 2018 08:16:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731154AbeGRPyF (ORCPT + 99 others); Wed, 18 Jul 2018 11:54:05 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:52276 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1730998AbeGRPyF (ORCPT ); Wed, 18 Jul 2018 11:54:05 -0400 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.rdu2.redhat.com [10.11.54.3]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 6C045401EF07; Wed, 18 Jul 2018 15:15:42 +0000 (UTC) Received: from llong.remote.csb (dhcp-17-175.bos.redhat.com [10.18.17.175]) by smtp.corp.redhat.com (Postfix) with ESMTP id 273A9111E402; Wed, 18 Jul 2018 15:15:41 +0000 (UTC) Subject: Re: [PATCH v2] locking/rwsem: Take read lock immediate if queue empty with no writer To: Peter Zijlstra , Ingo Molnar , Will Deacon Cc: linux-kernel@vger.kernel.org, Mark Ray , Joe Mario , Scott Norton References: <1531506653-5244-1-git-send-email-longman@redhat.com> From: Waiman Long Organization: Red Hat Message-ID: Date: Wed, 18 Jul 2018 11:15:40 -0400 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.0 MIME-Version: 1.0 In-Reply-To: <1531506653-5244-1-git-send-email-longman@redhat.com> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Content-Language: en-US X-Scanned-By: MIMEDefang 2.78 on 10.11.54.3 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.5]); Wed, 18 Jul 2018 15:15:42 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.5]); Wed, 18 Jul 2018 15:15:42 +0000 (UTC) for IP:'10.11.54.3' DOMAIN:'int-mx03.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'longman@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/13/2018 02:30 PM, Waiman Long wrote: > It was discovered that a constant stream of readers might cause the > count to go negative most of the time after an initial trigger by a > writer even if no writer was present afterward. As a result, most of the > readers would have to go through the slowpath reducing their performance. > > To avoid that from happening, an additional check is added to detect > the special case that the reader in the critical section is the only > one in the wait queue and no writer is present. When that happens, it > can just have the lock and return immediately without further action. > Other incoming readers won't see a waiter is present and be forced > into the slowpath. > > The additional code is in the slowpath and so should not have an impact > on rwsem performance. However, in the special case listed above, it may > greatly improve performance. > > The issue was found in a customer site where they had an application > that pounded on the pread64 syscalls heavily on an XFS filesystem. The > application was run in a recent 4-socket boxes with a lot of CPUs. They > saw significant spinlock contention in the rwsem_down_read_failed() call. > With this patch applied, the system CPU usage went from 85% to 57%, > and the spinlock contention in the pread64 syscalls was gone. > > v2: Add customer testing results and remove wording that may cause > confusion. > > Signed-off-by: Waiman Long > --- > kernel/locking/rwsem-xadd.c | 13 ++++++++++++- > 1 file changed, 12 insertions(+), 1 deletion(-) > > diff --git a/kernel/locking/rwsem-xadd.c b/kernel/locking/rwsem-xadd.c > index 3064c50..bf0570e 100644 > --- a/kernel/locking/rwsem-xadd.c > +++ b/kernel/locking/rwsem-xadd.c > @@ -233,8 +233,19 @@ static void __rwsem_mark_wake(struct rw_semaphore *sem, > waiter.type = RWSEM_WAITING_FOR_READ; > > raw_spin_lock_irq(&sem->wait_lock); > - if (list_empty(&sem->wait_list)) > + if (list_empty(&sem->wait_list)) { > + /* > + * In the unlikely event that the task is the only one in > + * the wait queue and a writer isn't present, it can have > + * the lock and return immediately without going through > + * the remaining slowpath code. > + */ > + if (unlikely(atomic_long_read(&sem->count) >= 0)) { > + raw_spin_unlock_irq(&sem->wait_lock); > + return sem; > + } > adjustment += RWSEM_WAITING_BIAS; > + } > list_add_tail(&waiter.list, &sem->wait_list); > > /* we're now waiting on the lock, but no longer actively locking */ Ping! Any comment on this one? Cheers, Longman