Received: by 10.223.176.46 with SMTP id f43csp3879332wra; Tue, 23 Jan 2018 00:13:30 -0800 (PST) X-Google-Smtp-Source: AH8x225ln0kKD7kY/89TL6Iwi6Qyg5dbZVsm5fSo08Mm7VRnsbulK7D/UZnzXfkJVdjXAcH/v+oI X-Received: by 10.98.12.23 with SMTP id u23mr9781165pfi.81.1516695210549; Tue, 23 Jan 2018 00:13:30 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516695210; cv=none; d=google.com; s=arc-20160816; b=blXquw/bg7PTfqW9pLoHszw1fpKuoJd5XjcBZ5LGv983gcRLAB6vBZHn+IEaLuRvB5 ehtZ6gDi82JwxkY5gc0Unq5AYD1X8rHXo4D+aMH4yvUhVbVfzOqt3iPjN8hGJhknwxBv elutSHP2bDAoJ7rWn8fW6ZIxN3P86rZxfJ6vX39ErjTlCIF9Y4mBZJNQHWlnaSWmPulY PTedcaeuqRwYn/afO/zSay2hRJQ//mH5u8S5rkU5lRo9XaCD52WoA8aS7lCpnvCw/SEH ixctGlIayccUfB6Ibh+hWJsoypLbReLiOlX2FxH/iGnm3Po58slOS2/IeBP97gt6VG9S yCWA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:arc-authentication-results; bh=40pMp3QetqANpc2L6b7fl+Z6c+U0JYwIAFTup3QisF8=; b=hASBZcEE9nvldjN7rwNmGCsKjqM3LpLb97uyHKvpee1kiNpdhIqd2e8H9ZGdI+3En8 3aOopT3da0gKxFKdmW3yE3mjG6R5Mqgx8CMMheeHHjX7/PEJx2E9xFSg/SUJymqdcWEA fhLKraQzkSt6zydCXHG2kEMgiIAuHPwjl0TBQM63FB+7E/MFgNOdYSF5hneFFtoLLaC/ r7SDaZyLkkg0dwOHu15tEtTKNFktGzcDo70T+cAY93oFOLbxPKSk9URiHQtCfv9PQ14O 2XRrMjZTm3iz5Cl/5G+Osj9cHk6DBCuUxXbVcnq446QnreDjdrbhwug+N7hbPmoGJ2Cb rEug== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k74si1109450pfb.69.2018.01.23.00.13.16; Tue, 23 Jan 2018 00:13:30 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751402AbeAWIMK (ORCPT + 99 others); Tue, 23 Jan 2018 03:12:10 -0500 Received: from szxga06-in.huawei.com ([45.249.212.32]:38668 "EHLO huawei.com" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1750907AbeAWIIq (ORCPT ); Tue, 23 Jan 2018 03:08:46 -0500 Received: from DGGEMS409-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id 1E6EB13C9A7B9; Tue, 23 Jan 2018 16:08:32 +0800 (CST) Received: from huawei.com (10.175.102.37) by DGGEMS409-HUB.china.huawei.com (10.3.19.209) with Microsoft SMTP Server id 14.3.361.1; Tue, 23 Jan 2018 16:08:26 +0800 From: To: CC: , , , , Subject: [PATCH RFC 01/16] prcu: Add PRCU implementation Date: Tue, 23 Jan 2018 15:59:26 +0800 Message-ID: <1516694381-20333-2-git-send-email-lianglihao@huawei.com> X-Mailer: git-send-email 1.7.12.4 In-Reply-To: <1516694381-20333-1-git-send-email-lianglihao@huawei.com> References: <1516694381-20333-1-git-send-email-lianglihao@huawei.com> MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.175.102.37] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Heng Zhang This RCU implementation (PRCU) is based on a fast consensus protocol published in the following paper: Fast Consensus Using Bounded Staleness for Scalable Read-mostly Synchronization. Haibo Chen, Heng Zhang, Ran Liu, Binyu Zang, and Haibing Guan. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. https://dl.acm.org/citation.cfm?id=3024114.3024143 Signed-off-by: Heng Zhang Signed-off-by: Lihao Liang --- include/linux/prcu.h | 37 +++++++++++++++ kernel/rcu/Makefile | 2 +- kernel/rcu/prcu.c | 125 +++++++++++++++++++++++++++++++++++++++++++++++++++ kernel/sched/core.c | 2 + 4 files changed, 165 insertions(+), 1 deletion(-) create mode 100644 include/linux/prcu.h create mode 100644 kernel/rcu/prcu.c diff --git a/include/linux/prcu.h b/include/linux/prcu.h new file mode 100644 index 00000000..653b4633 --- /dev/null +++ b/include/linux/prcu.h @@ -0,0 +1,37 @@ +#ifndef __LINUX_PRCU_H +#define __LINUX_PRCU_H + +#include +#include +#include + +#define CONFIG_PRCU + +struct prcu_local_struct { + unsigned int locked; + unsigned int online; + unsigned long long version; +}; + +struct prcu_struct { + atomic64_t global_version; + atomic_t active_ctr; + struct mutex mtx; + wait_queue_head_t wait_q; +}; + +#ifdef CONFIG_PRCU +void prcu_read_lock(void); +void prcu_read_unlock(void); +void synchronize_prcu(void); +void prcu_note_context_switch(void); + +#else /* #ifdef CONFIG_PRCU */ + +#define prcu_read_lock() do {} while (0) +#define prcu_read_unlock() do {} while (0) +#define synchronize_prcu() do {} while (0) +#define prcu_note_context_switch() do {} while (0) + +#endif /* #ifdef CONFIG_PRCU */ +#endif /* __LINUX_PRCU_H */ diff --git a/kernel/rcu/Makefile b/kernel/rcu/Makefile index 23803c7d..8791419c 100644 --- a/kernel/rcu/Makefile +++ b/kernel/rcu/Makefile @@ -2,7 +2,7 @@ # and is generally not a function of system call inputs. KCOV_INSTRUMENT := n -obj-y += update.o sync.o +obj-y += update.o sync.o prcu.o obj-$(CONFIG_CLASSIC_SRCU) += srcu.o obj-$(CONFIG_TREE_SRCU) += srcutree.o obj-$(CONFIG_TINY_SRCU) += srcutiny.o diff --git a/kernel/rcu/prcu.c b/kernel/rcu/prcu.c new file mode 100644 index 00000000..a00b9420 --- /dev/null +++ b/kernel/rcu/prcu.c @@ -0,0 +1,125 @@ +#include +#include +#include +#include +#include + +#include + +DEFINE_PER_CPU_SHARED_ALIGNED(struct prcu_local_struct, prcu_local); + +struct prcu_struct global_prcu = { + .global_version = ATOMIC64_INIT(0), + .active_ctr = ATOMIC_INIT(0), + .mtx = __MUTEX_INITIALIZER(global_prcu.mtx), + .wait_q = __WAIT_QUEUE_HEAD_INITIALIZER(global_prcu.wait_q) +}; +struct prcu_struct *prcu = &global_prcu; + +static inline void prcu_report(struct prcu_local_struct *local) +{ + unsigned long long global_version; + unsigned long long local_version; + + global_version = atomic64_read(&prcu->global_version); + local_version = local->version; + if (global_version > local_version) + cmpxchg(&local->version, local_version, global_version); +} + +void prcu_read_lock(void) +{ + struct prcu_local_struct *local; + + local = get_cpu_ptr(&prcu_local); + if (!local->online) { + WRITE_ONCE(local->online, 1); + smp_mb(); + } + + local->locked++; + put_cpu_ptr(&prcu_local); +} +EXPORT_SYMBOL(prcu_read_lock); + +void prcu_read_unlock(void) +{ + int locked; + struct prcu_local_struct *local; + + barrier(); + local = get_cpu_ptr(&prcu_local); + locked = local->locked; + if (locked) { + local->locked--; + if (locked == 1) + prcu_report(local); + put_cpu_ptr(&prcu_local); + } else { + put_cpu_ptr(&prcu_local); + if (!atomic_dec_return(&prcu->active_ctr)) + wake_up(&prcu->wait_q); + } +} +EXPORT_SYMBOL(prcu_read_unlock); + +static void prcu_handler(void *info) +{ + struct prcu_local_struct *local; + + local = this_cpu_ptr(&prcu_local); + if (!local->locked) + WRITE_ONCE(local->version, atomic64_read(&prcu->global_version)); +} + +void synchronize_prcu(void) +{ + int cpu; + cpumask_t cpus; + unsigned long long version; + struct prcu_local_struct *local; + + version = atomic64_add_return(1, &prcu->global_version); + mutex_lock(&prcu->mtx); + + local = get_cpu_ptr(&prcu_local); + local->version = version; + put_cpu_ptr(&prcu_local); + + cpumask_clear(&cpus); + for_each_possible_cpu(cpu) { + local = per_cpu_ptr(&prcu_local, cpu); + if (!READ_ONCE(local->online)) + continue; + if (READ_ONCE(local->version) < version) { + smp_call_function_single(cpu, prcu_handler, NULL, 0); + cpumask_set_cpu(cpu, &cpus); + } + } + + for_each_cpu(cpu, &cpus) { + local = per_cpu_ptr(&prcu_local, cpu); + while (READ_ONCE(local->version) < version) + cpu_relax(); + } + + if (atomic_read(&prcu->active_ctr)) + wait_event(prcu->wait_q, !atomic_read(&prcu->active_ctr)); + + mutex_unlock(&prcu->mtx); +} +EXPORT_SYMBOL(synchronize_prcu); + +void prcu_note_context_switch(void) +{ + struct prcu_local_struct *local; + + local = get_cpu_ptr(&prcu_local); + if (local->locked) { + atomic_add(local->locked, &prcu->active_ctr); + local->locked = 0; + } + local->online = 0; + prcu_report(local); + put_cpu_ptr(&prcu_local); +} diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 326d4f88..a308581b 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -15,6 +15,7 @@ #include #include #include +#include #include #include @@ -3383,6 +3384,7 @@ static void __sched notrace __schedule(bool preempt) local_irq_disable(); rcu_note_context_switch(preempt); + prcu_note_context_switch(); /* * Make sure that signal_pending_state()->signal_pending() below -- 2.14.1.729.g59c0ea183