Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp4344873rwl; Wed, 28 Dec 2022 02:48:50 -0800 (PST) X-Google-Smtp-Source: AMrXdXvgLH7t44JoUVNqTaAoNGNbfXARQd/riM2Eoe8vWenWXGC/vV47OJKOOrptdiVuB9LptDpl X-Received: by 2002:a17:906:1299:b0:7c1:ee:5bca with SMTP id k25-20020a170906129900b007c100ee5bcamr18897054ejb.73.1672224529905; Wed, 28 Dec 2022 02:48:49 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1672224529; cv=none; d=google.com; s=arc-20160816; b=0nwBLNCMvFCWYXGVPy6qeahoCdWmHrIJp6OKx/Zna63SyH5FTYvlDDQpyJpKXJ/Qw8 iuDL6LRtK+tpuLxMR8PbN8JwoET/Q88zNigj6ou+1HqH+8ZPfbs6Gu2m9QMVZU92FYni UioVdvsdRS+x425HzLebYozH/aLehQWN3CEWomrGKIV7oq3zvcW3Iavl56wBWxIGaElb 4830SdUc93mzDvMCVZpAMLmLFqjPDvj9pWB6GwAabcJEnLcWNonw1iRWYc3WIqsDr3XJ S6zgH65cRHeeJ7sRVldXjnrTljgX860VsJCvEbLvelGic8ByWXOO1WS1E6gi3zgjEZEr y2ng== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Y1EGnQCJSvGzOIf9doGboZdyw/pbxRUE+qj16FdxLig=; b=NlSKf9xI+eOZXXXdK62xKmiJ1rA6wOP9o8lZkcD9d0k9Ziddv+Bjin12CBobau37iz SIG+Del4qOUcZ06EOXs61ph4WK9x23+H6AoRgH2kLvLaC1R/0rZAi+d/qgSTfdLtcWc3 d+gFi62jZM0Ip0VV26z95o1lzkRjiaDtrNVkg3iwm8IS8chs6bRKy6bIvXCZYGrg0fkC 1hyhqZOd3pq/20iEaPlT1HNSWHKvV84oRnWGuT7pwfwTEglXW3s4lGqOKMsnSe1WvyjJ O7W2Z6Gw5l+fDCw2S3BkNlmYp+CQTadYCuwjJLZHCu1MmqmZ68u1qYEElS5sZhtNmB5k /ryw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Fdaf4QqT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a21-20020a1709066d5500b007add6be8c86si10750579ejt.762.2022.12.28.02.48.35; Wed, 28 Dec 2022 02:48:49 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Fdaf4QqT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232562AbiL1KOe (ORCPT + 64 others); Wed, 28 Dec 2022 05:14:34 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33170 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229997AbiL1KOb (ORCPT ); Wed, 28 Dec 2022 05:14:31 -0500 Received: from mail-ot1-x32f.google.com (mail-ot1-x32f.google.com [IPv6:2607:f8b0:4864:20::32f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3ED28120; Wed, 28 Dec 2022 02:14:27 -0800 (PST) Received: by mail-ot1-x32f.google.com with SMTP id y18-20020a0568301d9200b0067082cd4679so9625352oti.4; Wed, 28 Dec 2022 02:14:27 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=Y1EGnQCJSvGzOIf9doGboZdyw/pbxRUE+qj16FdxLig=; b=Fdaf4QqTqCoGMdtmihlKjs1XMN/DqOSXAH/kYSbONbkDWz6N0QJ5L/zu6+4mtgUKEh Ivps80gjQOFVRN0UGc2dQlUMkpL36PBNTZE6XLVa/MxCYCkKJ6SaVh6fRLMzQTC16cB0 K4v9iekTwetGBTNF7Yv516lK+X9Ivdn7H+whO24Jl/8qtjs5y9fM9kliUxvdVoeiRG7S 0PcjvZhUpNGyGMrZRtDJrk30R4XXAOH5lA0yYzdAq3e57yg0vYay2kw4CTQwKnvPLBta 2A518uZUutANzUMX07XAoe/qDAMkgLUfxbcnCjbBzqgSbh5vI1cpSuxMiL6/Wf0fmps5 Yvzw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=Y1EGnQCJSvGzOIf9doGboZdyw/pbxRUE+qj16FdxLig=; b=Y3bh5K5eZsemWuPQpAalX4p2BvsUlW6IhoyUC8VCt/XdyUJ4MfJgEAVILUZiEcYlYR YuB9IJVUzvU+BBuZXig8XNy6Lyl9fF/Lme7+JWL5sPk1VMECBC0BQ4kmrl+CMYOdEEVO QiQvjTFhDbhch7abNWz5VI+JGdGX4Q2fQWtZ0xivyxhp7WP/PiDymkZoCZjLXD//3B1Q HxCpAcYvC6b9Pxo6dqYh5cKIj8A4Ci28bt41fo5fIKDpKwEEb9Hb48On9R/0MZhGnyH3 3axi4KvfbO5Eq7vY3vKH0PDN/4PiO9w5DGynNxdjR6ab3HizV1suWFwBuVKsPupJ66a6 Am+g== X-Gm-Message-State: AFqh2krazUaSaT+BmtBEqQfCRI5HprN4b7rinZ9xzE8nB34dr9ucV4uh /ibr+tgRrnJNHj1/aeHCxqUFWH9Fd/ODANnlxZc= X-Received: by 2002:a05:6830:94:b0:671:cacb:681a with SMTP id a20-20020a056830009400b00671cacb681amr1392312oto.311.1672222466455; Wed, 28 Dec 2022 02:14:26 -0800 (PST) MIME-Version: 1.0 References: <20221228060346.352362-1-wedsonaf@gmail.com> In-Reply-To: From: Wedson Almeida Filho Date: Wed, 28 Dec 2022 10:14:15 +0000 Message-ID: Subject: Re: [PATCH 1/7] rust: sync: add `Arc` for ref-counted allocations To: Alice Ryhl Cc: rust-for-linux@vger.kernel.org, Miguel Ojeda , Alex Gaynor , Boqun Feng , Gary Guo , =?UTF-8?Q?Bj=C3=B6rn_Roy_Baron?= , linux-kernel@vger.kernel.org, Will Deacon , Peter Zijlstra , Mark Rutland Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 28 Dec 2022 at 10:02, Alice Ryhl wrote: > > On 12/28/22 07:03, Wedson Almeida Filho wrote: > > This is a basic implementation of `Arc` backed by C's `refcount_t`. It > > allows Rust code to idiomatically allocate memory that is ref-counted. > > > > Cc: Will Deacon > > Cc: Peter Zijlstra > > Cc: Boqun Feng > > Cc: Mark Rutland > > Signed-off-by: Wedson Almeida Filho > > Reviewed-by: Alice Ryhl Thanks for reviewing! > Instead of Box::leak, it would be more idiomatic to use Box::into_raw, > but both approaches will work. `Box::into_raw` returns a `*mut T`, whose conversion to `NonNull` is fallible (because it could be null). `Box::leak`, OTOH, returns an `&mut T`, which cannot be null so it can be converted to `NonNull` infallibly. > Regards, > Alice Ryhl > > > --- > > rust/bindings/bindings_helper.h | 1 + > > rust/bindings/lib.rs | 1 + > > rust/helpers.c | 19 ++++ > > rust/kernel/lib.rs | 1 + > > rust/kernel/sync.rs | 10 ++ > > rust/kernel/sync/arc.rs | 157 ++++++++++++++++++++++++++++++++ > > 6 files changed, 189 insertions(+) > > create mode 100644 rust/kernel/sync.rs > > create mode 100644 rust/kernel/sync/arc.rs > > > > diff --git a/rust/bindings/bindings_helper.h b/rust/bindings/bindings_helper.h > > index c48bc284214a..75d85bd6c592 100644 > > --- a/rust/bindings/bindings_helper.h > > +++ b/rust/bindings/bindings_helper.h > > @@ -7,6 +7,7 @@ > > */ > > > > #include > > +#include > > > > /* `bindgen` gets confused at certain things. */ > > const gfp_t BINDINGS_GFP_KERNEL = GFP_KERNEL; > > diff --git a/rust/bindings/lib.rs b/rust/bindings/lib.rs > > index 6c50ee62c56b..7b246454e009 100644 > > --- a/rust/bindings/lib.rs > > +++ b/rust/bindings/lib.rs > > @@ -41,6 +41,7 @@ mod bindings_raw { > > #[allow(dead_code)] > > mod bindings_helper { > > // Import the generated bindings for types. > > + use super::bindings_raw::*; > > include!(concat!( > > env!("OBJTREE"), > > "/rust/bindings/bindings_helpers_generated.rs" > > diff --git a/rust/helpers.c b/rust/helpers.c > > index b4f15eee2ffd..09a4d93f9d62 100644 > > --- a/rust/helpers.c > > +++ b/rust/helpers.c > > @@ -20,6 +20,7 @@ > > > > #include > > #include > > +#include > > > > __noreturn void rust_helper_BUG(void) > > { > > @@ -27,6 +28,24 @@ __noreturn void rust_helper_BUG(void) > > } > > EXPORT_SYMBOL_GPL(rust_helper_BUG); > > > > +refcount_t rust_helper_REFCOUNT_INIT(int n) > > +{ > > + return (refcount_t)REFCOUNT_INIT(n); > > +} > > +EXPORT_SYMBOL_GPL(rust_helper_REFCOUNT_INIT); > > + > > +void rust_helper_refcount_inc(refcount_t *r) > > +{ > > + refcount_inc(r); > > +} > > +EXPORT_SYMBOL_GPL(rust_helper_refcount_inc); > > + > > +bool rust_helper_refcount_dec_and_test(refcount_t *r) > > +{ > > + return refcount_dec_and_test(r); > > +} > > +EXPORT_SYMBOL_GPL(rust_helper_refcount_dec_and_test); > > + > > /* > > * We use `bindgen`'s `--size_t-is-usize` option to bind the C `size_t` type > > * as the Rust `usize` type, so we can use it in contexts where Rust > > diff --git a/rust/kernel/lib.rs b/rust/kernel/lib.rs > > index 53040fa9e897..ace064a3702a 100644 > > --- a/rust/kernel/lib.rs > > +++ b/rust/kernel/lib.rs > > @@ -31,6 +31,7 @@ mod static_assert; > > #[doc(hidden)] > > pub mod std_vendor; > > pub mod str; > > +pub mod sync; > > pub mod types; > > > > #[doc(hidden)] > > diff --git a/rust/kernel/sync.rs b/rust/kernel/sync.rs > > new file mode 100644 > > index 000000000000..39b379dd548f > > --- /dev/null > > +++ b/rust/kernel/sync.rs > > @@ -0,0 +1,10 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > + > > +//! Synchronisation primitives. > > +//! > > +//! This module contains the kernel APIs related to synchronisation that have been ported or > > +//! wrapped for usage by Rust code in the kernel. > > + > > +mod arc; > > + > > +pub use arc::Arc; > > diff --git a/rust/kernel/sync/arc.rs b/rust/kernel/sync/arc.rs > > new file mode 100644 > > index 000000000000..22290eb5ab9b > > --- /dev/null > > +++ b/rust/kernel/sync/arc.rs > > @@ -0,0 +1,157 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > + > > +//! A reference-counted pointer. > > +//! > > +//! This module implements a way for users to create reference-counted objects and pointers to > > +//! them. Such a pointer automatically increments and decrements the count, and drops the > > +//! underlying object when it reaches zero. It is also safe to use concurrently from multiple > > +//! threads. > > +//! > > +//! It is different from the standard library's [`Arc`] in a few ways: > > +//! 1. It is backed by the kernel's `refcount_t` type. > > +//! 2. It does not support weak references, which allows it to be half the size. > > +//! 3. It saturates the reference count instead of aborting when it goes over a threshold. > > +//! 4. It does not provide a `get_mut` method, so the ref counted object is pinned. > > +//! > > +//! [`Arc`]: https://doc.rust-lang.org/std/sync/struct.Arc.html > > + > > +use crate::{bindings, error::Result, types::Opaque}; > > +use alloc::boxed::Box; > > +use core::{marker::PhantomData, ops::Deref, ptr::NonNull}; > > + > > +/// A reference-counted pointer to an instance of `T`. > > +/// > > +/// The reference count is incremented when new instances of [`Arc`] are created, and decremented > > +/// when they are dropped. When the count reaches zero, the underlying `T` is also dropped. > > +/// > > +/// # Invariants > > +/// > > +/// The reference count on an instance of [`Arc`] is always non-zero. > > +/// The object pointed to by [`Arc`] is always pinned. > > +/// > > +/// # Examples > > +/// > > +/// ``` > > +/// use kernel::sync::Arc; > > +/// > > +/// struct Example { > > +/// a: u32, > > +/// b: u32, > > +/// } > > +/// > > +/// // Create a ref-counted instance of `Example`. > > +/// let obj = Arc::try_new(Example { a: 10, b: 20 })?; > > +/// > > +/// // Get a new pointer to `obj` and increment the refcount. > > +/// let cloned = obj.clone(); > > +/// > > +/// // Assert that both `obj` and `cloned` point to the same underlying object. > > +/// assert!(core::ptr::eq(&*obj, &*cloned)); > > +/// > > +/// // Destroy `obj` and decrement its refcount. > > +/// drop(obj); > > +/// > > +/// // Check that the values are still accessible through `cloned`. > > +/// assert_eq!(cloned.a, 10); > > +/// assert_eq!(cloned.b, 20); > > +/// > > +/// // The refcount drops to zero when `cloned` goes out of scope, and the memory is freed. > > +/// ``` > > +pub struct Arc { > > + ptr: NonNull>, > > + _p: PhantomData>, > > +} > > + > > +#[repr(C)] > > +struct ArcInner { > > + refcount: Opaque, > > + data: T, > > +} > > + > > +// SAFETY: It is safe to send `Arc` to another thread when the underlying `T` is `Sync` because > > +// it effectively means sharing `&T` (which is safe because `T` is `Sync`); additionally, it needs > > +// `T` to be `Send` because any thread that has an `Arc` may ultimately access `T` directly, for > > +// example, when the reference count reaches zero and `T` is dropped. > > +unsafe impl Send for Arc {} > > + > > +// SAFETY: It is safe to send `&Arc` to another thread when the underlying `T` is `Sync` for the > > +// same reason as above. `T` needs to be `Send` as well because a thread can clone an `&Arc` > > +// into an `Arc`, which may lead to `T` being accessed by the same reasoning as above. > > +unsafe impl Sync for Arc {} > > + > > +impl Arc { > > + /// Constructs a new reference counted instance of `T`. > > + pub fn try_new(contents: T) -> Result { > > + // INVARIANT: The refcount is initialised to a non-zero value. > > + let value = ArcInner { > > + // SAFETY: There are no safety requirements for this FFI call. > > + refcount: Opaque::new(unsafe { bindings::REFCOUNT_INIT(1) }), > > + data: contents, > > + }; > > + > > + let inner = Box::try_new(value)?; > > + > > + // SAFETY: We just created `inner` with a reference count of 1, which is owned by the new > > + // `Arc` object. > > + Ok(unsafe { Self::from_inner(Box::leak(inner).into()) }) > > + } > > +} > > + > > +impl Arc { > > + /// Constructs a new [`Arc`] from an existing [`ArcInner`]. > > + /// > > + /// # Safety > > + /// > > + /// The caller must ensure that `inner` points to a valid location and has a non-zero reference > > + /// count, one of which will be owned by the new [`Arc`] instance. > > + unsafe fn from_inner(inner: NonNull>) -> Self { > > + // INVARIANT: By the safety requirements, the invariants hold. > > + Arc { > > + ptr: inner, > > + _p: PhantomData, > > + } > > + } > > +} > > + > > +impl Deref for Arc { > > + type Target = T; > > + > > + fn deref(&self) -> &Self::Target { > > + // SAFETY: By the type invariant, there is necessarily a reference to the object, so it is > > + // safe to dereference it. > > + unsafe { &self.ptr.as_ref().data } > > + } > > +} > > + > > +impl Clone for Arc { > > + fn clone(&self) -> Self { > > + // INVARIANT: C `refcount_inc` saturates the refcount, so it cannot overflow to zero. > > + // SAFETY: By the type invariant, there is necessarily a reference to the object, so it is > > + // safe to increment the refcount. > > + unsafe { bindings::refcount_inc(self.ptr.as_ref().refcount.get()) }; > > + > > + // SAFETY: We just incremented the refcount. This increment is now owned by the new `Arc`. > > + unsafe { Self::from_inner(self.ptr) } > > + } > > +} > > + > > +impl Drop for Arc { > > + fn drop(&mut self) { > > + // SAFETY: By the type invariant, there is necessarily a reference to the object. We cannot > > + // touch `refcount` after it's decremented to a non-zero value because another thread/CPU > > + // may concurrently decrement it to zero and free it. It is ok to have a raw pointer to > > + // freed/invalid memory as long as it is never dereferenced. > > + let refcount = unsafe { self.ptr.as_ref() }.refcount.get(); > > + > > + // INVARIANT: If the refcount reaches zero, there are no other instances of `Arc`, and > > + // this instance is being dropped, so the broken invariant is not observable. > > + // SAFETY: Also by the type invariant, we are allowed to decrement the refcount. > > + let is_zero = unsafe { bindings::refcount_dec_and_test(refcount) }; > > + if is_zero { > > + // The count reached zero, we must free the memory. > > + // > > + // SAFETY: The pointer was initialised from the result of `Box::leak`. > > + unsafe { Box::from_raw(self.ptr.as_ptr()) }; > > + } > > + } > > +}