Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2F43AC433EF for ; Tue, 23 Nov 2021 10:20:24 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235216AbhKWKXa (ORCPT ); Tue, 23 Nov 2021 05:23:30 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:32978 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231221AbhKWKX3 (ORCPT ); Tue, 23 Nov 2021 05:23:29 -0500 Received: from mail-qt1-x833.google.com (mail-qt1-x833.google.com [IPv6:2607:f8b0:4864:20::833]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C047BC061574 for ; Tue, 23 Nov 2021 02:20:21 -0800 (PST) Received: by mail-qt1-x833.google.com with SMTP id f20so19344693qtb.4 for ; Tue, 23 Nov 2021 02:20:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=F+kxZkQQT7OHNmGMuUCVRF4t6nUAJ3c6sxl3xAFz4pk=; b=lw6p6cfcs8K3sP7S7ZJg8SrjbF0rowhzO3XZEQeqhWqd/rKOUMcyg0A3t/skz1HlPy Ru4dxvCiIHm4M6Y86yjdAz1SdtfZnY4P1xuxvJfZxiWYPgjyKzbXFg6M3ApqSkYG5dBb Zowe6xc7Re7uMOhVnYI8xKdMLNcZguTEy7zCIaW6oGmApqXc9GZGn3T0K4MC51hBfM4Z z9yEbgGBWNlpcMIbPrxAAvg++xp+OcqAoKH4piIIaQP11H32e8+Top4hcqq+bDEPpWXq ZmlcxtW4UkPW5cDU8lI8YBhrs6HGO+9JbKqadRA9ajE/4jtb/ZwJF3MOXlVSsmHQdjbk 2Vtg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=F+kxZkQQT7OHNmGMuUCVRF4t6nUAJ3c6sxl3xAFz4pk=; b=VThiYfplVFykVQJVXOupDeemd9sQ73exHM6zoL//XsODEb5WgQOmLYbVIrVpwWz7yf UDCO7GNafczm3FB1KkYVQyqUqV/zSDjPVwXo/qX0Ru7WE+597gQ5ZufhHPIX/mEvM07S TfOb4xvKiPh0q6mqvRhwVkMukjUEKtvLBboQ+VBxLVlGYK6LqVgHNQE2M0t0kTLlbR6t 8pov5A3mmcvYT5Cxuj2ULoRqzHmDbUI4nlsfAhvHxlV9E5/YNSk5+pu35MgvEiu+XLLZ UC947Vw+wOwcK7RY8j51MBzhKoYaUfYEeN0DccGIyilwDBODhmXzHJgvZBsmSkuFfpBV Y+Xw== X-Gm-Message-State: AOAM532LPYqXMjH8VX63dsPPYO6KQIKl5ou0b8Pu8OeDRGnYN5o+Umpv DCpCY6Tl7AkT3uEJYNC3s2UHKwLnX5j3KFqMxCRngg== X-Google-Smtp-Source: ABdhPJzhfMUonhREpiLejz8RriF63os5QyQ1fmOi2gaXPdkixZcXDAfGNJxBHNs+6l40BwgiRtGAj8srNdIJ53LbUao= X-Received: by 2002:a05:622a:1045:: with SMTP id f5mr4828127qte.319.1637662820622; Tue, 23 Nov 2021 02:20:20 -0800 (PST) MIME-Version: 1.0 References: <20211123051658.3195589-1-pcc@google.com> <20211123051658.3195589-3-pcc@google.com> In-Reply-To: From: Alexander Potapenko Date: Tue, 23 Nov 2021 11:19:44 +0100 Message-ID: Subject: Re: [PATCH v2 2/5] uaccess-buffer: add core code To: Dmitry Vyukov Cc: Peter Collingbourne , Catalin Marinas , Will Deacon , Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Steven Rostedt , Ben Segall , Mel Gorman , Daniel Bristot de Oliveira , Thomas Gleixner , Andy Lutomirski , Kees Cook , Andrew Morton , Masahiro Yamada , Sami Tolvanen , YiFei Zhu , Colin Ian King , Mark Rutland , Frederic Weisbecker , Viresh Kumar , Andrey Konovalov , Gabriel Krisman Bertazi , Chris Hyser , Daniel Vetter , Chris Wilson , Arnd Bergmann , Christian Brauner , "Eric W. Biederman" , Alexey Gladkov , Ran Xiaokai , David Hildenbrand , Xiaofeng Cao , Cyrill Gorcunov , Thomas Cedeno , Marco Elver , linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, Evgenii Stepanov Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Nov 23, 2021 at 10:56 AM Dmitry Vyukov wrote: > > .On Tue, 23 Nov 2021 at 06:17, Peter Collingbourne wrote= : > > > > Add the core code to support uaccess logging. Subsequent patches will > > hook this up to the arch-specific kernel entry and exit code for > > certain architectures. > > I don't see where we block signals when a user writes to the addr. I > expected to see some get_user from the addr somewhere in the signal > handling code. What am I missing? > > > Link: https://linux-review.googlesource.com/id/I6581765646501a5631b281d= 670903945ebadc57d > > Signed-off-by: Peter Collingbourne > > --- > > v2: > > - New interface that avoids multiple syscalls per real syscall and > > is arch-generic > > - Avoid logging uaccesses done by BPF programs > > - Add documentation > > - Split up into multiple patches > > - Various code moves, renames etc as requested by Marco > > > > arch/Kconfig | 5 + > > fs/exec.c | 2 + > > include/linux/instrumented.h | 5 +- > > include/linux/sched.h | 4 + > > include/linux/uaccess-buffer-log-hooks.h | 59 +++++++++++ > > include/linux/uaccess-buffer.h | 79 ++++++++++++++ > > include/uapi/linux/prctl.h | 3 + > > include/uapi/linux/uaccess-buffer.h | 25 +++++ > > kernel/Makefile | 1 + > > kernel/bpf/helpers.c | 6 +- > > kernel/fork.c | 3 + > > kernel/signal.c | 4 +- > > kernel/sys.c | 6 ++ > > kernel/uaccess-buffer.c | 125 +++++++++++++++++++++++ > > 14 files changed, 324 insertions(+), 3 deletions(-) > > create mode 100644 include/linux/uaccess-buffer-log-hooks.h > > create mode 100644 include/linux/uaccess-buffer.h > > create mode 100644 include/uapi/linux/uaccess-buffer.h > > create mode 100644 kernel/uaccess-buffer.c > > > > diff --git a/arch/Kconfig b/arch/Kconfig > > index 26b8ed11639d..6030298a7e9a 100644 > > --- a/arch/Kconfig > > +++ b/arch/Kconfig > > @@ -1302,6 +1302,11 @@ config ARCH_HAS_PARANOID_L1D_FLUSH > > config DYNAMIC_SIGFRAME > > bool > > > > +config HAVE_ARCH_UACCESS_BUFFER > > + bool > > + help > > + Select if the architecture's syscall entry/exit code supports= uaccess buffers. > > + > > source "kernel/gcov/Kconfig" > > > > source "scripts/gcc-plugins/Kconfig" > > diff --git a/fs/exec.c b/fs/exec.c > > index 537d92c41105..5f30314f3ec6 100644 > > --- a/fs/exec.c > > +++ b/fs/exec.c > > @@ -65,6 +65,7 @@ > > #include > > #include > > #include > > +#include > > > > #include > > #include > > @@ -1313,6 +1314,7 @@ int begin_new_exec(struct linux_binprm * bprm) > > me->personality &=3D ~bprm->per_clear; > > > > clear_syscall_work_syscall_user_dispatch(me); > > + uaccess_buffer_set_descriptor_addr_addr(0); > > > > /* > > * We have to apply CLOEXEC before we change whether the proces= s is > > diff --git a/include/linux/instrumented.h b/include/linux/instrumented.= h > > index 42faebbaa202..c96be1695614 100644 > > --- a/include/linux/instrumented.h > > +++ b/include/linux/instrumented.h > > @@ -2,7 +2,7 @@ > > > > /* > > * This header provides generic wrappers for memory access instrumenta= tion that > > - * the compiler cannot emit for: KASAN, KCSAN. > > + * the compiler cannot emit for: KASAN, KCSAN, uaccess buffers. > > */ > > #ifndef _LINUX_INSTRUMENTED_H > > #define _LINUX_INSTRUMENTED_H > > @@ -11,6 +11,7 @@ > > #include > > #include > > #include > > +#include > > > > /** > > * instrument_read - instrument regular read access > > @@ -117,6 +118,7 @@ instrument_copy_to_user(void __user *to, const void= *from, unsigned long n) > > { > > kasan_check_read(from, n); > > kcsan_check_read(from, n); > > + uaccess_buffer_log_write(to, n); > > } > > > > /** > > @@ -134,6 +136,7 @@ instrument_copy_from_user(const void *to, const voi= d __user *from, unsigned long > > { > > kasan_check_write(to, n); > > kcsan_check_write(to, n); > > + uaccess_buffer_log_read(from, n); > > } > > > > #endif /* _LINUX_INSTRUMENTED_H */ > > diff --git a/include/linux/sched.h b/include/linux/sched.h > > index 78c351e35fec..1f978deaa3f8 100644 > > --- a/include/linux/sched.h > > +++ b/include/linux/sched.h > > @@ -1484,6 +1484,10 @@ struct task_struct { > > struct callback_head l1d_flush_kill; > > #endif > > > > +#ifdef CONFIG_HAVE_ARCH_UACCESS_BUFFER > > + struct uaccess_buffer_info uaccess_buffer; > > +#endif > > Shouldn't this be controlled by an additional config that a user can > enable/disable? > If I am reading this correctly, the current implementation forces > uaccess logging for all arches that support it. Some embed kernels may > not want this. > > > > /* > > * New fields for task_struct should be added above here, so th= at > > * they are included in the randomized portion of task_struct. > > diff --git a/include/linux/uaccess-buffer-log-hooks.h b/include/linux/u= access-buffer-log-hooks.h > > new file mode 100644 > > index 000000000000..bddc84ddce32 > > --- /dev/null > > +++ b/include/linux/uaccess-buffer-log-hooks.h > > @@ -0,0 +1,59 @@ > > +/* SPDX-License-Identifier: GPL-2.0 */ > > +#ifndef _LINUX_UACCESS_BUFFER_LOG_HOOKS_H > > +#define _LINUX_UACCESS_BUFFER_LOG_HOOKS_H > > + > > +#ifdef CONFIG_HAVE_ARCH_UACCESS_BUFFER > > + > > +struct uaccess_buffer_info { > > + /* > > + * The pointer to pointer to struct uaccess_descriptor. This is= the > > + * value controlled by prctl(PR_SET_UACCESS_DESCRIPTOR_ADDR_ADD= R). > > + */ > > + struct uaccess_descriptor __user *__user *desc_ptr_ptr; > > + > > + /* > > + * The pointer to struct uaccess_descriptor read at syscall ent= ry time. > > + */ > > + struct uaccess_descriptor __user *desc_ptr; > > + > > + /* > > + * A pointer to the kernel's temporary copy of the uaccess log = for the > > + * current syscall. We log to a kernel buffer in order to avoid= leaking > > + * timing information to userspace. > > + */ > > + struct uaccess_buffer_entry *kbegin; > > + > > + /* > > + * The position of the next uaccess buffer entry for the curren= t > > + * syscall. > > + */ > > + struct uaccess_buffer_entry *kcur; > > + > > + /* > > + * A pointer to the end of the kernel's uaccess log. > > + */ > > + struct uaccess_buffer_entry *kend; > > + > > + /* > > + * The pointer to the userspace uaccess log, as read from the > > + * struct uaccess_descriptor. > > + */ > > + struct uaccess_buffer_entry __user *ubegin; > > +}; > > + > > +void uaccess_buffer_log_read(const void __user *from, unsigned long n)= ; > > +void uaccess_buffer_log_write(void __user *to, unsigned long n); > > + > > +#else > > + > > +static inline void uaccess_buffer_log_read(const void __user *from, > > + unsigned long n) > > +{ > > +} > > +static inline void uaccess_buffer_log_write(void __user *to, unsigned = long n) > > +{ > > +} > > + > > +#endif > > + > > +#endif /* _LINUX_UACCESS_BUFFER_LOG_HOOKS_H */ > > diff --git a/include/linux/uaccess-buffer.h b/include/linux/uaccess-buf= fer.h > > new file mode 100644 > > index 000000000000..14261368d3a9 > > --- /dev/null > > +++ b/include/linux/uaccess-buffer.h > > @@ -0,0 +1,79 @@ > > +/* SPDX-License-Identifier: GPL-2.0 */ > > +#ifndef _LINUX_UACCESS_BUFFER_H > > +#define _LINUX_UACCESS_BUFFER_H > > + > > +#include > > +#include > > + > > +#include > > + > > +#ifdef CONFIG_HAVE_ARCH_UACCESS_BUFFER > > + > > +static inline bool uaccess_buffer_maybe_blocked(struct task_struct *ts= k) > > +{ > > + return tsk->uaccess_buffer.desc_ptr_ptr; > > +} > > + > > +void __uaccess_buffer_syscall_entry(void); > > +static inline void uaccess_buffer_syscall_entry(void) > > +{ > > + if (uaccess_buffer_maybe_blocked(current)) > > + __uaccess_buffer_syscall_entry(); > > +} > > + > > +void __uaccess_buffer_syscall_exit(void); > > +static inline void uaccess_buffer_syscall_exit(void) > > +{ > > + if (uaccess_buffer_maybe_blocked(current)) > > + __uaccess_buffer_syscall_exit(); > > +} > > + > > +bool __uaccess_buffer_pre_exit_loop(void); > > +static inline bool uaccess_buffer_pre_exit_loop(void) > > +{ > > + if (!uaccess_buffer_maybe_blocked(current)) > > + return false; > > + return __uaccess_buffer_pre_exit_loop(); > > +} > > + > > +void __uaccess_buffer_post_exit_loop(void); > > +static inline void uaccess_buffer_post_exit_loop(bool pending) > > +{ > > + if (pending) > > + __uaccess_buffer_post_exit_loop(); > > +} > > + > > +void uaccess_buffer_cancel_log(struct task_struct *tsk); > > +int uaccess_buffer_set_descriptor_addr_addr(unsigned long addr); > > + > > +#else > > + > > +static inline bool uaccess_buffer_maybe_blocked(struct task_struct *ts= k) > > +{ > > + return false; > > +} > > + > > +static inline void uaccess_buffer_syscall_entry(void) > > +{ > > +} > > +static inline void uaccess_buffer_syscall_exit(void) > > +{ > > +} > > +static inline bool uaccess_buffer_pre_exit_loop(void) > > +{ > > + return false; > > +} > > +static inline void uaccess_buffer_post_exit_loop(bool pending) > > +{ > > +} > > +static inline void uaccess_buffer_cancel_log(struct task_struct *tsk) > > +{ > > +} > > + > > +static inline int uaccess_buffer_set_descriptor_addr_addr(unsigned lon= g addr) > > +{ > > + return -EINVAL; > > +} > > +#endif > > + > > +#endif /* _LINUX_UACCESS_BUFFER_H */ > > diff --git a/include/uapi/linux/prctl.h b/include/uapi/linux/prctl.h > > index bb73e9a0b24f..74b37469c7b3 100644 > > --- a/include/uapi/linux/prctl.h > > +++ b/include/uapi/linux/prctl.h > > @@ -272,4 +272,7 @@ struct prctl_mm_map { > > # define PR_SCHED_CORE_SCOPE_THREAD_GROUP 1 > > # define PR_SCHED_CORE_SCOPE_PROCESS_GROUP 2 > > > > +/* Configure uaccess logging feature */ > > +#define PR_SET_UACCESS_DESCRIPTOR_ADDR_ADDR 63 > > + > > #endif /* _LINUX_PRCTL_H */ > > diff --git a/include/uapi/linux/uaccess-buffer.h b/include/uapi/linux/u= access-buffer.h > > new file mode 100644 > > index 000000000000..619b17dc25c4 > > --- /dev/null > > +++ b/include/uapi/linux/uaccess-buffer.h > > @@ -0,0 +1,25 @@ > > +/* SPDX-License-Identifier: GPL-2.0 WITH Linux-syscall-note */ > > +#ifndef _UAPI_LINUX_UACCESS_BUFFER_H > > +#define _UAPI_LINUX_UACCESS_BUFFER_H > > + > > +/* Location of the uaccess log. */ > > +struct uaccess_descriptor { > > + /* Address of the uaccess_buffer_entry array. */ > > + __u64 addr; > > + /* Size of the uaccess_buffer_entry array in number of elements= . */ > > + __u64 size; > > +}; > > + > > +/* Format of the entries in the uaccess log. */ > > +struct uaccess_buffer_entry { > > + /* Address being accessed. */ > > + __u64 addr; > > + /* Number of bytes that were accessed. */ > > + __u64 size; > > + /* UACCESS_BUFFER_* flags. */ > > + __u64 flags; > > +}; > > + > > +#define UACCESS_BUFFER_FLAG_WRITE 1 /* access was a write */ > > + > > +#endif /* _UAPI_LINUX_UACCESS_BUFFER_H */ > > diff --git a/kernel/Makefile b/kernel/Makefile > > index 186c49582f45..d4d9be5146c3 100644 > > --- a/kernel/Makefile > > +++ b/kernel/Makefile > > @@ -114,6 +114,7 @@ obj-$(CONFIG_KCSAN) +=3D kcsan/ > > obj-$(CONFIG_SHADOW_CALL_STACK) +=3D scs.o > > obj-$(CONFIG_HAVE_STATIC_CALL_INLINE) +=3D static_call.o > > obj-$(CONFIG_CFI_CLANG) +=3D cfi.o > > +obj-$(CONFIG_HAVE_ARCH_UACCESS_BUFFER) +=3D uaccess-buffer.o > > > > obj-$(CONFIG_PERF_EVENTS) +=3D events/ > > > > diff --git a/kernel/bpf/helpers.c b/kernel/bpf/helpers.c > > index 649f07623df6..167b50177066 100644 > > --- a/kernel/bpf/helpers.c > > +++ b/kernel/bpf/helpers.c > > @@ -637,7 +637,11 @@ const struct bpf_func_proto bpf_event_output_data_= proto =3D { > > BPF_CALL_3(bpf_copy_from_user, void *, dst, u32, size, > > const void __user *, user_ptr) > > { > > - int ret =3D copy_from_user(dst, user_ptr, size); > > + /* > > + * Avoid copy_from_user() here as it may leak information about= the BPF > > + * program to userspace via the uaccess buffer. > > + */ > > + int ret =3D raw_copy_from_user(dst, user_ptr, size); > > Here I am more concerned about KASAN/KMSAN checks. > What exactly is the attack vector here? Are these accesses secret? If there are concerns about leaking information in this particular case, any other call to copy_from_user() added in the future will be prone to the same issues. So if uaccess logging is meant for production use, it should be possible to lock the feature down from unwanted users. > Can't the same info be obtained using userfaultfd/unmapping memory? > > raw_copy_from_user also skips access_ok, is it ok? > > > > if (unlikely(ret)) { > > memset(dst, 0, size); > > diff --git a/kernel/fork.c b/kernel/fork.c > > index 3244cc56b697..c7abe7e7c7cd 100644 > > --- a/kernel/fork.c > > +++ b/kernel/fork.c > > @@ -96,6 +96,7 @@ > > #include > > #include > > #include > > +#include > > > > #include > > #include > > @@ -890,6 +891,8 @@ static struct task_struct *dup_task_struct(struct t= ask_struct *orig, int node) > > if (memcg_charge_kernel_stack(tsk)) > > goto free_stack; > > > > + uaccess_buffer_cancel_log(tsk); > > Why do we need this? > tsk is a newly allocated task_struct. If I am not mistaken, it's not > zero initialized, so are we kfree'ing garbage? > But we may need to do something with tasks after arch_dup_task_struct. > > > stack_vm_area =3D task_stack_vm_area(tsk); > > > > err =3D arch_dup_task_struct(tsk, orig); > > diff --git a/kernel/signal.c b/kernel/signal.c > > index a629b11bf3e0..69bf21518bd0 100644 > > --- a/kernel/signal.c > > +++ b/kernel/signal.c > > @@ -45,6 +45,7 @@ > > #include > > #include > > #include > > +#include > > > > #define CREATE_TRACE_POINTS > > #include > > @@ -1031,7 +1032,8 @@ static void complete_signal(int sig, struct task_= struct *p, enum pid_type type) > > if (sig_fatal(p, sig) && > > !(signal->flags & SIGNAL_GROUP_EXIT) && > > !sigismember(&t->real_blocked, sig) && > > - (sig =3D=3D SIGKILL || !p->ptrace)) { > > + (sig =3D=3D SIGKILL || > > + !(p->ptrace || uaccess_buffer_maybe_blocked(p)))) { > > Why do we need this change? > > > /* > > * This signal will be fatal to the whole group. > > */ > > diff --git a/kernel/sys.c b/kernel/sys.c > > index 8fdac0d90504..c71a9a9c0f68 100644 > > --- a/kernel/sys.c > > +++ b/kernel/sys.c > > @@ -42,6 +42,7 @@ > > #include > > #include > > #include > > +#include > > > > #include > > #include > > @@ -2530,6 +2531,11 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned lon= g, arg2, unsigned long, arg3, > > error =3D sched_core_share_pid(arg2, arg3, arg4, arg5); > > break; > > #endif > > + case PR_SET_UACCESS_DESCRIPTOR_ADDR_ADDR: > > + if (arg3 || arg4 || arg5) > > + return -EINVAL; > > + error =3D uaccess_buffer_set_descriptor_addr_addr(arg2)= ; > > + break; > > default: > > error =3D -EINVAL; > > break; > > diff --git a/kernel/uaccess-buffer.c b/kernel/uaccess-buffer.c > > new file mode 100644 > > index 000000000000..e1c6d6ab9af8 > > --- /dev/null > > +++ b/kernel/uaccess-buffer.c > > @@ -0,0 +1,125 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > +/* > > + * Support for uaccess logging via uaccess buffers. > > + * > > + * Copyright (C) 2021, Google LLC. > > + */ > > + > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > + > > +static void uaccess_buffer_log(unsigned long addr, unsigned long size, > > + unsigned long flags) > > +{ > > + struct uaccess_buffer_info *buf =3D ¤t->uaccess_buffer; > > + struct uaccess_buffer_entry *entry =3D buf->kcur; > > + > > + if (!entry || unlikely(uaccess_kernel())) > > + return; > > + entry->addr =3D addr; > > + entry->size =3D size; > > + entry->flags =3D flags; > > + > > + ++buf->kcur; > > + if (buf->kcur =3D=3D buf->kend) > > + buf->kcur =3D 0; > > =3D NULL; > > > +} > > + > > +void uaccess_buffer_log_read(const void __user *from, unsigned long n) > > +{ > > + uaccess_buffer_log((unsigned long)from, n, 0); > > +} > > +EXPORT_SYMBOL(uaccess_buffer_log_read); > > + > > +void uaccess_buffer_log_write(void __user *to, unsigned long n) > > +{ > > + uaccess_buffer_log((unsigned long)to, n, UACCESS_BUFFER_FLAG_WR= ITE); > > +} > > +EXPORT_SYMBOL(uaccess_buffer_log_write); > > + > > +int uaccess_buffer_set_descriptor_addr_addr(unsigned long addr) > > +{ > > + current->uaccess_buffer.desc_ptr_ptr =3D > > + (struct uaccess_descriptor __user * __user *)addr; > > + uaccess_buffer_cancel_log(current); > > Is this necessary? It looks more reasonable and useful to not call > cancel. In most cases the user won't setup it twice/change. But if the > user anyhow asked to trace the prctl, why not trace it? > > > + return 0; > > +} > > + > > +bool __uaccess_buffer_pre_exit_loop(void) > > +{ > > + struct uaccess_buffer_info *buf =3D ¤t->uaccess_buffer; > > + struct uaccess_descriptor __user *desc_ptr; > > + sigset_t tmp_mask; > > + > > + if (get_user(desc_ptr, buf->desc_ptr_ptr) || !desc_ptr) > > + return false; > > + > > + current->real_blocked =3D current->blocked; > > + sigfillset(&tmp_mask); > > + set_current_blocked(&tmp_mask); > > + return true; > > +} > > + > > +void __uaccess_buffer_post_exit_loop(void) > > +{ > > + spin_lock_irq(¤t->sighand->siglock); > > + current->blocked =3D current->real_blocked; > > + recalc_sigpending(); > > + spin_unlock_irq(¤t->sighand->siglock); > > +} > > + > ;> +void uaccess_buffer_cancel_log(struct task_struct *tsk) > > +{ > > + struct uaccess_buffer_info *buf =3D &tsk->uaccess_buffer; > > + > > + if (buf->kcur) { > > uaccess_buffer_log sets kcur to NULL on overflow and we call this > function from a middle of fork, it looks strange that kfree'ing > something depends on the previous buffer overflow. Should we check > kbegin here? > > > + buf->kcur =3D 0; > > =3D NULL > I would also set kend to NULL to not leave a dangling pointer. > > > + kfree(buf->kbegin); > > + } > > +} > > + > > +void __uaccess_buffer_syscall_entry(void) > > +{ > > + struct uaccess_buffer_info *buf =3D ¤t->uaccess_buffer; > > + struct uaccess_descriptor desc; > > + > > + if (get_user(buf->desc_ptr, buf->desc_ptr_ptr) || !buf->desc_pt= r || > > + put_user(0, buf->desc_ptr_ptr) || > > + copy_from_user(&desc, buf->desc_ptr, sizeof(desc))) > > + return; > > + > > + if (desc.size > 1024) > > + desc.size =3D 1024; > > + > > + buf->kbegin =3D kmalloc_array( > > + desc.size, sizeof(struct uaccess_buffer_entry), GFP_KER= NEL); > > Is not handling error intentional here? > Maybe it's better to check the error just to make the code more > explicit (and maybe prevent some future bugs). > > Actually an interesting attack vector. If you can make this kmalloc > fail, you can prevent sanitizers from detecting any of the bad > accesses :) > > Does it make sense to flag the error somewhere in the desc?... or I am > thinking if we should pre-allocate the buffer, if we start tracing a > task, we will trace lots of syscalls, so avoiding kmalloc/kfree per > syscall can make sense. What do you think? > > > + buf->kcur =3D buf->kbegin; > > + buf->kend =3D buf->kbegin + desc.size; > > + buf->ubegin =3D (struct uaccess_buffer_entry __user *)desc.addr= ; > > +} > > + > > +void __uaccess_buffer_syscall_exit(void) > > +{ > > + struct uaccess_buffer_info *buf =3D ¤t->uaccess_buffer; > > + u64 num_entries =3D buf->kcur - buf->kbegin; > > + struct uaccess_descriptor desc; > > + > > + if (!buf->kcur) > > uaccess_buffer_log sets kcur to NULL on overflow. I think we need to > check kbegin here. > > > > + return; > > + > > + desc.addr =3D (u64)(buf->ubegin + num_entries); > > + desc.size =3D buf->kend - buf->kcur; > > Is the user expected to use size in any of reasonable scenarios? > We cap size at 1024 at entry, so the size can be truncated here. > > > + buf->kcur =3D 0; > > =3D NULL > > > + if (copy_to_user(buf->ubegin, buf->kbegin, > > + num_entries * sizeof(struct uaccess_buffer_ent= ry)) =3D=3D 0) > > + (void)copy_to_user(buf->desc_ptr, &desc, sizeof(desc)); > > + > > + kfree(buf->kbegin); > > What if we enter exit/exit_group with logging enabled, won't we leak the = buffer? > > > +} > > -- > > 2.34.0.rc2.393.gf8c9666880-goog > > --=20 Alexander Potapenko Software Engineer Google Germany GmbH Erika-Mann-Stra=C3=9Fe, 33 80636 M=C3=BCnchen Gesch=C3=A4ftsf=C3=BChrer: Paul Manicle, Halimah DeLaine Prado Registergericht und -nummer: Hamburg, HRB 86891 Sitz der Gesellschaft: Hamburg