Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp2696371rwb; Sat, 24 Sep 2022 13:15:11 -0700 (PDT) X-Google-Smtp-Source: AMsMyM6191bfEGl5hgXAXF4ztXCXbiaNOLvVMy4vUP8KCpwSwMXP0VcOVuE+K+L1uG+YgDZtvhgn X-Received: by 2002:a17:902:d2c7:b0:176:c8a4:2f2 with SMTP id n7-20020a170902d2c700b00176c8a402f2mr14791055plc.119.1664050511235; Sat, 24 Sep 2022 13:15:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664050511; cv=none; d=google.com; s=arc-20160816; b=PZxMN1/e2xA/ZrYkG2RPq3bjIyYY/GN/AMLk6AIPeOjCmdJAiJp5g0LvCOhSP/ofpl q05gxdcgR32rwoV0mz9D4LW5cbpR+qK8YS8Hp1wrZqjxGnUgnXr+vuF9Hc9ySsezEx8G kazo/23TZPwFgU/K8bypKot80kyAA7uWXL+8u+tMUa7exHylEcMPSnMwsR1fJpGv/P17 drEng/f5XG3HFZ2f9IeFGoNG6o4rS1Jmv2mEoAg4f6hnbmd1hMYoPJ9NBULn1y0DRYpP x5pBec7aFgDc/cw4rX+Z00TtfBmEaUMV5E71s77q48ziC6va9ogs0JkBgTqkabAI8ldN 1AtA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date; bh=SDt1bwjqAson8N0b8VnZTshuG5Hxr1Y8QzuC2GBoycg=; b=ulIm+CdOioa98zzUdYJ4jftVp3kglqXdgzb19kmbADANkUzrtg2QcehPnA9kcpjSuT ACuwQk0SpvvgksJvYKnGRINdil1JZXsGHQYfVdqzBtMhKGOHCAGX/6fA9kFYWXMT+zu5 jkcHuiG+1j7qMT05FazcKK86idfOSHZmpMhY46O3ven+lUgxenMroOsK5i3VfYNpLf6V dmArVM6HyPixyQlrkvpZrM34HC0yjLF+AEkce7fHbNn1InmxO/athtwnlQIWbeFN539D 5/mxO+OqoBNlgoHPlX8B3MJdP+cKttsGl55nesaBNqHDagc/mcHIB4TWlbhcoT+ialAb 6WAg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id e6-20020a63d946000000b004393aac966dsi13552015pgj.337.2022.09.24.13.14.52; Sat, 24 Sep 2022 13:15:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230396AbiIXUBo (ORCPT + 99 others); Sat, 24 Sep 2022 16:01:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37740 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230168AbiIXUBm (ORCPT ); Sat, 24 Sep 2022 16:01:42 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C333144549 for ; Sat, 24 Sep 2022 13:01:39 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 4720061053 for ; Sat, 24 Sep 2022 20:01:39 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id C0982C433C1; Sat, 24 Sep 2022 20:01:37 +0000 (UTC) Date: Sat, 24 Sep 2022 16:01:36 -0400 From: Steven Rostedt To: Jeff Xie Cc: mingo@redhat.com, mhiramat@kernel.org, zanussi@kernel.org, linux-kernel@vger.kernel.org, chensong_2000@189.cn Subject: Re: [PATCH v15 2/4] trace/objtrace: Get the value of the object Message-ID: <20220924160136.5029e942@rorschach.local.home> In-Reply-To: <20220819032706.695212-3-xiehuan09@gmail.com> References: <20220819032706.695212-1-xiehuan09@gmail.com> <20220819032706.695212-3-xiehuan09@gmail.com> X-Mailer: Claws Mail 3.17.8 (GTK+ 2.24.33; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-6.7 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 19 Aug 2022 11:27:04 +0800 Jeff Xie wrote: Hi Jeff, I finally (sorry about the long wait) got a chance to take a look at this code and I really like it a lot! and I want to get this into the next merge window. I have one issue with it though, and that's with the syntax. > Using objtrace trigger to get the value of the object which from the kernel > function parameter. > > Syntax: > objtrace:add:obj[,offset][:type][:count][if ] I'm thinking instead of using the above syntax that is new, instead use the syntax that is used by kprobes, eprobes and uprobes. That is: objtrace:+offset(obj):type That is, instead of: objtrace:add:arg1,0x28:u32:1 have: objtrace:+0x28(arg1):u32 Perhaps we can add for count for greater than 1: obtrace:+0x28(arg1):u32[2] for two items. Then we could do even more complex analysis where we can dereference a pointer within a structure to another pointer: obtrace:+0x16(+0x28(arg1)):u32[2] Which will look at arg1, add 0x28 to it. dereference that location, then add 0x16 to the value, and then dereference that location as well. This code is available in the kprobe code that eprobes also uses: See process_fetch_insn() in kernel/trace/trace_eprobe.c and the parsing of the string is in kernel/trace/trace_probe.c: parse_probe_arg() I think doing this will make it much more extensive, not to mention it will match the syntax of other code in the tracing infrastructure. What do you think? -- Steve > > Usage: > # echo 'p bio_add_page arg1=$arg1' > ./kprobe_events > # gdb vmlinux > (gdb) p &(((struct bio *)0)->bi_iter.bi_size) > $1 = (unsigned int *) 0x28 > # echo 'objtrace:add:arg1,0x28:u32:1 if comm == "cat"' > ./events/kprobes/ \ > p_bio_add_page_0/trigger > # cat /test.txt > > Signed-off-by: Jeff Xie > Acked-by: Masami Hiramatsu (Google) > --- > kernel/trace/trace.c | 2 +- > kernel/trace/trace_entries.h | 5 +- > kernel/trace/trace_object.c | 187 ++++++++++++++++++++++++++++++++--- > kernel/trace/trace_output.c | 6 +- > 4 files changed, 181 insertions(+), 19 deletions(-) > > diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c > index 2c2477dea0f2..c2ffc2235b7b 100644 > --- a/kernel/trace/trace.c > +++ b/kernel/trace/trace.c > @@ -5622,7 +5622,7 @@ static const char readme_msg[] = > "\t disable_hist::\n" > #endif > #ifdef CONFIG_TRACE_OBJECT > - "\t objtrace:add:obj[:count][if ]\n" > + "\t objtrace:add:obj[,offset][:type][:count][if ]\n" > #endif > #ifdef CONFIG_STACKTRACE > "\t\t stacktrace\n" > diff --git a/kernel/trace/trace_entries.h b/kernel/trace/trace_entries.h > index bb120d9498a9..2407c45a568c 100644 > --- a/kernel/trace/trace_entries.h > +++ b/kernel/trace/trace_entries.h > @@ -413,8 +413,9 @@ FTRACE_ENTRY(object, trace_object_entry, > __field( unsigned long, ip ) > __field( unsigned long, parent_ip ) > __field( unsigned long, object ) > + __field( unsigned long, value ) > ), > > - F_printk(" %ps <-- %ps object:%lx\n", > - (void *)__entry->ip, (void *)__entry->parent_ip, __entry->object) > + F_printk(" %ps <-- %ps object:%lx value:%lx\n", (void *)__entry->ip, > + (void *)__entry->parent_ip, __entry->object, __entry->value) > ); > diff --git a/kernel/trace/trace_object.c b/kernel/trace/trace_object.c > index 19ec4b1c0186..611341abeb7c 100644 > --- a/kernel/trace/trace_object.c > +++ b/kernel/trace/trace_object.c > @@ -16,8 +16,14 @@ static const int max_args_num = 6; > static void exit_trace_object(struct trace_array *tr); > static int init_trace_object(struct trace_array *tr); > > +/* > + * get the offset from the special object and > + * the type size of the value > + */ > struct object_instance { > void *obj; > + int obj_offset; > + int obj_value_type_size; > struct trace_array *tr; > }; > > @@ -25,9 +31,23 @@ struct object_instance { > struct objtrace_trigger_data { > struct ftrace_event_field *field; > char objtrace_cmd[OBJTRACE_CMD_LEN]; > + int obj_offset; > + int obj_value_type_size; > struct trace_array *tr; > }; > > +/* get the type size for the special object */ > +struct objtrace_fetch_type { > + char *name; > + int type_size; > +}; > + > +enum objattr { > + OBJ_OFFSET, > + OBJ_VAL_TYPE_SIZE, > + MAX_OBJ_ATTR > +}; > + > /* objtrace data with fops and objtrace_instances */ > struct objtrace_data { > struct list_head head; > @@ -67,6 +87,35 @@ static bool object_exist(void *obj, struct trace_array *tr) > return false; > } > > +static int get_object_attr(void *obj, int objattr, > + struct trace_array *tr, int *result) > +{ > + int i, max; > + struct objtrace_data *obj_data; > + > + obj_data = get_obj_data(tr); > + if (!obj_data) > + return -EINVAL; > + > + max = READ_ONCE(obj_data->num_traced_obj); > + smp_rmb(); > + for (i = 0; i < max; i++) { > + if (obj_data->traced_obj[i].obj == obj) { > + switch (objattr) { > + case OBJ_OFFSET: > + *result = obj_data->traced_obj[i].obj_offset; > + return 0; > + case OBJ_VAL_TYPE_SIZE: > + *result = obj_data->traced_obj[i].obj_value_type_size; > + return 0; > + default: > + return -EINVAL; > + } > + } > + } > + return -EINVAL; > +} > + > static bool object_empty(struct trace_array *tr) > { > struct objtrace_data *obj_data; > @@ -78,7 +127,8 @@ static bool object_empty(struct trace_array *tr) > return !READ_ONCE(obj_data->num_traced_obj); > } > > -static void set_trace_object(void *obj, struct trace_array *tr) > +static void set_trace_object(void *obj, int obj_offset, > + int obj_value_type_size, struct trace_array *tr) > { > unsigned long flags; > struct object_instance *obj_ins; > @@ -103,6 +153,8 @@ static void set_trace_object(void *obj, struct trace_array *tr) > } > obj_ins = &obj_data->traced_obj[READ_ONCE(obj_data->num_traced_obj)]; > obj_ins->obj = obj; > + obj_ins->obj_value_type_size = obj_value_type_size; > + obj_ins->obj_offset = obj_offset; > obj_ins->tr = tr; > /* make sure the num_traced_obj update always appears after traced_obj update */ > smp_wmb(); > @@ -112,7 +164,7 @@ static void set_trace_object(void *obj, struct trace_array *tr) > } > > static void submit_trace_object(unsigned long ip, unsigned long parent_ip, > - unsigned long object, struct trace_array *tr) > + unsigned long object, unsigned long value, struct trace_array *tr) > { > > struct trace_buffer *buffer = tr->array_buffer.buffer; > @@ -129,10 +181,43 @@ static void submit_trace_object(unsigned long ip, unsigned long parent_ip, > entry->ip = ip; > entry->parent_ip = parent_ip; > entry->object = object; > + entry->value = value; > > trace_buffer_unlock_commit(tr, buffer, event, trace_ctx); > } > > +static inline long get_object_value(unsigned long *val, void *obj, int type_size) > +{ > + char tmp[sizeof(u64)]; > + long ret = 0; > + > + ret = copy_from_kernel_nofault(tmp, obj, sizeof(tmp)); > + if (ret) > + return ret; > + switch (type_size) { > + case 1: { > + *val = (unsigned long)*(u8 *)tmp; > + break; > + } > + case 2: { > + *val = (unsigned long)*(u16 *)tmp; > + break; > + } > + case 4: { > + *val = (unsigned long)*(u32 *)tmp; > + break; > + } > + case 8: { > + *val = (unsigned long)*(u64 *)tmp; > + break; > + } > + default: > + return -EINVAL; > + } > + > + return 0; > +} > + > static void > trace_object_events_call(unsigned long ip, unsigned long parent_ip, > struct ftrace_ops *op, struct ftrace_regs *fregs) > @@ -140,7 +225,8 @@ trace_object_events_call(unsigned long ip, unsigned long parent_ip, > struct pt_regs *pt_regs = ftrace_get_regs(fregs); > struct trace_array *tr = op->private; > struct trace_array_cpu *data; > - unsigned long obj; > + int ret, val_type_size, obj_offset; > + unsigned long obj, val; > long disabled; > int cpu, n; > > @@ -154,8 +240,21 @@ trace_object_events_call(unsigned long ip, unsigned long parent_ip, > goto out; > for (n = 0; n < max_args_num; n++) { > obj = regs_get_kernel_argument(pt_regs, n); > - if (object_exist((void *)obj, tr)) > - submit_trace_object(ip, parent_ip, obj, tr); > + if (object_exist((void *)obj, tr)) { > + ret = get_object_attr((void *)obj, OBJ_OFFSET, > + tr, &obj_offset); > + if (unlikely(ret) < 0) > + goto out; > + ret = get_object_attr((void *)obj, OBJ_VAL_TYPE_SIZE, > + tr, &val_type_size); > + if (unlikely(ret) < 0) > + goto out; > + if (get_object_value(&val, (void *)(obj + obj_offset), > + val_type_size)) > + continue; > + > + submit_trace_object(ip, parent_ip, obj, val, tr); > + } > /* The parameters of a function may match multiple objects */ > } > } > @@ -176,9 +275,23 @@ trace_object_trigger(struct event_trigger_data *data, > > field = obj_data->field; > memcpy(&obj, rec + field->offset, sizeof(obj)); > - set_trace_object(obj, tr); > + /* set the offset from the special object and the type size of the value*/ > + set_trace_object(obj, obj_data->obj_offset, > + obj_data->obj_value_type_size, tr); > } > > +static const struct objtrace_fetch_type objtrace_fetch_types[] = { > + {"u8", 1}, > + {"x8", 1}, > + {"u16", 2}, > + {"x16", 2}, > + {"u32", 4}, > + {"x32", 4}, > + {"u64", 8}, > + {"x64", 8}, > + {NULL, 0}, > +}; > + > static void > trace_object_trigger_free(struct event_trigger_data *data) > { > @@ -213,14 +326,25 @@ static int > event_trigger_print(const char *name, struct seq_file *m, > void *data, char *filter_str, void *objtrace_data) > { > + int i; > long count = (long)data; > struct objtrace_trigger_data *obj_data = objtrace_data; > + const char *value_type_name; > > seq_puts(m, name); > > seq_printf(m, ":%s", obj_data->objtrace_cmd); > seq_printf(m, ":%s", obj_data->field->name); > + if (obj_data->obj_offset) > + seq_printf(m, ",0x%x", obj_data->obj_offset); > > + for (i = 0; objtrace_fetch_types[i].name; i++) { > + if (objtrace_fetch_types[i].type_size == obj_data->obj_value_type_size) { > + value_type_name = objtrace_fetch_types[i].name; > + break; > + } > + } > + seq_printf(m, ":%s", value_type_name); > if (count == -1) > seq_puts(m, ":unlimited"); > else > @@ -303,16 +427,18 @@ event_object_trigger_parse(struct event_command *cmd_ops, > struct event_trigger_data *trigger_data; > struct objtrace_trigger_data *obj_data; > struct ftrace_event_field *field; > - char *objtrace_cmd, *arg; > - char *param, *filter; > - int ret; > + char *objtrace_cmd, *obj; > + char *param, *filter, *str, *type; > + int ret, i, def_type_size, obj_value_type_size = 0; > + char *tmp_saved_param; > + long offset = 0; > bool remove; > > remove = event_trigger_check_remove(glob); > > /* > * separate the param and the filter: > - * objtrace:add:OBJ[:COUNT] [if filter] > + * objtrace:add:OBJ[,OFFS][:TYPE][:COUNT] [if filter] > */ > ret = event_trigger_separate_filter(param_and_filter, ¶m, &filter, true); > if (ret) > @@ -324,11 +450,44 @@ event_object_trigger_parse(struct event_command *cmd_ops, > return -EINVAL; > } > > - arg = strsep(¶m, ":"); > - if (!arg) > + obj = strsep(¶m, ":"); > + if (!obj) > return -EINVAL; > > - field = trace_find_event_field(file->event_call, arg); > + str = strchr(obj, ','); > + if (!str) > + offset = 0; > + else { > + *str++ = '\0'; > + ret = kstrtol(str, 0, &offset); > + if (ret) > + return -EINVAL; > + } > + def_type_size = sizeof(void *); > + if (!param) { > + obj_value_type_size = def_type_size; > + goto skip_get_type; > + } > + tmp_saved_param = param; > + type = strsep(¶m, ":"); > + if (!type) > + obj_value_type_size = def_type_size; > + /* if this is the trigger count */ > + else if (isdigit(type[0])) { > + obj_value_type_size = def_type_size; > + param = tmp_saved_param; > + } else { > + for (i = 0; objtrace_fetch_types[i].name; i++) { > + if (strcmp(objtrace_fetch_types[i].name, type) == 0) { > + obj_value_type_size = objtrace_fetch_types[i].type_size; > + break; > + } > + } > + } > + if (!obj_value_type_size) > + return -EINVAL; > +skip_get_type: > + field = trace_find_event_field(file->event_call, obj); > if (!field) > return -EINVAL; > > @@ -345,6 +504,8 @@ event_object_trigger_parse(struct event_command *cmd_ops, > return -ENOMEM; > > obj_data->field = field; > + obj_data->obj_offset = offset; > + obj_data->obj_value_type_size = obj_value_type_size; > obj_data->tr = file->tr; > snprintf(obj_data->objtrace_cmd, OBJTRACE_CMD_LEN, objtrace_cmd); > > diff --git a/kernel/trace/trace_output.c b/kernel/trace/trace_output.c > index 34ff7b4dc521..a45871b52dcc 100644 > --- a/kernel/trace/trace_output.c > +++ b/kernel/trace/trace_output.c > @@ -1561,6 +1561,7 @@ static enum print_line_t trace_object_print(struct trace_iterator *iter, int fla > trace_assign_type(field, iter->ent); > print_fn_trace(s, field->ip, field->parent_ip, flags); > trace_seq_printf(s, " object:0x%lx", field->object); > + trace_seq_printf(s, " value:0x%lx", field->value); > trace_seq_putc(s, '\n'); > > return trace_handle_return(s); > @@ -1573,9 +1574,8 @@ static enum print_line_t trace_object_raw(struct trace_iterator *iter, int flags > > trace_assign_type(field, iter->ent); > > - trace_seq_printf(&iter->seq, "%lx %lx\n", > - field->ip, > - field->parent_ip); > + trace_seq_printf(&iter->seq, "%lx %lx %lx %lx\n", field->ip, > + field->parent_ip, field->object, field->value); > > return trace_handle_return(&iter->seq); > }