Received: by 2002:ac2:464d:0:0:0:0:0 with SMTP id s13csp1997181lfo; Sat, 28 May 2022 13:10:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzXjmc6ULOzRRkoc+fL3TQt0riAVEcZT8y0DCy2mo66I8hKRjes+gRgaHweEAusbzlZApmv X-Received: by 2002:a17:903:32d2:b0:161:d485:613f with SMTP id i18-20020a17090332d200b00161d485613fmr49122865plr.173.1653768640470; Sat, 28 May 2022 13:10:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653768640; cv=none; d=google.com; s=arc-20160816; b=IW8dGP/IYmp2CFW6T0vPkL+q3hkibGF6+ipL/ROdh59RcAi971FusWM3FzfMUaan3I sqKRkX6NYesdCyowdeoETeVVvmU31xMcpdNTT0ybun4cLusax+qpQBEgXZkIQu5eEGQk DxqEZOmjrpISx7BbVlr374mxFoTIP5r7ZBDMYcQXoENalb5uNsxUfRy7KpJB5Vp6dlj6 cefdqujaZiC5pMO7sJvp2jP03e7zPQiBSS/JvpzdTjsTUDI8c20hrLRYjfdazhXZMS3T 5t6NBxGR6h9in3/ijPM88dAr1SQAHhTnW3jK9AbNulJwcxT7xT570xGd1KU3K0kNlEBV bJgQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=6mnq6s94t7HVDlK1KKnF1BOHF1n74alGQX0XJ6Y9vKE=; b=gII48bvjk8MgQFwUTZ+FJTCcQj5Ec+3itdNE9t0OWXM+oQEY/J4WxYobb5/X0ZbCXu hi4D6nmhitqCggMPgMYYuFdU6B1RCEaPKu45ocpekfz4CKHg06H76D+rWZNZfoC6Oud5 pC6tnCI2mXWEYoiSK+1xlvj7fz8oggsdft9+ee23DIb33EQddKNGGmjriMKLdLDOqnzi tIyFPLE0U5Lzqcf9q7HZM4zaNMR75sxt9vrGxUZd6TsKNhO234TW77q/eAms7D4FKxbY V/oiOqUJFORd/zpeHn2qS70MwSHqy3Puk0UkA14Nndr75GLubGCyjoux9q0E4P0DKYVW M76A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b="BR3zd/vB"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id j12-20020a65558c000000b003f5f0e7a705si11209140pgs.190.2022.05.28.13.10.40 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 28 May 2022 13:10:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b="BR3zd/vB"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id E7FF112AB0A; Sat, 28 May 2022 12:24:03 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1348574AbiE0SwO (ORCPT + 99 others); Fri, 27 May 2022 14:52:14 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41616 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238626AbiE0SwM (ORCPT ); Fri, 27 May 2022 14:52:12 -0400 Received: from mail-wr1-x433.google.com (mail-wr1-x433.google.com [IPv6:2a00:1450:4864:20::433]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8E9EF18B00 for ; Fri, 27 May 2022 11:52:08 -0700 (PDT) Received: by mail-wr1-x433.google.com with SMTP id h11so3225712wrb.0 for ; Fri, 27 May 2022 11:52:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=6mnq6s94t7HVDlK1KKnF1BOHF1n74alGQX0XJ6Y9vKE=; b=BR3zd/vBbmw1kqJ+9veXoJPVyr84j7xUtlwS+kx2VF6cEeRX56XW2ceoNXenDm9cZF 2+DmTQCWed/9NzrE3uAczYkvyLQ484ZJfmb7Jq99BEeHPk/dCA4R4tNBMp9ptoB6DLjR 4RhF4sPLrM7ojQITMHh/++RTnYQ8wqRDWhyUZd7j/V1axvFB0KABe07CqLntG8G6unap CYUOysMLIuOsOf6dO2599uUE2wbyLoj3UshMAZ8oF5nwCkY/iRREw0X4yFGJ8RrKHyKn 2dQIMVyCFq4qPRqviXpWzvrQNbjJW7qZRPywaMLukSHwzb8qtFq4jTRAN7VD2NsD1x1R 5qVQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=6mnq6s94t7HVDlK1KKnF1BOHF1n74alGQX0XJ6Y9vKE=; b=Zwz95OWJ8rb1PK9rSgcUFSBnusLes8tspDS4mXx1t3A5Z0ueQsXdvsNEaNai64DNNf kLpH0RPUs8q2M9yo6S8rJIjUBuO3sUFhZth3ATumerFU5iTTUH+Oqla4sHfYwtpJ0Z7j zzzLS2tv0ptCFdoGi4isqC8M2vdqIu9NC+ZvLGQihMg8ksvfXlKnSnVhgJTw0irROeyv Vyp70edK4IvqAn+82A8knzbSbp8bQiWjNsqMWNDrykdacThYiaQyI2Ewf6rxZekh6qLD xyi0zsuUmAGbt0umYNj+Z1hXT71UxVohp4k/Uq1Ay2v8g4/RP8V63CF3a07j7MuEbuVA Az/w== X-Gm-Message-State: AOAM530SR6bsbFqRDDcgh7rqcGpIkSFz7jcR8y4aaR7jvFp2FZFM1iNl sV6tW8IdhMJJEOnVouIkQia46aSSJnED3h40vfGOPw== X-Received: by 2002:a5d:598f:0:b0:20c:83c9:b05b with SMTP id n15-20020a5d598f000000b0020c83c9b05bmr36529153wri.343.1653677526611; Fri, 27 May 2022 11:52:06 -0700 (PDT) MIME-Version: 1.0 References: <20220527182039.673248-1-maskray@google.com> In-Reply-To: <20220527182039.673248-1-maskray@google.com> From: Ian Rogers Date: Fri, 27 May 2022 11:51:54 -0700 Message-ID: Subject: Re: [PATCH v2] perf: Fix segbase for ld.lld linked objects To: Fangrui Song Cc: Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, llvm@lists.linux.dev, Sebastian Ullrich Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.5 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 27, 2022 at 11:20 AM Fangrui Song wrote: > > segbase is the address of .eh_frame_hdr and table_data is segbase plus > the header size. find_proc_info computes segbase as `map->start + > segbase - map->pgoff` which is wrong when > > * .eh_frame_hdr and .text are in different PT_LOAD program headers > * and their p_vaddr difference does not equal their p_offset difference > > Since 10.0, ld.lld's default --rosegment -z noseparate-code layout has > such R and RX PT_LOAD program headers. > > ld.lld (default) => perf report fails to unwind `perf record > --call-graph dwarf` recorded data > ld.lld --no-rosegment => ok (trivial, no R PT_LOAD) > ld.lld -z separate-code => ok but by luck: there are two PT_LOAD but > their p_vaddr difference equals p_offset difference > > ld.bfd -z noseparate-code => ok (trivial, no R PT_LOAD) > ld.bfd -z separate-code (default for Linux/x86) => ok but by luck: > there are two PT_LOAD but their p_vaddr difference equals p_offset > difference > > To fix the issue, compute segbase as dso's base address plus > PT_GNU_EH_FRAME's p_vaddr. The base address is computed by iterating > over all dso-associated maps and then subtract the first PT_LOAD p_vaddr > (the minimum guaranteed by generic ABI) from the minimum address. > > In libunwind, find_proc_info transitively called by unw_step is cached, > so the iteration overhead is acceptable. > > Reported-by: Sebastian Ullrich > Link: https://github.com/ClangBuiltLinux/linux/issues/1646 > Signed-off-by: Fangrui Song > Cc: Ian Rogers Reviewed-by: Ian Rogers Thanks! Ian > -- > Changes from v1: > * Fix elf_base_address to use the first PT_LOAD > * Use dso::elf_base_addr which is a constant even if the dso is loaded into multiple processes > --- > tools/perf/util/dso.h | 2 + > tools/perf/util/unwind-libunwind-local.c | 107 ++++++++++++++++------- > 2 files changed, 78 insertions(+), 31 deletions(-) > > diff --git a/tools/perf/util/dso.h b/tools/perf/util/dso.h > index 3a9fd4d389b5..97047a11282b 100644 > --- a/tools/perf/util/dso.h > +++ b/tools/perf/util/dso.h > @@ -196,7 +196,9 @@ struct dso { > u32 status_seen; > u64 file_size; > struct list_head open_entry; > + u64 elf_base_addr; > u64 debug_frame_offset; > + u64 eh_frame_hdr_addr; > u64 eh_frame_hdr_offset; > } data; > /* bpf prog information */ > diff --git a/tools/perf/util/unwind-libunwind-local.c b/tools/perf/util/unwind-libunwind-local.c > index 41e29fc7648a..37622699c91a 100644 > --- a/tools/perf/util/unwind-libunwind-local.c > +++ b/tools/perf/util/unwind-libunwind-local.c > @@ -169,30 +169,64 @@ static int __dw_read_encoded_value(u8 **p, u8 *end, u64 *val, > __v; \ > }) > > -static u64 elf_section_offset(int fd, const char *name) > +static int elf_section_address_and_offset(int fd, const char *name, u64 *address, u64 *offset) > { > Elf *elf; > GElf_Ehdr ehdr; > GElf_Shdr shdr; > - u64 offset = 0; > + int ret; > > elf = elf_begin(fd, PERF_ELF_C_READ_MMAP, NULL); > if (elf == NULL) > - return 0; > + return -1; > > - do { > - if (gelf_getehdr(elf, &ehdr) == NULL) > - break; > + if (gelf_getehdr(elf, &ehdr) == NULL) > + goto out_err; > > - if (!elf_section_by_name(elf, &ehdr, &shdr, name, NULL)) > - break; > - > - offset = shdr.sh_offset; > - } while (0); > + if (!elf_section_by_name(elf, &ehdr, &shdr, name, NULL)) > + goto out_err; > > + *address = shdr.sh_addr; > + *offset = shdr.sh_offset; > + ret = 0; > +out_err: > elf_end(elf); > + return ret; > +} > + > +#ifndef NO_LIBUNWIND_DEBUG_FRAME > +static u64 elf_section_offset(int fd, const char *name) > +{ > + u64 address, offset; > + > + if (elf_section_address_and_offset(fd, name, &address, &offset)) > + return 0; > + > return offset; > } > +#endif > + > +static u64 elf_base_address(int fd) > +{ > + Elf *elf = elf_begin(fd, PERF_ELF_C_READ_MMAP, NULL); > + GElf_Phdr phdr; > + u64 retval = 0; > + size_t i, phdrnum = 0; > + > + if (elf == NULL) > + return 0; > + (void)elf_getphdrnum(elf, &phdrnum); > + /* PT_LOAD segments are sorted by p_vaddr, so the first has the minimum p_vaddr. */ > + for (i = 0; i < phdrnum; i++) { > + if (gelf_getphdr(elf, i, &phdr) && phdr.p_type == PT_LOAD) { > + retval = phdr.p_vaddr & -getpagesize(); > + break; > + } > + } > + > + elf_end(elf); > + return retval; > +} > > #ifndef NO_LIBUNWIND_DEBUG_FRAME > static int elf_is_exec(int fd, const char *name) > @@ -248,8 +282,7 @@ struct eh_frame_hdr { > } __packed; > > static int unwind_spec_ehframe(struct dso *dso, struct machine *machine, > - u64 offset, u64 *table_data, u64 *segbase, > - u64 *fde_count) > + u64 offset, u64 *table_data_offset, u64 *fde_count) > { > struct eh_frame_hdr hdr; > u8 *enc = (u8 *) &hdr.enc; > @@ -265,35 +298,47 @@ static int unwind_spec_ehframe(struct dso *dso, struct machine *machine, > dw_read_encoded_value(enc, end, hdr.eh_frame_ptr_enc); > > *fde_count = dw_read_encoded_value(enc, end, hdr.fde_count_enc); > - *segbase = offset; > - *table_data = (enc - (u8 *) &hdr) + offset; > + *table_data_offset = enc - (u8 *) &hdr; > return 0; > } > > -static int read_unwind_spec_eh_frame(struct dso *dso, struct machine *machine, > +static int read_unwind_spec_eh_frame(struct dso *dso, struct unwind_info *ui, > u64 *table_data, u64 *segbase, > u64 *fde_count) > { > - int ret = -EINVAL, fd; > - u64 offset = dso->data.eh_frame_hdr_offset; > + struct map *map; > + u64 base_addr = UINT64_MAX; > + int ret, fd; > > - if (offset == 0) { > - fd = dso__data_get_fd(dso, machine); > + if (dso->data.eh_frame_hdr_offset == 0) { > + fd = dso__data_get_fd(dso, ui->machine); > if (fd < 0) > return -EINVAL; > > /* Check the .eh_frame section for unwinding info */ > - offset = elf_section_offset(fd, ".eh_frame_hdr"); > - dso->data.eh_frame_hdr_offset = offset; > + ret = elf_section_address_and_offset(fd, ".eh_frame_hdr", > + &dso->data.eh_frame_hdr_addr, > + &dso->data.eh_frame_hdr_offset); > + dso->data.elf_base_addr = elf_base_address(fd); > dso__data_put_fd(dso); > + if (ret || dso->data.eh_frame_hdr_offset == 0) > + return -EINVAL; > } > > - if (offset) > - ret = unwind_spec_ehframe(dso, machine, offset, > - table_data, segbase, > - fde_count); > - > - return ret; > + maps__for_each_entry(ui->thread->maps, map) { > + if (map->dso == dso && map->start < base_addr) > + base_addr = map->start; > + } > + base_addr -= dso->data.elf_base_addr; > + /* Address of .eh_frame_hdr */ > + *segbase = base_addr + dso->data.eh_frame_hdr_addr; > + ret = unwind_spec_ehframe(dso, ui->machine, dso->data.eh_frame_hdr_offset, > + table_data, fde_count); > + if (ret) > + return ret; > + /* binary_search_table offset plus .eh_frame_hdr address */ > + *table_data += *segbase; > + return 0; > } > > #ifndef NO_LIBUNWIND_DEBUG_FRAME > @@ -388,14 +433,14 @@ find_proc_info(unw_addr_space_t as, unw_word_t ip, unw_proc_info_t *pi, > pr_debug("unwind: find_proc_info dso %s\n", map->dso->name); > > /* Check the .eh_frame section for unwinding info */ > - if (!read_unwind_spec_eh_frame(map->dso, ui->machine, > + if (!read_unwind_spec_eh_frame(map->dso, ui, > &table_data, &segbase, &fde_count)) { > memset(&di, 0, sizeof(di)); > di.format = UNW_INFO_FORMAT_REMOTE_TABLE; > di.start_ip = map->start; > di.end_ip = map->end; > - di.u.rti.segbase = map->start + segbase - map->pgoff; > - di.u.rti.table_data = map->start + table_data - map->pgoff; > + di.u.rti.segbase = segbase; > + di.u.rti.table_data = table_data; > di.u.rti.table_len = fde_count * sizeof(struct table_entry) > / sizeof(unw_word_t); > ret = dwarf_search_unwind_table(as, ip, &di, pi, > -- > 2.36.1.124.g0e6072fb45-goog >