Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp3432762ioo; Mon, 30 May 2022 01:50:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy+22j7Uc7ASbjF/2JaYYNeHBF23oW6q9DWMZQ2OMpyu5AZ7cXIgXxY89jOgrUyfjAH7JKI X-Received: by 2002:a17:903:1111:b0:15f:7f0:bbf3 with SMTP id n17-20020a170903111100b0015f07f0bbf3mr55050387plh.12.1653900652243; Mon, 30 May 2022 01:50:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653900652; cv=none; d=google.com; s=arc-20160816; b=I2OS5ACzpcX5EeCcWuDzqukRkfhzfFQbbjjXNnT4dySYTb9VXTps8yX7JZRN2ehaJl RgONwmo4kcyzQgSUB+FkKvm5HYUOeK/2yi45ivX24hscIgojGw5F4d7CkST0U9c2jydx cuRhPhKJ/9MnACN3XtgMqmFQwPsyquw7S9ZCd2EAs/w7rH0EM2Luulpsc8ADonJOm9Uc co5KcE4/qCpT6iCJHk/OI/LXWLTv2KJ8zmjFJ3FcM+88jD2mUTaIEWxnQEM1g06q2zUp UbYXCFb8nTbPeszvIeTqhR/HF4nwlTLmtU1K0powyEi+uomc58F2BJrVqHy76tYxLsxU ItuA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:date:from:dkim-signature; bh=v+BD4H/bYYlyGcYIgteRpzX9kFCxi0nwABmpdcOcsDQ=; b=vWnHNfzKsyvXAc34qsIi7TLboFXYcBDyyQtgDr1voyLVlyhp2d1BoQcNVH9hDH4hzM VMF2Xjv7z+IquTZM5dMMYvD1wOjbGbp6aOB/FaI424dskrqx4PYv/8WVlKVEmWHKcyQc bWuzhn/zogdRCVbKWPod8H8aBFzrYOviqgTgLBx4LdSHxdq2OdzmxVlsiFQRNqML9+JP b5o/dsI73dzTWRdp6cFRysSokWcs7xxOzTKQC3S/xVe5CtxQvOV/zvTvuBK90n/CZ3EB msTXDfyURjIix07lcNB09lV6fajYubQ5Kgc1dNbrK+kwxHfeQDTt3sBocR9pj6VJsmM3 g5hA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=V7tucomF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q17-20020a17090311d100b00161abfb3230si16668285plh.363.2022.05.30.01.50.38; Mon, 30 May 2022 01:50:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=V7tucomF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230516AbiE2UOx (ORCPT + 99 others); Sun, 29 May 2022 16:14:53 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49222 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229515AbiE2UOv (ORCPT ); Sun, 29 May 2022 16:14:51 -0400 Received: from mail-wr1-x42d.google.com (mail-wr1-x42d.google.com [IPv6:2a00:1450:4864:20::42d]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 20D2854BCB; Sun, 29 May 2022 13:14:50 -0700 (PDT) Received: by mail-wr1-x42d.google.com with SMTP id e25so1562213wra.11; Sun, 29 May 2022 13:14:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:date:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=v+BD4H/bYYlyGcYIgteRpzX9kFCxi0nwABmpdcOcsDQ=; b=V7tucomFehUmShYn5bdOQ2JTUXCbe23GwkQwE3zo4brScPHIlWcVsqja7ViYdgYUtx mjoI/MfDXup4+YkleFKaA6QYx85W5mSa2qaAmU1xL1Q6tfCfqrNAECQtCxdhSW+d/4Cw MTHJjKK1wB/IJHpsOG4TSGLZEiDi6w/ipPtjpYoYMPsPG/xWEsZwC066vBjWBoZVehkO W28I29hiikT/D09J0UBzqR06EghsjsZZboJZcKZ6z3j7wxOohyztuYwVDKFM7RfvyyxX c8xpytOfzqITV/7A/HFW1QklzFfg9a5HfsRCLByza9TZChQa7zYgOGPeybHMpZ+ve0cZ lEBg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:date:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=v+BD4H/bYYlyGcYIgteRpzX9kFCxi0nwABmpdcOcsDQ=; b=qPpNBFTnN9eK48A6jxiK6wU7gsTsNslIs0cii9xE5D2v9CDlVNuJEsBI9DGudImweZ 09R0+osAvlwCJSit5I8HUJTe+DxY9eHmwbaxCc9S1VnCDATqAH+qIRfRBiXhF3w8W2ZP bv4XEP/snuNW+kFougzrpg0L0SBHYVurNmrmEsA20pbiLsKFzsWX0oZGy3ndGoTwlPJJ NdtAu2R1TSEp3vQA7bTaNo9rPlBT1JZfKZRDU3DtEtDj1OMsKy9g1QuWIFMhXDi+WoWz 9FhhmjngPK1AnUQvrylzeWHZNDn/BkEE/TUqxKtusQWQTY3g4bJ7JryOvhi4/8khHpN3 eDbw== X-Gm-Message-State: AOAM532HaiYin14oaOOfR8C5X870/3mwxaL4o/wRPkfPQDuvSZA6I88A RqvjDc+UIkqeuZTJSmfePzc= X-Received: by 2002:a05:6000:1186:b0:210:413:aa6b with SMTP id g6-20020a056000118600b002100413aa6bmr17323392wrx.383.1653855288489; Sun, 29 May 2022 13:14:48 -0700 (PDT) Received: from krava ([83.240.62.49]) by smtp.gmail.com with ESMTPSA id p2-20020a5d59a2000000b0020c7ec0fdf4sm8515776wrr.117.2022.05.29.13.14.47 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 29 May 2022 13:14:47 -0700 (PDT) From: Jiri Olsa X-Google-Original-From: Jiri Olsa Date: Sun, 29 May 2022 22:14:45 +0200 To: Ian Rogers , Jan Kratochvil Cc: Jiri Olsa , Fangrui Song , Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, llvm@lists.linux.dev, Sebastian Ullrich Subject: Re: [PATCH v2] perf: Fix segbase for ld.lld linked objects Message-ID: References: <20220527182039.673248-1-maskray@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, May 28, 2022 at 07:49:13AM -0700, Ian Rogers wrote: > On Sat, May 28, 2022 at 5:06 AM Jiri Olsa wrote: > > > > On Fri, May 27, 2022 at 11:20:39AM -0700, Fangrui Song wrote: > > > segbase is the address of .eh_frame_hdr and table_data is segbase plus > > > the header size. find_proc_info computes segbase as `map->start + > > > segbase - map->pgoff` which is wrong when > > > > > > * .eh_frame_hdr and .text are in different PT_LOAD program headers > > > * and their p_vaddr difference does not equal their p_offset difference > > > > > > Since 10.0, ld.lld's default --rosegment -z noseparate-code layout has > > > such R and RX PT_LOAD program headers. > > > > > > ld.lld (default) => perf report fails to unwind `perf record > > > --call-graph dwarf` recorded data > > > ld.lld --no-rosegment => ok (trivial, no R PT_LOAD) > > > ld.lld -z separate-code => ok but by luck: there are two PT_LOAD but > > > their p_vaddr difference equals p_offset difference > > > > > > ld.bfd -z noseparate-code => ok (trivial, no R PT_LOAD) > > > ld.bfd -z separate-code (default for Linux/x86) => ok but by luck: > > > there are two PT_LOAD but their p_vaddr difference equals p_offset > > > difference > > > > > > To fix the issue, compute segbase as dso's base address plus > > > PT_GNU_EH_FRAME's p_vaddr. The base address is computed by iterating > > > over all dso-associated maps and then subtract the first PT_LOAD p_vaddr > > > (the minimum guaranteed by generic ABI) from the minimum address. > > > > > > In libunwind, find_proc_info transitively called by unw_step is cached, > > > so the iteration overhead is acceptable. > > > > > > Reported-by: Sebastian Ullrich > > > Link: https://github.com/ClangBuiltLinux/linux/issues/1646 > > > Signed-off-by: Fangrui Song > > > Cc: Ian Rogers > > > > nice, is this a problem for libdw dwarf unwind as well? > > libdw also is broken with lld but not bfd. When I switch on verbose > output (-vv) with perf report I see: > > overlapping maps: > 202000-203000 0 a.out > 202000-203000 0 a.out > > and: > > unwind: failed with 'address range overlaps an existing module' > > The sense I get from this is libdw is rejecting the file before > anything we can do in perf. Jan, any idea if can this be fixed in libdw? thanks, jirka > > Thanks, > Ian > > > you should be able to build it with: > > > > $ make NO_LIBUNWIND=1 > > > > thanks, > > jirka > > > > > > > > -- > > > Changes from v1: > > > * Fix elf_base_address to use the first PT_LOAD > > > * Use dso::elf_base_addr which is a constant even if the dso is loaded into multiple processes > > > --- > > > tools/perf/util/dso.h | 2 + > > > tools/perf/util/unwind-libunwind-local.c | 107 ++++++++++++++++------- > > > 2 files changed, 78 insertions(+), 31 deletions(-) > > > > > > diff --git a/tools/perf/util/dso.h b/tools/perf/util/dso.h > > > index 3a9fd4d389b5..97047a11282b 100644 > > > --- a/tools/perf/util/dso.h > > > +++ b/tools/perf/util/dso.h > > > @@ -196,7 +196,9 @@ struct dso { > > > u32 status_seen; > > > u64 file_size; > > > struct list_head open_entry; > > > + u64 elf_base_addr; > > > u64 debug_frame_offset; > > > + u64 eh_frame_hdr_addr; > > > u64 eh_frame_hdr_offset; > > > } data; > > > /* bpf prog information */ > > > diff --git a/tools/perf/util/unwind-libunwind-local.c b/tools/perf/util/unwind-libunwind-local.c > > > index 41e29fc7648a..37622699c91a 100644 > > > --- a/tools/perf/util/unwind-libunwind-local.c > > > +++ b/tools/perf/util/unwind-libunwind-local.c > > > @@ -169,30 +169,64 @@ static int __dw_read_encoded_value(u8 **p, u8 *end, u64 *val, > > > __v; \ > > > }) > > > > > > -static u64 elf_section_offset(int fd, const char *name) > > > +static int elf_section_address_and_offset(int fd, const char *name, u64 *address, u64 *offset) > > > { > > > Elf *elf; > > > GElf_Ehdr ehdr; > > > GElf_Shdr shdr; > > > - u64 offset = 0; > > > + int ret; > > > > > > elf = elf_begin(fd, PERF_ELF_C_READ_MMAP, NULL); > > > if (elf == NULL) > > > - return 0; > > > + return -1; > > > > > > - do { > > > - if (gelf_getehdr(elf, &ehdr) == NULL) > > > - break; > > > + if (gelf_getehdr(elf, &ehdr) == NULL) > > > + goto out_err; > > > > > > - if (!elf_section_by_name(elf, &ehdr, &shdr, name, NULL)) > > > - break; > > > - > > > - offset = shdr.sh_offset; > > > - } while (0); > > > + if (!elf_section_by_name(elf, &ehdr, &shdr, name, NULL)) > > > + goto out_err; > > > > > > + *address = shdr.sh_addr; > > > + *offset = shdr.sh_offset; > > > + ret = 0; > > > +out_err: > > > elf_end(elf); > > > + return ret; > > > +} > > > + > > > +#ifndef NO_LIBUNWIND_DEBUG_FRAME > > > +static u64 elf_section_offset(int fd, const char *name) > > > +{ > > > + u64 address, offset; > > > + > > > + if (elf_section_address_and_offset(fd, name, &address, &offset)) > > > + return 0; > > > + > > > return offset; > > > } > > > +#endif > > > + > > > +static u64 elf_base_address(int fd) > > > +{ > > > + Elf *elf = elf_begin(fd, PERF_ELF_C_READ_MMAP, NULL); > > > + GElf_Phdr phdr; > > > + u64 retval = 0; > > > + size_t i, phdrnum = 0; > > > + > > > + if (elf == NULL) > > > + return 0; > > > + (void)elf_getphdrnum(elf, &phdrnum); > > > + /* PT_LOAD segments are sorted by p_vaddr, so the first has the minimum p_vaddr. */ > > > + for (i = 0; i < phdrnum; i++) { > > > + if (gelf_getphdr(elf, i, &phdr) && phdr.p_type == PT_LOAD) { > > > + retval = phdr.p_vaddr & -getpagesize(); > > > + break; > > > + } > > > + } > > > + > > > + elf_end(elf); > > > + return retval; > > > +} > > > > > > #ifndef NO_LIBUNWIND_DEBUG_FRAME > > > static int elf_is_exec(int fd, const char *name) > > > @@ -248,8 +282,7 @@ struct eh_frame_hdr { > > > } __packed; > > > > > > static int unwind_spec_ehframe(struct dso *dso, struct machine *machine, > > > - u64 offset, u64 *table_data, u64 *segbase, > > > - u64 *fde_count) > > > + u64 offset, u64 *table_data_offset, u64 *fde_count) > > > { > > > struct eh_frame_hdr hdr; > > > u8 *enc = (u8 *) &hdr.enc; > > > @@ -265,35 +298,47 @@ static int unwind_spec_ehframe(struct dso *dso, struct machine *machine, > > > dw_read_encoded_value(enc, end, hdr.eh_frame_ptr_enc); > > > > > > *fde_count = dw_read_encoded_value(enc, end, hdr.fde_count_enc); > > > - *segbase = offset; > > > - *table_data = (enc - (u8 *) &hdr) + offset; > > > + *table_data_offset = enc - (u8 *) &hdr; > > > return 0; > > > } > > > > > > -static int read_unwind_spec_eh_frame(struct dso *dso, struct machine *machine, > > > +static int read_unwind_spec_eh_frame(struct dso *dso, struct unwind_info *ui, > > > u64 *table_data, u64 *segbase, > > > u64 *fde_count) > > > { > > > - int ret = -EINVAL, fd; > > > - u64 offset = dso->data.eh_frame_hdr_offset; > > > + struct map *map; > > > + u64 base_addr = UINT64_MAX; > > > + int ret, fd; > > > > > > - if (offset == 0) { > > > - fd = dso__data_get_fd(dso, machine); > > > + if (dso->data.eh_frame_hdr_offset == 0) { > > > + fd = dso__data_get_fd(dso, ui->machine); > > > if (fd < 0) > > > return -EINVAL; > > > > > > /* Check the .eh_frame section for unwinding info */ > > > - offset = elf_section_offset(fd, ".eh_frame_hdr"); > > > - dso->data.eh_frame_hdr_offset = offset; > > > + ret = elf_section_address_and_offset(fd, ".eh_frame_hdr", > > > + &dso->data.eh_frame_hdr_addr, > > > + &dso->data.eh_frame_hdr_offset); > > > + dso->data.elf_base_addr = elf_base_address(fd); > > > dso__data_put_fd(dso); > > > + if (ret || dso->data.eh_frame_hdr_offset == 0) > > > + return -EINVAL; > > > } > > > > > > - if (offset) > > > - ret = unwind_spec_ehframe(dso, machine, offset, > > > - table_data, segbase, > > > - fde_count); > > > - > > > - return ret; > > > + maps__for_each_entry(ui->thread->maps, map) { > > > + if (map->dso == dso && map->start < base_addr) > > > + base_addr = map->start; > > > + } > > > + base_addr -= dso->data.elf_base_addr; > > > + /* Address of .eh_frame_hdr */ > > > + *segbase = base_addr + dso->data.eh_frame_hdr_addr; > > > + ret = unwind_spec_ehframe(dso, ui->machine, dso->data.eh_frame_hdr_offset, > > > + table_data, fde_count); > > > + if (ret) > > > + return ret; > > > + /* binary_search_table offset plus .eh_frame_hdr address */ > > > + *table_data += *segbase; > > > + return 0; > > > } > > > > > > #ifndef NO_LIBUNWIND_DEBUG_FRAME > > > @@ -388,14 +433,14 @@ find_proc_info(unw_addr_space_t as, unw_word_t ip, unw_proc_info_t *pi, > > > pr_debug("unwind: find_proc_info dso %s\n", map->dso->name); > > > > > > /* Check the .eh_frame section for unwinding info */ > > > - if (!read_unwind_spec_eh_frame(map->dso, ui->machine, > > > + if (!read_unwind_spec_eh_frame(map->dso, ui, > > > &table_data, &segbase, &fde_count)) { > > > memset(&di, 0, sizeof(di)); > > > di.format = UNW_INFO_FORMAT_REMOTE_TABLE; > > > di.start_ip = map->start; > > > di.end_ip = map->end; > > > - di.u.rti.segbase = map->start + segbase - map->pgoff; > > > - di.u.rti.table_data = map->start + table_data - map->pgoff; > > > + di.u.rti.segbase = segbase; > > > + di.u.rti.table_data = table_data; > > > di.u.rti.table_len = fde_count * sizeof(struct table_entry) > > > / sizeof(unw_word_t); > > > ret = dwarf_search_unwind_table(as, ip, &di, pi, > > > -- > > > 2.36.1.124.g0e6072fb45-goog > > >