Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp1028051iob; Fri, 13 May 2022 20:12:12 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyWl9DXojdVpSz+cHuWRGILIIt9h18fJFUlapgqry4UjwZQPk5qKjGSdLfNZwGLnLU3RwwH X-Received: by 2002:a5d:56c9:0:b0:20a:c54c:70e2 with SMTP id m9-20020a5d56c9000000b0020ac54c70e2mr6250629wrw.415.1652497932097; Fri, 13 May 2022 20:12:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652497932; cv=none; d=google.com; s=arc-20160816; b=rbSIow0RivCWhF+MujLBnmBvgd5rPeAZtODOdCORXOe5cBiXoKCq2W0UeTCvjZMATl ipzce6ia8cmzMrUPcTCRy31YCbjwq80UncBnUdd1vCk6lh0ETI6ZBm/iOUHAFOEOTTxL TbCndW6vUvri7TkGGY/Iobdl7M/jc9XtYZzwcfr6DObj8H6r7ecVsGLEOPbbVIF/yZwf qVK9FfayaDvyzrYobnxxXhMVXWeTUaXltb7ivpXiu2YZ15O3iTspG9gqfZd8F52k4Nov u8L01UbMQy4Ll9IPqbKAXIRXl+JK004j6n9tyBKEKA8T8ECW24i2G0a24p1IBem+hMoT aGdA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=yAcmV9FDTz7FaesnF1N1N4V0GnEb7KHKInbH9fhNWU8=; b=rHzb4Ou6cbVj39HLEZ0/QxAJqg6FTKctJvoFiqZO1pZ5bwbXKOkA2T1/Imo+Y7ud/w K3NLunVKnGMHlotKQGp75Y7zk+j+RXkFkmDhfVKzgfXHm/zYqjatTKGv/mODI7sk4b/j OCJA/0jS63kssBG8VEWg1NPXBzk+aad7xfD4e1K57TqTZf+3kQoTvGiQl7ZIf6hv6mSk LDpRZrmyipS4iFFpzVEPxcy0LbV8pzwiRb1sE6karkN3ZgyBMs0sh1U05tJ1BkIhI3oB KtwZeVxAR1rpgC82ITEw8y6Ogrf0zcb7evrpdlJr1jGrgWAPSfyX2xoEf0dyUXC4gZt5 MhhQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=QHETTkWn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id r11-20020a5d52cb000000b0020cd284962dsi3077800wrv.254.2022.05.13.20.12.11 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 13 May 2022 20:12:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=QHETTkWn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 321113D0931; Fri, 13 May 2022 16:52:27 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1376640AbiEMDXG (ORCPT + 99 others); Thu, 12 May 2022 23:23:06 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44156 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1347229AbiEMDXF (ORCPT ); Thu, 12 May 2022 23:23:05 -0400 Received: from mail-ed1-x52f.google.com (mail-ed1-x52f.google.com [IPv6:2a00:1450:4864:20::52f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4627A1F1597 for ; Thu, 12 May 2022 20:23:03 -0700 (PDT) Received: by mail-ed1-x52f.google.com with SMTP id c12so8409488edv.10 for ; Thu, 12 May 2022 20:23:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=yAcmV9FDTz7FaesnF1N1N4V0GnEb7KHKInbH9fhNWU8=; b=QHETTkWnyZGXpgnl8ePep7ok3z4DDLg8Zrn78o8rL2LbxEGESiKNwCyhDu+suMQN88 MKkRdkThW5qr7PfDduYHY/DI2t/dOxZQH5Kqf5y8deyq9fWPMHI4SbQ01YNQkNNLSy8+ zFM/m1i5LNj2m9W0ZtVxFbxbdBSIaT3XIBeSJoA+hNNzXhSEZKCs2vzuB6CQPw4iHyah r3QAdDQSsMVMxnN10Mx3bsbUxF28M4+5mW0eIpzf+CSFQR4n1brBGyn7Rn68y4Nxcu5y 88q2sNvQ6FJJWMGWKZn8RvVaQDt/ca41bIPO5AS37/qmWapBx3wadrRYJ/NThtYNPF99 cn5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=yAcmV9FDTz7FaesnF1N1N4V0GnEb7KHKInbH9fhNWU8=; b=i7l+HPcfDWMyXsVgXdHpB3q/7R7cHyU47VoxbOMffX7jd7mTtAmIwdWn9fdj+iuPNE E2BkKRIJsLv/dMLnVI7XOKehE6Kg3Jpy8E9GvUoCG4C4zebBCepV9I7GbNs/Eobnd6iE R9Wsg6/3G8iMYsRJD0xDYOt7yQbOxfjhal5HVaLPTs+5XBAHCoAPmvBR/ZParncUrt3O hgv6wN5vWZ5Yp2WLmqiLfRsp5NR6jJTT8+ZlZYkFgv0KOIK178K+HBhzbjY9dgkBjdQG cxX7gelssj7WkLMgXG/vm7m4k5FOY3Bh3Ig5zltJMuowUBY+ehw3HplduYJoZaBG7TE6 WlYw== X-Gm-Message-State: AOAM533mJQyAjm7Fw0W937FmrlHpomtQKEBo/rKdoZrMhAl73NqqhuXh yzrPkdUJ2CdxzL/HHIOW694hST1orP2q/MbwRMA= X-Received: by 2002:a05:6402:1694:b0:425:ae5e:4843 with SMTP id a20-20020a056402169400b00425ae5e4843mr38942566edv.415.1652412181572; Thu, 12 May 2022 20:23:01 -0700 (PDT) MIME-Version: 1.0 References: <29ff99f86e3da965b6e46c1cc2d72ce6528c17c3.1652382321.git.jpoimboe@kernel.org> In-Reply-To: <29ff99f86e3da965b6e46c1cc2d72ce6528c17c3.1652382321.git.jpoimboe@kernel.org> From: Kaiwan N Billimoria Date: Fri, 13 May 2022 08:52:45 +0530 Message-ID: Subject: Re: [PATCH] scripts/faddr2line: Fix overlapping text section failures To: Josh Poimboeuf Cc: Linux Kernel Mailing List , Peter Zijlstra Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-1.7 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RDNS_NONE, SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Works perfectly! Thanks Josh, awaiting the merge into mainline ... On Fri, May 13, 2022 at 12:35 AM Josh Poimboeuf wrote: > > There have been some recent reports of faddr2line failures: > > $ scripts/faddr2line sound/soundcore.ko sound_devnode+0x5/0x35 > bad symbol size: base: 0x0000000000000000 end: 0x0000000000000000 > > $ ./scripts/faddr2line vmlinux.o enter_from_user_mode+0x24 > bad symbol size: base: 0x0000000000005fe0 end: 0x0000000000005fe0 > > The problem is that faddr2line is based on 'nm', which has a major > limitation: it doesn't know how to distinguish between different text > sections. So if an offset exists in multiple text sections in the > object, it may fail. > > Rewrite faddr2line to be section-aware, by basing it on readelf. > > Fixes: 67326666e2d4 ("scripts: add script for translating stack dump function offsets") > Reported-by: Kaiwan N Billimoria > Reported-by: Peter Zijlstra > Signed-off-by: Josh Poimboeuf > --- > scripts/faddr2line | 150 +++++++++++++++++++++++++++++---------------- > 1 file changed, 97 insertions(+), 53 deletions(-) > > diff --git a/scripts/faddr2line b/scripts/faddr2line > index 6c6439f69a72..0e6268d59883 100755 > --- a/scripts/faddr2line > +++ b/scripts/faddr2line > @@ -44,17 +44,6 @@ > set -o errexit > set -o nounset > > -READELF="${CROSS_COMPILE:-}readelf" > -ADDR2LINE="${CROSS_COMPILE:-}addr2line" > -SIZE="${CROSS_COMPILE:-}size" > -NM="${CROSS_COMPILE:-}nm" > - > -command -v awk >/dev/null 2>&1 || die "awk isn't installed" > -command -v ${READELF} >/dev/null 2>&1 || die "readelf isn't installed" > -command -v ${ADDR2LINE} >/dev/null 2>&1 || die "addr2line isn't installed" > -command -v ${SIZE} >/dev/null 2>&1 || die "size isn't installed" > -command -v ${NM} >/dev/null 2>&1 || die "nm isn't installed" > - > usage() { > echo "usage: faddr2line [--list] ..." >&2 > exit 1 > @@ -69,6 +58,14 @@ die() { > exit 1 > } > > +READELF="${CROSS_COMPILE:-}readelf" > +ADDR2LINE="${CROSS_COMPILE:-}addr2line" > +AWK="awk" > + > +command -v ${AWK} >/dev/null 2>&1 || die "${AWK} isn't installed" > +command -v ${READELF} >/dev/null 2>&1 || die "${READELF} isn't installed" > +command -v ${ADDR2LINE} >/dev/null 2>&1 || die "${ADDR2LINE} isn't installed" > + > # Try to figure out the source directory prefix so we can remove it from the > # addr2line output. HACK ALERT: This assumes that start_kernel() is in > # init/main.c! This only works for vmlinux. Otherwise it falls back to > @@ -76,7 +73,7 @@ die() { > find_dir_prefix() { > local objfile=$1 > > - local start_kernel_addr=$(${READELF} -sW $objfile | awk '$8 == "start_kernel" {printf "0x%s", $2}') > + local start_kernel_addr=$(${READELF} --symbols --wide $objfile | ${AWK} '$8 == "start_kernel" {printf "0x%s", $2}') > [[ -z $start_kernel_addr ]] && return > > local file_line=$(${ADDR2LINE} -e $objfile $start_kernel_addr) > @@ -97,86 +94,133 @@ __faddr2line() { > local dir_prefix=$3 > local print_warnings=$4 > > - local func=${func_addr%+*} > + local sym_name=${func_addr%+*} > local offset=${func_addr#*+} > offset=${offset%/*} > - local size= > - [[ $func_addr =~ "/" ]] && size=${func_addr#*/} > + local user_size= > + [[ $func_addr =~ "/" ]] && user_size=${func_addr#*/} > > - if [[ -z $func ]] || [[ -z $offset ]] || [[ $func = $func_addr ]]; then > + if [[ -z $sym_name ]] || [[ -z $offset ]] || [[ $sym_name = $func_addr ]]; then > warn "bad func+offset $func_addr" > DONE=1 > return > fi > > # Go through each of the object's symbols which match the func name. > - # In rare cases there might be duplicates. > - file_end=$(${SIZE} -Ax $objfile | awk '$1 == ".text" {print $2}') > - while read symbol; do > - local fields=($symbol) > - local sym_base=0x${fields[0]} > - local sym_type=${fields[1]} > - local sym_end=${fields[3]} > - > - # calculate the size > - local sym_size=$(($sym_end - $sym_base)) > + # In rare cases there might be duplicates, in which case we print all > + # matches. > + while read line; do > + local fields=($line) > + local sym_addr=0x${fields[1]} > + local sym_elf_size=${fields[2]} > + local sym_sec=${fields[6]} > + > + # Get the section size: > + local sec_size=$(${READELF} --section-headers --wide $objfile | > + sed 's/\[ /\[/' | > + ${AWK} -v sec=$sym_sec '$1 == "[" sec "]" { print "0x" $6; exit }') > + > + if [[ -z $sec_size ]]; then > + warn "bad section size: section: $sym_sec" > + DONE=1 > + return > + fi > + > + # Calculate the symbol size. > + # > + # Unfortunately we can't use the ELF size, because kallsyms > + # also includes the padding bytes in its size calculation. For > + # kallsyms, the size calculation is the distance between the > + # symbol and the next symbol in a sorted list. > + local sym_size > + local cur_sym_addr > + local found=0 > + while read line; do > + local fields=($line) > + cur_sym_addr=0x${fields[1]} > + local cur_sym_elf_size=${fields[2]} > + local cur_sym_name=${fields[7]:-} > + > + if [[ $cur_sym_addr = $sym_addr ]] && > + [[ $cur_sym_elf_size = $sym_elf_size ]] && > + [[ $cur_sym_name = $sym_name ]]; then > + found=1 > + continue > + fi > + > + if [[ $found = 1 ]]; then > + sym_size=$(($cur_sym_addr - $sym_addr)) > + [[ $sym_size -lt $sym_elf_size ]] && continue; > + found=2 > + break > + fi > + done < <(${READELF} --symbols --wide $objfile | ${AWK} -v sec=$sym_sec '$7 == sec' | sort --key=2) > + > + if [[ $found = 0 ]]; then > + warn "can't find symbol: sym_name: $sym_name sym_sec: $sym_sec sym_addr: $sym_addr sym_elf_size: $sym_elf_size" > + DONE=1 > + return > + fi > + > + # If nothing was found after the symbol, assume it's the last > + # symbol in the section. > + [[ $found = 1 ]] && sym_size=$(($sec_size - $sym_addr)) > + > if [[ -z $sym_size ]] || [[ $sym_size -le 0 ]]; then > - warn "bad symbol size: base: $sym_base end: $sym_end" > + warn "bad symbol size: sym_addr: $sym_addr cur_sym_addr: $cur_sym_addr" > DONE=1 > return > fi > + > sym_size=0x$(printf %x $sym_size) > > - # calculate the address > - local addr=$(($sym_base + $offset)) > + # Calculate the section address from user-supplied offset: > + local addr=$(($sym_addr + $offset)) > if [[ -z $addr ]] || [[ $addr = 0 ]]; then > - warn "bad address: $sym_base + $offset" > + warn "bad address: $sym_addr + $offset" > DONE=1 > return > fi > addr=0x$(printf %x $addr) > > - # weed out non-function symbols > - if [[ $sym_type != t ]] && [[ $sym_type != T ]]; then > - [[ $print_warnings = 1 ]] && > - echo "skipping $func address at $addr due to non-function symbol of type '$sym_type'" > - continue > - fi > - > - # if the user provided a size, make sure it matches the symbol's size > - if [[ -n $size ]] && [[ $size -ne $sym_size ]]; then > + # If the user provided a size, make sure it matches the symbol's size: > + if [[ -n $user_size ]] && [[ $user_size -ne $sym_size ]]; then > [[ $print_warnings = 1 ]] && > - echo "skipping $func address at $addr due to size mismatch ($size != $sym_size)" > + echo "skipping $sym_name address at $addr due to size mismatch ($user_size != $sym_size)" > continue; > fi > > - # make sure the provided offset is within the symbol's range > + # Make sure the provided offset is within the symbol's range: > if [[ $offset -gt $sym_size ]]; then > [[ $print_warnings = 1 ]] && > - echo "skipping $func address at $addr due to size mismatch ($offset > $sym_size)" > + echo "skipping $sym_name address at $addr due to size mismatch ($offset > $sym_size)" > continue > fi > > - # separate multiple entries with a blank line > + # In case of duplicates or multiple addresses specified on the > + # cmdline, separate multiple entries with a blank line: > [[ $FIRST = 0 ]] && echo > FIRST=0 > > - # pass real address to addr2line > - echo "$func+$offset/$sym_size:" > - local file_lines=$(${ADDR2LINE} -fpie $objfile $addr | sed "s; $dir_prefix\(\./\)*; ;") > - [[ -z $file_lines ]] && return > + echo "$sym_name+$offset/$sym_size:" > > + # Pass section address to addr2line and strip absolute paths > + # from the output: > + local output=$(${ADDR2LINE} -fpie $objfile $addr | sed "s; $dir_prefix\(\./\)*; ;") > + [[ -z $output ]] && continue > + > + # Default output (non --list): > if [[ $LIST = 0 ]]; then > - echo "$file_lines" | while read -r line > + echo "$output" | while read -r line > do > echo $line > done > DONE=1; > - return > + continue > fi > > - # show each line with context > - echo "$file_lines" | while read -r line > + # For --list, show each line with its corresponding source code: > + echo "$output" | while read -r line > do > echo > echo $line > @@ -184,12 +228,12 @@ __faddr2line() { > n1=$[$n-5] > n2=$[$n+5] > f=$(echo $line | sed 's/.*at \(.\+\):.*/\1/g') > - awk 'NR>=strtonum("'$n1'") && NR<=strtonum("'$n2'") { if (NR=='$n') printf(">%d<", NR); else printf(" %d ", NR); printf("\t%s\n", $0)}' $f > + ${AWK} 'NR>=strtonum("'$n1'") && NR<=strtonum("'$n2'") { if (NR=='$n') printf(">%d<", NR); else printf(" %d ", NR); printf("\t%s\n", $0)}' $f > done > > DONE=1 > > - done < <(${NM} -n $objfile | awk -v fn=$func -v end=$file_end '$3 == fn { found=1; line=$0; start=$1; next } found == 1 { found=0; print line, "0x"$1 } END {if (found == 1) print line, end; }') > + done < <(${READELF} --symbols --wide $objfile | ${AWK} -v fn=$sym_name '$4 == "FUNC" && $8 == fn') > } > > [[ $# -lt 2 ]] && usage > -- > 2.34.1 >