Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp3622019pxb; Fri, 4 Feb 2022 12:35:38 -0800 (PST) X-Google-Smtp-Source: ABdhPJwxfWM/Dwi9TuIOzEqzo55oExPXx7+VgzLPHG5sz1N1d64SeSLuEv49Wz8B05IyIaYFbErH X-Received: by 2002:a17:902:6903:: with SMTP id j3mr4836430plk.137.1644006937831; Fri, 04 Feb 2022 12:35:37 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644006937; cv=none; d=google.com; s=arc-20160816; b=xhJzoCxvSoiJbDY01ox3Agp3rlif8L28RR4l3YckuxyUxj2WVLeysJ8V4TT0tlQhjK i6R+CmedJLKIcxPzmgTa9ZA2O8Pcq5C8VqPJGQJvwnBvTqr0DkwrKzSxWmNnpT0HdEON SUQDlYhANbHHam05AsH0k4JCS8Ucqtz7uwqO4QlXnuNwG6sshaSMunJTuNKbsnA6ljCY UF/5qfsPEvlLFWCX3ZB3GoObh9zCQP9X/UqMjBF3Fi9+aCjseHxF50VK7V/gkBFIi1Ie PL9jUz9nF7YnXoHOv8eabJaS0a0cFgSueM1AOWt5uLZtjTuTUej9RQ1FCgZErwMmXyko gAcg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=PjmllJnCzdLeudw4omc83lv2WMyg+Kaf/IkNgbRVUK0=; b=C56MMttakWgIqxyxh52xt33owE3KKhPuDQZweOAGA/T9NvPMZqsSsfDEqTIxVvVDXW 18+WQdOlV+rjdL5rsQo8g9zgMxvi3IgIsc4bQeL2XbhECuZxKmvWBWl5dWdhER6YkjTh gNt1rpexOoNnVRYJrmzoodJEjIGYTY/KmKo8niysEZUZ6ojH0oODn2h9KIC/hvg5Dqfx +Y5pzSWjwM+ntsT1kklqFWYjz/1XViLNvUmrLlFasyxuPxJGJDU78UHE6gnu23zQ3IZx xmsfNJdPzmw6ATziKAzVMIWUICjv8pueujeoXH0LlMjgCCiExLLtEshld32rdvGCgvEs P+mA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=ikCrqBKm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q2si2866672pgq.79.2022.02.04.12.35.26; Fri, 04 Feb 2022 12:35:37 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=ikCrqBKm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1357145AbiBDIjz (ORCPT + 99 others); Fri, 4 Feb 2022 03:39:55 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50916 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233838AbiBDIjy (ORCPT ); Fri, 4 Feb 2022 03:39:54 -0500 Received: from mail-pl1-x630.google.com (mail-pl1-x630.google.com [IPv6:2607:f8b0:4864:20::630]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2B5E3C061714; Fri, 4 Feb 2022 00:39:54 -0800 (PST) Received: by mail-pl1-x630.google.com with SMTP id j16so4564741plx.4; Fri, 04 Feb 2022 00:39:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=PjmllJnCzdLeudw4omc83lv2WMyg+Kaf/IkNgbRVUK0=; b=ikCrqBKmF0FOlv3xA/SHqjD9rSotMOMvsWGdBP5J5GAFwwVHYGPo4gt5G+qSO3QlPk MtYaiFFHI7KE3SdQs3Rml3RRvfXAFGwRAR2cBkIH5+F+yMNa5WOmBu0asO3XJ2Gveylj gDOmvXUkSZTXclZVHlEdzC5+tlsrLxEZaQT9B6kfH/GhXdoNs9h5vs5ZAteSk/xIot3g KV0LXPFCV2Hd3u8c6Zv6j8RhN+RrPBInSU9SY5kdaSo7b8WzwaeN4MjkjJKf9LQpzgaY UJbAPo0z+abiOqR7A8RCL3hCKOodI0xPLy2d25IIZL0wRRMGswmSRCpGq/7WukDpPXn6 x7Dw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=PjmllJnCzdLeudw4omc83lv2WMyg+Kaf/IkNgbRVUK0=; b=x8wT9mNhHCxrCSJZ5FJXc0sTfBs7eW+6O3X2bibfmKWdFTYZO0Za3sz6Cjd5n/90SH z3gRuJfFEOJ50237sxp9+SEicet9QeiMaDNtYb+KtmnnXnsFl1RZK5z/DOnF8QdTYrRk j7uloaK5D84AjSNsu+Ieway0v0Sb1kctfnuZJGM5Q9Z0vjrqcoyzggL1xsnKXRPx0epV 4aW50bp/o0OKN5zlpIxxb5SfITH2qG3jmPb3sfeynKlLMidQaIFp3C99Gi2znEAvZbWl kqZnL2PInjACwd1rif1/1L2E9tkIEhMM2yipIQ0Vx6kf8CQM1NxZpRVULXuGX8dkpI14 l9OA== X-Gm-Message-State: AOAM530imrSsmXqp234JDhrjlFgsRb1e8DKlceeSBCpCT8X+pwGJKKBT fNVFsUeN+J2GhCl/GZTVK70= X-Received: by 2002:a17:90b:4a09:: with SMTP id kk9mr1945026pjb.107.1643963993495; Fri, 04 Feb 2022 00:39:53 -0800 (PST) Received: from vimal-VirtualBox.. ([49.207.201.237]) by smtp.gmail.com with ESMTPSA id a13sm122683pfv.97.2022.02.04.00.39.49 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 04 Feb 2022 00:39:53 -0800 (PST) From: Vimal Agrawal X-Google-Original-From: Vimal Agrawal To: mcgrof@kernel.org Cc: hch@infradead.org, masahiroy@kernel.org, michal.lkml@markovi.net, ndesaulniers@google.com, JBeulich@suse.com, jeffm@suse.com, sam@ravnborg.org, linux-kbuild@vger.kernel.org, jeyu@kernel.org, linux-kernel@vger.kernel.org, avimalin@gmail.com, nishit.shah@sophos.com, Vimal Agrawal , Dirk VanDerMerwe Subject: [PATCH v5] modules: add heuristic when stripping unneeded symbols Date: Fri, 4 Feb 2022 14:09:32 +0530 Message-Id: <20220204083933.136096-1-vimal.agrawal@sophos.com> X-Mailer: git-send-email 2.32.0 In-Reply-To: <20220203060641.119522-1-vimal.agrawal@sophos.com> References: <20220203060641.119522-1-vimal.agrawal@sophos.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org If kernel modules are stripped off symbols for some reason then stack traces in dmesg do not show symbol name for address. It just prints absolute address sometimes (if there is no good match with any symbol) This was seen with OpenWrt which uses option INSTALL_MOD_STRIP= "--strip-unneeded" at kernel/module build/install time, and so modules are stripped off unneeded symbols. [245864.699580] do_nmi+0x12f/0x370 [245864.699583] end_repeat_nmi+0x16/0x50 [245864.699585] RIP: 0010:0xffffffffc06b67ec <<<<<<<< [245864.699585] RSP: 0000:ffffaaa540cffe48 EFLAGS: 00000097 [245864.699586] RAX: 0000000000000001 RBX: ffff93357a729000 RCX: 0000000000000001 [245864.699587] RDX: ffff93357a729050 RSI: 0000000000000000 RDI: ffff93357a729000 [245864.699588] RBP: ffff9335cf521300 R08: 0000000000000001 R09: 0000000000000004 [245864.699588] R10: ffffaaa545b23ed0 R11: 0000000000000001 R12: ffffffffc06b61a0 [245864.699589] R13: ffffaaa540cffe60 R14: ffff9335c77fa3c0 R15: ffff9335cf51d7c0 [245864.699590] ? 0xffffffffc06b61a0 [245864.699592] ? 0xffffffffc06b67ec <<<<<<<< [245864.699593] ? 0xffffffffc06b67ec [245864.699594] Note RIP: 0010:0xffffffffc06b67ec and 0xffffffffc06b67ec printed in above stack trace as absolute address. There is no easy way in case box crashes as we loose information on load address of specific module. This changes the symbol decoding (in kernel/module.c) such that it can print offset from start of section (.text or .init.text) in case there is no good match with any symbol. It will now decode address in such cases to [module]+ offset/size or [module __init]+offset/size depending on where the address lies (in core/.text or init/.init.text section of module). One can use objdump/readelf/nm to find symbols with offset from .init.text and .text sections. steps to reproduce the problem: ------------------------------- 1. Add WARN_ON_ONCE(1) in module e.g. test_module.c 2. Build and strip the module using --strip-unneeded option 3. Load the module and check RIP in dmesg tests done: ----------- 1. Added WARN_ON_ONE(1) in functions of a module for testing ------------------------------------------------------------- [ 407.934085] CPU: 0 PID: 2956 Comm: insmod Tainted: G W E 5.16.0-rc5-next-20211220+ #2 [ 407.934087] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006 [ 407.934088] RIP: 0010:[module __init]+0x4/0x7 [test_module] [ 407.934097] Code: Unable to access opcode bytes at RIP 0xffffffffc07edfda. [ 407.934098] RSP: 0018:ffffb21440487c20 EFLAGS: 00010202 [ 407.934100] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [ 407.934101] RDX: 0000000000000000 RSI: ffffffff9c38e5e1 RDI: 0000000000000001 [ 407.934102] RBP: ffffb21440487c28 R08: 0000000000000000 R09: ffffb21440487a20 [ 407.934103] R10: ffffb21440487a18 R11: ffffffff9c755248 R12: ffffffffc07ee007 [ 407.934104] R13: ffff92a0f1e260b0 R14: 0000000000000000 R15: 0000000000000000 [ 407.934105] FS: 00007f578ebc4400(0000) GS:ffff92a1c0e00000(0000) knlGS:0000000000000000 [ 407.934107] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 407.934108] CR2: ffffffffc07edfda CR3: 00000000063ea006 CR4: 00000000000706f0 [ 407.934113] Call Trace: [ 407.934114] [ 407.934116] ? init_module+0x55/0xff9 [test_module] ... [ 407.934232] CPU: 0 PID: 2956 Comm: insmod Tainted: G W E 5.16.0-rc5-next-20211220+ #2 [ 407.934234] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006 [ 407.934242] RIP: 0010:[module]+0x4/0x7 [test_module] [ 407.934248] Code: Unable to access opcode bytes at RIP 0xffffffffc07e1fda. [ 407.934249] RSP: 0018:ffffb21440487c20 EFLAGS: 00010202 [ 407.934251] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [ 407.934252] RDX: 0000000000000000 RSI: ffffffff9c38e5e1 RDI: 0000000000000001 [ 407.934253] RBP: ffffb21440487c28 R08: 0000000000000000 R09: ffffb21440487a20 [ 407.934254] R10: ffffb21440487a18 R11: ffffffff9c755248 R12: ffffffffc07ee007 [ 407.934255] R13: ffff92a0f1e260b0 R14: 0000000000000000 R15: 0000000000000000 [ 407.934256] FS: 00007f578ebc4400(0000) GS:ffff92a1c0e00000(0000) knlGS:0000000000000000 [ 407.934257] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 407.934258] CR2: ffffffffc07e1fda CR3: 00000000063ea006 CR4: 00000000000706f0 [ 407.934260] Call Trace: [ 407.934260] [ 407.934261] ? init_module+0x5a/0xff9 [test_module] note that it is able to decode RIP to an offset from module start or init start now. tested on linux->next (tag next-20211220) Signed-off-by: Vimal Agrawal Acked-by: Nishit Shah Suggested-by: Dirk VanDerMerwe --- kernel/module.c | 26 +++++++++++++++++++++++--- 1 file changed, 23 insertions(+), 3 deletions(-) diff --git a/kernel/module.c b/kernel/module.c index 24dab046e16c..4de15c06e760 100644 --- a/kernel/module.c +++ b/kernel/module.c @@ -4270,14 +4270,21 @@ static const char *find_kallsyms_symbol(struct module *mod, unsigned long *offset) { unsigned int i, best = 0; - unsigned long nextval, bestval; + unsigned long baseval, nextval, bestval; struct mod_kallsyms *kallsyms = rcu_dereference_sched(mod->kallsyms); + char *module_base_name; /* At worse, next value is at end of module */ - if (within_module_init(addr, mod)) + if (within_module_init(addr, mod)) { + baseval = (unsigned long)mod->init_layout.base; nextval = (unsigned long)mod->init_layout.base+mod->init_layout.text_size; - else + module_base_name = "[module __init]"; + + } else { + baseval = (unsigned long)mod->core_layout.base; nextval = (unsigned long)mod->core_layout.base+mod->core_layout.text_size; + module_base_name = "[module]"; + } bestval = kallsyms_symbol_value(&kallsyms->symtab[best]); @@ -4308,6 +4315,19 @@ static const char *find_kallsyms_symbol(struct module *mod, nextval = thisval; } + if ((is_module_text_address(addr) && + (bestval < baseval || bestval > nextval))) { + /* + * return MODULE base and offset if we could not find + * any best match for text address + */ + if (size) + *size = nextval - baseval; + if (offset) + *offset = addr - baseval; + return module_base_name; + } + if (!best) return NULL; -- 2.32.0