Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759363AbbLCMli (ORCPT ); Thu, 3 Dec 2015 07:41:38 -0500 Received: from us01smtprelay-2.synopsys.com ([198.182.47.9]:47405 "EHLO smtprelay.synopsys.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751927AbbLCMlh (ORCPT ); Thu, 3 Dec 2015 07:41:37 -0500 From: Vineet Gupta To: CC: , , , Vineet Gupta Subject: [RFC 00/17] ARC Dwarf unwinder improvements Date: Thu, 3 Dec 2015 18:10:58 +0530 Message-ID: <1449146475-15335-1-git-send-email-vgupta@synopsys.com> X-Mailer: git-send-email 1.9.1 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.12.197.182] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2089 Lines: 54 Hi guys, In light of perf -g stalling (as unwinder was taking ~3million cycles for non existent entries), I've revamped the dwarf unwinder. There are some optim tweaks and much of it is "De-generalization" for things which we can safely assume on ARC. Crude Instrumentation shows following improvements per unwinder call: - Avg time come down from ~4650 cycles to ~2794 cycles (+40%) - Max time come down 9793 cycles to 5987 cycles This is on a SMP FPGA config @ 75 MHz It seems much of time (65%) is taken for binary lookup thru ~12k FDE entries, roughly 13 lookups, each likely a dcache miss. git://git.kernel.org/pub/scm/linux/kernel/git/vgupta/arc # topic-unwinder-rework-4-instrument -Vineet Vineet Gupta (17): ARC: dw2 unwind: Elide generation of const propagated clones ARC: dw2 unwind: remove unused cruft ARC: dw2 unwind: Remove handling of for signal frame ARC: dw2 unwind: Remove FP based unwinding ARC: dw2 unwind: Better printing ARC: dw2 unwind: Don't verify Main FDE Table size everytime ARC: dw2 unwind: Refactor the FDE lookup table (eh_frame_header) code ARC: dw2 unwind: Don't verify FDE lookup table metadata ARC: dw2 unwind: Use striaght forward code to implement binary lookup ARC: dw2 unwind: CIE parsing/validation done only once at startup ARC: dw2 unwind: Elide REG_INVALID check ARC: dw2 unwind: Elide a loop if DW_CFA_register not present ARC: dw2 unwind: Assume all regs to be unsigned long ARC: dw2 unwind: No need for __get_user ARC: dw2 unwind: Single exit point for instrumentation ARC: dw2 unwind: skip regs not updated xxx: instrument arch/arc/include/asm/unwind.h | 47 +-- arch/arc/kernel/Makefile | 1 + arch/arc/kernel/unwind.c | 806 ++++++++++++++++-------------------------- 3 files changed, 313 insertions(+), 541 deletions(-) -- 1.9.1 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/