Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp4270497ybc; Fri, 15 Nov 2019 01:45:58 -0800 (PST) X-Google-Smtp-Source: APXvYqzxO3UxUvLLOmc3fWuoiI17poxAaHtTrCV2i9GiA8Ojgh1vtrIoq996hC/n1fjFCF55ZzUF X-Received: by 2002:a17:906:c2d6:: with SMTP id ch22mr12839465ejb.262.1573811158121; Fri, 15 Nov 2019 01:45:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1573811158; cv=none; d=google.com; s=arc-20160816; b=Jw7qb0mbKMHPZvuo6/a53ryuKP08k22wJBJhoUl6ltjvbReSo+f1Y8+eMJI/k34PG6 X4fD6iSXvDKKRkEX3YLTEXfCiT5DgyieqHZfx8RunYkZcXQpQismTIpDLVQbhwlb1JqE NzpVII9Yh9gD1RMrw4Se2wV//xemCH4FIXOgvbWNxeji4H2WqFs9CrzN91ChboQn4xT7 n+lrL38qduRDpKYLTaKxcnxwiDSiSXJB+QM1zFUR+siuU1qYUvm9o4JyNlsq/kmVku4n JUOWwebcc+8WFtgJTNV32Hu49Y235mh8IG32kJY/cjTIrvueIvRCjcGyfNpgAKzk2iEm eOlw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=Nd+GWqpY6xytXpZ7fQNDkvv/bDp7nSKSF2zNgI28YP8=; b=I7wb6yj6DKLHvPKl38A4mZ9UpdV7oaP+tqWlpuRwzTX6AhoYe1I/ZN+/gUjyA0zTyd zQHnHYzTdzQCahOEZGG7L1T5IvMIVluUGVxB39xuZY17N71AEXmWbUA5UmrlLVM7Qm1l 84WfEQmSFYQ1jdtClACK4wkAVUfeO1yZro0u/c5wy5HhkJJm92sZ/cMByLbJYOMI2UG8 Ju0SwqkzE8tU2qzVBzmqo1uEYnNCkSDP5HTOQFgS232KAfN2SqUm7fNtSnjmWOpVO8f6 oPGj4CO3R0pWUQTGQKjRC2CYgmrERb0YUjAbWaJQSAeUPwd/rkubqcPLWrllgqZkA1qQ ojXQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l3si5033424ejr.86.2019.11.15.01.45.30; Fri, 15 Nov 2019 01:45:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727543AbfKOJn4 (ORCPT + 99 others); Fri, 15 Nov 2019 04:43:56 -0500 Received: from out30-43.freemail.mail.aliyun.com ([115.124.30.43]:34080 "EHLO out30-43.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727355AbfKOJny (ORCPT ); Fri, 15 Nov 2019 04:43:54 -0500 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R911e4;CH=green;DM=||false|;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e01422;MF=shile.zhang@linux.alibaba.com;NM=1;PH=DS;RN=11;SR=0;TI=SMTPD_---0Ti8RdaY_1573811029; Received: from ali-6c96cfdd1403.local(mailfrom:shile.zhang@linux.alibaba.com fp:SMTPD_---0Ti8RdaY_1573811029) by smtp.aliyun-inc.com(127.0.0.1); Fri, 15 Nov 2019 17:43:51 +0800 Subject: Re: [RFC PATCH v3 6/7] scripts/sorttable: Add ORC unwind tables sort concurrently To: Peter Zijlstra Cc: Josh Poimboeuf , Masahiro Yamada , Michal Marek , Thomas Gleixner , Ingo Molnar , Borislav Petkov , x86@kernel.org, "H . Peter Anvin" , linux-kernel@vger.kernel.org, linux-kbuild@vger.kernel.org References: <20191115064750.47888-1-shile.zhang@linux.alibaba.com> <20191115064750.47888-7-shile.zhang@linux.alibaba.com> <20191115090723.GS4114@hirez.programming.kicks-ass.net> From: Shile Zhang Message-ID: <9594afbc-52bc-5ae7-4a19-8fc4b36a1abd@linux.alibaba.com> Date: Fri, 15 Nov 2019 17:43:49 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 In-Reply-To: <20191115090723.GS4114@hirez.programming.kicks-ass.net> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/11/15 17:07, Peter Zijlstra wrote: > On Fri, Nov 15, 2019 at 02:47:49PM +0800, Shile Zhang wrote: > >> +#if defined(SORTTABLE_64) && defined(UNWINDER_ORC_ENABLED) >> +/* ORC unwinder only support X86_64 */ >> +#include >> +#include >> +#include >> + >> +#define ORC_REG_UNDEFINED 0 >> +#define ERRSTRING_MAXSZ 256 >> + >> +struct orc_entry { >> + s16 sp_offset; >> + s16 bp_offset; >> + unsigned sp_reg:4; >> + unsigned bp_reg:4; >> + unsigned type:2; >> + unsigned end:1; >> +} __attribute__((packed)); >> + >> +struct orctable_info { >> + size_t orc_size; >> + size_t orc_ip_size; >> +} orctable; > There's ./arch/x86/include/asm/orc_types.h for this. Please don't > duplicate. objtool uses that same header. Good catch! Thanks for your kindly reminder! I'll remove it. >> +/** >> + * sort - sort an array of elements >> + * @base: pointer to data to sort >> + * @num: number of elements >> + * @size: size of each element >> + * @cmp_func: pointer to comparison function >> + * @swap_func: pointer to swap function >> + * >> + * This function does a heapsort on the given array. You may provide a >> + * swap_func function optimized to your element type. >> + * >> + * Sorting time is O(n log n) both on average and worst-case. While >> + * qsort is about 20% faster on average, it suffers from exploitable >> + * O(n*n) worst-case behavior and extra memory requirements that make >> + * it less suitable for kernel use. >> + * >> + * This code token out of /lib/sort.c. >> + */ >> +static void sort(void *base, size_t num, size_t size, >> + int (*cmp_func)(const void *, const void *), >> + void (*swap_func)(void *, void *, int size)) >> +{ >> + /* pre-scale counters for performance */ >> + int i = (num/2 - 1) * size, n = num * size, c, r; >> + >> + /* heapify */ >> + for ( ; i >= 0; i -= size) { >> + for (r = i; r * 2 + size < n; r = c) { >> + c = r * 2 + size; >> + if (c < n - size && >> + cmp_func(base + c, base + c + size) < 0) >> + c += size; >> + if (cmp_func(base + r, base + c) >= 0) >> + break; >> + swap_func(base + r, base + c, size); >> + } >> + } >> + >> + /* sort */ >> + for (i = n - size; i > 0; i -= size) { >> + swap_func(base, base + i, size); >> + for (r = 0; r * 2 + size < i; r = c) { >> + c = r * 2 + size; >> + if (c < i - size && >> + cmp_func(base + c, base + c + size) < 0) >> + c += size; >> + if (cmp_func(base + r, base + c) >= 0) >> + break; >> + swap_func(base + r, base + c, size); >> + } >> + } >> +} > Do we really need to copy the heapsort implementation? That is, why not > use libc's qsort() ? This is userspace after all. Yes, I think qsort is better choice than copy-paste here. But qsort does not support customized swap func, which is needed for ORC unwind swap two tables together. I think it's hard to do with qsort, so I used sort same with original orc unwind table sort.