Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp14243269rwb; Sun, 27 Nov 2022 20:45:37 -0800 (PST) X-Google-Smtp-Source: AA0mqf72FqRxEjPu1szTAy8dYoOPawGQ8a54/wXVYNC5xpDibpvx6BqUmMwKG2+E3HiyfDbmn0ut X-Received: by 2002:a17:906:a58:b0:7ad:b45c:dbca with SMTP id x24-20020a1709060a5800b007adb45cdbcamr30613774ejf.388.1669610737308; Sun, 27 Nov 2022 20:45:37 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669610737; cv=none; d=google.com; s=arc-20160816; b=nKsboDnL1mSSgniBcz5zYvbX50U1cVQ26jlbhbu5gF+8/ZRVzftIMgAjbqkK1R1wuK I/dv1/HkfOfCftiR6QlhWOFG4zQA/qkC2jwOGC9yKbo68xe0WkJ3nL9jUvFgmiLzGSCA Mr20+d0PimVN1RrLXiq8487+TiplqDFcMOSq56g9zsuICn9Lp1iIGpSAH1eTW8KBgi0u /4e/y/zTYK3XjDZAZ0SpTKl1F63H5Lf8mfaPh2H72V173SlEblg8Je8Jfi+Tru6sF+Rm Tu1pU6tdLYKU5kfYYxrrGaWSydFBg7Xme/sXz5IdducfhDnstGTER7bu8bsAupwjKmG3 a7uw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=j8is9grJlEIcW1zm8IYQ/4d0JawRYOIGTyqoOOkKQgo=; b=JCDsdCRaNJsMrMY4CnCj1UVxHAcy6Kj5y6xdcGSWMRyneGbzYU4Wt0xIBd2iJiM8sd R8irkZT+8CC3kM6cPlpDTywxuHDQ+qdeYz4EfYCCPICeLtGxAyY8ArISkSbZu6cDaJGn ZdOHbpteftYgnWTtF06KczPD7R9CU6Uzct/v9oSEKrp5YwUQU1dflfMpuV6iBNmjqaDe oynY9BJYwE3Xslazvp1fIvTmAtGIfN8+R/RarTXZWuLT2qmWrkoeBebHNTrExBwWGk1S qhCm8UqJtEcVUfumE4Qq1+HaSXU7hDcIewUG+SQMHzRGFA6SOjvIP9C4pN7threWJk2Q 4QSw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=antgroup.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id dn20-20020a17090794d400b007ae754729d1si1739699ejc.883.2022.11.27.20.45.17; Sun, 27 Nov 2022 20:45:37 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=antgroup.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229518AbiK1EVl (ORCPT + 84 others); Sun, 27 Nov 2022 23:21:41 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55784 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229893AbiK1EUq (ORCPT ); Sun, 27 Nov 2022 23:20:46 -0500 Received: from out0-138.mail.aliyun.com (out0-138.mail.aliyun.com [140.205.0.138]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id DEEDC384 for ; Sun, 27 Nov 2022 20:19:42 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R781e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018047192;MF=houwenlong.hwl@antgroup.com;NM=1;PH=DS;RN=17;SR=0;TI=SMTPD_---.QITc3YH_1669609178; Received: from localhost(mailfrom:houwenlong.hwl@antgroup.com fp:SMTPD_---.QITc3YH_1669609178) by smtp.aliyun-inc.com; Mon, 28 Nov 2022 12:19:38 +0800 Date: Mon, 28 Nov 2022 12:19:38 +0800 From: "Hou Wenlong" To: "H. Peter Anvin" Cc: linux-kernel@vger.kernel.org, Juergen Gross , "Srivatsa S. Bhat (VMware)" , Alexey Makhalov , VMware PV-Drivers Reviewers , Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , x86@kernel.org, Peter Zijlstra , Josh Poimboeuf , Kees Cook , Song Liu , Nadav Amit , virtualization@lists.linux-foundation.org Subject: Re: [PATCH v2] x86/paravirt: Use relative reference for original instruction Message-ID: <20221128041938.GA42449@k08j02272.eu95sqa> References: <73c9ffac157087da78af9fca59cf7d8db7f17226.1669290510.git.houwenlong.hwl@antgroup.com> <20221128030320.GA101008@k08j02272.eu95sqa> <169A82BE-E5A9-4DB6-9CBE-055699F00213@zytor.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <169A82BE-E5A9-4DB6-9CBE-055699F00213@zytor.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_PASS,UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Nov 27, 2022 at 07:34:39PM -0800, H. Peter Anvin wrote: > On November 27, 2022 7:03:20 PM PST, Hou Wenlong wrote: > >On Sun, Nov 27, 2022 at 09:24:34AM -0800, H. Peter Anvin wrote: > >> On November 24, 2022 3:51:53 AM PST, Hou Wenlong wrote: > >> >Similar to the alternative patching, use relative reference for original > >> >instruction rather than absolute one, which saves 8 bytes for one entry > >> >on x86_64. And it could generate R_X86_64_PC32 relocation instead of > >> >R_X86_64_64 relocation, which also reduces relocation metadata on > >> >relocatable builds. And the alignment could be hard coded to be 4 now. > >> > > >> >Signed-off-by: Hou Wenlong > >> >--- > >> > arch/x86/include/asm/paravirt.h | 10 +++++----- > >> > arch/x86/include/asm/paravirt_types.h | 8 ++++---- > >> > arch/x86/kernel/alternative.c | 8 +++++--- > >> > 3 files changed, 14 insertions(+), 12 deletions(-) > >> > > >> >diff --git a/arch/x86/include/asm/paravirt.h b/arch/x86/include/asm/paravirt.h > >> >index 2851bc2339d5..e56065ea73f2 100644 > >> >--- a/arch/x86/include/asm/paravirt.h > >> >+++ b/arch/x86/include/asm/paravirt.h > >> >@@ -735,16 +735,16 @@ extern void default_banner(void); > >> > > >> > #else /* __ASSEMBLY__ */ > >> > > >> >-#define _PVSITE(ptype, ops, word, algn) \ > >> >+#define _PVSITE(ptype, ops) \ > >> > 771:; \ > >> > ops; \ > >> > 772:; \ > >> > .pushsection .parainstructions,"a"; \ > >> >- .align algn; \ > >> >- word 771b; \ > >> >+ .align 4; \ > >> >+ .long 771b-.; \ > >> > .byte ptype; \ > >> > .byte 772b-771b; \ > >> >- _ASM_ALIGN; \ > >> >+ .align 4; \ > >> > .popsection > >> > > >> > > >> >@@ -752,7 +752,7 @@ extern void default_banner(void); > >> > #ifdef CONFIG_PARAVIRT_XXL > >> > > >> > #define PARA_PATCH(off) ((off) / 8) > >> >-#define PARA_SITE(ptype, ops) _PVSITE(ptype, ops, .quad, 8) > >> >+#define PARA_SITE(ptype, ops) _PVSITE(ptype, ops) > >> > #define PARA_INDIRECT(addr) *addr(%rip) > >> > > >> > #ifdef CONFIG_DEBUG_ENTRY > >> >diff --git a/arch/x86/include/asm/paravirt_types.h b/arch/x86/include/asm/paravirt_types.h > >> >index 8c1da419260f..68952ae07a3f 100644 > >> >--- a/arch/x86/include/asm/paravirt_types.h > >> >+++ b/arch/x86/include/asm/paravirt_types.h > >> >@@ -5,7 +5,7 @@ > >> > #ifndef __ASSEMBLY__ > >> > /* These all sit in the .parainstructions section to tell us what to patch. */ > >> > struct paravirt_patch_site { > >> >- u8 *instr; /* original instructions */ > >> >+ s32 instr_offset; /* original instructions */ > >> > u8 type; /* type of this instruction */ > >> > u8 len; /* length of original instruction */ > >> > }; > >> >@@ -273,11 +273,11 @@ extern struct paravirt_patch_template pv_ops; > >> > #define _paravirt_alt(insn_string, type) \ > >> > "771:\n\t" insn_string "\n" "772:\n" \ > >> > ".pushsection .parainstructions,\"a\"\n" \ > >> >- _ASM_ALIGN "\n" \ > >> >- _ASM_PTR " 771b\n" \ > >> >+ " .align 4\n" \ > >> >+ " .long 771b-.\n" \ > >> > " .byte " type "\n" \ > >> > " .byte 772b-771b\n" \ > >> >- _ASM_ALIGN "\n" \ > >> >+ " .align 4\n" \ > >> > ".popsection\n" > >> > > >> > /* Generate patchable code, with the default asm parameters. */ > >> >diff --git a/arch/x86/kernel/alternative.c b/arch/x86/kernel/alternative.c > >> >index 111b809f0ac2..6eea563a098d 100644 > >> >--- a/arch/x86/kernel/alternative.c > >> >+++ b/arch/x86/kernel/alternative.c > >> >@@ -1232,20 +1232,22 @@ void __init_or_module apply_paravirt(struct paravirt_patch_site *start, > >> > { > >> > struct paravirt_patch_site *p; > >> > char insn_buff[MAX_PATCH_LEN]; > >> >+ u8 *instr; > >> > > >> > for (p = start; p < end; p++) { > >> > unsigned int used; > >> > > >> >+ instr = (u8 *)&p->instr_offset + p->instr_offset; > >> > BUG_ON(p->len > MAX_PATCH_LEN); > >> > /* prep the buffer with the original instructions */ > >> >- memcpy(insn_buff, p->instr, p->len); > >> >- used = paravirt_patch(p->type, insn_buff, (unsigned long)p->instr, p->len); > >> >+ memcpy(insn_buff, instr, p->len); > >> >+ used = paravirt_patch(p->type, insn_buff, (unsigned long)instr, p->len); > >> > > >> > BUG_ON(used > p->len); > >> > > >> > /* Pad the rest with nops */ > >> > add_nops(insn_buff + used, p->len - used); > >> >- text_poke_early(p->instr, insn_buff, p->len); > >> >+ text_poke_early(instr, insn_buff, p->len); > >> > } > >> > } > >> > extern struct paravirt_patch_site __start_parainstructions[], > >> > >> Any reason that you couldn't use the same patching code? > > > >Sorry, what do you mean using the same patching code? Do you > >mean that share some code between apply_alternatives() and > >apply_paravirt()? > > Yes. Abstract the facility rather than duplicate. The structure of patching entry is different between paravirt patching and alternative patching. The only same logic of those two patching functions is iteration in the section. The patching way is really diffferent. I can abstract the facility like this: #define for_each_patch_entry(p, start, end, patch_func) \ for (p = start; p < end; p++) { \ u8 *instr; \ char insn_buff[MAX_PATCH_LEN]; \ instr = p->instr_offset + p->instr_offset; \ BUG_ON(p->len > MAX_PATCH_LEN); \ used = patch_func(p, instr, insn_buff); \ add_nops(insn_buff + used, p->len - used); \ text_poke_early(instr, insn_buff, p->len); \ } But I think it is ugly :(. Do you have any better ideas?