Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp1772436rwb; Fri, 23 Sep 2022 19:07:25 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7rUW6BQ8Bv7O6k21aInhZJHgBL4oqyTPl+CBjPHPgtJdyA7imnjmy3rJsKSWUO8WNWG8Rf X-Received: by 2002:a17:903:22d0:b0:177:e5b2:c598 with SMTP id y16-20020a17090322d000b00177e5b2c598mr11362663plg.56.1663985245533; Fri, 23 Sep 2022 19:07:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1663985245; cv=none; d=google.com; s=arc-20160816; b=WwzGTmj3IvZi2EIaOOihqiPU5GQRaikmF+fDq/CRwOD2/5fuPWrp3BrOi9iQKkur/4 x4dBDZLQtKYIX7HUfsCtZITCLf04R/p46DWHWS6u6CAfFcVOHx3FV5ctslPaK/ZIgudT 8l/cfYtkG0ieM+U6mQCOs60oCbg7oVyjPCZgFXQhDfRIAVrgGwEKAAOFKlS8gaLqLC99 5WM9VU06lOHdjleC23nRwS5pJBeDJInxX/j32C8vALQtjzQqYtOFWYaD3eDerulM2Ipq 5Sg88QoBfOt5e1SSbnor3FGbjC7KmtvcJlmNMu7u9XOB7eAByAhwek8DM7X6clDSrmmP QEqA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id; bh=Rlymp3+wPaJvsujvUscm1mMCVlrG7i5JhVNx3oFgGt4=; b=pkIXEOfEG1vNAmvv2GefDG91043tD47WdtF3fl08sUuAfP1MYcriE4JI7UWMbTX76H OaMGyGJyJxNKm3CTicJOw1vba7Zypppqh6B6OToGYr+doXkxgGXEljUOXrIsZygjRo7Q pc600ZHCe8kE2nYjC7uImZxMuYAZHqDrI8dtkjZ3SoRu8TmYwh8JISlABA5KPVW5pEdl tIqepvcXDxOFbRdc4G34H8+fyjGEJ7lH/+amo5DXqMEbUHMphkEqKs4ky+NE6zCl4FfI 9h7JFhI5YgQW2XMQgiVgx5CT3NhAeCeePvPOBz2pMjsDU1SLIsCy6Xz5e9sDHg/3kdHN 2zZQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 13-20020a63010d000000b0043c22e9127bsi5596245pgb.831.2022.09.23.19.07.14; Fri, 23 Sep 2022 19:07:25 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232990AbiIXBwe (ORCPT + 99 others); Fri, 23 Sep 2022 21:52:34 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33630 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232982AbiIXBwc (ORCPT ); Fri, 23 Sep 2022 21:52:32 -0400 Received: from szxga03-in.huawei.com (szxga03-in.huawei.com [45.249.212.189]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0ACFB10196D; Fri, 23 Sep 2022 18:52:32 -0700 (PDT) Received: from kwepemi500012.china.huawei.com (unknown [172.30.72.57]) by szxga03-in.huawei.com (SkyGuard) with ESMTP id 4MZBmT4r3qzHqM9; Sat, 24 Sep 2022 09:50:17 +0800 (CST) Received: from [10.67.110.108] (10.67.110.108) by kwepemi500012.china.huawei.com (7.221.188.12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Sat, 24 Sep 2022 09:52:29 +0800 Message-ID: <19ffdae6-8484-08e5-f408-ab39a97ce5c0@huawei.com> Date: Sat, 24 Sep 2022 09:52:28 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.4.0 Subject: Re: [PATCH 3/3] arm64/kprobe: Optimize the performance of patching single-step slot To: Mark Rutland CC: , , , , , , , , , , , , , References: <20220923084658.99304-1-liaochang1@huawei.com> <20220923084658.99304-4-liaochang1@huawei.com> From: "liaochang (A)" In-Reply-To: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit X-Originating-IP: [10.67.110.108] X-ClientProxiedBy: dggems702-chm.china.huawei.com (10.3.19.179) To kwepemi500012.china.huawei.com (7.221.188.12) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2022/9/23 20:39, Mark Rutland 写道: > On Fri, Sep 23, 2022 at 04:46:58PM +0800, Liao Chang wrote: >> Single-step slot would not be used until kprobe is enabled, that means >> no race condition occurs on it under SMP, hence it is safe to pacth ss >> slot without stopping machine. > > I think this is correct, but this depends on a couple of subtleties, > importantly: > > * That the I-cache maintenance for these instructions is complete *before* the > kprobe BRK is written (and aarch64_insn_patch_text_nosync() ensures this, but > just omits causing a Context-Synchronization-Event on all CPUS). So in order to guarantee the I-cache maintenance is observed on all CPUS, it needs to be followed by a explicit Context-Synchronization-Event, perhaps it is better to place ISB before kprobe BRK is written. > > * That the kprobe BRK results in an exception (and consequently a > Context-Synchronoization-Event), which ensures that the CPU will fetch the > single-step slot instructions *after* this, ensuring that the new > instructions are used. Yes, because of single-step slot is installed int the BRK execption handler, so it is not necessary to generate Context-Synchronization-Event via ISB mentioned above... Thanks. > > It would be good if we could call that out explicitly. > > Thanks, > Mark. > >> Since I and D caches are coherent within single-step slot from >> aarch64_insn_patch_text_nosync(), hence no need to do it again via >> flush_icache_range(). >> >> Acked-by: Masami Hiramatsu (Google) >> Signed-off-by: Liao Chang >> --- >> arch/arm64/kernel/probes/kprobes.c | 7 ++----- >> 1 file changed, 2 insertions(+), 5 deletions(-) >> >> diff --git a/arch/arm64/kernel/probes/kprobes.c b/arch/arm64/kernel/probes/kprobes.c >> index d1d182320245..29b98bc12833 100644 >> --- a/arch/arm64/kernel/probes/kprobes.c >> +++ b/arch/arm64/kernel/probes/kprobes.c >> @@ -44,13 +44,10 @@ post_kprobe_handler(struct kprobe *, struct kprobe_ctlblk *, struct pt_regs *); >> static void __kprobes arch_prepare_ss_slot(struct kprobe *p) >> { >> kprobe_opcode_t *addr = p->ainsn.api.insn; >> - void *addrs[] = {addr, addr + 1}; >> - u32 insns[] = {p->opcode, BRK64_OPCODE_KPROBES_SS}; >> >> /* prepare insn slot */ >> - aarch64_insn_patch_text(addrs, insns, 2); >> - >> - flush_icache_range((uintptr_t)addr, (uintptr_t)(addr + MAX_INSN_SIZE)); >> + aarch64_insn_patch_text_nosync(addr, p->opcode); >> + aarch64_insn_patch_text_nosync(addr + 1, BRK64_OPCODE_KPROBES_SS); >> >> /* >> * Needs restoring of return address after stepping xol. >> -- >> 2.17.1 >> > > . -- BR, Liao, Chang