Received: by 2002:a25:ca44:0:0:0:0:0 with SMTP id a65csp753833ybg; Tue, 28 Jul 2020 18:53:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyyQ97hKwth5wMeYCeRjPwu6RKRdYqISOYpqD6xsIO5g0XcUNevEoBBCpOIvncgV1dQ1tCK X-Received: by 2002:a17:906:f290:: with SMTP id gu16mr16196031ejb.502.1595987613723; Tue, 28 Jul 2020 18:53:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1595987613; cv=none; d=google.com; s=arc-20160816; b=nLSWaSbogUTIVgBjyMm7yIfjda6hgvG6JiKT7NMZNaB5HZ0B63VRuzRx2NcL9CKU1D bQ3On2So2RBqM1XIm6xW4xlExixpYDlYFPacgIU5TdpxWEvFss5v/OQnmob8Uqk70MZz yyMMZozUGNLEorRw/Unz79k8aaoph5eOQfTbI8V+Ogj26mw3GPCwBNGZNkpCDi1ShJJv z3EmiU4wVDlsGaE5wo3uLaJUmas1QOQmDrvXERcP+Jfs7ZlYcBsYxrks9BcKmAhaMdjg STzN2xp2ptCUkm3bBLgvEAHAw/9dflrhhCUQbmdbFpdgNi65hlMz6/qXhzapSDQ17z9z ZRnQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=i3yxXIjszG5Mkad/IfDkD+yg+Ard7tvimfdE1g2QchM=; b=UcaRfKomS8Q6Uj6TQeo/zRY1MeomJUWcfJIkwlc8OBSnhDEn39IjdaW1eSkWeqVfd5 6J5kZ6XcP6qDbNPm9iTpUwOIrThnsr+wg3PFNxBDQR0+g2rjnLcFgaNpClFQ1GaWxIs8 SNad8uosSyiSvdCMk0R5uNZkwFySIo9ovjSpvbjsGbQp+wnMM2R8u9SGjjs/WaJuVWU+ nSCxcTSWxbgDm6rdhU3XjTpI1f3mK2D+qMoB8GxrG7wqgWaQOuO1tUOjSreniDu6Zdmj yD95VobS/qafTc6KnPsoFl8qtsBGDK5JEHPafbVPzzLxH4cfvtnFfEawOpNrnw95rxaZ YxRw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=Ij03ThB7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i23si324709edx.261.2020.07.28.18.53.11; Tue, 28 Jul 2020 18:53:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=Ij03ThB7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730442AbgG2BvB (ORCPT + 99 others); Tue, 28 Jul 2020 21:51:01 -0400 Received: from mail.kernel.org ([198.145.29.99]:53552 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730328AbgG2BvA (ORCPT ); Tue, 28 Jul 2020 21:51:00 -0400 Received: from devnote2 (NE2965lan1.rev.em-net.ne.jp [210.141.244.193]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 55C0022CAE; Wed, 29 Jul 2020 01:50:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1595987460; bh=bZZ5zERLPsJkFYzGQ/kJWYaF35riQDb+P2o0nySVOBk=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=Ij03ThB7Pz8GFML/S0YQ0lgmlGenEwziXAH32UEUoqUr/a5i6OHjsYxo2H8RX+kbE n8EUKQWQM2jBBQhYphAc4vWSuAeWcv+09MTGlA7RxygjOsin5O75oX1XPbxn5T4L/S 2zeR2fILPnMyl1uwskldcBO2GqqUrZABbBazOgLM= Date: Wed, 29 Jul 2020 10:50:54 +0900 From: Masami Hiramatsu To: Ard Biesheuvel Cc: Mike Rapoport , Jarkko Sakkinen , Ingo Molnar , Linux Kernel Mailing List , linux-mm@kvack.org, Andi Kleen , Peter Zijlstra , "Naveen N. Rao" , Anil S Keshavamurthy , "David S. Miller" , Jessica Yu Subject: Re: [PATCH v5 5/6] kprobes: Use text_alloc() and text_free() Message-Id: <20200729105054.06f74749eb933c08342e6dd6@kernel.org> In-Reply-To: References: <20200724050553.1724168-1-jarkko.sakkinen@linux.intel.com> <20200724050553.1724168-6-jarkko.sakkinen@linux.intel.com> <20200724092746.GD517988@gmail.com> <20200725031648.GG17052@linux.intel.com> <20200726081408.GB2927915@kernel.org> <20200728171715.0800093e2226e3d72b04a3ae@kernel.org> <20200728223545.ce4ff78cac73b571a27bb357@kernel.org> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.32; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 28 Jul 2020 20:51:08 +0300 Ard Biesheuvel wrote: > On Tue, 28 Jul 2020 at 16:35, Masami Hiramatsu wrote: > > > > On Tue, 28 Jul 2020 13:56:43 +0300 > > Ard Biesheuvel wrote: > > > > > On Tue, 28 Jul 2020 at 11:17, Masami Hiramatsu wrote: > > > > > Masami or Peter should correct me if I am wrong, but it seems to me > > > > > that the way kprobes uses these pages does not require them to be in > > > > > relative branching range of the core kernel on any architecture, given > > > > > that they are populated with individual instruction opcodes that are > > > > > executed in single step mode, and relative branches are emulated (when > > > > > needed) > > > > > > > > Actually, x86 and arm has the "relative branching range" requirements > > > > for the jump optimized kprobes. For the other architectures, I think > > > > we don't need it. Only executable text buffer is needed. > > > > > > > > > > Thanks for the explanation. Today, arm64 uses the definition below. > > > > > > void *alloc_insn_page(void) > > > { > > > return __vmalloc_node_range(PAGE_SIZE, 1, VMALLOC_START, VMALLOC_END, > > > GFP_KERNEL, PAGE_KERNEL_ROX, VM_FLUSH_RESET_PERMS, > > > NUMA_NO_NODE, __builtin_return_address(0)); > > > } > > > > > > Do you think we could use that as the generic implementation if we use > > > MODULES_START/_END as the allocation window? > > > > Yes, but for the generic implementation, we don't need to consider the > > relative branching range since we can override it for x86 and arm. > > (and that will be almost same as module_alloc() default code) > > Indeed. So having kprobes specific macros that default to > VMALLOC_START/END but can be overridden would be sufficient. > > > BTW, is PAGE_KERNEL_ROX flag available generically? > > > > Turns out that it is not :-( Hmm, in that case, we need to use PAGE_KERNEL_EXEC. In the result, may it be similar to this? :) void * __weak module_alloc(unsigned long size) { return __vmalloc_node_range(size, 1, VMALLOC_START, VMALLOC_END, GFP_KERNEL, PAGE_KERNEL_EXEC, VM_FLUSH_RESET_PERMS, NUMA_NO_NODE, __builtin_return_address(0)); } The major difference between module_alloc() and kprobe's alloc_page_insn() is the alloc_page_insn() makes the page ROX after allocating the pages *ONLY* on x86 and arm64. $ git grep -w alloc_insn_page -- arch arch/arm64/kernel/probes/kprobes.c:void *alloc_insn_page(void) arch/x86/kernel/kprobes/core.c:void *alloc_insn_page(void) However since the module_alloc() owns its arch-dependent implementations most of major architectures, if we implement independent text_alloc_kprobe(), we need to make deadcopies of module_alloc() for each architecture. $ git grep 'module_alloc(unsigned' arch/ arch/arm/kernel/module.c:void *module_alloc(unsigned long size) arch/arm64/kernel/module.c:void *module_alloc(unsigned long size) arch/mips/kernel/module.c:void *module_alloc(unsigned long size) arch/nds32/kernel/module.c:void *module_alloc(unsigned long size) arch/nios2/kernel/module.c:void *module_alloc(unsigned long size) arch/parisc/kernel/module.c:void *module_alloc(unsigned long size) arch/riscv/kernel/module.c:void *module_alloc(unsigned long size) arch/s390/kernel/module.c:void *module_alloc(unsigned long size) arch/sparc/kernel/module.c:void *module_alloc(unsigned long size) arch/unicore32/kernel/module.c:void *module_alloc(unsigned long size) arch/x86/kernel/module.c:void *module_alloc(unsigned long size) It seems that some constrains for module_alloc() exists for above architectures. Anyway, for kprobe's text_alloc() requirements are - It must be executable for the arch which uses a single-step out-of-line. (and need to be registered to KASAN?) - It must be ROX if implemented (currently only for x86 and arm64) - It must be in the range of relative branching only for x86 and arm. Thank you, -- Masami Hiramatsu