Received: by 2002:a25:ca44:0:0:0:0:0 with SMTP id a65csp205121ybg; Tue, 28 Jul 2020 04:00:21 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwLw6thOVSlZ0WU53e6fx10W5y89h6VoAIfpVQCS+9BkIKbYA8n/Dyddeqv55plMpwFpdLB X-Received: by 2002:a17:907:7255:: with SMTP id ds21mr872505ejc.44.1595934021337; Tue, 28 Jul 2020 04:00:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1595934021; cv=none; d=google.com; s=arc-20160816; b=KreozzkJUnTrmJvuVJ6kYYVtb683rnNO145dhpeSABEiJ4KZPXYDvXuqxILs1FNJut 7aLQETLqiiBEyEr87M0tY1Ixkhv0MZsPkNiC8CjF0BRNyPN7XWTjVl+deFJeURvrC4Tv e7Z0ewfcTGnQ3asAHJzKW1cwNAltpXYb7Qa7JZ3xPLh/Kt0H7GM8TPoOKP1jHWyJM6q1 pE20spu3hUpu35i7901+KSE5Lm6XJgTMiLLp8o8jHLvRrF/hIPxQmD07tdoMD64SweMj Qd/+QxMJESvAkz9dfH17yq/W+faWBt7lt6OXO70MmPY64/As8K58LIhQnkPulZvICCO5 oPGA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=hMaIzU0CQG5VsDB9Q1oxGuL/5Kw9sF/vvnRCz2yl0V8=; b=ynEUIezvf2fUxbuHibq+Qt8tQ/JeHI2gVKwcmdicmS3ilzVb1J+VhFcRfn2aTtGNSQ I5YSmJZZtS4Jg4PgQEZyt7vmchYc+CE2pdV1cbLvkUe5H1d+fVhZ1GzLkDJ4/zqguqhw kpSj+QK3xZjJ4cSozfe8YYpBjJpYj8GTJdMjBCR9BQfdvVklyVNkHRKR2rjq0TCJEDCw gNfOAGtdWCzz9YslHHjNmOdWoVT6T2j2YK6YOitzFoIC0P8lFTu0QhRsEQhaZIzviWRl uyg+Cd36Tb8eCudh8QuW7Xa65jGRwEBVBjHluqKqNNK8eWpHGzDyeCAZKxbqFnT+VquG 1RWg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JWZMBJ83; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id k12si7340001edl.489.2020.07.28.03.59.58; Tue, 28 Jul 2020 04:00:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JWZMBJ83; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728918AbgG1K45 (ORCPT + 99 others); Tue, 28 Jul 2020 06:56:57 -0400 Received: from mail.kernel.org ([198.145.29.99]:33646 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728588AbgG1K44 (ORCPT ); Tue, 28 Jul 2020 06:56:56 -0400 Received: from mail-ot1-f53.google.com (mail-ot1-f53.google.com [209.85.210.53]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id C7018208E4 for ; Tue, 28 Jul 2020 10:56:55 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1595933815; bh=FgcTnYXgA0rEllPu4tLOBKhCeVE/MwrCLcbdciPSbZY=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=JWZMBJ83jcH+bMosW6PNvkNvoWQPeb7JhsUAcIsXH0RZeNllMlPJT1xMLvgmjTKPp vUm6eCWf+sHFJ3LKfsBi4FI6OX2BHOaj8nhfqe2CxFJR1ERRt0Km/1EDolfGIxGQla Qv2zNsS8W/alO3sphsiOVCTdgD560qEw8lMxAVG0= Received: by mail-ot1-f53.google.com with SMTP id o72so8991278ota.11 for ; Tue, 28 Jul 2020 03:56:55 -0700 (PDT) X-Gm-Message-State: AOAM530FjhwZAwWhFNjMBQXDnxdsuEbvyjSItjwxt3lQnTEBA9Z3Edki +QTsE2eoOrYzQOBZvAj+vdZ6u9y6dtEheUTeTq0= X-Received: by 2002:a9d:3a04:: with SMTP id j4mr10807490otc.108.1595933815109; Tue, 28 Jul 2020 03:56:55 -0700 (PDT) MIME-Version: 1.0 References: <20200724050553.1724168-1-jarkko.sakkinen@linux.intel.com> <20200724050553.1724168-6-jarkko.sakkinen@linux.intel.com> <20200724092746.GD517988@gmail.com> <20200725031648.GG17052@linux.intel.com> <20200726081408.GB2927915@kernel.org> <20200728171715.0800093e2226e3d72b04a3ae@kernel.org> In-Reply-To: <20200728171715.0800093e2226e3d72b04a3ae@kernel.org> From: Ard Biesheuvel Date: Tue, 28 Jul 2020 13:56:43 +0300 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v5 5/6] kprobes: Use text_alloc() and text_free() To: Masami Hiramatsu Cc: Mike Rapoport , Jarkko Sakkinen , Ingo Molnar , Linux Kernel Mailing List , linux-mm@kvack.org, Andi Kleen , Peter Zijlstra , "Naveen N. Rao" , Anil S Keshavamurthy , "David S. Miller" , Jessica Yu Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 28 Jul 2020 at 11:17, Masami Hiramatsu wrote: > > On Sun, 26 Jul 2020 19:06:20 +0300 > Ard Biesheuvel wrote: > > > On Sun, 26 Jul 2020 at 11:14, Mike Rapoport wrote: > > > > > > On Sat, Jul 25, 2020 at 06:16:48AM +0300, Jarkko Sakkinen wrote: > > > > On Fri, Jul 24, 2020 at 11:27:46AM +0200, Ingo Molnar wrote: > > > > > > > > > > * Jarkko Sakkinen wrote: > > > > > > > > > > > Use text_alloc() and text_free() instead of module_alloc() and > > > > > > module_memfree() when an arch provides them. > > > > > > > > > > > > Cc: linux-mm@kvack.org > > > > > > Cc: Andi Kleen > > > > > > Cc: Masami Hiramatsu > > > > > > Cc: Peter Zijlstra > > > > > > Signed-off-by: Jarkko Sakkinen > > > > > > --- > > > > > > kernel/kprobes.c | 9 +++++++++ > > > > > > 1 file changed, 9 insertions(+) > > > > > > > > > > > > diff --git a/kernel/kprobes.c b/kernel/kprobes.c > > > > > > index 4e46d96d4e16..611fcda9f6bf 100644 > > > > > > --- a/kernel/kprobes.c > > > > > > +++ b/kernel/kprobes.c > > > > > > @@ -40,6 +40,7 @@ > > > > > > #include > > > > > > #include > > > > > > #include > > > > > > +#include > > > > > > > > > > > > #define KPROBE_HASH_BITS 6 > > > > > > #define KPROBE_TABLE_SIZE (1 << KPROBE_HASH_BITS) > > > > > > @@ -111,12 +112,20 @@ enum kprobe_slot_state { > > > > > > > > > > > > void __weak *alloc_insn_page(void) > > > > > > { > > > > > > +#ifdef CONFIG_ARCH_HAS_TEXT_ALLOC > > > > > > + return text_alloc(PAGE_SIZE); > > > > > > +#else > > > > > > return module_alloc(PAGE_SIZE); > > > > > > +#endif > > > > > > } > > > > > > > > > > > > void __weak free_insn_page(void *page) > > > > > > { > > > > > > +#ifdef CONFIG_ARCH_HAS_TEXT_ALLOC > > > > > > + text_free(page); > > > > > > +#else > > > > > > module_memfree(page); > > > > > > +#endif > > > > > > } > > > > > > > > > > I've read the observations in the other threads, but this #ifdef > > > > > jungle is silly, it's a de-facto open coded text_alloc() with a > > > > > module_alloc() fallback... > > > > > > > > In the previous version I had: > > > > > > > > https://lore.kernel.org/lkml/20200717030422.679972-4-jarkko.sakkinen@linux.intel.com/ > > > > > > > > and I had just calls to text_alloc() and text_free() in corresponding > > > > snippet to the above. > > > > > > > > I got this feedback from Mike: > > > > > > > > https://lore.kernel.org/lkml/20200718162359.GA2919062@kernel.org/ > > > > > > > > I'm not still sure that I fully understand this feedback as I don't see > > > > any inherent and obvious difference to the v4. In that version fallbacks > > > > are to module_alloc() and module_memfree() and text_alloc() and > > > > text_memfree() can be overridden by arch. > > > > > > Let me try to elaborate. > > > > > > There are several subsystems that need to allocate memory for executable > > > text. As it happens, they use module_alloc() with some abilities for > > > architectures to override this behaviour. > > > > > > For many architectures, it would be enough to rename modules_alloc() to > > > text_alloc(), make it built-in and this way allow removing dependency on > > > MODULES. > > > > > > Yet, some architectures have different restrictions for code allocation > > > for different subsystems so it would make sense to have more than one > > > variant of text_alloc() and a single config option ARCH_HAS_TEXT_ALLOC > > > won't be sufficient. > > > > > > I liked Mark's suggestion to have text_alloc_() and proposed > > > a way to introduce text_alloc_kprobes() along with > > > HAVE_KPROBES_TEXT_ALLOC to enable arch overrides of this function. > > > > > > The major difference between your v4 and my suggestion is that I'm not > > > trying to impose a single ARCH_HAS_TEXT_ALLOC as an alternative to > > > MODULES but rather to use per subsystem config option, e.g. > > > HAVE_KPROBES_TEXT_ALLOC. > > > > > > Another thing, which might be worth doing regardless of the outcome of > > > this discussion is to rename alloc_insn_pages() to text_alloc_kprobes() > > > because the former is way too generic and does not emphasize that the > > > instruction page is actually used by kprobes only. > > The name of the insn_pages came from the struct kprobe_insn_page, so > if there is a text_alloc_kprobe(), I'm OK to rename it. (anyway, that > is an allocation operator, we don't call it directly.) > > > Masami or Peter should correct me if I am wrong, but it seems to me > > that the way kprobes uses these pages does not require them to be in > > relative branching range of the core kernel on any architecture, given > > that they are populated with individual instruction opcodes that are > > executed in single step mode, and relative branches are emulated (when > > needed) > > Actually, x86 and arm has the "relative branching range" requirements > for the jump optimized kprobes. For the other architectures, I think > we don't need it. Only executable text buffer is needed. > Thanks for the explanation. Today, arm64 uses the definition below. void *alloc_insn_page(void) { return __vmalloc_node_range(PAGE_SIZE, 1, VMALLOC_START, VMALLOC_END, GFP_KERNEL, PAGE_KERNEL_ROX, VM_FLUSH_RESET_PERMS, NUMA_NO_NODE, __builtin_return_address(0)); } Do you think we could use that as the generic implementation if we use MODULES_START/_END as the allocation window?