Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp914311pxj; Fri, 14 May 2021 20:59:23 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwzrHJDuSe0ibBB+USVUOjEMRzgbjqVKT39P+lXSngVrDdI+uwL2LomWRw2Atokvq2UKMCv X-Received: by 2002:a05:6402:2712:: with SMTP id y18mr62018352edd.41.1621051163211; Fri, 14 May 2021 20:59:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1621051163; cv=none; d=google.com; s=arc-20160816; b=f/Y/j+rurpnjvtx70MLwxtmlm36a4BE0j0mD+ltmE5QFqQqs1kppu8/JTMsiE9kaNY XvA5FGQ+XRvwl6LkDX7gxAGEo7eBid0sT1ogyQwarGgj2po96wqMcvWmuXcYCKOUuFKn JerFV5h2u105rNv/xplQ8Ph3qwp85imEhII0vKWCyCRHo4TIWOA/z2YFwq0+SGr6zEt3 /F+TRqSCYqYA9a0jQ15U4yLp1t1qtnJ1pcS4q9h3KSnq0Mex3M5NkD244OeOloWc2cEW molTbZEaho2ypxN4KakF4zYgNHaE1SqRpzf6YxrYOi4lQFFKKX+DIEMKXYc/H54LC5Po 6kZg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:to:from:subject:mime-version:message-id:date :dkim-signature; bh=c6kHqF9mJQYVoqbE7Z/Zt84eKpUDeq460Qh7vhm+vhE=; b=D4XQqVTggZjT4UO1h+lYTnRfqIdDnZ3X4Cor+3ppld1MmkAyEiOCTzdztnkfe3nSUB AOsypV/tU5wKLvB15ArJsMKLrlQqSBYl8c0+OEwn5njZW6Cbp70xEgjXQD8RuWsfJO5r 7a+quWI7VJgFUjLxAzVqSftOeCO6edtDjsDF5V6oqzdKS5IeRmCk+zzYFcLwBVSd30kZ Ch4aW369yKCVBWFUQe16CBuBhea3ID0rSQB3FoAmqjL6FdTmZwWOETgZ94Wx6XQFkw8F v/dk0ZRZqp4m2jM5MslmKF7s3kdR+o1Yi103CflzQZrN7a8JvWeX3joC2OXVp4e5uVvt qi7g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=BpCJqUDv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y10si7650519ejp.136.2021.05.14.20.59.00; Fri, 14 May 2021 20:59:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=BpCJqUDv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234681AbhEOBIE (ORCPT + 99 others); Fri, 14 May 2021 21:08:04 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53276 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234677AbhEOBIE (ORCPT ); Fri, 14 May 2021 21:08:04 -0400 Received: from mail-yb1-xb49.google.com (mail-yb1-xb49.google.com [IPv6:2607:f8b0:4864:20::b49]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 10267C06174A for ; Fri, 14 May 2021 18:06:51 -0700 (PDT) Received: by mail-yb1-xb49.google.com with SMTP id s8-20020a5b04480000b029049fb35700b9so1378751ybp.5 for ; Fri, 14 May 2021 18:06:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:message-id:mime-version:subject:from:to; bh=c6kHqF9mJQYVoqbE7Z/Zt84eKpUDeq460Qh7vhm+vhE=; b=BpCJqUDvDyGuB3hF0sUqajb22Nkkel1SKN78FTdQqPaYwQEWCe4sDEVdohBIqR6DX/ +/6W83lVKqgZ6Clx2KlQgIhw4Gmx+K4zouhLVROJl9RQGNMhmt0bryDVX8XNUpD3ha36 /TKIhcZa3CXGmUu9QCsXrRMK5X1LLUgSiUck1gH8mVHzA/mJyUQ5akQDzfHU2rWkKxvl ip93JlZg/L2e8i2NuQhx61inMdjwIAdX9veLWJn1Zf0KfMnXCQuNCHrL+XaXwWHvpX9R xnNfSKKyyi/WmgOuLmtBFXZQO3Wjs0RJFltAER+zPtMWGvFYUzfpzByiTfCu5mFutjdn 2yVA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:message-id:mime-version:subject:from:to; bh=c6kHqF9mJQYVoqbE7Z/Zt84eKpUDeq460Qh7vhm+vhE=; b=kAclrwkl6ugivdRBq0PxdzbaI1FrRtmL4y6pUt//TuEz6AtcgQQsP8Nya9DpP5mjp3 UTTIytPEt26W1kgYVZ6xjYRl2h8R/jegJtFGXg2IQzzcKxOPZTA5BdCS5fAdmyxDQdDb Soh8OqGDVvLR6uZuiEPpGt2aDLXXKrENq4Gaay2IkQxIWVaq4TfwcszrQ2SujhQW5n4/ 4f/sX2rUjeWXUUmIYOL31ifLBaVADPTQPehzcl7oPeSeWG9P2kTRjA8h9JWqte7+/cdu EF+VXWmf1JPfttQomZj0QiRe97msS5kblVIk6JRfAK6E4QYvnfRLZy1Y4YYuFN+Uvw2e nStg== X-Gm-Message-State: AOAM533kezwazC0duu3MBypZNhKNUPlgWFl+J6c9iQh30eSIYOqJCyKj 3g/JsN4W199Ws9OpYYe4r68c7exG49Gh X-Received: from eugenis.svl.corp.google.com ([2620:15c:2ce:200:9c55:d7a2:8f8f:5c4e]) (user=eugenis job=sendgmr) by 2002:a25:820e:: with SMTP id q14mr8001948ybk.152.1621040810242; Fri, 14 May 2021 18:06:50 -0700 (PDT) Date: Fri, 14 May 2021 18:06:43 -0700 Message-Id: <20210515010643.2340448-1-eugenis@google.com> Mime-Version: 1.0 X-Mailer: git-send-email 2.31.1.751.gd2f1c929bd-goog Subject: [PATCH] kasan: speed up mte_set_mem_tag_range From: Evgenii Stepanov To: Andrey Ryabinin , Alexander Potapenko , Andrey Konovalov , Dmitry Vyukov , Catalin Marinas , Will Deacon , Steven Price , Peter Collingbourne , Evgenii Stepanov , kasan-dev@googlegroups.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Use DC GVA / DC GZVA to speed up KASan memory tagging in HW tags mode. The first cacheline is always tagged using STG/STZG even if the address is cacheline-aligned, as benchmarks show it is faster than a conditional branch. Signed-off-by: Evgenii Stepanov --- arch/arm64/include/asm/mte-kasan.h | 40 +------------------ arch/arm64/lib/Makefile | 2 + arch/arm64/lib/mte-kasan.S | 63 ++++++++++++++++++++++++++++++ 3 files changed, 66 insertions(+), 39 deletions(-) create mode 100644 arch/arm64/lib/mte-kasan.S diff --git a/arch/arm64/include/asm/mte-kasan.h b/arch/arm64/include/asm/mte-kasan.h index ddd4d17cf9a0..e29a0e2ab35c 100644 --- a/arch/arm64/include/asm/mte-kasan.h +++ b/arch/arm64/include/asm/mte-kasan.h @@ -48,45 +48,7 @@ static inline u8 mte_get_random_tag(void) return mte_get_ptr_tag(addr); } -/* - * Assign allocation tags for a region of memory based on the pointer tag. - * Note: The address must be non-NULL and MTE_GRANULE_SIZE aligned and - * size must be non-zero and MTE_GRANULE_SIZE aligned. - */ -static inline void mte_set_mem_tag_range(void *addr, size_t size, - u8 tag, bool init) -{ - u64 curr, end; - - if (!size) - return; - - curr = (u64)__tag_set(addr, tag); - end = curr + size; - - /* - * 'asm volatile' is required to prevent the compiler to move - * the statement outside of the loop. - */ - if (init) { - do { - asm volatile(__MTE_PREAMBLE "stzg %0, [%0]" - : - : "r" (curr) - : "memory"); - curr += MTE_GRANULE_SIZE; - } while (curr != end); - } else { - do { - asm volatile(__MTE_PREAMBLE "stg %0, [%0]" - : - : "r" (curr) - : "memory"); - curr += MTE_GRANULE_SIZE; - } while (curr != end); - } -} - +void mte_set_mem_tag_range(void *addr, size_t size, u8 tag, bool init); void mte_enable_kernel_sync(void); void mte_enable_kernel_async(void); void mte_init_tags(u64 max_tag); diff --git a/arch/arm64/lib/Makefile b/arch/arm64/lib/Makefile index d31e1169d9b8..c06ada79a437 100644 --- a/arch/arm64/lib/Makefile +++ b/arch/arm64/lib/Makefile @@ -18,3 +18,5 @@ obj-$(CONFIG_CRC32) += crc32.o obj-$(CONFIG_FUNCTION_ERROR_INJECTION) += error-inject.o obj-$(CONFIG_ARM64_MTE) += mte.o + +obj-$(CONFIG_KASAN_HW_TAGS) += mte-kasan.o diff --git a/arch/arm64/lib/mte-kasan.S b/arch/arm64/lib/mte-kasan.S new file mode 100644 index 000000000000..9f6975e2af60 --- /dev/null +++ b/arch/arm64/lib/mte-kasan.S @@ -0,0 +1,63 @@ +/* SPDX-License-Identifier: GPL-2.0-only */ +/* + * Copyright (C) 2021 Google Inc. + */ +#include +#include + +#include + + .arch armv8.5-a+memtag + + .macro __set_mem_tag_range, stg, gva, start, size, linesize, tmp1, tmp2, tmp3 + add \tmp3, \start, \size + cmp \size, \linesize, lsl #1 + b.lt .Lsmtr3_\@ + + sub \tmp1, \linesize, #1 + bic \tmp2, \tmp3, \tmp1 + orr \tmp1, \start, \tmp1 + +.Lsmtr1_\@: + \stg \start, [\start], #MTE_GRANULE_SIZE + cmp \start, \tmp1 + b.lt .Lsmtr1_\@ + +.Lsmtr2_\@: + dc \gva, \start + add \start, \start, \linesize + cmp \start, \tmp2 + b.lt .Lsmtr2_\@ + +.Lsmtr3_\@: + cmp \start, \tmp3 + b.ge .Lsmtr4_\@ + \stg \start, [\start], #MTE_GRANULE_SIZE + b .Lsmtr3_\@ +.Lsmtr4_\@: + .endm + +/* + * Assign allocation tags for a region of memory based on the pointer tag. + * Note: The address must be non-NULL and MTE_GRANULE_SIZE aligned and + * size must be non-zero and MTE_GRANULE_SIZE aligned. + * x0 - start address + * x1 - region size + * x2 - tag + * x3 - bool init + */ +SYM_FUNC_START(mte_set_mem_tag_range) + mrs x4, dczid_el0 + and w4, w4, #0xf + mov x5, #4 + lsl x4, x5, x4 + + bfi x0, x2, #56, #8 + + cbz x3, .Lnoinit + __set_mem_tag_range stzg, gzva, x0, x1, x4, x2, x3, x5 + ret +.Lnoinit: + __set_mem_tag_range stg, gva, x0, x1, x4, x2, x3, x5 + ret +SYM_FUNC_END(mte_set_mem_tag_range) -- 2.31.1.751.gd2f1c929bd-goog