Received: by 2002:a05:6358:7058:b0:131:369:b2a3 with SMTP id 24csp5463919rwp; Mon, 17 Jul 2023 04:48:45 -0700 (PDT) X-Google-Smtp-Source: APBJJlHk5vvGWFwZfpBxUviI0/4w7nxY28atMGpAxj/o2GsOkcR0U3E7JmaTsLPRzc3z7wRkfXlb X-Received: by 2002:aa7:cd17:0:b0:51e:1837:d5be with SMTP id b23-20020aa7cd17000000b0051e1837d5bemr13166843edw.24.1689594525349; Mon, 17 Jul 2023 04:48:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689594525; cv=none; d=google.com; s=arc-20160816; b=TKNEqG4uzGYD2JABveisN9jCg/OK+AI5qRMc/TIV+OU0u+ljLDP0KsVFxiLhQaVAc/ DvEfzJnP9r8zDGiQ4HsbOJ1kGdAq+zfzLDlOEt7i+x1XZrRZDHdL3WHLOgShD2RnTnXY BVsUQbPikp9YVP76hJrEZIk1YthWmQovDkTrWdxfTTESk+Y9/R+lMq+AyLdMOl5wD0mm stfsyp5g9W9+zKaUQYOK5zjShO+DyE5pw1BVApS6vp2sHbYtcIqHInenTUQJgixFjGRX HljzQJ4ix0CH5z61Fu3kodm2gaep3Ru0V0EZBYEnDL69oaXS+KIGpLWTIBK5pKHstS4O 9RVg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:mime-version:date :dkim-signature; bh=xYv4V4b0xoYvwQpX2K4ZRcvLQvhVYz6YxW8k/QuP+50=; fh=O56S7FQoQXDYw0ENjMcjZy7yKrdXtC51zI+f09bWPts=; b=cSqGb9FzD8X0NIKwTEoUeWa0D+IongYt3MetIdEHTQaJlx4RWrPXbsWszKy8D1rl9U LOcNeXq9C50bPNfJkdIhp4NZ6wYFwAqSmMjdD1xORIel4xs8vm8/iSwxMk1hxVztMn93 qYDEG5LG5kczXPWAWv2UiV/Bkt8fkjc0DPXSESFCmGEXiRa6sQELg7tlUJTQebx0tgKp UWAMTG8MMPrIPbnPeGy8DeN2zJv304YrG/kOE6ZajofisuDaI16AZPHHr/Rsx1DP1kws voJtIOqjDJQnD/+aWiUQRxzZTwcl6ID5WksdPmSkxARdIJYP7l3A0s9v1thWDCKC7iUC QUCQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=JswCLXIy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id b6-20020aa7d486000000b0051bedb63244si13139183edr.372.2023.07.17.04.48.20; Mon, 17 Jul 2023 04:48:45 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=JswCLXIy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229732AbjGQLhP (ORCPT + 99 others); Mon, 17 Jul 2023 07:37:15 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46716 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229669AbjGQLhP (ORCPT ); Mon, 17 Jul 2023 07:37:15 -0400 Received: from mail-yb1-xb49.google.com (mail-yb1-xb49.google.com [IPv6:2607:f8b0:4864:20::b49]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F144DBF for ; Mon, 17 Jul 2023 04:37:13 -0700 (PDT) Received: by mail-yb1-xb49.google.com with SMTP id 3f1490d57ef6-c926075a50cso8591235276.1 for ; Mon, 17 Jul 2023 04:37:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1689593833; x=1692185833; h=cc:to:from:subject:message-id:mime-version:date:from:to:cc:subject :date:message-id:reply-to; bh=xYv4V4b0xoYvwQpX2K4ZRcvLQvhVYz6YxW8k/QuP+50=; b=JswCLXIyw4aNMNLj4Os5G6MDwT36EIoBrxHx0J5DU1NgUCBeXfgLJxmSB13+rw6YwG +xueK+VeS7KyZAxDokA/xpDESoMC9moujQtUitazxJaMMw3SG0P4bGjMfBSk/1Yx2Pvy E0LDj+pR1Yx8SpWbmJIbU8obQayy0crOe0nwPDFaL8/dROh8Y0Za8tn6q0MW8LSM78fN /FtUJCJPzbZarIYNqQTRLSVknHi+k/WEGOm16T4r1HlYR3K/tV+gL2Y/mza5ViqxXRd+ XtycMHmxEBxqEZUJEWhrv5HybxGNp+qTBtYZKWpJ65XPaK7pUqwp+n9WA2hTm7K+L+jd 3cxA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1689593833; x=1692185833; h=cc:to:from:subject:message-id:mime-version:date:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=xYv4V4b0xoYvwQpX2K4ZRcvLQvhVYz6YxW8k/QuP+50=; b=Br5Da33Ks65UPgDHR1QQy/2xS1U2un4lStqV23E7bFzYcjXDlnrnEh3CTQoY5UgAX0 xsxjUQBl02fb8q7H6sLgdcwX26C8pgKLVrXvKtXr2+m8H0QFs4tfTHOGmhVgKROQIM+O 6yg6kRIFkU/hkObW50kZ+RR/0YQifO0zl+5wgK70wxlgpjQzW3DZy3ZjTRauJvprZ99C wHaINS9n+BAvC49sGzyT1Vumj6kKrptJyfuK0xxGBbypdDsCF9mfhjGNvu6vLWX3cDce /ba/VYmC8mvtFvnFQ78Ef+8wkc3eQJb93nl43hCAQGhIKGe+p1/iwbLsPiPV0x1A/UU5 XGUA== X-Gm-Message-State: ABy/qLb8Ku8L+2fb8ttWFd/wMe4l8gl3hsg/cjecJaW2/XmwFXEzE/c0 91GuF+dRAfq33fiyUlKBReya6cxr3xU= X-Received: from glider.muc.corp.google.com ([2a00:79e0:9c:201:3b6e:8102:6db8:d83f]) (user=glider job=sendgmr) by 2002:a05:6902:11c9:b0:c2c:1b68:99b0 with SMTP id n9-20020a05690211c900b00c2c1b6899b0mr186434ybu.5.1689593833274; Mon, 17 Jul 2023 04:37:13 -0700 (PDT) Date: Mon, 17 Jul 2023 13:37:03 +0200 Mime-Version: 1.0 X-Mailer: git-send-email 2.41.0.255.g8b1d071c50-goog Message-ID: <20230717113709.328671-1-glider@google.com> Subject: [PATCH v3 0/5] Implement MTE tag compression for swapped pages From: Alexander Potapenko To: glider@google.com, catalin.marinas@arm.com, will@kernel.org, pcc@google.com, andreyknvl@gmail.com, andriy.shevchenko@linux.intel.com, linux@rasmusvillemoes.dk, yury.norov@gmail.com Cc: linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, eugenis@google.com, syednwaris@gmail.com, william.gray@linaro.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Currently, when MTE pages are swapped out, the tags are kept in the memory, occupying 128 bytes per page. This is especially problematic for devices that use zram-backed in-memory swap, because tags stored uncompressed in the heap effectively reduce the available amount of swap memory. The RLE-based EA0 algorithm suggested by Evgenii Stepanov and implemented in this patch series is able to efficiently compress 128-byte tag buffers, resulting in practical compression ratio between 2.5x and 20x. In most cases it is possible to store the compressed data in 63-bit Xarray values, resulting in no extra memory allocations. Our measurements show that EA0 provides better compression than existing kernel compression algorithms (LZ4, LZO, LZ4HC, ZSTD) can offer, because EA0 specifically targets 128-byte buffers. To implement compression/decompression, we also extend with methods to set/get bit values at arbitrary places in the map. We refactor arch/arm64/mm/mteswap.c to support both the compressed (CONFIG_ARM64_MTE_COMP) and non-compressed case. For the former, in addition to tag compression, we move tag allocation from kmalloc() to separate kmem caches, providing greater locality and relaxing the alignment requirements. v3: - as suggested by Andy Shevchenko, use bitmap_get_value()/bitmap_set_value() written by Syed Nayyar Waris - switched to unsigned long to reduce typecasts - simplified the compression code v2: - as suggested by Yuri Norov, replace the poorly implemented struct bitq with Alexander Potapenko (5): lib/bitmap: add bitmap_{set,get}_value() lib/test_bitmap: add tests for bitmap_{set,get}_value() arm64: mte: implement CONFIG_ARM64_MTE_COMP arm64: mte: add a test for MTE tags compression arm64: mte: add compression support to mteswap.c arch/arm64/Kconfig | 20 ++ arch/arm64/include/asm/mtecomp.h | 60 +++++ arch/arm64/mm/Makefile | 7 + arch/arm64/mm/mtecomp.c | 406 +++++++++++++++++++++++++++++++ arch/arm64/mm/mteswap.c | 20 +- arch/arm64/mm/mteswap.h | 12 + arch/arm64/mm/mteswap_comp.c | 52 ++++ arch/arm64/mm/mteswap_nocomp.c | 38 +++ arch/arm64/mm/test_mtecomp.c | 177 ++++++++++++++ include/linux/bitmap.h | 57 +++++ lib/test_bitmap.c | 33 +++ 11 files changed, 871 insertions(+), 11 deletions(-) create mode 100644 arch/arm64/include/asm/mtecomp.h create mode 100644 arch/arm64/mm/mtecomp.c create mode 100644 arch/arm64/mm/mteswap.h create mode 100644 arch/arm64/mm/mteswap_comp.c create mode 100644 arch/arm64/mm/mteswap_nocomp.c create mode 100644 arch/arm64/mm/test_mtecomp.c -- 2.41.0.255.g8b1d071c50-goog