Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp1484801pxb; Sun, 19 Sep 2021 19:44:27 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxzFBSA/9HFys4WlXvbEfcOdO6s8OVu9iquNPhN5r8glx+PPQalPmEbo8sn017o28feroUU X-Received: by 2002:a17:906:7217:: with SMTP id m23mr25765808ejk.466.1632105867484; Sun, 19 Sep 2021 19:44:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632105867; cv=none; d=google.com; s=arc-20160816; b=Q/roiUTHNugrIi1dM3BCJFzt5w4Lv7axuR4w+q5QkWHup481FtGoHlk4qOJYLoUs67 Vh1vy5uvBNWxOhOhcvoXtk2D6p7eQRl7mLTX7HccQlgSRyDSoBOS1OV8uZE+6C7A2AYw 7cBjTan2w6d29pTccLFG3ECn29R+eC8/R06v7PYm7lclSDp0Od21BI/bLyEFJ59T3EUS bAIKhh0F0y8rsWN2Y8bIRI1xjUeRzwa6NPxDN3Y2PQebtQKue1k7wvAoGwiEXAJv1pOp 8KBvBTewI9h9vyD5JddMR/zduFmghZ1cbINxC8iJDTBqTdhS8U5ID9/nIExKYBqvUOSn VpFg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=YPQno8A85O4WeBnY/bkDmDISB83i8fh43dkMBVfbagg=; b=qH+nOcW6qW+204OVh2geC+mPclwCUI1ao078+wcxgUCS9vSZM81unHTPv0QEWuLO34 jPSS5ugRw1VkwZAoKUK38K0xMzovp/pF9dDdk2rkubYs26akgXTRpQVOilP/Sw1vuQA/ TWHtQ8iSZLOIMZPNdmbmrN+Tjx+lTkjo+fBpCCT+RaqTZrtVOIrr1STCXLBSsCmPG15+ cdNaRIc+oMcehFCyqYlcbFdN3syByFvahr9q/oyYaWU+UdB7A2g4cmAIVn1w/UgDoC92 5ydB0nzkWiMNt3t4fES5yNM7S+piX4BYdJd4UQp9+9dL8jdQemOfrf5Vuw5uRSUL2u1z XDvw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id gb5si9477290ejc.310.2021.09.19.19.43.43; Sun, 19 Sep 2021 19:44:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229805AbhISTXD (ORCPT + 99 others); Sun, 19 Sep 2021 15:23:03 -0400 Received: from mail-ed1-f45.google.com ([209.85.208.45]:43597 "EHLO mail-ed1-f45.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229477AbhISTXC (ORCPT ); Sun, 19 Sep 2021 15:23:02 -0400 Received: by mail-ed1-f45.google.com with SMTP id n10so51395777eda.10; Sun, 19 Sep 2021 12:21:36 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=YPQno8A85O4WeBnY/bkDmDISB83i8fh43dkMBVfbagg=; b=Wv5zk3qmRhU6wk38WDi5IUQM0oQe4ATvmqg+6PE9Q4TC+9cQY3WWah6myn8GlWp/6i fKfXqP34u0iPHd0kvfcAHb/aR/6mkRSmevwIzb4TCUxcWL6HKDp8NTIgYREc0PbXNsV0 eujsMzSCxNZcwt61z4mbgzITKOb9wA/moELMBQ2k7pMMPq6SI+aaWXIciD35hA3oLbBk TgNje/nQ5kbCrZE3V4WP1QbnckfgSaT929OFIMBZuJD3Tt5ZZ9/YESVyMU6vo0L5Xe77 nBF/X19J59T1wsc3RleVX3LMG+ZR25FXguZBqve8gTBAwSfPpoOAMFKX+pH/C9SBDoGG OONQ== X-Gm-Message-State: AOAM5337B1V+D2/MiF5BiVnTtHat638cH9+0b+lbMwlbgoz0Uj7iGsEZ ic0D4gWU1uS3uwxNtn+WjSE= X-Received: by 2002:a17:906:6dd4:: with SMTP id j20mr24232036ejt.316.1632079295730; Sun, 19 Sep 2021 12:21:35 -0700 (PDT) Received: from turbo.teknoraver.net (net-2-34-36-22.cust.vodafonedsl.it. [2.34.36.22]) by smtp.gmail.com with ESMTPSA id 10sm5208525eju.12.2021.09.19.12.21.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 19 Sep 2021 12:21:35 -0700 (PDT) From: Matteo Croce To: linux-riscv@lists.infradead.org Cc: linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, Paul Walmsley , Palmer Dabbelt , Albert Ou , Atish Patra , Emil Renner Berthing , Akira Tsukamoto , Drew Fustini , Bin Meng , David Laight , Guo Ren , Christoph Hellwig Subject: [PATCH v4 0/3] riscv: optimized mem* functions Date: Sun, 19 Sep 2021 21:21:01 +0200 Message-Id: <20210919192104.98592-1-mcroce@linux.microsoft.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Matteo Croce Replace the assembly mem{cpy,move,set} with C equivalent. Try to access RAM with the largest bit width possible, but without doing unaligned accesses. A further improvement could be to use multiple read and writes as the assembly version was trying to do. Tested on a BeagleV Starlight with a SiFive U74 core, where the improvement is noticeable. v3 -> v4: - incorporate changes from proposed generic version: https://lore.kernel.org/lkml/20210617152754.17960-1-mcroce@linux.microsoft.com/ v2 -> v3: - alias mem* to __mem* and not viceversa - use __alias instead of a tail call v1 -> v2: - reduce the threshold from 64 to 16 bytes - fix KASAN build - optimize memset Matteo Croce (3): riscv: optimized memcpy riscv: optimized memmove riscv: optimized memset arch/riscv/include/asm/string.h | 18 ++-- arch/riscv/kernel/Makefile | 1 - arch/riscv/kernel/riscv_ksyms.c | 17 ---- arch/riscv/lib/Makefile | 4 +- arch/riscv/lib/memcpy.S | 108 ---------------------- arch/riscv/lib/memmove.S | 64 ------------- arch/riscv/lib/memset.S | 113 ----------------------- arch/riscv/lib/string.c | 154 ++++++++++++++++++++++++++++++++ 8 files changed, 164 insertions(+), 315 deletions(-) delete mode 100644 arch/riscv/kernel/riscv_ksyms.c delete mode 100644 arch/riscv/lib/memcpy.S delete mode 100644 arch/riscv/lib/memmove.S delete mode 100644 arch/riscv/lib/memset.S create mode 100644 arch/riscv/lib/string.c -- 2.31.1