Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp1766365pxv; Sat, 10 Jul 2021 14:33:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzmoJDj0+kRX+fpuQ3DwgSHvptpXMcPRf7stXLJuV9ogugD5BrWWcK+mYHwXCbnNr1VT8Ya X-Received: by 2002:a05:6402:2681:: with SMTP id w1mr55948471edd.275.1625952835515; Sat, 10 Jul 2021 14:33:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1625952835; cv=none; d=google.com; s=arc-20160816; b=kHjMTcRIVWVkdi3R2ABwmpTiFlQnjpBj+C4HqlOqu9I7XxoBdDlkIuiR/Q3+FAhUJ3 Yr93wFZquxPkmDyXJea6Uo/0f99/ptDHZH/51XUL8l0nG0sOel4W7zudtwnWKCF+XSZV wfMxCkKZKdDdb2RXsuuNkkqbXopvPQb1CQ10vXbvIZZ7N13TMbmoYHVHn2G+f4SLcvnM TfWf1LXkkm+/pP4jrxT+HyHFbZHwZo2Bni1Q1mMW6gQqfPuXSD9qN6+8+8EoAWHL9Rzq Dj3fkUC8UZJB9Ub4fHR5Z2cS9l7sU60/Bs1Cjr6aqGFCzHxK7J6vmF5fNi73xpeqQJ6F 5O3A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=wkrNRfC+kCNESABWMDP2BHuD9GTPlQfrGrj1yx8nQys=; b=e3o/9OR8PpAHjFuSqFBDNNrRGaFF0k5R/cEUnvjUxm3Sy4gB1L/6r1uR4tbXAmQkUd mGw2fdAZ9rNYnPPZSnotlsOuUOPPQG/kFVjyCg0NS4PLAgw46mfw1X5Ol1r05iw0ccUE 5m42cLPfKZFrwOSSRg19ewi78/eUAt3eDVB+Vi44nrmZ38e3flwnqmIZRQ1FNjBW9KIP MnagTdFcHh0zvFV9zRUlQZb2dQE+STNmtdujMAQ5YN3IeVm8XDJfk4rdRyhuv5e2oYd2 19VTkeaKforn10buKY4QnQDtGLuDl4MomfmhJbnv6SAHrG66HWI+FC0Mdqncu9/4mE1V 0UaA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=korg header.b=YumewwPz; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id v16si12189235edy.327.2021.07.10.14.33.20; Sat, 10 Jul 2021 14:33:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=korg header.b=YumewwPz; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229674AbhGJVd4 (ORCPT + 99 others); Sat, 10 Jul 2021 17:33:56 -0400 Received: from mail.kernel.org ([198.145.29.99]:56490 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229515AbhGJVd4 (ORCPT ); Sat, 10 Jul 2021 17:33:56 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 318D661353; Sat, 10 Jul 2021 21:31:10 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linux-foundation.org; s=korg; t=1625952670; bh=BhBrfVjiyW6tp+kvAEQqA+E6E0jDxEmrABGM+LD2iDE=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=YumewwPz/9BP0RTGzpUxeqL4PJJ4OfJkwnfq1BY5RJzjIZuRPzCzTJQlke7FWeRex l+9/3WdNu6ozxDiKthf+6mHMStX4b4pkvnMW+/rCy5B1+t+v2Mx78IXMh809uXo2zb CslWQUnTE4guJikE8f/uBeNbhimRkTvWYDzXUYmY= Date: Sat, 10 Jul 2021 14:31:09 -0700 From: Andrew Morton To: Matteo Croce Cc: linux-kernel@vger.kernel.org, Nick Kossifidis , Guo Ren , Christoph Hellwig , David Laight , Palmer Dabbelt , Emil Renner Berthing , Drew Fustini , linux-arch@vger.kernel.org, Nick Desaulniers , linux-riscv@lists.infradead.org Subject: Re: [PATCH v2 0/3] lib/string: optimized mem* functions Message-Id: <20210710143109.fd5062902ef4d5d59e83f5bb@linux-foundation.org> In-Reply-To: <20210702123153.14093-1-mcroce@linux.microsoft.com> References: <20210702123153.14093-1-mcroce@linux.microsoft.com> X-Mailer: Sylpheed 3.5.1 (GTK+ 2.24.32; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2 Jul 2021 14:31:50 +0200 Matteo Croce wrote: > From: Matteo Croce > > Rewrite the generic mem{cpy,move,set} so that memory is accessed with > the widest size possible, but without doing unaligned accesses. > > This was originally posted as C string functions for RISC-V[1], but as > there was no specific RISC-V code, it was proposed for the generic > lib/string.c implementation. > > Tested on RISC-V and on x86_64 by undefining __HAVE_ARCH_MEM{CPY,SET,MOVE} > and HAVE_EFFICIENT_UNALIGNED_ACCESS. > > These are the performances of memcpy() and memset() of a RISC-V machine > on a 32 mbyte buffer: > > memcpy: > original aligned: 75 Mb/s > original unaligned: 75 Mb/s > new aligned: 114 Mb/s > new unaligned: 107 Mb/s > > memset: > original aligned: 140 Mb/s > original unaligned: 140 Mb/s > new aligned: 241 Mb/s > new unaligned: 241 Mb/s Did you record the x86_64 performance? Which other architectures are affected by this change?