Received: by 2002:a05:7412:f584:b0:e2:908c:2ebd with SMTP id eh4csp84671rdb; Sat, 2 Sep 2023 00:43:39 -0700 (PDT) X-Google-Smtp-Source: AGHT+IH0A0MmfshVmq7pwNrHHkPa/f5Qgrhmvlntl6ISNwt92lGVwJTwlimQai95iAiExBZclqzB X-Received: by 2002:a05:6e02:d0c:b0:34b:ad7a:383f with SMTP id g12-20020a056e020d0c00b0034bad7a383fmr5101258ilj.0.1693640618748; Sat, 02 Sep 2023 00:43:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1693640618; cv=none; d=google.com; s=arc-20160816; b=DtArl+7E5IwhjvHinGzkGzGy7hHiS1UIUa2f2dWJlVamsiNbMWDr996x2QDNkVomld BSiAJVE9xVSehcbVZnNRUsPOvVcmyhd5h7P2VwgQQvLrLdEWVtN0OQeeLWBDExdQeHcu Zw5EtWUrnQsfjaB7sF12MwS8ndeipwzGGfczUJ/sbuz8WumrzgfHFFNrOj9NwwR/L2Xr MYznNOov+/NrOARorwwL/XEIt6O1W0E8qEeyYNBVWHISVjPhkMUZxc9/gNb3KbTSfg5k /gV72W6DPs0DwpZA/34e1InSNynTkLo4QekjN93oUboaZjJ4+qS7vAW+DUYSrC6BDFwX PgTw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=uRGQEprw3AuEXGqSurI5N/IxqW3m2OoQiSwJ7I+oKcA=; fh=+YlS+3bgiBAEkNSyYLZDuKHkrOncfg8Vu7nMjlZe7ig=; b=ZpZr9tSn9UbVfLzOnD2xEiD8o2c1qcBQGJ4GQT6d1EgB6T72tR4dLlMcyJD65aCMyr W2AvxEoMmHSsSzdUhTVtQYn5R1vCXkTbPSPZLZAXuby9++e6oDkG+Sw/NbcceudRibku 19snGVaDYKJawgVEgCkrQfnjcU57bd0AL5AMle1MCqplR5aeI5jh7Zn2KNVkCtcVLVmy gueaZZLjUZo78J8ZWwo4In90LG7QiniDX5b3bQH02Z6nDXTGYJQbhe9bEQtuPphTV8SG vJmPJRuMiXUXaOMvcyzax+cJ6Lj/UM0TR7k9BwFie419Av0E8O99Q+WG9WDK6A6XC37C nD6A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id z4-20020a17090ad78400b00267ed8359e2si4209477pju.55.2023.09.02.00.43.23; Sat, 02 Sep 2023 00:43:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1351652AbjIBGW4 (ORCPT + 99 others); Sat, 2 Sep 2023 02:22:56 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43076 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231330AbjIBGW4 (ORCPT ); Sat, 2 Sep 2023 02:22:56 -0400 Received: from 1wt.eu (ded1.1wt.eu [163.172.96.212]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 8948B10F9 for ; Fri, 1 Sep 2023 23:22:52 -0700 (PDT) Received: (from willy@localhost) by pcw.home.local (8.15.2/8.15.2/Submit) id 3826MbQZ023182; Sat, 2 Sep 2023 08:22:37 +0200 Date: Sat, 2 Sep 2023 08:22:37 +0200 From: Willy Tarreau To: Ammar Faizi Cc: Alviro Iskandar Setiawan , Thomas =?iso-8859-1?Q?Wei=DFschuh?= , David Laight , Nicholas Rosenberg , Michael William Jonathan , GNU/Weeb Mailing List , Linux Kernel Mailing List Subject: Re: [RFC PATCH v2 1/4] tools/nolibc: x86-64: Use `rep movsb` for `memcpy()` and `memmove()` Message-ID: <20230902062237.GA23141@1wt.eu> References: <20230902055045.2138405-1-ammarfaizi2@gnuweeb.org> <20230902055045.2138405-2-ammarfaizi2@gnuweeb.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_PASS,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Sep 02, 2023 at 01:11:06PM +0700, Ammar Faizi wrote: > On Sat, Sep 02, 2023 at 01:07:50PM +0700, Alviro Iskandar Setiawan wrote: > > Btw, sir, this can be simplified more by merging the forward copy > > path, only using two "rep movsb" for both memmove() and memcpy() > > should be enough? > > ``` > > __asm__ ( > > ".section .text.nolibc_memmove_memcpy\n" > > ".weak memmove\n" > > ".weak memcpy\n" > > "memmove:\n" > > "movq %rdx, %rcx\n" > > "movq %rdi, %rdx\n" > > "movq %rdi, %rax\n" > > "subq %rsi, %rdx\n" > > "cmpq %rcx, %rdx\n" > > "jnb __nolibc_forward_copy\n" > > "leaq -1(%rdi, %rcx, 1), %rdi\n" > > "leaq -1(%rsi, %rcx, 1), %rsi\n" > > "std\n" > > "rep movsb\n" > > "cld\n" > > "retq\n" > > > > "memcpy:\n" > > "movq %rdi, %rax\n" > > "movq %rdx, %rcx\n" > > "__nolibc_forward_copy:\n" > > "rep movsb\n" > > "retq\n" > > ); > > ``` > > Looks good. I'll apply that change. Note that in this case we simply don't need a special version of memcpy(), memmove() is always OK for this, so you can simplify this further by starting with: memcpy: memmove: ... Willy