Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp456045pxv; Thu, 22 Jul 2021 04:30:38 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw7XZL948EghM2mKoTxTaZLd9Fdo2+PLwn70/j+CxUG8LXdmltASoXaVzcgc8539NoOjVJX X-Received: by 2002:a05:6602:218c:: with SMTP id b12mr19707005iob.82.1626953438306; Thu, 22 Jul 2021 04:30:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1626953438; cv=none; d=google.com; s=arc-20160816; b=C7OremUzv5ODS1iRUaJhSK7OfpOUYAmmZuo5QggrKn6+OjsUrA32vyCyecKvZwYgtF uYxOvz7xNR+hBsODhnXdD+n+Lwgbd/Vyz1HVsfH0pPK7FZwDreJfwHj0XmOQ7sqlnTYw 2iEC7jUZdX4cxUY6pAT3AbMR8DB0t1r5KpP8CqlDaSrTuAT8RANgpNBrvI1eKxuTxCuP eIfiAiXyBSV3RJ5QxOxZ0Za4+V4KQ4E9SGQZdwv4F2qn8x33mUoQPAmVe5nUTNkbnYvV IUrV62BwTs8FVWUa73gDIFk6jvPZHcKHIkunrhxqNhMnxDV7noYVyrxARXsct4GKRWY2 j7Bw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=0IUu5cxIaYOqC1XS0AMKZ7KuvInfL/EoTMBPiEMolG0=; b=KZlpYQ1nl++ayc5lTvJa7m/WNXLIYMK/BHbQmHcUAzkdNYwj60o4yx7G0T4oL5dkF7 5FiStq6s18d1Zum7qCGH8T60APhN62hjAI9Ii+styGS7uFf3RfnpPYGydN6QGqN5xYFi 4PcdsBF0lZoXOe0sCoqDYnUnUBUVeaffTWbu+cbGgwrg6xxHYl3rXdw4+798yZoVL566 5GtqGD1TSreLwbrQJoO4oD4T9enbdzI8iGZmvQKt1fPWIvFAOFUjFlAJE1AMp+pr3YN5 rcGf9pVW4qCExVajvDkMqSQdUMc1ba62gbl7cWVq8016FhS8a8AKUomc33fNuu+ztAGI LSxA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=QCYAMSL8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l15si31279847jad.101.2021.07.22.04.30.24; Thu, 22 Jul 2021 04:30:38 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=QCYAMSL8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231683AbhGVKrg (ORCPT + 99 others); Thu, 22 Jul 2021 06:47:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60640 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231599AbhGVKrg (ORCPT ); Thu, 22 Jul 2021 06:47:36 -0400 Received: from mail-wm1-x331.google.com (mail-wm1-x331.google.com [IPv6:2a00:1450:4864:20::331]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3294EC061575; Thu, 22 Jul 2021 04:28:10 -0700 (PDT) Received: by mail-wm1-x331.google.com with SMTP id f190so3113701wmf.4; Thu, 22 Jul 2021 04:28:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=0IUu5cxIaYOqC1XS0AMKZ7KuvInfL/EoTMBPiEMolG0=; b=QCYAMSL8dCv+avqi2vEPn3dZ2pBZ3RjQ92wroER3XLXz/tLodZFXvsxNXJj6w/PnHT To/ChckV6ckCgrTAHDOJHsbe0tmtO5fBladJWMENDRgBndkl1q+EPsziWlBl7S4U3rnJ +J8pul7NkJEoMguS7huKS0zSBy8DnP39sm1vI+uwE8nRtrBdy/m56A4CVdS5I3fc855b z+bvgeU8agsIhbfZpNsjHzrYXMVWg14FLDGjmNe0jR7KRgNF9Z34A/bmTRUXZ4wedXsy BWTNG6L10GoHhC9vEH4BtV7slhIBPiD54r6E4X/c5T2FLw6YvFqTLN6derFOmx4wvB/E o9Ow== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=0IUu5cxIaYOqC1XS0AMKZ7KuvInfL/EoTMBPiEMolG0=; b=Rk8bFICy0gL1JVteCzxiVNg7LFBu7Nna617Tzcm3nQb9CS6KK7OhqJMy2CumD+umzw Zu4fn/X5YBvpeiQm9Hl1SEI4GBvT4AOjDIxs5H04CHod04W+LZUmGDsTaigydOc3vrr/ kueVDu1/9rAnG1WAZlh5LKF1AAWksAb/vChAVUOpsM+XJDOnmSQ7ETB8ivp1JCoMKfiX KIqEQmatdWw0sI8+57mb/R1eDR9fYHbxfIIg4qw1QxnL164uQI9aPTQAacvyFa3M7vVF ZMBnMEI5gO96S3D+ljjGfgLVyCox+4SDdWWwkGCi9RqP67D56AGDIMYRVpoSmOcwYmHg LfnQ== X-Gm-Message-State: AOAM53166/kYdqBGQV/INxaVf4mw3adiiRr7J6cWZ+NFljIMHVB8nCY8 XFxEWhiViguckamZVZxF08I= X-Received: by 2002:a05:600c:19d3:: with SMTP id u19mr8962355wmq.115.1626953288815; Thu, 22 Jul 2021 04:28:08 -0700 (PDT) Received: from [10.20.1.172] ([185.138.176.94]) by smtp.gmail.com with ESMTPSA id r2sm3319495wmh.34.2021.07.22.04.28.08 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 22 Jul 2021 04:28:08 -0700 (PDT) Subject: Re: [PATCH] lib/string: Bring optimized memcmp from glibc To: Linus Torvalds , Nikolay Borisov Cc: Linux Kernel Mailing List , Nick Desaulniers , linux-fsdevel , Dave Chinner References: <20210721135926.602840-1-nborisov@suse.com> From: Nikolay Borisov Message-ID: Date: Thu, 22 Jul 2021 14:28:07 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21.07.21 г. 22:26, Linus Torvalds wrote: > On Wed, Jul 21, 2021 at 11:45 AM Linus Torvalds > wrote: >> >> I can do the mutual alignment too, but I'd actually prefer to do it as >> a separate patch, for when there are numbers for that. >> >> And I wouldn't do it as a byte-by-byte case, because that's just stupid. > > Here's that "try to align one of the pointers in order to avoid the > lots-of-unaligned case" patch. > > It's not quite as simple, and the generated assembly isn't quite as > obvious. But it still generates code that looks good, it's just that > the code to align the first pointer ends up being a bit harder to > read. > This one also works, tested only on x86-64. Looking at the perf diff: 30.44% -28.66% [kernel.vmlinux] [k] memcmp Comparing your 2 version that you submitted the difference is: 1.05% +0.72% [kernel.vmlinux] [k] memcmp So the pointer alignment one is slightly more expensive. However those measurements were done only on x86-64. Now on a more practical note, IIUC your 2nd version makes sense if the cost of doing a one unaligned access in the loop body is offset by the fact we are doing a native word-sized comparison, right?