Received: by 2002:ab2:3350:0:b0:1f4:6588:b3a7 with SMTP id o16csp1335116lqe; Mon, 8 Apr 2024 06:17:41 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWJzUCbyaDrtF8u02polXQNQ+t7e1lEq2ccVLePL6Kegfz5IGmKrl3MOWwpBHxZg9CJibbatT+vOq/p9rKzXjnE4B54nd7Mdi73/kQBHw== X-Google-Smtp-Source: AGHT+IGxxRTsrZ/4PW12gjJug6Kk+ZSmndUU9+SLaqMOXsng993IwnPHjClFuF8vA/2Y4uFXiNxP X-Received: by 2002:a05:6102:3019:b0:479:f418:dc7a with SMTP id s25-20020a056102301900b00479f418dc7amr2501880vsa.3.1712582260842; Mon, 08 Apr 2024 06:17:40 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1712582260; cv=pass; d=google.com; s=arc-20160816; b=q0cH4yPVoZyKFHuTUnLM3FfM/ZxbR68KUvGt1B75HJw9IS24MnKzWl0Jxn/oIaQ8sn YVVlcqhue4vwd8sAfRZj8l2+BeVjkgr6Rw6lJjPUXSwRLUzPWclCKTr5pgaxtAqGj5X5 5K0g47cLMKEfTF3xX0dWcFjkfuatjo2M2VXnoVE5FzPuJggfKga/V7VHPZuBrW6BwQnJ B1MLi5wx2bbk//Ggq0bHlCderZ8JVh2WxAVgFnK7ba3OZ6SwisT4ayvICCHZkzBqQzYg 20vEzYFq9DYhZuFLvQxX6j1pZnPI+2xvKGCUpUg4qvMXpyFzPFDxBu58S36hy1DqBkvh EGxg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date:dkim-signature; bh=Qw87mtV3W+oJdJAJ1W99CO6w9UlaRy4c/rHwvqmH5Js=; fh=ad2OA34kW9lx5OokKR1Mp02R0ChuE5QaHwb8xE4nF7s=; b=YgrCq+Ziz3j1GAgQJ/nzXfb1rJCxU01HAegiyy3Eaj+0t0PvnfRNxJlzcjxbVn2ePL Zd2+CyQK5EMg3EzMECFjDgnoGnVT9897USy6OeavXu2ZI1Qe1Vnqz1IG/DYqEvijxudG N3kzhsRx3lMEYeeh4QqgKozZq0XbQXu+mJcB+7k+oo2WdEsFy3RAW/zLQqZDtq4SEy+w ECOHiiOkfq4tggSHDn/6aZyLJrfTazythAJMFulenp0Mb3h/+RxsdjUXy0CEaFsZ2x7b CeYNDCJ6Z0lX5+VEFtTL1fysBN07MhnHCp5ChlRHrgZYX2cLzYWBdgu7FloL8vEp7LoQ OdwA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=JRggopEW; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-crypto+bounces-3405-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-crypto+bounces-3405-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [147.75.199.223]) by mx.google.com with ESMTPS id e8-20020ab078c8000000b007e51a17a349si1047410uau.136.2024.04.08.06.17.40 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 08 Apr 2024 06:17:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-crypto+bounces-3405-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) client-ip=147.75.199.223; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=JRggopEW; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-crypto+bounces-3405-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-crypto+bounces-3405-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 6445D1C215E4 for ; Mon, 8 Apr 2024 13:17:36 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 016A77EF1E; Mon, 8 Apr 2024 13:12:41 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="JRggopEW" X-Original-To: linux-crypto@vger.kernel.org Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id B0B537E575; Mon, 8 Apr 2024 13:12:40 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1712581960; cv=none; b=Oek6g7nGBasEA8a5xTsrwO3D9SQ7OQpwxoPXU1lAYmZ9+dBqmoaMmxOTyBdNpOmKEjIsvq0yPnlI+HeeVYzNnbOunjow5Z0SOlnDsk0RgqMgSs15iCsq+Ha1YcXc+nWRw09D1Np5AE3i/lGfLfQOg4cOqQnC1cRFKZpIdEU57XI= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1712581960; c=relaxed/simple; bh=FDyXl726c9B9U4ZoE70AwoSNthND7FSk/8PQFnBAXIE=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=ObdbRN2wKUUuU5+B60kDNJ4bY/uQvNjfCi2wI2R+ittPEkCeUqkymYFj1AwFJOZaYBw6cJAHdM1IFha3D/Mx5dxqikrIFdMWn8M/c0wgJUgblq0n7u9e6z33WsMLAB6PCF4spJ3KQ+RsgxYJErYYBi9Qyd1fvBCOqMaLAQc0aeg= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=JRggopEW; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 16EE6C43390; Mon, 8 Apr 2024 13:12:40 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1712581960; bh=FDyXl726c9B9U4ZoE70AwoSNthND7FSk/8PQFnBAXIE=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=JRggopEWyiJOQ1Goi48EjKmJ/RpM5o2bYUPDpk/CcnySwjHAkM5mGQ7Cn/S4ZcaNx RpSumi87UT83FC6tnrmfTshrX5rbp9LmmkvtA34WKQDhVYx/Fz1Gno6Tj4cSu/GdBt LNx5wHvReuahjW6/wua7KE6bOtRS247xbfNqRDK1uYnQ9UdjcAQYMkYsICzu+YCpV4 V10Ko0DqGMgkVgFoYJJTMyn+iROs60s3FTYysS13ST/4HOWeS09xNdEprDXqeyZcZ4 cFWv+gIlUBb4AnPo9QETsE3eSN8OKoDFgLhOWIG1RNtYsUGb7A1kty7KkLKVVX9b/d +Vt8lQqJi8wyA== Date: Mon, 8 Apr 2024 09:12:38 -0400 From: Eric Biggers To: Stefan Kanthak Cc: linux-kernel@vger.kernel.org, linux-crypto@vger.kernel.org, tim.c.chen@linux.intel.com, sean.m.gulley@intel.com Subject: Re: [PATCH 1/2] crypto: s(h)aving 40+ bytes off arch/x86/crypto/sha256_ni_asm.S Message-ID: <20240408131238.GC732@quark.localdomain> References: <5EEE09A9021540A5AAD8BFEEE915512D@H270> Precedence: bulk X-Mailing-List: linux-crypto@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <5EEE09A9021540A5AAD8BFEEE915512D@H270> On Mon, Apr 08, 2024 at 11:26:52AM +0200, Stefan Kanthak wrote: > @@ -315,11 +315,11 @@ > jne .Lloop0 > > /* Write hash values back in the correct order */ > - pshufd $0x1B, STATE0, STATE0 /* FEBA */ > - pshufd $0xB1, STATE1, STATE1 /* DCHG */ > movdqa STATE0, MSGTMP4 > - pblendw $0xF0, STATE1, STATE0 /* DCBA */ > - palignr $8, MSGTMP4, STATE1 /* HGFE */ > + punpcklqdq STATE1, STATE0 /* GHEF */ > + punpckhqdq MSGTMP4, STATE1 /* ABCD */ > + pshufd $0xB1, STATE0, STATE0 /* HGFE */ > + pshufd $0x1B, STATE1, STATE1 /* DCBA */ > > movdqu STATE0, 0*16(DIGEST_PTR) > movdqu STATE1, 1*16(DIGEST_PTR) Please make sure to run the crypto self-tests too. The above is storing the two halves of the state in the wrong order. Thanks, - Eric