Received: by 2002:ab2:3350:0:b0:1f4:6588:b3a7 with SMTP id o16csp1306377lqe; Mon, 8 Apr 2024 05:31:23 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCXEszSUk5TkjFV+b++EaLEVByCWzKAcQreN9b8IzXBsSHv/I+3ROUeFC84nlPE3Rq/fN+7JnrAXMzpztWVkjSh0P21w6zoEnvLyxH8gxQ== X-Google-Smtp-Source: AGHT+IHwunPhIyId4oS/kBd0LM7EWl6koOOQF96LR3KTojoGOhTAcs6NMZOHX08W1EL+yaqDcmVm X-Received: by 2002:a9d:7557:0:b0:6ea:1090:4d2f with SMTP id b23-20020a9d7557000000b006ea10904d2fmr4655534otl.22.1712579483283; Mon, 08 Apr 2024 05:31:23 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1712579483; cv=pass; d=google.com; s=arc-20160816; b=i1lw+nPgJVSVT5QWeUr2t6a1J8HF//Haj/oqLCezZccUU9+vpUQFH1UGPKsVgO6zm2 vcx0ITroM5BohH6wUwvyWNsSenIdxBWqsmYL2Iy3lXfINgq3DxpoxtKmZLVxI5IRo7bA t8KpR8fd81kiSjBHGfIF1ZgtkuW4U3HTn+uAqu1VYbMNKC+YRXOBCgURzxRf3Vk0s5fZ GPoID3KSB3bFz0rJHz7W0xNozKtD0Ie+xoK94cFTMLeJ4nvlN9C5NDLQnILVErj63/Ot /K1RIpd1E8Zc+XcvfnKv+p4mV+50rQWpavsI7MkuPv/78uMkZ9G+GA5s0qg8gqqAblcS 6Zfg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date:dkim-signature; bh=Y0sW22Idn0MNdG3xpWppxWH1E6CrPLYXa/TQaLxUYhI=; fh=Fb8BePtQZor3QfK+tpEHG4jr88BqdgrV2OHVqlomeLE=; b=NtA/PD4DaPpN0gebWhkgBt+xY5G+Qbu2JvaLkU6uiZAOpUKaKzjp3XHi9WYHtSL6sc mH/XjTkiWmULUDQ3/TqLNVF4wMJHsG4oA0liXiYgPJT4bE1XiJhfhDj2HZ5nChIC260l aowSO2MdN3Lngy+ulnnkc18FdlVomFq0F1cwuJVP+lRnPdpN/ET02mY7XXjbbH8FcWHB DYVem58Vv2ZDe/7d4QUJ4F6qAh7O+/fUh+JeIMaanH/NRcTtINmU3wUNsDcytnRnc2WW Qx9XH8c2RSw7FArgtHj8rR/6GWiGovxhNUnusX8b0QemiNiKkPfAIGHb9f1ilRejobaG +kMQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Ugz7LOVy; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-crypto+bounces-3403-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-crypto+bounces-3403-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id cz14-20020a056214088e00b00699477f541bsi6856699qvb.464.2024.04.08.05.31.23 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 08 Apr 2024 05:31:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-crypto+bounces-3403-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Ugz7LOVy; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-crypto+bounces-3403-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-crypto+bounces-3403-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 0614B1C21CD3 for ; Mon, 8 Apr 2024 12:31:23 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 246F36D1C7; Mon, 8 Apr 2024 12:31:12 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="Ugz7LOVy" X-Original-To: linux-crypto@vger.kernel.org Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id D6DB46CDA9; Mon, 8 Apr 2024 12:31:11 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1712579471; cv=none; b=azcZk6CgfofKWRhvgutAeBfibOe+I75og4c8QtOO6OQ4mmJ0KippK+3yOYU8P6QexlIN4PVp9P/wroRn/oFyDLfMdJSvLDcEMxz4uNZUvph6RpZ1exISALDUYZHt4/QsN7DHk8H5I65kozz6VokNAiuNaP2AaDE82cMFMeWjk/Y= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1712579471; c=relaxed/simple; bh=hAf068l3Nm0d9JEUttg8/muc9fg+jcdg1rtCCWVZF6Q=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=lQ5Wu+hphVIcT1+EH5Dndmq9O8b4LxqWg41CyAx4lddRQZ7JFb3ByMbsH42D/rYpqhHzsDKw7RDgSusreKNw/T6lTfyeuif7EfIENMJMakqsuDr9ojPNn+lRbTC08/iSSZ2241O9q3hh0FReOOQs7pyBbNrPYwVcaX89EvSoBs8= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=Ugz7LOVy; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id DDD6DC433C7; Mon, 8 Apr 2024 12:31:10 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1712579471; bh=hAf068l3Nm0d9JEUttg8/muc9fg+jcdg1rtCCWVZF6Q=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=Ugz7LOVybBzVtz6+uRTeZl8Jt3HWhp/3g6RT8hgA393jNE6Olgti/L/+Zlz6QE6Jw haQ8+UvaJ/IdHyHnqSmlovsUdqGpScRkJrolh7VnWFhy4uyg0bw9CN+CzItD5S8zCt 50Igib856+TdoreVjGSI11RlMXbIQbcHa59cCUcb2r1gSnEbAumdjpd18Wen1vyqUs 2wfM8Acjc4PHsoKhVq/L7S3fQ5z3JRnCjfhwlmWCoawySduZ+v/7j5/viTZ53X68iB oz14SNqp+Chq52CLG6UXXo2gfenspOLqZVGX625Lgrmf+VtaUVQ/OKVLzHfBaXPoo5 WT2Sdh6AutcWw== Date: Mon, 8 Apr 2024 08:31:08 -0400 From: Eric Biggers To: David Laight Cc: Ard Biesheuvel , "linux-crypto@vger.kernel.org" , "x86@kernel.org" , "linux-kernel@vger.kernel.org" , Andy Lutomirski , "Chang S . Bae" Subject: Re: [PATCH 0/6] Faster AES-XTS on modern x86_64 CPUs Message-ID: <20240408123108.GA732@quark.localdomain> References: <20240326080305.402382-1-ebiggers@kernel.org> <20240326164755.GB1524@sol.localdomain> <6629b8120807458ab76e1968056f5e10@AcuMS.aculab.com> <20240404013529.GB24248@quark.localdomain> <142077804bee45daac3b0fad8bc4c2fe@AcuMS.aculab.com> <20240405191904.GA1205@quark.localdomain> Precedence: bulk X-Mailing-List: linux-crypto@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Mon, Apr 08, 2024 at 07:41:44AM +0000, David Laight wrote: > From: Eric Biggers > > Sent: 05 April 2024 20:19 > ... > > I did some tests on Sapphire Rapids using a system call that I customized to do > > nothing except possibly a kernel_fpu_begin / kernel_fpu_end pair. > > > > On average the bare syscall took 70 ns. The syscall with the kernel_fpu_begin / > > kernel_fpu_end pair took 160 ns if the userspace program used xmm only, 340 ns > > if it used ymm, or 360 ns if it used zmm... > > > > Note that without the kernel_fpu_begin / kernel_fpu_end pair, AES-NI > > instructions cannot be used and the alternative would be xts(ecb(aes-generic)). > > On the same CPU, encrypting a single 512-byte sector with xts(ecb(aes-generic)) > > takes about 2235ns. With xts-aes-vaes-avx10_512 it takes 75 ns... > > So most of the cost of a single 512-byte sector is the kernel_fpu_begin(). > But it is so much slower any other way it is still faster. > Yes. To clarify, the 75 ns time I mentioned for a 512-byte sector is the average for repeated calls, amortizing the XSAVE and XRSTOR. For a real single 512-byte sector that eats the entire cost of the XSAVE and XRSTOR by itself, if all state is in-use it should be about 75 + (360 - 70) = 365 ns (based on the syscall benchmarks I did), with the XSAVE and XRSTOR accounting for 80% of that time. But yes, that's still over 6 times faster than the scalar alternative. - Eric