From: Herbert Xu Subject: Re: [PATCH v3 0/4] Patchset to use PCLMULQDQ to accelerate CRC-T10DIF checksum computation Date: Tue, 14 May 2013 10:35:26 +0800 Message-ID: <20130514023526.GB19306@gondor.apana.org.au> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: "H. Peter Anvin" , "David S. Miller" , "Martin K. Petersen" , James Bottomley , Matthew Wilcox , Jim Kukunas , Keith Busch , Erdinc Ozturk , Vinodh Gopal , James Guilford , Wajdi Feghali , Jussi Kivilinna , linux-kernel , linux-crypto@vger.kernel.org, linux-scsi@vger.kernel.org To: Tim Chen Return-path: Received: from ringil.hengli.com.au ([178.18.16.133]:58595 "EHLO fornost.hengli.com.au" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1755783Ab3ENCfz (ORCPT ); Mon, 13 May 2013 22:35:55 -0400 Content-Disposition: inline In-Reply-To: Sender: linux-crypto-owner@vger.kernel.org List-ID: On Wed, May 01, 2013 at 12:52:47PM -0700, Tim Chen wrote: > Currently the CRC-T10DIF checksum is computed using a generic table lookup > algorithm. By switching the checksum to PCLMULQDQ based computation, > we can speedup the computation by 8x for checksumming 512 bytes and > even more for larger buffer size. This will improve performance of SCSI > drivers turning on the CRC-T10IDF checksum. In our SSD based experiments, > we have seen increase disk throughput by 3.5x with T10DIF for 512 byte > block size. > > This patch set provides the x86_64 routine using PCLMULQDQ instruction > and switches the crc_t10dif library function to use the faster PCLMULQDQ > based routine when available. > > Tim > > v3 > 1. Update the crct10dif crypto transform used in the crct10dif library in a safe way. > 2. Load the accelerated t10dif transform for the x86_64 cpus that support it. > 3. Added generic crct10dif crypto transform. All applied. Thanks Tim. -- Email: Herbert Xu Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt