Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp5078244imu; Sat, 1 Dec 2018 08:41:37 -0800 (PST) X-Google-Smtp-Source: AFSGD/UGpbJv0oY1plxxDBTz64AM8a39lHYRcyvzy9MYF6FGo1+uYjbqdncFiSxFPYMeM2vd0Gj1 X-Received: by 2002:a63:d157:: with SMTP id c23mr8247596pgj.170.1543682496938; Sat, 01 Dec 2018 08:41:36 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543682496; cv=none; d=google.com; s=arc-20160816; b=P7wLFCDv/949u7xdfRj5dRYbV5sgG/QzG1XhHamIcAGJwbWJCG3anUZCnwEN8+IDFP XhkncPjFZGTVW4aW6mbTGexRrRf37OPaIgJR+Ydh1HbU2BUo1pWQQj8OVX9zsnZ0VmS6 krdI0qXnYHUO6jUkmmt5JiqU4FMH3gi+Tvkdz03FeYsUBLHnX+CBv9rutoSW/yVbQ/Un 8O33kU003XZ984I9drqme09YQaJSkAaVTJZ3sqgE5HrIlJnTag0stPBAPu/1v1BpP+32 oIEZkVjuuLCS4sRdzgGvkfAgflnMi1LQlK3INbi1xx6MrF8yCNW1lABWfTL6JIGPIezo 6NYw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id; bh=Q0zha1aFTWiMeyQYv5XRQ0gjYnrn85oy61cUEwgT/wY=; b=YyYAXc8P5cHutCy+NYy7Mdl3QfEMhBoAGMfcLhR7OrF64DyAXgGTra4YVzR1t8hJZU Kj2d52YyLIXV0xmjMrKWasfXXYJuAlkDbI1cbRORx/AKwjrDB++NyUL9tmb+S1q5EsOd 21O+UYL4IbYixz81UjMcBBev7muYFOmBA6nrgMdwVl+JTxaZORaHZtNZm0nM2apGdP8u Jt1+2UUwL21D9ZRDyWXM5twciRyqbzepRcJtzi47zfSUpSUtZJ+4aE+xNJR7OoD+3RtS a7RQh56AwxnEgLFbJFw4440FhS3ZFIJIEyVJJMwBIlvoe/3UArSStNYyX49II1o/5s98 SPuw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c6si8500209plr.414.2018.12.01.08.41.22; Sat, 01 Dec 2018 08:41:36 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726950AbeLBDxi (ORCPT + 99 others); Sat, 1 Dec 2018 22:53:38 -0500 Received: from sitav-80046.hsr.ch ([152.96.80.46]:41214 "EHLO mail.strongswan.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726778AbeLBDxh (ORCPT ); Sat, 1 Dec 2018 22:53:37 -0500 Received: from book (unknown [IPv6:2a01:2a8:8106:7c01:b5f0:8f23:2aa:553]) by mail.strongswan.org (Postfix) with ESMTPSA id F0FCE40128; Sat, 1 Dec 2018 17:40:40 +0100 (CET) Message-ID: <99ed681fa4d3233b18ae9328a14f9e23971073cb.camel@strongswan.org> Subject: Re: [PATCH v2 4/6] crypto: x86/chacha20 - add XChaCha20 support From: Martin Willi To: Eric Biggers , linux-crypto@vger.kernel.org Cc: Paul Crowley , Milan Broz , "Jason A . Donenfeld" , linux-kernel@vger.kernel.org Date: Sat, 01 Dec 2018 17:40:40 +0100 In-Reply-To: <20181129230217.158038-5-ebiggers@kernel.org> References: <20181129230217.158038-1-ebiggers@kernel.org> <20181129230217.158038-5-ebiggers@kernel.org> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5-0ubuntu0.18.04.1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > An SSSE3 implementation of single-block HChaCha20 is also added so > that XChaCha20 can use it rather than the generic > implementation. This required refactoring the ChaCha permutation > into its own function. > [...] > +ENTRY(chacha20_block_xor_ssse3) > + # %rdi: Input state matrix, s > + # %rsi: up to 1 data block output, o > + # %rdx: up to 1 data block input, i > + # %rcx: input/output length in bytes > + > + # x0..3 = s0..3 > + movdqa 0x00(%rdi),%xmm0 > + movdqa 0x10(%rdi),%xmm1 > + movdqa 0x20(%rdi),%xmm2 > + movdqa 0x30(%rdi),%xmm3 > + movdqa %xmm0,%xmm8 > + movdqa %xmm1,%xmm9 > + movdqa %xmm2,%xmm10 > + movdqa %xmm3,%xmm11 > + > + mov %rcx,%rax > + call chacha20_permute > + > # o0 = i0 ^ (x0 + s0) > paddd %xmm8,%xmm0 > cmp $0x10,%rax > @@ -189,6 +198,23 @@ ENTRY(chacha20_block_xor_ssse3) > > ENDPROC(chacha20_block_xor_ssse3) > > +ENTRY(hchacha20_block_ssse3) > + # %rdi: Input state matrix, s > + # %rsi: output (8 32-bit words) > + > + movdqa 0x00(%rdi),%xmm0 > + movdqa 0x10(%rdi),%xmm1 > + movdqa 0x20(%rdi),%xmm2 > + movdqa 0x30(%rdi),%xmm3 > + > + call chacha20_permute AFAIK, the general convention is to create proper stack frames using FRAME_BEGIN/END for non leaf-functions. Should chacha20_permute() callers do so? For the other parts: Reviewed-by: Martin Willi