Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp2005711imm; Thu, 24 May 2018 04:23:25 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpUSzjH2dhC57emZszkjbV9KmT9dafXotT8h4AL850XLBxkjULpX0toGfCBnKjcHt+xSDBk X-Received: by 2002:a62:11dc:: with SMTP id 89-v6mr6901921pfr.18.1527161005616; Thu, 24 May 2018 04:23:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1527161005; cv=none; d=google.com; s=arc-20160816; b=eBUA0E0sM3fBG5YMk2dErdoIPcDexzYgFhKuer4USaRQ9CbfkcO71+vp4c5SmIdxDM u8B52O/vnJseLW+n+CJmLceJrv10FL8/M5EnnJ77Pz5qtJlfcJZjYDyhRrLEKwy/gqyf ef64MYIQa1eBfAh/inU52N3ejNp4X19QzNZIeC13cqup7sLhTeo/YkGAp+7McZ0cXLHe s0ADcNcUroM4FTPur07kVr1AytJ44xGOI/LmbNTNT0Pr4QQeL9zA3F7J1drP0yFU9Dq1 pHwJDITyvOWwgc15xtWD4KWF5A41jxxFKE+2XU3wxBlJrq1GnhGyvfEgZkGrYV9yyHgP KuvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:date:cc:to:subject:from:message-id :arc-authentication-results; bh=2JEpDBVZzIWkNw77IWLK40/lON1a96zoBj0swpdG27A=; b=ekW39EFHraedMmBJW/1wta92lZ8nwT+jJwTFfcDM0D25KdcmUeOvDsmGgUdkT0Y9XO LFyV5czRCRObu+9Jp+Unx3Ql+3zegwRS3u2F5tQ1bpYfZXHzZUcWBG1F2Vn4I7Ipn2gj yDAKFBzbwRsFlhvHwHH7fv8mqICTfet3D0Z4K0Ibta6f1KI++Amoc9G09+ag+FRCHMba 02yb09OSzYB0ScRnznCbSBEG2LCopFUZ0lW4ji9Ue5EmEQVGrkCNj2c+o3EU4eICDEg0 yKB5gfS1H43Qy3w1kLDcyzdEU7dGvE1BuBiML388RTBo1+8aKZ3Ubp09/ESk78kkVAaA Cmbw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e24-v6si20450768pff.30.2018.05.24.04.23.11; Thu, 24 May 2018 04:23:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1033135AbeEXLWe (ORCPT + 99 others); Thu, 24 May 2018 07:22:34 -0400 Received: from pegase1.c-s.fr ([93.17.236.30]:24289 "EHLO pegase1.c-s.fr" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S967840AbeEXLW3 (ORCPT ); Thu, 24 May 2018 07:22:29 -0400 Received: from localhost (mailhub1-int [192.168.12.234]) by localhost (Postfix) with ESMTP id 40s6Rl74S0z9ttr5; Thu, 24 May 2018 13:22:27 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at c-s.fr Received: from pegase1.c-s.fr ([192.168.12.234]) by localhost (pegase1.c-s.fr [192.168.12.234]) (amavisd-new, port 10024) with ESMTP id V2D8z4pqq7q7; Thu, 24 May 2018 13:22:27 +0200 (CEST) Received: from messagerie.si.c-s.fr (messagerie.si.c-s.fr [192.168.25.192]) by pegase1.c-s.fr (Postfix) with ESMTP id 40s6Rl6ZS6z9ttqg; Thu, 24 May 2018 13:22:27 +0200 (CEST) Received: from localhost (localhost [127.0.0.1]) by messagerie.si.c-s.fr (Postfix) with ESMTP id 2FE248B8EC; Thu, 24 May 2018 13:22:28 +0200 (CEST) X-Virus-Scanned: amavisd-new at c-s.fr Received: from messagerie.si.c-s.fr ([127.0.0.1]) by localhost (messagerie.si.c-s.fr [127.0.0.1]) (amavisd-new, port 10023) with ESMTP id Jk0odTdZI7eg; Thu, 24 May 2018 13:22:28 +0200 (CEST) Received: from PO15451.localdomain (po15451.idsi0.si.c-s.fr [172.25.231.2]) by messagerie.si.c-s.fr (Postfix) with ESMTP id 135F58B8B8; Thu, 24 May 2018 13:22:28 +0200 (CEST) Received: by localhost.localdomain (Postfix, from userid 0) id CE2C06C991; Thu, 24 May 2018 11:22:27 +0000 (UTC) Message-Id: <484bcfaccc1ec3d91b74aeaaa26a0ae66fe0955a.1527160868.git.christophe.leroy@c-s.fr> From: Christophe Leroy Subject: [PATCH] powerpc/32: Optimise __csum_partial() To: Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , segher@kernel.crashing.org Cc: linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org Date: Thu, 24 May 2018 11:22:27 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Improve __csum_partial by interleaving loads and adds. On a 8xx, it brings neither improvement nor degradation. On a 83xx, it brings a 25% improvement. Signed-off-by: Christophe Leroy --- arch/powerpc/lib/checksum_32.S | 13 +++++++++++-- 1 file changed, 11 insertions(+), 2 deletions(-) diff --git a/arch/powerpc/lib/checksum_32.S b/arch/powerpc/lib/checksum_32.S index d2238ea82209..aa224069f93a 100644 --- a/arch/powerpc/lib/checksum_32.S +++ b/arch/powerpc/lib/checksum_32.S @@ -47,16 +47,25 @@ _GLOBAL(__csum_partial) bdnz 2b 21: srwi. r6,r4,4 /* # blocks of 4 words to do */ beq 3f + lwz r0,4(r3) mtctr r6 -22: lwz r0,4(r3) lwz r6,8(r3) + adde r5,r5,r0 lwz r7,12(r3) + adde r5,r5,r6 lwzu r8,16(r3) + adde r5,r5,r7 + bdz 23f +22: lwz r0,4(r3) + adde r5,r5,r8 + lwz r6,8(r3) adde r5,r5,r0 + lwz r7,12(r3) adde r5,r5,r6 + lwzu r8,16(r3) adde r5,r5,r7 - adde r5,r5,r8 bdnz 22b +23: adde r5,r5,r8 3: andi. r0,r4,2 beq+ 4f lhz r0,4(r3) -- 2.13.3