Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp104585imm; Mon, 21 May 2018 03:04:22 -0700 (PDT) X-Google-Smtp-Source: AB8JxZosYfgv/NOrF+K0I8s5F5UwVAU3sXPNdDuDn5MdmocY6zSH25xfmT7C756rOS1ODFoqAN5s X-Received: by 2002:a17:902:82ca:: with SMTP id u10-v6mr19896634plz.160.1526897062626; Mon, 21 May 2018 03:04:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526897062; cv=none; d=google.com; s=arc-20160816; b=yI4rd7mdckTZ8MnO8WdJ7r/yQcgqkMPzzAoG6wv4x6M/HfPZFcX/Uc1VAA4NxsaKzg qv1DmMZfpmZI0EQV798v3sc2gxBM5erNnBBLxcdV6JTMRJ2ImBbM0IoTy8zs9/runT0w W7tf4CfUeyxaLb59PY1tILXoyYNRrQDamof+3mvnavGGDekVoVjwJDPz5iRioVRZjdY7 qPFDsgFCxgpd4Ito4+U5jEH/6lZxKHWp6hGOgp32SeQvsW5hS8qmxMLWEkDKPnIlRa5r mBtrjHoN8d7rq6xv7r+MzQb6YjfxAvckFXMj9SXjvxlElwPAGR2lIsRztA6teTceg4Sf akWg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:date:message-id:subject:cc:from:to :in-reply-to:arc-authentication-results; bh=2M20AuWYjz4NduwrpmWROEGDLneQeWIQ3dMqE6xUAbg=; b=iOAC9iyaRIpnXFbbsVCSvTUl0c152q1Bw7BH7lFda7UZ3VTwCla4jgQb4FkvO2PAqq 25+iIdYaV3KVUvVMZhOrq+slRvkvDqZsRu22DdF1XeQXvjzNU+zM0yV0FT87KV/b2t3o QcPwnqvukiToaFU/VrXSYrlHQTe/G7XTcTB1r4ego1Q9xvfF62JhO83Zx9qHhiMjd/qm bB+sedy+s5QEjDfKjNtHL6KYMUJJqrH1gTSeumsrJYZwqchO0J/JELnXiaNhEKu79I3Y /IJstnZe8Rk3Xj93EdnjfImtD8GZG/YHAz+vwnhv2+DjnOtiPApvOMj8tJW7vC6xkYyA 2X5Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 203-v6si13589048pfa.60.2018.05.21.03.04.08; Mon, 21 May 2018 03:04:22 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752627AbeEUKDY (ORCPT + 99 others); Mon, 21 May 2018 06:03:24 -0400 Received: from ozlabs.org ([203.11.71.1]:46911 "EHLO ozlabs.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752880AbeEUKB2 (ORCPT ); Mon, 21 May 2018 06:01:28 -0400 Received: by ozlabs.org (Postfix, from userid 1034) id 40qDnZ2Vrmz9s31; Mon, 21 May 2018 20:01:22 +1000 (AEST) X-powerpc-patch-notification: thanks X-powerpc-patch-commit: 96f391cf40ee5c9201cc7b55abe3903761e6a2e2 In-Reply-To: <20180410063437.217D2653BC@po15720vm.idsi0.si.c-s.fr> To: Christophe Leroy , Benjamin Herrenschmidt , Paul Mackerras , Scott Wood From: Michael Ellerman Cc: Shile Zhang , linuxppc-dev@lists.ozlabs.org, linux-kernel@vger.kernel.org Subject: Re: Revert "powerpc/64: Fix checksum folding in csum_add()" Message-Id: <40qDnZ2Vrmz9s31@ozlabs.org> Date: Mon, 21 May 2018 20:01:22 +1000 (AEST) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2018-04-10 at 06:34:37 UTC, Christophe Leroy wrote: > This reverts commit 6ad966d7303b70165228dba1ee8da1a05c10eefe. > > That commit was pointless, because csum_add() sums two 32 bits > values, so the sum is 0x1fffffffe at the maximum. > And then when adding upper part (1) and lower part (0xfffffffe), > the result is 0xffffffff which doesn't carry. > Any lower value will not carry either. > > And behind the fact that this commit is useless, it also kills the > whole purpose of having an arch specific inline csum_add() > because the resulting code gets even worse than what is obtained > with the generic implementation of csum_add() > > 0000000000000240 <.csum_add>: > 240: 38 00 ff ff li r0,-1 > 244: 7c 84 1a 14 add r4,r4,r3 > 248: 78 00 00 20 clrldi r0,r0,32 > 24c: 78 89 00 22 rldicl r9,r4,32,32 > 250: 7c 80 00 38 and r0,r4,r0 > 254: 7c 09 02 14 add r0,r9,r0 > 258: 78 09 00 22 rldicl r9,r0,32,32 > 25c: 7c 00 4a 14 add r0,r0,r9 > 260: 78 03 00 20 clrldi r3,r0,32 > 264: 4e 80 00 20 blr > > In comparison, the generic implementation of csum_add() gives: > > 0000000000000290 <.csum_add>: > 290: 7c 63 22 14 add r3,r3,r4 > 294: 7f 83 20 40 cmplw cr7,r3,r4 > 298: 7c 10 10 26 mfocrf r0,1 > 29c: 54 00 ef fe rlwinm r0,r0,29,31,31 > 2a0: 7c 60 1a 14 add r3,r0,r3 > 2a4: 78 63 00 20 clrldi r3,r3,32 > 2a8: 4e 80 00 20 blr > > And the reverted implementation for PPC64 gives: > > 0000000000000240 <.csum_add>: > 240: 7c 84 1a 14 add r4,r4,r3 > 244: 78 80 00 22 rldicl r0,r4,32,32 > 248: 7c 80 22 14 add r4,r0,r4 > 24c: 78 83 00 20 clrldi r3,r4,32 > 250: 4e 80 00 20 blr > > Fixes: 6ad966d7303b7 ("powerpc/64: Fix checksum folding in csum_add()") > Signed-off-by: Christophe Leroy > Acked-by: Paul Mackerras Applied to powerpc next, thanks. https://git.kernel.org/powerpc/c/96f391cf40ee5c9201cc7b55abe390 cheers