Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp2610075imm; Wed, 16 May 2018 16:12:04 -0700 (PDT) X-Google-Smtp-Source: AB8JxZqQJZew7mat07IhywmukGTT54G5OxQopvaSdRGqKeWUd98XI/FoeoKzlJxGhaAXynz7LWUr X-Received: by 2002:a63:442:: with SMTP id 63-v6mr2278105pge.156.1526512324899; Wed, 16 May 2018 16:12:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526512324; cv=none; d=google.com; s=arc-20160816; b=P3Y/5mUOgpSrH8uEsDZu03tKtnNnicnbmIcNc4UT2oR7rtetM/05FPS9+xNOd/AG89 MoK7M2LoBwY26ok97HfcXPG/aiBktwUzzdH2PJQ9C+X9t519eLGy2J3cqYy3uyTp/M+K 1jzgujLAxh0Ak1PWtIi4l3qa+xJhHf2iFqmWK0wrRUdbaAjbUgNl9zY4O7iIWl7GYdtj VcKPwUwpDR+KVMn/2QhbIxV9YoHFiE/A+ySe7lV+Lb4GqUsYzHZ8twboiaJbkpG339eA BwsObv7iMkfhf3629+XrecBwEJM7OIWoY5vTcC1QFG7CDzHnhVIAg7aGcJNYIhPlFUhx xBdw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=ibY8Rc8dC4IVuOCYwuhq/eeUjbiEPOApqeOiqFhWhyA=; b=riY0QC8ilhiUbjYkLc2knM4mcpx2WSCjDEp9Wu1+5SnHk1dQIUjjh0WTqjHEA2gCsf cwguXMTkKvcFmjQG2hr2Qup1nZhS+/EABr8Mf08ZGK7cIR2YhcX744HLjNRPbpASuPY0 WjvGQCVSEg4MD4xtp4CvKme89FrZWNQ6/uXO1UZeDqPERuC+LcFoiLvO/hfPfQbTd+Lr SNNFvxqk62RHzCh3FJsTRe1N+FPBm+B/bof8BF52AYSXP4KIE4n2/mJTEFvzUvKGlKB0 HYjPm1GhzjL2EueUQI+k7OHPCnRXsXy8GIL6vVEGKbOyMdmn9FQ0ovCfHR++4kf/qm34 gkHQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ozlabs.org header.s=201707 header.b=rbmTbVSU; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a7-v6si2949518pgd.338.2018.05.16.16.11.50; Wed, 16 May 2018 16:12:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@ozlabs.org header.s=201707 header.b=rbmTbVSU; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751642AbeEPXLA (ORCPT + 99 others); Wed, 16 May 2018 19:11:00 -0400 Received: from ozlabs.org ([203.11.71.1]:44081 "EHLO ozlabs.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751013AbeEPXK6 (ORCPT ); Wed, 16 May 2018 19:10:58 -0400 Received: by ozlabs.org (Postfix, from userid 1003) id 40mVXx2vD5z9s1w; Thu, 17 May 2018 09:10:57 +1000 (AEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ozlabs.org; s=201707; t=1526512257; bh=34tuadKM5sHzVBe3uzNpOdh6cYefSbC/7DGqEtOlCX0=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=rbmTbVSUatbk/E2Cd1n2ORFkErjRjmCgfK8kGillYugRSgSKznCpHT2Wi59DLHX+8 mbzqw4bCk4QdKpNFcxOC7qvIO2QB84rbapv/7gVF7oQNUfwGD7V31cRe10cQ1YHq9K l3tbSPflryaYghakRkyksKBjwrXlNIVzVhvnDD4W2dE3jVxYPmM6Ax9/vtFGpgnwac gPGkUay85VQiehPI22pv91c4afemHhZQ1+z6K1B0cSYIRCOvFmIWdaxeJEOaFxXPuw 4rA8shlGmQSGfXPTtYMXd6Fwa4wU5Fv0Sv42ahaDriuitUDmSYEsclR3bOI2ALyY+c 5BgITJJr1SdBQ== Date: Thu, 17 May 2018 09:10:52 +1000 From: Paul Mackerras To: Christophe Leroy Cc: Benjamin Herrenschmidt , Michael Ellerman , Scott Wood , linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, Shile Zhang Subject: Re: [PATCH] Revert "powerpc/64: Fix checksum folding in csum_add()" Message-ID: <20180516231052.GA24778@fergus.ozlabs.ibm.com> References: <20180410063437.217D2653BC@po15720vm.idsi0.si.c-s.fr> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180410063437.217D2653BC@po15720vm.idsi0.si.c-s.fr> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 10, 2018 at 08:34:37AM +0200, Christophe Leroy wrote: > This reverts commit 6ad966d7303b70165228dba1ee8da1a05c10eefe. > > That commit was pointless, because csum_add() sums two 32 bits > values, so the sum is 0x1fffffffe at the maximum. > And then when adding upper part (1) and lower part (0xfffffffe), > the result is 0xffffffff which doesn't carry. > Any lower value will not carry either. > > And behind the fact that this commit is useless, it also kills the > whole purpose of having an arch specific inline csum_add() > because the resulting code gets even worse than what is obtained > with the generic implementation of csum_add() > > 0000000000000240 <.csum_add>: > 240: 38 00 ff ff li r0,-1 > 244: 7c 84 1a 14 add r4,r4,r3 > 248: 78 00 00 20 clrldi r0,r0,32 > 24c: 78 89 00 22 rldicl r9,r4,32,32 > 250: 7c 80 00 38 and r0,r4,r0 > 254: 7c 09 02 14 add r0,r9,r0 > 258: 78 09 00 22 rldicl r9,r0,32,32 > 25c: 7c 00 4a 14 add r0,r0,r9 > 260: 78 03 00 20 clrldi r3,r0,32 > 264: 4e 80 00 20 blr > > In comparison, the generic implementation of csum_add() gives: > > 0000000000000290 <.csum_add>: > 290: 7c 63 22 14 add r3,r3,r4 > 294: 7f 83 20 40 cmplw cr7,r3,r4 > 298: 7c 10 10 26 mfocrf r0,1 > 29c: 54 00 ef fe rlwinm r0,r0,29,31,31 > 2a0: 7c 60 1a 14 add r3,r0,r3 > 2a4: 78 63 00 20 clrldi r3,r3,32 > 2a8: 4e 80 00 20 blr > > And the reverted implementation for PPC64 gives: > > 0000000000000240 <.csum_add>: > 240: 7c 84 1a 14 add r4,r4,r3 > 244: 78 80 00 22 rldicl r0,r4,32,32 > 248: 7c 80 22 14 add r4,r0,r4 > 24c: 78 83 00 20 clrldi r3,r4,32 > 250: 4e 80 00 20 blr > > Fixes: 6ad966d7303b7 ("powerpc/64: Fix checksum folding in csum_add()") > Signed-off-by: Christophe Leroy Seems I was right first time... :) Acked-by: Paul Mackerras