Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933403AbeAJIiH (ORCPT + 1 other); Wed, 10 Jan 2018 03:38:07 -0500 Received: from mail-lf0-f68.google.com ([209.85.215.68]:36668 "EHLO mail-lf0-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932320AbeAJIiE (ORCPT ); Wed, 10 Jan 2018 03:38:04 -0500 X-Google-Smtp-Source: ACJfBov6eosJaP7pKeVm/1QfPu4dPqerKsN3xbwmiOkgMCS/B2BOBanlQRkLbZmYP2uLbQOPOf6kXxc6ARUDkcSnjuU= MIME-Version: 1.0 In-Reply-To: References: <20180109151847.30258-1-cmo@melexis.com> <1515525824.9619.103.camel@perches.com> From: Crt Mori Date: Wed, 10 Jan 2018 09:37:21 +0100 Message-ID: Subject: Re: [PATCH v12 1/3] lib: Add strongly typed 64bit int_sqrt To: Joe Perches Cc: Jonathan Cameron , Ingo Molnar , Andrew Morton , Kees Cook , Rusty Russell , Ian Abbott , Larry Finger , Niklas Soderlund , Thomas Gleixner , Krzysztof Kozlowski , Masahiro Yamada , linux-kernel@vger.kernel.org, Linux Iio , Peter Zijlstra , David Laight Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On 10 January 2018 at 09:33, Crt Mori wrote: > On 10 January 2018 at 09:15, Crt Mori wrote: >> On 9 January 2018 at 20:23, Joe Perches wrote: >>> On Tue, 2018-01-09 at 16:18 +0100, Crt Mori wrote: >>>> There is no option to perform 64bit integer sqrt on 32bit platform. >>>> Added stronger typed int_sqrt64 enables the 64bit calculations to >>>> be performed on 32bit platforms. Using same algorithm as int_sqrt() >>>> with strong typing provides enough precision also on 32bit platforms, >>>> but it sacrifices some performance. >>> [] >>>> diff --git a/lib/int_sqrt.c b/lib/int_sqrt.c >>> [] >>>> @@ -36,3 +37,34 @@ unsigned long int_sqrt(unsigned long x) >>>> return y; >>>> } >>>> EXPORT_SYMBOL(int_sqrt); >>>> + >>>> +#if BITS_PER_LONG < 64 >>>> +/** >>>> + * int_sqrt64 - strongly typed int_sqrt function when minimum 64 bit input >>>> + * is expected. >>>> + * @x: 64bit integer of which to calculate the sqrt >>>> + */ >>>> +u32 int_sqrt64(u64 x) >>>> +{ >>>> + u64 b, m; >>>> + u32 y = 0; >>>> + >>>> + if (x <= 1) >>>> + return x; >>> >>> I think this should instead be: >>> >>> if (x <= INT_MAX) >>> return int_sqrt((int)x); >>> >>> to reduce the loop cost below when the >>> value is small enough. >>> >> >> In existing int_sqrt its only 1 and I assume that is more to protect >> from loop execution with 0 or 1. Since there is no difference (except >> fls64) with int_sqrt I assume there is no need to call it to avoid >> loop? >> > > Nevermind, I see what you mean (should have thought longer before I > written). The cost of below loop is because of 64bit calculation is > not native on 32bit and we could just use 32bit calculation in that > loop. Will send v13 with a fix for this. > Shouldn't I rather make it if (x <= ULONG_MAX) return int_sqrt((unsigned long) x); >>>> + >>>> + m = 1ULL << (fls64(x) & ~1ULL); >>>> + while (m != 0) { >>>> + b = y + m; >>>> + y >>= 1; >>>> + >>>> + if (x >= b) { >>>> + x -= b; >>>> + y += m; >>>> + } >>>> + m >>= 2; >>>> + } >>>> + >>>> + return y; >>>> +} >>>> +EXPORT_SYMBOL(int_sqrt64); >>>> +#endif