Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935436AbeAITXv (ORCPT + 1 other); Tue, 9 Jan 2018 14:23:51 -0500 Received: from smtprelay0128.hostedemail.com ([216.40.44.128]:42826 "EHLO smtprelay.hostedemail.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S933847AbeAITXt (ORCPT ); Tue, 9 Jan 2018 14:23:49 -0500 X-Session-Marker: 6A6F6540706572636865732E636F6D X-HE-Tag: wave66_7828114b0394c X-Filterd-Recvd-Size: 2509 Message-ID: <1515525824.9619.103.camel@perches.com> Subject: Re: [PATCH v12 1/3] lib: Add strongly typed 64bit int_sqrt From: Joe Perches To: Crt Mori , Jonathan Cameron Cc: Ingo Molnar , Andrew Morton , Kees Cook , Rusty Russell , Ian Abbott , Larry Finger , Niklas Soderlund , Thomas Gleixner , Krzysztof Kozlowski , Masahiro Yamada , linux-kernel@vger.kernel.org, linux-iio@vger.kernel.org, Peter Zijlstra , David Laight Date: Tue, 09 Jan 2018 11:23:44 -0800 In-Reply-To: <20180109151847.30258-1-cmo@melexis.com> References: <20180109151847.30258-1-cmo@melexis.com> Content-Type: text/plain; charset="ISO-8859-1" X-Mailer: Evolution 3.26.1-1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On Tue, 2018-01-09 at 16:18 +0100, Crt Mori wrote: > There is no option to perform 64bit integer sqrt on 32bit platform. > Added stronger typed int_sqrt64 enables the 64bit calculations to > be performed on 32bit platforms. Using same algorithm as int_sqrt() > with strong typing provides enough precision also on 32bit platforms, > but it sacrifices some performance. [] > diff --git a/lib/int_sqrt.c b/lib/int_sqrt.c [] > @@ -36,3 +37,34 @@ unsigned long int_sqrt(unsigned long x) > return y; > } > EXPORT_SYMBOL(int_sqrt); > + > +#if BITS_PER_LONG < 64 > +/** > + * int_sqrt64 - strongly typed int_sqrt function when minimum 64 bit input > + * is expected. > + * @x: 64bit integer of which to calculate the sqrt > + */ > +u32 int_sqrt64(u64 x) > +{ > + u64 b, m; > + u32 y = 0; > + > + if (x <= 1) > + return x; I think this should instead be: if (x <= INT_MAX) return int_sqrt((int)x); to reduce the loop cost below when the value is small enough. > + > + m = 1ULL << (fls64(x) & ~1ULL); > + while (m != 0) { > + b = y + m; > + y >>= 1; > + > + if (x >= b) { > + x -= b; > + y += m; > + } > + m >>= 2; > + } > + > + return y; > +} > +EXPORT_SYMBOL(int_sqrt64); > +#endif