Received: by 10.213.65.68 with SMTP id h4csp920872imn; Sun, 18 Mar 2018 07:26:00 -0700 (PDT) X-Google-Smtp-Source: AG47ELsdfhHYgPBKRHzlvNzYW8neVCjz1Jkymye3u+aXnhXQQMX7GWc73QvffH2Aj0AEeENVNLPn X-Received: by 2002:a17:902:6b02:: with SMTP id o2-v6mr9312897plk.334.1521383160728; Sun, 18 Mar 2018 07:26:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521383160; cv=none; d=google.com; s=arc-20160816; b=nRlhUFle1hilZBK4mfx6stlnE32HlIs5Jg5HFJc1XL6OsXsz3YOLzQw1dSHYryp8m7 E7uMyrhhe8oWsHZbM4Xp8DCFumh1C8BeP0n7cNxW9XBzmPEmDitv7gg8h1NK196K5jly yw9jAzPA4uq+d6lzoYRVLy5l3T9G3jYZtY2lWfiuplCycKCBVhGCisAtFhhF/1PEzUxf TBOJFDxK5GcW3zUwJsYwnWfnrkcx6kGYamtZuc3SSA5ZAyedEvh+5xWb0ZBCr/PMkkdu vEZk7z6QMVOudoHl9Tcjon3Q6uNxfwkIb76FQsE09LBFZBJiNpIVunPNbfVymzCQ7ks4 75Zg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=dkpTV+m9psdGZdLp/5BDkqsOSywBE30u8m1AoFBLEpM=; b=bokjxoFonT5BqsL899YRLpZ1bd3TJem/7/0pSnylvKfdxpwJHvZlD3ttAC2yQPNsob S5qC62aXo29f8FjJW0CPicLnQs7lCKDWkjV0BLnhuosrQVRJPy4S1qMhqmL3Df86X/RF +CmgNlDWOXE2Su8Hqmg0ALNk6gGSe4UhgRNomql1AJToRhL6xwDmnbLQjE3if4CGzsmb Kxt2+KCAQPzhW8Gl+x3Ilo6MsCZTGkqg8sKm56iLKP0JFC1EXnEbvW+2zxmcnLsJlCFE cIBxwoLWZEFMyCcJwoK5jGadqrAVsHmjnV6TuvWwutM59RUrJ8AgEkKl+WGj9wjkuEKY NTQg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p11si8950672pfl.127.2018.03.18.07.25.34; Sun, 18 Mar 2018 07:26:00 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754244AbeCROXb (ORCPT + 99 others); Sun, 18 Mar 2018 10:23:31 -0400 Received: from bmailout1.hostsharing.net ([83.223.95.100]:48719 "EHLO bmailout1.hostsharing.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754132AbeCROX3 (ORCPT ); Sun, 18 Mar 2018 10:23:29 -0400 Received: from h08.hostsharing.net (h08.hostsharing.net [83.223.95.28]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "*.hostsharing.net", Issuer "COMODO RSA Domain Validation Secure Server CA" (not verified)) by bmailout1.hostsharing.net (Postfix) with ESMTPS id 8A2CA300002A0; Sun, 18 Mar 2018 15:23:27 +0100 (CET) Received: by h08.hostsharing.net (Postfix, from userid 100393) id 3CD864944; Sun, 18 Mar 2018 15:23:27 +0100 (CET) Date: Sun, 18 Mar 2018 15:23:27 +0100 From: Lukas Wunner To: Rasmus Villemoes Cc: Laura Abbott , Linus Walleij , Kees Cook , linux-gpio@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-hardening@lists.openwall.com, Mathias Duckeck , Nandor Han , Semi Malinen , Patrice Chotard Subject: Re: [PATCH 1/4] gpio: Remove VLA from gpiolib Message-ID: <20180318142327.GA23761@wunner.de> References: <20180310001021.6437-1-labbott@redhat.com> <20180310001021.6437-2-labbott@redhat.com> <20180317082509.GA2579@wunner.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180317082509.GA2579@wunner.de> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Mar 17, 2018 at 09:25:09AM +0100, Lukas Wunner wrote: > On Mon, Mar 12, 2018 at 04:00:36PM +0100, Rasmus Villemoes wrote: > > On 2018-03-10 01:10, Laura Abbott wrote: > > > @@ -2887,14 +2909,30 @@ void gpiod_set_array_value_complex(bool raw, bool can_sleep, > > > > > > while (i < array_size) { > > > struct gpio_chip *chip = desc_array[i]->gdev->chip; > > > - unsigned long mask[BITS_TO_LONGS(chip->ngpio)]; > > > - unsigned long bits[BITS_TO_LONGS(chip->ngpio)]; > > > + unsigned long *mask; > > > + unsigned long *bits; > > > int count = 0; > > > > > > + mask = kmalloc_array(BITS_TO_LONGS(chip->ngpio), > > > + sizeof(*mask), > > > + can_sleep ? GFP_KERNEL : GFP_ATOMIC); > > > + > > > + if (!mask) > > > + return; > > > + > > > + bits = kmalloc_array(BITS_TO_LONGS(chip->ngpio), > > > + sizeof(*bits), > > > + can_sleep ? GFP_KERNEL : GFP_ATOMIC); > > > + > > > + if (!bits) { > > > + kfree(mask); > > > + return; > > > + } > > > + > > > if (!can_sleep) > > > WARN_ON(chip->can_sleep); > > > > > > - memset(mask, 0, sizeof(mask)); > > > + memset(mask, 0, sizeof(*mask)); > > > > Other random thoughts: maybe two allocations for each loop iteration is > > a bit much. Maybe do a first pass over the array and collect the maximal > > chip->ngpio, do the memory allocation and freeing outside the loop (then > > you'd of course need to preserve the memset() with appropriate length > > computed). And maybe even just do one allocation, making bits point at > > the second half. > > I think those are great ideas because the function is kind of a hotpath > and usage of VLAs was motivated by the desire to make it fast. > > I'd go one step further and store the maximum ngpio of all registered > chips in a global variable (and update it in gpiochip_add_data_with_key()), > then allocate 2 * max_ngpio once before entering the loop (as you've > suggested). That would avoid the first pass to determine the maximum > chip->ngpio. In most systems max_ngpio will be < 64, so one or two > unsigned longs depending on the arch's bitness. Actually, scratch that. If ngpio is usually smallish, we can just allocate reasonably sized space for mask and bits on the stack, and fall back to the kcalloc slowpath only if chip->ngpio exceeds that limit. Basically the below (likewise compile-tested only), this is on top of Laura's patch, could be squashed together. Let me know what you think, thanks. -- >8 -- Subject: [PATCH] gpio: Add fastpath to gpiod_get/set_array_value_complex() Signed-off-by: Lukas Wunner --- drivers/gpio/gpiolib.c | 76 ++++++++++++++++++++++++-------------------------- 1 file changed, 37 insertions(+), 39 deletions(-) diff --git a/drivers/gpio/gpiolib.c b/drivers/gpio/gpiolib.c index 429bc251392b..ffc67b0b866c 100644 --- a/drivers/gpio/gpiolib.c +++ b/drivers/gpio/gpiolib.c @@ -2432,6 +2432,8 @@ static int gpio_chip_get_multiple(struct gpio_chip *chip, return -EIO; } +#define FASTPATH_NGPIO 256 + int gpiod_get_array_value_complex(bool raw, bool can_sleep, unsigned int array_size, struct gpio_desc **desc_array, @@ -2441,27 +2443,24 @@ int gpiod_get_array_value_complex(bool raw, bool can_sleep, while (i < array_size) { struct gpio_chip *chip = desc_array[i]->gdev->chip; - unsigned long *mask; - unsigned long *bits; + unsigned long fastpath[2 * BITS_TO_LONGS(FASTPATH_NGPIO)]; + unsigned long *slowpath = NULL, *mask, *bits; int first, j, ret; - mask = kcalloc(BITS_TO_LONGS(chip->ngpio), - sizeof(*mask), - can_sleep ? GFP_KERNEL : GFP_ATOMIC); - - if (!mask) - return -ENOMEM; - - bits = kcalloc(BITS_TO_LONGS(chip->ngpio), - sizeof(*bits), - can_sleep ? GFP_KERNEL : GFP_ATOMIC); - - if (!bits) { - kfree(mask); - return -ENOMEM; + if (likely(chip->ngpio <= FASTPATH_NGPIO)) { + memset(fastpath, 0, sizeof(fastpath)); + mask = fastpath; + bits = fastpath + BITS_TO_LONGS(FASTPATH_NGPIO); + } else { + slowpath = kcalloc(2 * BITS_TO_LONGS(chip->ngpio), + sizeof(*slowpath), + can_sleep ? GFP_KERNEL : GFP_ATOMIC); + if (!slowpath) + return -ENOMEM; + mask = slowpath; + bits = slowpath + BITS_TO_LONGS(chip->ngpio); } - if (!can_sleep) WARN_ON(chip->can_sleep); @@ -2478,8 +2477,8 @@ int gpiod_get_array_value_complex(bool raw, bool can_sleep, ret = gpio_chip_get_multiple(chip, mask, bits); if (ret) { - kfree(bits); - kfree(mask); + if (slowpath) + kfree(slowpath); return ret; } @@ -2493,8 +2492,9 @@ int gpiod_get_array_value_complex(bool raw, bool can_sleep, value_array[j] = value; trace_gpio_value(desc_to_gpio(desc), 1, value); } - kfree(bits); - kfree(mask); + + if (slowpath) + kfree(slowpath); } return 0; } @@ -2699,24 +2699,22 @@ int gpiod_set_array_value_complex(bool raw, bool can_sleep, while (i < array_size) { struct gpio_chip *chip = desc_array[i]->gdev->chip; - unsigned long *mask; - unsigned long *bits; + unsigned long fastpath[2 * BITS_TO_LONGS(FASTPATH_NGPIO)]; + unsigned long *slowpath = NULL, *mask, *bits; int count = 0; - mask = kcalloc(BITS_TO_LONGS(chip->ngpio), - sizeof(*mask), - can_sleep ? GFP_KERNEL : GFP_ATOMIC); - - if (!mask) - return -ENOMEM; - - bits = kcalloc(BITS_TO_LONGS(chip->ngpio), - sizeof(*bits), - can_sleep ? GFP_KERNEL : GFP_ATOMIC); - - if (!bits) { - kfree(mask); - return -ENOMEM; + if (likely(chip->ngpio <= FASTPATH_NGPIO)) { + memset(fastpath, 0, sizeof(fastpath)); + mask = fastpath; + bits = fastpath + BITS_TO_LONGS(FASTPATH_NGPIO); + } else { + slowpath = kcalloc(2 * BITS_TO_LONGS(chip->ngpio), + sizeof(*slowpath), + can_sleep ? GFP_KERNEL : GFP_ATOMIC); + if (!slowpath) + return -ENOMEM; + mask = slowpath; + bits = slowpath + BITS_TO_LONGS(chip->ngpio); } if (!can_sleep) @@ -2753,8 +2751,8 @@ int gpiod_set_array_value_complex(bool raw, bool can_sleep, if (count != 0) gpio_chip_set_multiple(chip, mask, bits); - kfree(mask); - kfree(bits); + if (slowpath) + kfree(slowpath); } return 0; } -- 2.16.2