Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932835Ab0BCUk3 (ORCPT ); Wed, 3 Feb 2010 15:40:29 -0500 Received: from ovro.ovro.caltech.edu ([192.100.16.2]:46273 "EHLO ovro.ovro.caltech.edu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932826Ab0BCUkZ (ORCPT ); Wed, 3 Feb 2010 15:40:25 -0500 Date: Wed, 3 Feb 2010 12:40:23 -0800 From: "Ira W. Snyder" To: Andrew Morton Cc: Stefani Seibold , linux-kernel , Andi Kleen , Greg Kroah-Hartman , Alan Cox , Theodore Tso Subject: Re: [PATCH] enhanced reimplemention of the kfifo API Message-ID: <20100203204023.GA4544@ovro.caltech.edu> References: <1264597243.3607.24.camel@wall-e> <20100203120504.6da9b000.akpm@linux-foundation.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20100203120504.6da9b000.akpm@linux-foundation.org> User-Agent: Mutt/1.5.20 (2009-06-14) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0 (ovro.ovro.caltech.edu); Wed, 03 Feb 2010 12:40:25 -0800 (PST) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 19171 Lines: 629 On Wed, Feb 03, 2010 at 12:05:04PM -0800, Andrew Morton wrote: > On Wed, 27 Jan 2010 14:00:43 +0100 > Stefani Seibold wrote: > > > This is a complete reimplementation of the new kfifo API, which is now > > really generic, type save and type definable. > > > > > > ... > > > > diff -u -N -r -p linux-2.6.33-rc5.dummy/include/linux/kfifo.h linux-2.6.33-rc5.new/include/linux/kfifo.h > > --- linux-2.6.33-rc5.dummy/include/linux/kfifo.h 1970-01-01 01:00:00.000000000 +0100 > > +++ linux-2.6.33-rc5.new/include/linux/kfifo.h 2010-01-27 13:34:06.324991898 +0100 > > > > ... > > > > +/* > > + * DECLARE_KFIFO_PTR - macro to declare a fifo pointer object > > + * @fifo: name of the declared fifo > > + * @type: type of the fifo elements > > + * @size: the number of elements in the fifo, this must be a power of 2. > > + */ > > Throughout this patch the comments are in kerneldoc form, but they > don't use the /** opening token, so the kerneldoc tools will ignore them. > > Please fix that up and test it. > > > +#define DECLARE_KFIFO_PTR(fifo, type) STRUCT_KFIFO_PTR(type) fifo > > + > > +/* helper macro */ > > +#define __is_kfifo_ptr(fifo) (sizeof(*fifo) == sizeof(struct __kfifo)) > > This is weird. It's a compile-time constant. What's it here for? > > > + > > +#define __kfifo_initializer(fifo) \ > > + (typeof(fifo)) { \ > > + { \ > > + { \ > > + .in = 0, \ > > + .out = 0, \ > > + .mask = __is_kfifo_ptr(&fifo) ? \ > > + 0 : \ > > + sizeof((fifo).buf)/sizeof(*(fifo).buf) - 1, \ > > + .esize = sizeof(*(fifo).buf), \ > > + .data = __is_kfifo_ptr(&fifo) ? \ > > + NULL : \ > > + (fifo).buf, \ > > + } \ > > + } \ > > + } > > + > > +/* > > + * INIT_KFIFO - Initialize a fifo declared by DECLARE_KFIFO > > + * @fifo: name of the declared fifo datatype > > + */ > > +#define INIT_KFIFO(fifo) fifo = __kfifo_initializer(fifo) > > Some versions of gcc generate quite poor code for > > struct foo f = { ... }; > > It would generate an instance of the struct into the stack and then > would memcpy it over, iirc. Please take a look, see if that's > happening with this construct (might need to use an old gcc version) > and if so, see if there's a fix? > > > +/* > > + * DECLARE_KFIFO - macro to declare a fifo object > > + * @fifo: name of the declared fifo > > + * @type: type of the fifo elements > > + * @size: the number of elements in the fifo, this must be a power of 2. > > + */ > > +#define DECLARE_KFIFO(fifo, type, size) STRUCT_KFIFO(type, size) fifo > > + > > +/* > > + * DEFINE_KFIFO - macro to define and initialize a fifo > > + * @fifo: name of the declared fifo datatype > > + * @type: type of the fifo elements > > + * @size: the number of elements in the fifo, this must be a power of 2. > > + * > > + * Note: the macro can be used for global and local fifo data type variables > > + */ > > +#define DEFINE_KFIFO(fifo, type, size) \ > > + DECLARE_KFIFO(fifo, type, size) = __kfifo_initializer(fifo) > > + > > +static inline unsigned int __must_check __kfifo_check(unsigned int val) > > +{ > > + return val; > > +} > > "check" is a poor term. Check what? For what? Unclear. > > If I knew what this was supposed to check, I could suggest a better > name. But it's called "check", so I don't know :( > > > +/* > > + * kfifo_initialized - Check if kfifo is initialized. > > + * @fifo: fifo to check > > + * Return %true if FIFO is initialized, otherwise %false. > > + * Assumes the fifo was 0 before. > > + * > > + * Note: for in place fifo's this macro returns alway true > > + */ > > +#define kfifo_initialized(fifo) ((fifo)->kfifo.data != NULL) > > kfifo_initialized() is suspicious. Any code which doesn't know whether > its kfifo was initialised is probably confused, lame and broken. > Fortunately kfifo_initialized() is not used anywhere. Please prepare a > separate patch to kill it sometime. > > > +/* > > + * kfifo_esize - returns the size of the element managed by the fifo > > + * @fifo: the fifo to be used. > > + */ > > +#define kfifo_esize(fifo) ((fifo)->kfifo.esize) > > This doesn't seem to be used anywhere either. > > > +/* > > + * kfifo_recsize - returns the size of the record length field > > + * @fifo: the fifo to be used. > > + */ > > +#define kfifo_recsize(fifo) (sizeof(*(fifo)->rectype)) > > Neither is this. What's going on? > > > +/* > > + * kfifo_size - returns the size of the fifo in elements > > + * @fifo: the fifo to be used. > > + */ > > +#define kfifo_size(fifo) ((fifo)->kfifo.mask + 1) > > + > > +/* > > + * kfifo_reset - removes the entire fifo content > > + * @fifo: the fifo to be used. > > + * > > + * Note: usage of kfifo_reset() is dangerous. It should be only called when the > > + * fifo is exclusived locked or when it is secured that no other thread is > > + * accessing the fifo. > > + */ > > +#define kfifo_reset(fifo) \ > > +(void)({ \ > > + typeof(fifo + 1) __tmp = (fifo); \ > > What does the "+ 1" trick do? > > > + __tmp->kfifo.in = __tmp->kfifo.out = 0; \ > > +}) > > + > > > > ... > > > > +/* > > + * kfifo_put - put data into the fifo > > + * @fifo: the fifo to be used. > > + * @val: the data to be added. > > + * > > + * This macro copies the given value into the fifo. > > + * > > + * Note that with only one concurrent reader and one concurrent > > + * writer, you don't need extra locking to use these macro. > > There are quite a few typos in the comments in this patch. > > > > > ... > > > > +/* > > + * kfifo_out_locked - gets some data from the fifo using a spinlock for locking > > + * @fifo: the fifo to be used. > > + * @buf: pointer to the storage buffer > > + * @n: max. number of elements to get > > + * @lock: pointer to the spinlock to use for locking. > > + * > > + * This macro get the data from the fifo and return the numbers of elements > > + * copied. > > + */ > > +#define kfifo_out_locked(fifo, buf, n, lock) \ > > +__kfifo_check( \ > > +({ \ > > + unsigned long __flags; \ > > + unsigned int __ret; \ > > + spin_lock_irqsave(lock, __flags); \ > > + __ret = kfifo_out(fifo, buf, n); \ > > + spin_unlock_irqrestore(lock, __flags); \ > > + __ret; \ > > +}) \ > > +) > > This is poorly named. Generally "foo_locked" is to be called when the > caller has already taken the lock. This identifier inverts that > convention. > > > +/* > > + * kfifo_from_user - puts some data from user space into the fifo > > + * @fifo: the fifo to be used. > > + * @from: pointer to the data to be added. > > + * @len: the length of the data to be added. > > + * @copied: pointer to output variable to store the number of copied bytes. > > + * > > + * This macro copies at most @len bytes from the @from into the > > + * fifo, depending of the available space and returns -EFAULT/0 > > + * > > + * Note that with only one concurrent reader and one concurrent > > + * writer, you don't need extra locking to use these macro. > > + */ > > +#define kfifo_from_user(fifo, from, len, copied) \ > > +__kfifo_check( \ > > +({ \ > > + typeof(fifo + 1) __tmp = (fifo); \ > > + const void __user *__from = (from); \ > > + unsigned int __len = (len); \ > > + unsigned int *__copied = (copied); \ > > + const size_t __recsize = sizeof(*__tmp->rectype); \ > > + struct __kfifo *__kfifo = &__tmp->kfifo; \ > > + (__recsize) ? \ > > + __kfifo_from_user_r(__kfifo, __from, __len, __copied, __recsize) : \ > > + __kfifo_from_user(__kfifo, __from, __len, __copied); \ > > +}) \ > > +) > > + > > +/* > > + * kfifo_to_user - copies data from the fifo into user space > > + * @fifo: the fifo to be used. > > + * @to: where the data must be copied. > > + * @len: the size of the destination buffer. > > + * @copied: pointer to output variable to store the number of copied bytes. > > + * > > + * This macro copies at most @len bytes from the fifo into the > > + * @to buffer and returns -EFAULT/0. > > + * > > + * Note that with only one concurrent reader and one concurrent > > + * writer, you don't need extra locking to use these macro. > > + */ > > +#define kfifo_to_user(fifo, to, len, copied) \ > > +__kfifo_check( \ > > +({ \ > > + typeof(fifo + 1) __tmp = (fifo); \ > > + void __user *__to = (to); \ > > + unsigned int __len = (len); \ > > + unsigned int *__copied = (copied); \ > > + const size_t __recsize = sizeof(*__tmp->rectype); \ > > + struct __kfifo *__kfifo = &__tmp->kfifo; \ > > + (__recsize) ? \ > > + __kfifo_to_user_r(__kfifo, __to, __len, __copied, __recsize) : \ > > + __kfifo_to_user(__kfifo, __to, __len, __copied); \ > > +}) \ > > +) > > Do these have any callers? > As a driver author, I have use for these functions. I've basically implemented a scatterlist_from_user() function in my own driver, which isn't upstream yet. See below for more. > > +/* > > + * kfifo_dma_in_prepare - setup a scatterlist for DMA input > > + * @fifo: the fifo to be used. > > + * @sgl: pointer to the scatterlist array. > > + * @nents: number of entries in the scatterlist array > > + * @len: number of elements to transfer. > > + * > > + * This macro fills a scatterlist for DMA input. > > + * It returns the number entries in the scatterlist array > > + * > > + * Note that with only one concurrent reader and one concurrent > > + * writer, you don't need extra locking to use these macros. > > + */ > > +#define kfifo_dma_in_prepare(fifo, sgl, nents, len) \ > > +({ \ > > + typeof(fifo + 1) __tmp = (fifo); \ > > + struct scatterlist *__sgl = (sgl); \ > > + int __nents = (nents); \ > > + unsigned int __len = (len); \ > > + const size_t __recsize = sizeof(*__tmp->rectype); \ > > + struct __kfifo *__kfifo = &__tmp->kfifo; \ > > + (__recsize) ? \ > > + __kfifo_dma_in_prepare_r(__kfifo, __sgl, __nents, __len, __recsize) : \ > > + __kfifo_dma_in_prepare(__kfifo,__sgl, __nents, __len); \ > > +}) > > Do the DMA functions have any callers? If not, why was all this stuff > added? > I have use for these too. The driver in question needs to retrieve an FPGA bitfile from userspace, setup a DMA, and trigger the (hardware) FPGA programming logic. The programming logic toggles the DMA pins appropriately until the FPGA bitfile is loaded into the FPGA. The driver isn't upstream, since it is only useful on my board, and no other hardware in existence. I didn't think anyone would care. This would remove my needs for a custom FIFO-ish type that I've invented on top of a scatterlist, all so I could use dma_map_sg(). I have another driver which periodically reads (via DMA) from the FPGA into a scatterlist. These scatterlists are buffered up until userspace reads them via a char device. The driver essentially goes: preallocate SG's interrupt -> DMA to SG -> SG to userspace I had to roll my own implementation to keep track of how much data had been copied to userspace out of the buffer. Speaking as a driver author, the new kfifo userspace and DMA API's would make my life just that much easier. Ira > > > > ... > > > > --- linux-2.6.33-rc5.dummy/kernel/kfifo.c 1970-01-01 01:00:00.000000000 +0100 > > +++ linux-2.6.33-rc5.new/kernel/kfifo.c 2010-01-27 13:33:56.456825816 +0100 > > @@ -0,0 +1,583 @@ > > +/* > > + * A generic kernel FIFO implementation > > + * > > + * Copyright (C) 2009/2010 Stefani Seibold > > + * > > + * This program is free software; you can redistribute it and/or modify > > + * it under the terms of the GNU General Public License as published by > > + * the Free Software Foundation; either version 2 of the License, or > > + * (at your option) any later version. > > + * > > + * This program is distributed in the hope that it will be useful, > > + * but WITHOUT ANY WARRANTY; without even the implied warranty of > > + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the > > + * GNU General Public License for more details. > > + * > > + * You should have received a copy of the GNU General Public License > > + * along with this program; if not, write to the Free Software > > + * Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA. > > + * > > + */ > > + > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > + > > +static unsigned int roundup_diff(unsigned int val, unsigned int size) > > +{ > > + if (size <= 1) > > + return size; > > + return (val + size - 1) / size; > > +} > > Would be useful to add a comment explaining why this exists. > > Can we not use existing general facilities? > > Should this be added to the existing general facilities? > > > +static inline unsigned int kfifo_unused(struct __kfifo *fifo) > > +{ > > + return (fifo->mask + 1) - (fifo->in - fifo->out); > > +} > > hm, how does this differ from kfifo_avail()? > > Should this use existing helpers such as kfifo_avail(), kfifo_len(), > etc? > > > +int __kfifo_alloc(struct __kfifo *fifo, unsigned int size, > > + size_t esize, gfp_t gfp_mask) > > +{ > > + /* > > + * round up to the next power of 2, since our 'let the indices > > + * wrap' technique works only in this case. > > + */ > > + if (!is_power_of_2(size)) > > + size = rounddown_pow_of_two(size); > > + > > + if (size < 2) > > + return -EINVAL; > > + > > + fifo->in = fifo->out = 0; > > fifo->in = 0; > fifo->out = 0; > > is usually preferred. > > > + fifo->data = kmalloc(size * esize, gfp_mask); > > + > > + if (!fifo->data) { > > + fifo->mask = 0; > > + return -ENOMEM; > > + } > > + > > + fifo->mask = size - 1; > > + fifo->esize = esize; > > + > > + return 0; > > +} > > +EXPORT_SYMBOL(__kfifo_alloc); > > + > > +void __kfifo_free(struct __kfifo *fifo) > > +{ > > + kfree(fifo); > > + fifo->data = 0; > > + fifo->mask = 0; > > + fifo->in = fifo->out = 0; > > +} > > +EXPORT_SYMBOL(__kfifo_free); > > err, no. This scribbles on memory after having returned it to slab! > > Which implies that this code wasn't tested with suitable kernel > debugging options enabled. Please absorb > Documentation/SubmitChecklist. > > Initialising memory to zero just prior to freeing it is lame anyway. > And it can hide bugs. Just remove this and return the fifo to slab > as-is. > > > + > > +int __kfifo_init(struct __kfifo *fifo, void *buffer, > > + unsigned int size, size_t esize) > > +{ > > + size /= esize; > > + > > + if (size) > > + size = rounddown_pow_of_two(size); > > + > > + if (size < 2) > > + return -EINVAL; > > + > > + fifo->data = buffer; > > + fifo->in = fifo->out = 0; > > + fifo->mask = size - 1; > > + fifo->esize = esize; > > + return 0; > > +} > > +EXPORT_SYMBOL(__kfifo_init); > > + > > +static void kfifo_copy_in(struct __kfifo *fifo, const void *src, > > + unsigned int len, unsigned int off) > > +{ > > + unsigned int size = fifo->mask + 1; > > + unsigned int esize = fifo->esize; > > + unsigned int l; > > + > > + off &= fifo->mask; > > + if (esize != 1) { > > + off *= esize; > > + size *= esize; > > + len *= esize; > > + } > > + l = min(len, size - off); > > + > > + memcpy(fifo->data + off, src, l); > > + memcpy(fifo->data, src + l, len - l); > > + smp_wmb(); > > It would be better if all the barriers in this code had comments > explaining why they're there. Looking at the above code it's quite > hard to determine which other code the barrier is protecting the data > against. > > > +} > > + > > > > ... > > > > +static unsigned long kfifo_copy_from_user(struct __kfifo *fifo, > > + const void *src, unsigned int len, unsigned int off, > > + unsigned int *copied) > > +{ > > + unsigned int size = fifo->mask + 1; > > + unsigned int esize = fifo->esize; > > + unsigned int l; > > + unsigned long ret; > > + > > + off &= fifo->mask; > > + if (esize != 1) { > > + off *= esize; > > + size *= esize; > > + len *= esize; > > + } > > + l = min(len, size - off); > > + > > + ret = copy_from_user(fifo->data + off, src, l); > > + if (unlikely(ret)) > > + ret = roundup_diff(ret + len - l, esize); > > + else { > > + ret = copy_from_user(fifo->data, src + l, len - l); > > + if (unlikely(ret)) > > + ret = roundup_diff(ret, esize); > > + } > > + smp_wmb(); > > + if (copied) > > Can `copied' ever be NULL? If not, remove this test. > > > + *copied = len - ret; > > + return ret; > > +} > > All pointers which refer to userspace should have the __user tag. And > the code should be checked with sparse, please. > > It's rather hard to work out the meaning of this function's return value. > > > +int __kfifo_from_user(struct __kfifo *fifo, const void __user *from, > > + unsigned long len, unsigned int *copied) > > +{ > > + unsigned int l; > > + unsigned long ret; > > + unsigned int esize = fifo->esize; > > + int err; > > + > > + if (esize != 1) > > + len /= esize; > > + > > + l = kfifo_unused(fifo); > > + if (len > l) > > + len = l; > > + > > + ret = kfifo_copy_from_user(fifo, from, len, fifo->in, copied); > > + if (unlikely(ret)) { > > + len -= ret; > > + err = -EFAULT; > > + } > > + else > > + err = 0; > > + fifo->in += len; > > + return err; > > +} > > +EXPORT_SYMBOL(__kfifo_from_user); > > + > > > > ... > > > > +static int setup_sgl_buf(struct scatterlist *sgl, void *buf, > > + int nents, unsigned int len) > > +{ > > + int n; > > + unsigned int l; > > + unsigned int off; > > + struct page *page; > > + > > + if (!nents) > > + return 0; > > + > > + if (!len) > > + return 0; > > + > > + n = 0; > > + page = virt_to_page(buf); > > + off = offset_in_page(buf); > > + l = 0; > > + > > + while(len >= l + PAGE_SIZE - off) { > > + struct page *npage; > > + > > + l += PAGE_SIZE; > > + buf += PAGE_SIZE; > > + npage = virt_to_page(buf); > > + if (page_to_phys(page) != page_to_phys(npage) - l) { > > + sgl->page_link = 0; > > + sg_set_page(sgl++, page, l - off, off); > > + if (++n == nents) > > + return n; > > + page = npage; > > + len -= l - off; > > + l = off = 0; > > + } > > + } > > + sgl->page_link = 0; > > + sg_set_page(sgl++, page, len, off); > > + return n + 1; > > +} > > Again, I just don't see why all this sg and dma stuff is here. Has it > been tested? > > > > > ... > > > > +/** > > + * __kfifo_peek_n internal helper function for determinate the length of > > + * the next record in the fifo > > + */ > > This comment purports to be kerneldoc ("/**") but it isn't. > > > +static unsigned int __kfifo_peek_n(struct __kfifo *fifo, size_t recsize) > > +{ > > + unsigned int l; > > + unsigned int mask = fifo->mask; > > + unsigned char *data = fifo->data; > > + > > + l = __KFIFO_PEEK(data, fifo->out, mask); > > + > > + if (--recsize) > > + l |= __KFIFO_PEEK(data, fifo->out + 1, mask) << 8; > > + > > + return l; > > +} > > + > > +#define __KFIFO_POKE(data, in, mask, val) \ > > + ( \ > > + (data)[(in) & (mask)] = (unsigned char)(val) \ > > + ) > > + > > +/** > > + * __kfifo_poke_n internal helper function for storeing the length of > > + * the record into the fifo > > + */ > > Please check all kerneldoc stuff. > > > > > ... > > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/