From: Herbert Xu <herbert@gondor.apana.org.au>
Subject: Re: [v2 PATCH 1/16] crypto: skcipher - Add skcipher walk interface
Date: Tue, 15 Nov 2016 21:58:39 +0800
Message-ID: <20161115135839.GA20038@gondor.apana.org.au>
References: <20161113114354.GA8169@gondor.apana.org.au>
 <E1c5tE4-00028Y-BC@gondolin.me.apana.org.au>
 <20161114013548.GA4778@google.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Cc: Linux Crypto Mailing List <linux-crypto@vger.kernel.org>
To: Eric Biggers <ebiggers@google.com>
Content-Disposition: inline
In-Reply-To: <20161114013548.GA4778@google.com>
Sender: linux-crypto-owner@vger.kernel.org

Hi Eric:

On Sun, Nov 13, 2016 at 05:35:48PM -0800, Eric Biggers wrote:
> Hi Herbert,
> 
> On Sun, Nov 13, 2016 at 07:45:32PM +0800, Herbert Xu wrote:
> > +int skcipher_walk_done(struct skcipher_walk *walk, int err)
> > +{
> > +	unsigned int nbytes = 0;
> > +	unsigned int n = 0;
> > +
> > +	if (likely(err >= 0)) {
> > +		n = walk->nbytes - err;
> > +		nbytes = walk->total - n;
> > +	}
> > +
> > +	if (likely(!(walk->flags & (SKCIPHER_WALK_PHYS |
> > +				    SKCIPHER_WALK_SLOW |
> > +				    SKCIPHER_WALK_COPY |
> > +				    SKCIPHER_WALK_DIFF)))) {
> > +unmap_src:
> > +		skcipher_unmap_src(walk);
> 
> Isn't it wrong to unmap the buffers when err < 0?  They won't have been mapped
> yet, e.g. if the 'skcipher_walk_done(walk, -EINVAL)' case is hit.

Indeed.  The unmapping code needs to be inside the first if clause.
I'll rearrange it.

> > +	/* Short-circuit for the common/fast path. */
> > +	if (!(((unsigned long)walk->iv ^ (unsigned long)walk->oiv) |
> > +	     ((unsigned long)walk->buffer ^ (unsigned long)walk->page) |
> > +	     (unsigned long)walk->page))
> > +		goto out;
> 
> This is really saying that the IV wasn't copied and that walk->buffer and
> walk->page are both NULL.  But if walk->buffer is NULL then the IV wasn't
> copied.  So this can be simplified to:
> 
> 	if (!((unsigned long)walk->buffer | (unsigned long)walk->page))
> 		goto out;

Nice, I'll use this.

> > +int skcipher_walk_next(struct skcipher_walk *walk)
> > +{
> 
> Shouldn't this be static?  There's even a static declaration earlier in the
> file.

Yes, static added.

> > +static int skcipher_next_slow(struct skcipher_walk *walk)
> > +{
> > +	bool phys = walk->flags & SKCIPHER_WALK_PHYS;
> > +	unsigned alignmask = walk->alignmask;
> > +	unsigned bsize = walk->chunksize;
> ...
> > +	walk->nbytes = bsize;
> 
> skcipher_next_slow() is always setting up 'chunksize' bytes to be processed.
> Isn't this wrong in the 'blocksize < chunksize' case because fewer than
> 'chunksize' bytes may need to be processed?

Good catch.  This is supposed to be the bsize from the caller
which is capped by walk->total.

> > +static inline void *skcipher_map(struct scatter_walk *walk)
> > +{
> > +	struct page *page = scatterwalk_page(walk);
> > +
> > +	return (PageHighMem(page) ? kmap_atomic(page) : page_address(page)) +
> > +	       offset_in_page(walk->offset);
> > +}
> 
> Is the PageHighMem() optimization really worthwhile?  It will skip kmap_atomic()
> and hence preempt_disable() for non-highmem pages, so it may make it harder to
> find bugs where users are sleeping when they shouldn't be.  It also seems
> inconsistent that skcipher_map() will do this optimization but scatterwalk_map()
> won't.

It did make a big difference on my machine.  Because users such
as lrw will call the walker a lot more often after the conversion
to skcipher this is now worthwhile as an optimisation.  With the
old code the blkcipher walk was not called as often.

> > +		if (!walk->page) {
> > +			gfp_t gfp = skcipher_walk_gfp(walk);
> > +
> > +			walk->page = (void *)__get_free_page(gfp);
> > +			if (!walk->page)
> > +				goto slow_path;
> > +		}
> > +
> > +		walk->nbytes = min_t(unsigned, n,
> > +				     PAGE_SIZE - offset_in_page(walk->page));
> 
> walk->page will be page-aligned, so there's no need for offset_in_page().  Also,
> 'n' has already been clamped to at most one page.  So AFAICS this can simply be
> 'walk->nbytes = n;'.

This is valid for the sync walk, but not for the async case.  In
the async case, we will reuse the page if the previous copy did
not completely consume it.

> > +int skcipher_walk_done(struct skcipher_walk *walk, int err);
> ...
> > +void skcipher_walk_complete(struct skcipher_walk *walk, int err);
> 
> The naming is confusing: "done" vs. "complete".  They can easily be mixed up.
> Would it make more sense to have something with "async" in the name for the
> second one, like skcipher_walk_async_done()?

I see your point.  But I think async_done would also be confusing
because the async case needs to call both skcipher_walk_done and
skcipher_walk_async_done.

Thanks,
-- 
Email: Herbert Xu <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt