by Joonsoo Kim

[permalink] [raw]

Subject: Re: [PATCH v2 01/11] mm/slab: fix the theoretical race by holding proper lock

On Tue, Apr 12, 2016 at 11:38:39AM -0500, Christoph Lameter wrote:
> On Tue, 12 Apr 2016, [email protected] wrote:
>
> > @@ -2222,6 +2241,7 @@ static void drain_cpu_caches(struct kmem_cache *cachep)
> > {
> > struct kmem_cache_node *n;
> > int node;
> > + LIST_HEAD(list);
> >
> > on_each_cpu(do_drain, cachep, 1);
> > check_irq_on();
> > @@ -2229,8 +2249,13 @@ static void drain_cpu_caches(struct kmem_cache *cachep)
> > if (n->alien)
> > drain_alien_cache(cachep, n->alien);
> >
> > - for_each_kmem_cache_node(cachep, node, n)
> > - drain_array(cachep, n, n->shared, 1, node);
> > + for_each_kmem_cache_node(cachep, node, n) {
> > + spin_lock_irq(&n->list_lock);
> > + drain_array_locked(cachep, n->shared, node, true, &list);
> > + spin_unlock_irq(&n->list_lock);
> > +
> > + slabs_destroy(cachep, &list);
>
> Can the slabs_destroy() call be moved outside of the loop? It may be
> faster then?

Yes, it can. But, I'd prefer to call it on each node. It would be
better for cache although it would be marginal.

Thanks.

2016-04-26 00:47:09

by Joonsoo Kim

[permalink] [raw]

Subject: Re: [PATCH v2 04/11] mm/slab: factor out kmem_cache_node initialization code

On Tue, Apr 12, 2016 at 01:50:59PM +0900, [email protected] wrote:
> From: Joonsoo Kim <[email protected]>
>
> It can be reused on other place, so factor out it. Following patch will
> use it.
>
> Signed-off-by: Joonsoo Kim <[email protected]>
> ---
> mm/slab.c | 68 ++++++++++++++++++++++++++++++++++++---------------------------
> 1 file changed, 39 insertions(+), 29 deletions(-)
>
> diff --git a/mm/slab.c b/mm/slab.c
> index 5451929..49af685 100644
> --- a/mm/slab.c
> +++ b/mm/slab.c
> @@ -841,6 +841,40 @@ static inline gfp_t gfp_exact_node(gfp_t flags)
> }
> #endif
>
> +static int init_cache_node(struct kmem_cache *cachep, int node, gfp_t gfp)
> +{
> + struct kmem_cache_node *n;
> +
> + /*
> + * Set up the kmem_cache_node for cpu before we can
> + * begin anything. Make sure some other cpu on this
> + * node has not already allocated this
> + */
> + n = get_node(cachep, node);
> + if (n)
> + return 0;
> +
> + n = kmalloc_node(sizeof(struct kmem_cache_node), gfp, node);
> + if (!n)
> + return -ENOMEM;
> +
> + kmem_cache_node_init(n);
> + n->next_reap = jiffies + REAPTIMEOUT_NODE +
> + ((unsigned long)cachep) % REAPTIMEOUT_NODE;
> +
> + n->free_limit =
> + (1 + nr_cpus_node(node)) * cachep->batchcount + cachep->num;
> +
> + /*
> + * The kmem_cache_nodes don't come and go as CPUs
> + * come and go. slab_mutex is sufficient
> + * protection here.
> + */
> + cachep->node[node] = n;
> +
> + return 0;
> +}
> +

Hello, Andrew.

Could you apply following fix for this patch to mmotm?

Thanks.

------>8-----------
Date: Thu, 14 Apr 2016 10:28:11 +0900
Subject: [PATCH] mm/slab: fix bug

n->free_limit is once set in boot-up process without enabling multiple
cpu so it could be very low value. If we don't re-set when another cpu
is up, it will stay too low. Fix it.

Signed-off-by: Joonsoo Kim <[email protected]>

---
mm/slab.c | 8 +++++++-
1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/mm/slab.c b/mm/slab.c
index 13e74aa..59dd94a 100644
--- a/mm/slab.c
+++ b/mm/slab.c
@@ -856,8 +856,14 @@ static int init_cache_node(struct kmem_cache *cachep, int node, gfp_t gfp)
* node has not already allocated this
*/
n = get_node(cachep, node);
- if (n)
+ if (n) {
+ spin_lock_irq(&n->list_lock);
+ n->free_limit = (1 + nr_cpus_node(node)) * cachep->batchcount +
+ cachep->num;
+ spin_unlock_irq(&n->list_lock);
+
return 0;
+ }

n = kmalloc_node(sizeof(struct kmem_cache_node), gfp, node);
if (!n)
--
1.9.1