Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752999AbZIUGYA (ORCPT ); Mon, 21 Sep 2009 02:24:00 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752124AbZIUGX6 (ORCPT ); Mon, 21 Sep 2009 02:23:58 -0400 Received: from courier.cs.helsinki.fi ([128.214.9.1]:39544 "EHLO mail.cs.helsinki.fi" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751401AbZIUGX5 (ORCPT ); Mon, 21 Sep 2009 02:23:57 -0400 Subject: Re: [PATCH 1/3] slqb: Do not use DEFINE_PER_CPU for per-node data From: Pekka Enberg To: Tejun Heo Cc: Mel Gorman , Nick Piggin , Christoph Lameter , heiko.carstens@de.ibm.com, sachinp@in.ibm.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Andrew Morton In-Reply-To: <4AB6508C.4070602@kernel.org> References: <1253302451-27740-1-git-send-email-mel@csn.ul.ie> <1253302451-27740-2-git-send-email-mel@csn.ul.ie> <84144f020909200145w74037ab9vb66dae65d3b8a048@mail.gmail.com> <4AB5FD4D.3070005@kernel.org> <4AB5FFF8.7000602@cs.helsinki.fi> <4AB6508C.4070602@kernel.org> Date: Mon, 21 Sep 2009 09:24:00 +0300 Message-Id: <1253514240.5216.3.camel@penberg-laptop> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 7bit X-Mailer: Evolution 2.24.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1874 Lines: 46 Hi Tejun, On Mon, 2009-09-21 at 00:55 +0900, Tejun Heo wrote: > Pekka Enberg wrote: > > Tejun Heo wrote: > >> Pekka Enberg wrote: > >>> On Fri, Sep 18, 2009 at 10:34 PM, Mel Gorman wrote: > >>>> SLQB used a seemingly nice hack to allocate per-node data for the > >>>> statically > >>>> initialised caches. Unfortunately, due to some unknown per-cpu > >>>> optimisation, these regions are being reused by something else as the > >>>> per-node data is getting randomly scrambled. This patch fixes the > >>>> problem but it's not fully understood *why* it fixes the problem at the > >>>> moment. > >>> Ouch, that sounds bad. I guess it's architecture specific bug as x86 > >>> works ok? Lets CC Tejun. > >> > >> Is the corruption being seen on ppc or s390? > > > > On ppc. > > Can you please post full dmesg showing the corruption? Also, if you > apply the attached patch, does the added BUG_ON() trigger? I don't have the affected machines, Sachin and Mel do. Pekka > diff --git a/include/linux/percpu.h b/include/linux/percpu.h > index 878836c..fb690d2 100644 > --- a/include/linux/percpu.h > +++ b/include/linux/percpu.h > @@ -127,7 +127,7 @@ extern int __init pcpu_page_first_chunk(size_t reserved_size, > * dynamically allocated. Non-atomic access to the current CPU's > * version should probably be combined with get_cpu()/put_cpu(). > */ > -#define per_cpu_ptr(ptr, cpu) SHIFT_PERCPU_PTR((ptr), per_cpu_offset((cpu))) > +#define per_cpu_ptr(ptr, cpu) ({ BUG_ON(!(ptr)); SHIFT_PERCPU_PTR((ptr), per_cpu_offset((cpu))); }) > > extern void *__alloc_reserved_percpu(size_t size, size_t align); -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/