Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933225AbZJFUeB (ORCPT ); Tue, 6 Oct 2009 16:34:01 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S933193AbZJFUeB (ORCPT ); Tue, 6 Oct 2009 16:34:01 -0400 Received: from mx1.redhat.com ([209.132.183.28]:33478 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933200AbZJFUeA (ORCPT ); Tue, 6 Oct 2009 16:34:00 -0400 Date: Tue, 6 Oct 2009 16:32:26 -0400 From: Jason Baron To: Steven Rostedt Cc: Justin Mattock , Ingo Molnar , Peter Zijlstra , Li Zefan , Frederic Weisbecker , Linux Kernel Mailing List Subject: Re: system gets stuck in a lock during boot Message-ID: <20091006203225.GC2631@redhat.com> References: <1251096925.7538.121.camel@twins> <4A9251EB.8040805@gmail.com> <20090825085919.GB14003@elte.hu> <4A94803A.5060408@gmail.com> <20090826073351.GE23435@elte.hu> <4A9549E5.5020002@gmail.com> <20091002211211.GA2633@redhat.com> <1254792249.13160.213.camel@gandalf.stny.rr.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1254792249.13160.213.camel@gandalf.stny.rr.com> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5035 Lines: 132 On Mon, Oct 05, 2009 at 09:24:09PM -0400, Steven Rostedt wrote: > On Fri, 2009-10-02 at 17:12 -0400, Jason Baron wrote: > > > hi Justin, > > > > I've been playing around with gcc '4.5' as well and hit a panic that > > looks very similar to what you've seen with stock 2.6.31 - I haven't > > seen it anywhere else. Anyways, it seems to be some sort of alignment > > issue with the 'struct ftrace_event_call'. I'm not sure yet if this is a > > compiler or kernel issue. But the following kernel patch fixes the issue > > for me. It would be interesting to verify if the patch also resolves the > > issue for you. > > > > thanks, > > > > -Jason > > > > > > diff --git a/include/asm-generic/vmlinux.lds.h b/include/asm-generic/vmlinux.lds.h > > index 6ad76bf..0029af4 100644 > > --- a/include/asm-generic/vmlinux.lds.h > > +++ b/include/asm-generic/vmlinux.lds.h > > @@ -164,6 +164,7 @@ > > LIKELY_PROFILE() \ > > BRANCH_PROFILE() \ > > TRACE_PRINTKS() \ > > + . = ALIGN(32); \ > > FTRACE_EVENTS() \ > > TRACE_SYSCALLS() > > > > diff --git a/include/linux/ftrace_event.h b/include/linux/ftrace_event.h > > index a81170d..43f9f1e 100644 > > --- a/include/linux/ftrace_event.h > > +++ b/include/linux/ftrace_event.h > > @@ -124,7 +124,7 @@ struct ftrace_event_call { > > atomic_t profile_count; > > int (*profile_enable)(struct ftrace_event_call *); > > void (*profile_disable)(struct ftrace_event_call *); > > -}; > > +} __attribute__((aligned(32))); > > > > #define MAX_FILTER_PRED 32 > > #define MAX_FILTER_STR_VAL 128 > > diff --git a/include/trace/ftrace.h b/include/trace/ftrace.h > > index f64fbaa..4697fb6 100644 > > --- a/include/trace/ftrace.h > > +++ b/include/trace/ftrace.h > > @@ -600,7 +600,7 @@ static int ftrace_raw_init_event_##call(void) \ > > } \ > > \ > > static struct ftrace_event_call __used \ > > -__attribute__((__aligned__(4))) \ > > +__attribute__((__aligned__(32))) \ > > __attribute__((section("_ftrace_events"))) event_##call = { \ > > .name = #call, \ > > .system = __stringify(TRACE_SYSTEM), \ > > Are all alignments needed? Or just adding one might help. Or removing > the one directly above? > > -- Steve > So the problem I'm seeing is an oops on boot caused by the call->system pointer deference in event_create_dir(). The 'call' variable is of type 'struct ftrace_event_call'. What's going on is that the 'struct ftrace_event_call' is of size 168 bytes (sizeof(struct ftrace_event_call)) = 168 = 0xA8. However, in memory the structures are 16-byte aligned. Thus, the stride for walking through the pointers needs to be 176 (0xB0), but instead its 168 causing the oops. I've only seen this issue while using gcc (GCC) 4.5.0 20090916, on a vanilla 2.6.31 kernel. That said, I'm not sure the compiler is doing the wrong thing here. The 'struct ftrace_event_call' contains an embedded 'struct list_head' which is 16 bytes. According to the gcc docs, the aligned attribute, 'specifies a minimum alignment for the variable or structure field, measured in bytes'. Thus, at least according to the docs, gcc can increase the alignment of the 'struct ftrace_event_call', from its original specification of 4, to 16. Even in the case where we are working corectly the structures are 8-byte aligned. Thus, I would reccommend the patch below as a preventive measure. Its the minimal patch I've found to resolve this issue. In general, if we are going to walk data structures embedded in a special elf section, I think the general rules needs to be to set the alignment to the power of two which is greater than or equal to the largest item in the structure. thanks, -Jason Signed-off-by: Jason Baron diff --git a/include/linux/ftrace_event.h b/include/linux/ftrace_event.h index a81170d..7182f03 100644 --- a/include/linux/ftrace_event.h +++ b/include/linux/ftrace_event.h @@ -124,7 +124,10 @@ struct ftrace_event_call { atomic_t profile_count; int (*profile_enable)(struct ftrace_event_call *); void (*profile_disable)(struct ftrace_event_call *); -}; +} __attribute__((aligned(16))); + +/* Align to the largest field in the data structure: + * sizeof(struct list_head) = 16 */ #define MAX_FILTER_PRED 32 #define MAX_FILTER_STR_VAL 128 diff --git a/include/trace/ftrace.h b/include/trace/ftrace.h index f64fbaa..e344e81 100644 --- a/include/trace/ftrace.h +++ b/include/trace/ftrace.h @@ -600,7 +600,6 @@ static int ftrace_raw_init_event_##call(void) \ } \ \ static struct ftrace_event_call __used \ -__attribute__((__aligned__(4))) \ __attribute__((section("_ftrace_events"))) event_##call = { \ .name = #call, \ .system = __stringify(TRACE_SYSTEM), \ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/