LinuxLists.cc - [BUG] perf: bogus correlation of kernel symbols

2011-05-12 14:48:51

Subject: [BUG] perf: bogus correlation of kernel symbols

Hi,

I think there is a serious problem with kernel symbol correlation
with the latest perf in 2.6.39-rc7-tip.

Here is a simple example with a stupid program that only
does open()/close on /dev/null:

$ perf record -e cycles:k openclose
$ perf report --stdio

# Events: 2K cycles
#
# Overhead Command Shared Object Symbol
# ........ ......... ................ ...............
#
99.76% openclose [binfmt_misc] [k] 0xffffffff81010fe6
0.13% openclose libc-2.12.1.so [.] __open_nocancel
0.09% openclose libc-2.12.1.so [.] __GI_close

The DSO (binfmt_misc) is bogus. That's not where time is spent.

But if I ran the same test as root:

$ sudo perf record -e cycles:k openclose
$ sudo perf report --stdio

# Events: 2K cycles
#
# Overhead Command Shared Object Symbol
# ........ ......... ................. .............................
#
17.13% openclose [kernel.kallsyms] [k] __lock_acquire
11.77% openclose [kernel.kallsyms] [k] native_sched_clock
7.36% openclose [kernel.kallsyms] [k] sched_clock_local
5.99% openclose [kernel.kallsyms] [k] lock_release
5.38% openclose [kernel.kallsyms] [k] local_clock
4.43% openclose [kernel.kallsyms] [k] lock_acquired
4.05% openclose [kernel.kallsyms] [k] lock_acquire
3.95% openclose [kernel.kallsyms] [k] lock_is_held
3.51% openclose [kernel.kallsyms] [k] sched_clock_cpu
3.24% openclose [kernel.kallsyms] [k] trace_hardirqs_off_caller

This is much more meaningful.

This is not related to the paranoid level (1 for me).

Looking at perf report -D, the same kernel address is associated to different
module based on my permission level.

first perf.data:
416749738927 0x4210 [0x28]: PERF_RECORD_SAMPLE(IP, 1): 4886/4886:
0xffffffff8107c1d8 period: 2262681
... thread: openclose:4886
...... dso: /lib/modules/2.6.39-rc7-tip/kernel/fs/binfmt_misc.ko

second perf.data:
436879910722 0xc950 [0x28]: PERF_RECORD_SAMPLE(IP, 1): 4894/4894:
0xffffffff8107c1d8 period: 2280253
... thread: openclose:4894
...... dso: vmlinux

Same address different mapping!

My path to vmlinux is all accessible to me.

If there were permission problems, I would expect perf record or perf report
to tell me and not fallback to some bogus mappings.

2011-05-12 18:06:37

by David Miller

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

From: Stephane Eranian <[email protected]>
Date: Thu, 12 May 2011 16:48:46 +0200

> I think there is a serious problem with kernel symbol correlation
> with the latest perf in 2.6.39-rc7-tip.

The behavior seems to be intentional, so that we don't expose internal
kernel addresses to userspace.

I hate this too, and I think it's absolutely rediculous.

Also, like you, I lost an entire afternoon trying to figure out why
this started happening.

I wish we could revert this change.

2011-05-12 18:37:51

by Dave Jones

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

On Thu, May 12, 2011 at 02:06:30PM -0400, David Miller wrote:
> From: Stephane Eranian <[email protected]>
> Date: Thu, 12 May 2011 16:48:46 +0200
>
> > I think there is a serious problem with kernel symbol correlation
> > with the latest perf in 2.6.39-rc7-tip.
>
> The behavior seems to be intentional, so that we don't expose internal
> kernel addresses to userspace.

Sounds like commit 9f36e2c448007b54851e7e4fa48da97d1477a175

> I hate this too, and I think it's absolutely rediculous.
>
> Also, like you, I lost an entire afternoon trying to figure out why
> this started happening.
>
> I wish we could revert this change.

At least it can be permanently disabled..

echo kernel.kptr_restrict = 0 >> /etc/sysctl.conf

Dave

2011-05-12 19:01:45

by David Miller

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

From: Dave Jones <[email protected]>
Date: Thu, 12 May 2011 14:37:41 -0400

> On Thu, May 12, 2011 at 02:06:30PM -0400, David Miller wrote:
> > I hate this too, and I think it's absolutely rediculous.
> >
> > Also, like you, I lost an entire afternoon trying to figure out why
> > this started happening.
> >
> > I wish we could revert this change.
>
> At least it can be permanently disabled..
>
> echo kernel.kptr_restrict = 0 >> /etc/sysctl.conf

Regardless, what to do about all of the "perf is broken" reports?

First off, perf can find out whether this madness exists, and it
should by default print out a warning in this situation instead of
knowingly emitting garbage kernel event information.

"I'm going to knowingly give you bad data, and I'm not even going to
let you know about it."

It's really crazy that we give people these incredibly powerful tools
and they don't even work properly by default.

We've been exposing kernel pointers for 20 years, nobody's grandmother
died because of it.

This is very "Animal Farm" the way we're gradually losing little bits
of functionality, time and time again, over this "kernel pointer
exposure" issue.

Are we going to be like animals and just accept the totality of this,
or are we going to be outraged enough to push back on stuff like perf
actually working properly?

2011-05-12 19:58:55

by Pekka Enberg

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

On Thu, May 12, 2011 at 10:01 PM, David Miller <[email protected]> wrote:
> From: Dave Jones <[email protected]>
> Date: Thu, 12 May 2011 14:37:41 -0400
>
>> On Thu, May 12, 2011 at 02:06:30PM -0400, David Miller wrote:
>> ?> I hate this too, and I think it's absolutely rediculous.
>> ?>
>> ?> Also, like you, I lost an entire afternoon trying to figure out why
>> ?> this started happening.
>> ?>
>> ?> I wish we could revert this change.
>>
>> At least it can be permanently disabled..
>>
>> echo kernel.kptr_restrict = 0 >> /etc/sysctl.conf
>
> Regardless, what to do about all of the "perf is broken" reports?

Lets revert the commit 9f36e2c448007b54851e7e4fa48da97d1477a175
("printk: use %pK for /proc/kallsyms and /proc/modules"), please! I
too have been wondering what's going on with perf reporting insane
symbols and this should definitely not be enabled by default.

Pekka

2011-05-12 20:25:10

by Alexey Dobriyan

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

On Thu, May 12, 2011 at 03:01:32PM -0400, David Miller wrote:
> From: Dave Jones <[email protected]>
> Date: Thu, 12 May 2011 14:37:41 -0400
>
> > On Thu, May 12, 2011 at 02:06:30PM -0400, David Miller wrote:
> > > I hate this too, and I think it's absolutely rediculous.
> > >
> > > Also, like you, I lost an entire afternoon trying to figure out why
> > > this started happening.
> > >
> > > I wish we could revert this change.
> >
> > At least it can be permanently disabled..
> >
> > echo kernel.kptr_restrict = 0 >> /etc/sysctl.conf
>
> Regardless, what to do about all of the "perf is broken" reports?

The problem is that they turned it on by default.

int kptr_restrict = 1;

2011-05-12 20:32:33

by Linus Torvalds

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

On Thu, May 12, 2011 at 7:48 AM, Stephane Eranian <[email protected]> wrote:
>
> I think there is a serious problem with kernel symbol correlation
> with the latest perf in 2.6.39-rc7-tip.

Yeah. It's annoying. It's a "perf" bug, though - triggered by
/proc/sys/kernel/kptr_restrict being set to 1.

The bug is that perf doesn't say "I can't match kernel symbols", but
instead does some crazy matching and gives total crap module
information (I think it just picks the one that shows up last in
/proc/kallsyms).

That said, I have considered just reverting the thing that makes
kptr_restrict be 1 by default. I do like the security implications of
restricting visibility into kernel pointers, but I also think that
security rules that make the system less usable are dubious. So I
dunno.

Linus

2011-05-12 20:44:07

by David Miller

[permalink] [raw]

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

From: Linus Torvalds <[email protected]>
Date: Thu, 12 May 2011 13:31:37 -0700

> That said, I have considered just reverting the thing that makes
> kptr_restrict be 1 by default. I do like the security implications of
> restricting visibility into kernel pointers, but I also think that
> security rules that make the system less usable are dubious. So I
> dunno.

We don't have any firewalling or SELINUX rules installed by default,
even if those features are enabled in the kernel. Userspace asks for
it.

Many people would claim that use of such things are "essential" these
days.

I don't see a good reason to handle kptr_restrict any differently.

2011-05-12 21:00:40

Subject: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: [PATCH] vsprintf: Turn kptr_restrict off by default

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [PATCH] vsprintf: Turn kptr_restrict off by default

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Subject: Re: [BUG] perf: bogus correlation of kernel symbols

Attachments:

Subject: Re: [BUG] perf: bogus correlation of kernel symbols