Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751875AbZA1RNp (ORCPT ); Wed, 28 Jan 2009 12:13:45 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750951AbZA1RNg (ORCPT ); Wed, 28 Jan 2009 12:13:36 -0500 Received: from tomts5-srv.bellnexxia.net ([209.226.175.25]:56039 "EHLO tomts5-srv.bellnexxia.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750899AbZA1RNf (ORCPT ); Wed, 28 Jan 2009 12:13:35 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AuwEAC0dgElMQWt2/2dsb2JhbACBbsszhUcG Date: Wed, 28 Jan 2009 12:13:31 -0500 From: Mathieu Desnoyers To: Masami Hiramatsu Cc: Nick Piggin , LKML , Ananth N Mavinakayanahalli , Jim Keniston , systemtap-ml , "Frank Ch. Eigler" Subject: Re: [BUG][kprobes][vunmap?]: kprobes may cause memory corruption Message-ID: <20090128171331.GA9006@Krystal> References: <497FC3B1.7050805@redhat.com> <497FE895.1080708@redhat.com> <20090128154824.GA6025@Krystal> <49808EEF.1020700@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Content-Disposition: inline In-Reply-To: <49808EEF.1020700@redhat.com> X-Editor: vi X-Info: http://krystal.dyndns.org:8080 X-Operating-System: Linux/2.6.21.3-grsec (i686) X-Uptime: 12:06:36 up 27 days, 17:04, 3 users, load average: 1.93, 1.61, 1.34 User-Agent: Mutt/1.5.16 (2007-06-11) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3925 Lines: 142 * Masami Hiramatsu (mhiramat@redhat.com) wrote: > Mathieu Desnoyers wrote: > > * Masami Hiramatsu (mhiramat@redhat.com) wrote: > >> Masami Hiramatsu wrote: > > Hi Masami, > > > > This would not surprise me if it came from bug in the new vmap() > > implementation done in this commit : > > > > http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=db64fe02258f1507e13fe5212a989922323685ce > > > > Especially because going from vmap -> vm_map_ram makes this behavior > > disappear. > > > > Looking at the commit, I notice that it delays vunmap so it's done in > > batch to minimize locking effect. I think it would be good to create a > > test case to try to isolate this, without any kprobes/text_poke > > involved, which does something like this : > > > > load module (this is also doing vmalloc, so it might be part of the > > problem) > > for i (i=0; i < 400; i++) { > > vmap() > > vfree() > ^^^^^ vunmap? yep. > > } > > unload module > > > > Another interesting test would be : > > > > for i (i=0; i < 400; i++) { > > vmalloc() > > vfree() > > } > > Hi Mathieu, > > Thank you for test ideas. > I made both of above two tests and run it. Both test modules > do NOT cause memory corruption... > OK > > All this called in a loop. This would help isolating the "vmap" part of > > the issue. If this test is not enough, then we should maybe try > > something like this in a kernel module (which does what text_poke does > > with vmalloc, more or less) in a loop : > > > > char somedata[PAGE_SIZE] __attribute__((aligned(PAGE_SIZE))); > > char copydata[PAGE_SIZE] __attribute__((aligned(PAGE_SIZE))); > > Should both of them have PAGE_SIZE*2? > Yes. > > > > void test_vmap(void) > > } > > struct page *pages[2]; > > char *vaddr; > > int i; > > > > for (i = 0; i < 2 * PAGE_SIZE; i++) > > copydata[i] = somedata[i]; > > page[0] = virt_to_page(&somedata); > > BUG_ON(!page[0]); > > page[1] = virt_to_page(&somedata + PAGE_SIZE); > > BUG_ON(!page[1]); > > vaddr = vmap(pages, 2, VM_MAP, PAGE_KERNEL); > > BUG_ON(!vaddr); > > > > for (i = 0; i < 2 * PAGE_SIZE; i++) > > vaddr[i] = copydata[i] + 1; > > > > vunmap(vaddr); > > > > for (i = 0; i < 2 * PAGE_SIZE; i++) > > BUG_ON(somedata[i] != copydata[i] + 1); > > } > > Hmm, when I ran above code, it hit the last BUG_ON(). > I checked that somedata[i] didn't updated. > Do you hit the BUG_ON after the first loop ? > > Given you don't seem to have hit the > > for (i = 0; i < len; i++) > > BUG_ON(((char *)addr)[i] != ((char *)opcode)[i]); > > test at the end of text_poke, > > However, when I ran kprobe-based test, it doesn't hit the BUG_ON() > in text_poke(). > The variable declarations should have been 2*PAGE_SIZE, hopefully you fixed them. There is also a sync_core() in text_poke. It should not matter, but maybe that could help ? > > I suspect the write through the vmapped > > area is correctly done, but that the problem may lay in the mm layer. > > Maybe it's running out of pre-allocated vmap areas or something like > > this ? > > I haven't seen vmalloc failure message on 2.6.29-rc2. > It could be because the available vmalloc space is slightly higher. Looking into the lazy vunmap threshold would be useful. You could also try with loop values higher than 400. Mathieu > Thank you again, > > > -- > Masami Hiramatsu > > Software Engineer > Hitachi Computer Products (America) Inc. > Software Solutions Division > > e-mail: mhiramat@redhat.com > -- Mathieu Desnoyers OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F BA06 3F25 A8FE 3BAE 9A68 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/