Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754459AbZCLDVX (ORCPT ); Wed, 11 Mar 2009 23:21:23 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750951AbZCLDVO (ORCPT ); Wed, 11 Mar 2009 23:21:14 -0400 Received: from mga03.intel.com ([143.182.124.21]:23562 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750742AbZCLDVN convert rfc822-to-8bit (ORCPT ); Wed, 11 Mar 2009 23:21:13 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.38,347,1233561600"; d="scan'208";a="119392797" From: "Pallipadi, Venkatesh" To: "Pallipadi, Venkatesh" , Frans Pop CC: "mingo@elte.hu" , "thellstrom@vmware.com" , Linux kernel mailing list , "Siddha, Suresh B" , Nick Piggin , "ebiederm@xmission.com" Date: Wed, 11 Mar 2009 20:22:47 -0700 Subject: RE: [PATCH] VM, x86, PAT: Change implementation of is_linear_pfn_mapping Thread-Topic: [PATCH] VM, x86, PAT: Change implementation of is_linear_pfn_mapping Thread-Index: AcmiwdDCZO3unPFQQDqqTSq194F6bQ== Message-ID: <7E82351C108FA840AB1866AC776AEC464FF628B8@orsmsx505.amr.corp.intel.com> References: <498B5ADE.3090602@vmware.com> <498C062C.201@vmware.com> <20090304060857.GA18318@linux-os.sc.intel.com> <130CA3A191875048A0624FB523A55EC7075DA7CA@PA-EXMBX51.vmware.com> <20090310013953.GA11312@linux-os.sc.intel.com> <49B6232B.4050503@vmware.com> <1236706959.4529.56.camel@localhost.localdomain> <49B781B6.3060704@vmware.com> <20090311093338.GC12814@elte.hu> <20090311093338.GC12814@elte.hu> <20090311175452.GA1702@linux-os.sc.intel.com> <200903112309.40145.elendil@planet.nl> <1236817912.4529.76.camel@localhost.localdomain> In-Reply-To: <1236817912.4529.76.camel@localhost.localdomain> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3860 Lines: 98 >-----Original Message----- >From: Pallipadi, Venkatesh [mailto:venkatesh.pallipadi@intel.com] >Sent: Wednesday, March 11, 2009 5:32 PM >To: Frans Pop >Cc: mingo@elte.hu; thellstrom@vmware.com; Linux kernel mailing >list; Siddha, Suresh B; Nick Piggin; ebiederm@xmission.com >Subject: Re: [PATCH] VM, x86, PAT: Change implementation of >is_linear_pfn_mapping > >On Wed, 2009-03-11 at 15:09 -0700, Frans Pop wrote: >> Pallipadi, Venkatesh wrote: >> > Use of vma->vm_pgoff to identify the pfnmaps that are fully >> > mapped at mmap time is broke. vm_pgoff is set by generic mmap >> > code even for cases where drivers are setting up the mappings >> > at the fault time. >> > >> > The problem was originally reported here. >> > http://marc.info/?l=linux-kernel&m=123383810628583&w=2 >> > >> > Change is_linear_pfn_mapping logic to overload VM_NONLINEAR >> > flag along with VM_PFNMAP to mean full PFNMAP setup at mmap >> > time. >> > >> > Acked-by: Thomas Hellstrom >> > Signed-off-by: Venkatesh Pallipadi >> > Signed-off-by: Suresh Siddha @intel.com> >> >> I've applied this patch on top of v2.6.29-rc7-143-g99adcd9 >[1] and since >> then I've had my system, or rather X/KDE, hang several >times. The last >> time the problem seems to have been KDE's kicker. I was >running a kernel >> compile in a konsole window and that just continued and >finished, but the >> keyboard was completely dead. >> I could still ssh in from another box. 'ps' would show the >top processes, >> but hang as well at some point (in the middle of listing KDE >processes. >> >> The hang was with pat enabled. I've now booted with nopat. > >Frans, > >Thanks for testing this. I don't seem to reproduce this on any of my >test systems with this patch on either tip or latest git. Do >you see the >hang on every boot or once in a while? Are things stable with nopat? > >> The log shows (full log attached): >> kernel: BUG: unable to handle kernel NULL pointer dereference at >> 0000000000000008 >> kernel: IP: [] prio_tree_remove+0x9c/0xcc >> kernel: PGD 7cab1067 PUD 7d644067 PMD 0 >> kernel: Oops: 0000 [#1] SMP >> kernel: last sysfs file: /sys/class/power_supply/C23D/charge_full >> kernel: CPU 1 >> kernel: Pid: 5415, comm: kicker Not tainted 2.6.29-rc7 #4 HP Comp >> aq 2510p Notebook PC >> kernel: RIP: 0010:[] [] prio >> _tree_remove+0x9c/0xcc >> [...] >> kernel: Call Trace: >> kernel: [] prio_tree_insert+0xab/0x22a >> kernel: [] vma_prio_tree_insert+0x23/0xc2 >> kernel: [] __vma_link_file+0x70/0x72 >> kernel: [] vma_link+0x7d/0xab >> kernel: [] mmap_region+0x313/0x479 >> kernel: [] do_mmap_pgoff+0x2f6/0x35c >> kernel: [] do_shmat+0x28a/0x36c >> kernel: [] sys_shmat+0x11/0x1c >> kernel: [] system_call_fastpath+0x16/0x1b >> >> From the symptoms I strongly suspect this patch to be the culprit. >> >> [1] Together with some other patches (mainly Rafael's latest >patchset >> for "Rework disabling of interrupts during suspend-resume"), >but I doubt >> any of those are related to this issue. >> > >Nothing obvious strikes me with this patch and above OOPs. Can you Thinking about it a bit more, the usage of VM_NONLINEAR flag in this patch may be conflicting with some expectation in mm code, that may be resulting in above oops. Let me spend some more time on this and get back to you. Thanks, Venki-- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/