Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754617Ab1FJKBo (ORCPT ); Fri, 10 Jun 2011 06:01:44 -0400 Received: from smtp.ctxuk.citrix.com ([62.200.22.115]:39897 "EHLO SMTP.EU.CITRIX.COM" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750769Ab1FJKBl (ORCPT ); Fri, 10 Jun 2011 06:01:41 -0400 X-IronPort-AV: E=Sophos;i="4.65,345,1304294400"; d="scan'208";a="6208492" Date: Fri, 10 Jun 2011 11:01:39 +0100 From: Tim Deegan To: Igor Mammedov CC: , Keir Fraser , Stefano Stabellini , "containers@lists.linux-foundation.org" , Li Zefan , "linux-kernel@vger.kernel.org" , Michal Hocko , "linux-mm@kvack.org" , Keir Fraser , "akpm@linux-foundation.org" , "balbir@linux.vnet.ibm.com" , Paul Menage , KAMEZAWA Hiroyuki , Hiroyuki Kamezawa Subject: Re: [Xen-devel] Possible shadow bug Message-ID: <20110610100139.GG5098@whitby.uk.xensource.com> References: <4DE66BEB.7040502@redhat.com> <4DE8D50F.1090406@redhat.com> <4DEE26E7.2060201@redhat.com> <20110608123527.479e6991.kamezawa.hiroyu@jp.fujitsu.com> <4DF0801F.9050908@redhat.com> <20110609150133.GF5098@whitby.uk.xensource.com> <4DF0F90D.4010900@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Disposition: inline In-Reply-To: <4DF0F90D.4010900@redhat.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2054 Lines: 48 Hi, At 18:47 +0200 on 09 Jun (1307645229), Igor Mammedov wrote: > It's rhel5.6 xen. I've tried to test on SLES 11 that has 4.0.1 xen, however > wasn't able to reproduce problem. (I'm not sure if hap was turned > off in this case). More detailed info can be found at RHBZ#700565 The best way to be sure whether HAP is in use is to connect to the serial line, hit ^A^A^A to switch input to Xen, and hit 'q' to dump per-domain state. The printout for the guest domain should either say "paging assistance: shadow refcounts translate external" or "paging assistance: hap refcounts translate external". (If you don't have serial you can get the same info by running "xm debug-keys q" and then "xm dmesg" to read the output.) > >you're willing to try recompiling Xen with some small patches that > >disable the "cleverer" parts of the shadow pagetable code that might > >indicate something. (Of course, it might just change the timing to > >obscure a real linux bug too.) > > > Haven't got to this part yet. But looks like it's the only option left. Actually, looking at the disassembly you posted, it looks more like it might be an emulator bug in Xen; if Xen finds itself emulating the IMUL instruction and either gets the logic wrong or does the memory access wrong, it could cause that failure. And one reason that Xen emulates instructions is if the memory operand is on a pagetable that's shadowed (which might be a page that was recently a pagetable). ISTR that even though the RHEL xen reports a 3.0.x version it has quite a lot of backports in it. Does it have this patch? http://hg.uk.xensource.com/xen-3.1-testing.hg/rev/e8fca4c42d05 Cheers, Tim. -- Tim Deegan Principal Software Engineer, Xen Platform Team Citrix Systems UK Ltd. (Company #02937203, SL9 0BG) -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/