Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755956Ab2JCPAj (ORCPT ); Wed, 3 Oct 2012 11:00:39 -0400 Received: from smtp.ctxuk.citrix.com ([62.200.22.115]:57638 "EHLO SMTP.EU.CITRIX.COM" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755068Ab2JCPAf (ORCPT ); Wed, 3 Oct 2012 11:00:35 -0400 X-IronPort-AV: E=Sophos;i="4.80,528,1344211200"; d="scan'208";a="14918687" Message-ID: <1349276431.650.156.camel@zakaz.uk.xensource.com> Subject: Re: [Xen-devel] [PATCH] xen: point xen_start_info to a dummy struct for PV on HVM guests From: Ian Campbell To: Konrad Rzeszutek Wilk CC: Stefano Stabellini , "xen-devel@lists.xensource.com" , "linux-kernel@vger.kernel.org" Date: Wed, 3 Oct 2012 16:00:31 +0100 In-Reply-To: <20121003141116.GA10633@phenom.dumpdata.com> References: <1349272182.650.150.camel@zakaz.uk.xensource.com> <1349272482.650.151.camel@zakaz.uk.xensource.com> <20121003141116.GA10633@phenom.dumpdata.com> Organization: Citrix Systems, Inc. Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.4.3-1 MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 11304 Lines: 185 On Wed, 2012-10-03 at 15:11 +0100, Konrad Rzeszutek Wilk wrote: > On Wed, Oct 03, 2012 at 02:54:42PM +0100, Ian Campbell wrote: > > On Wed, 2012-10-03 at 14:51 +0100, Stefano Stabellini wrote: > > > On Wed, 3 Oct 2012, Ian Campbell wrote: > > > > On Wed, 2012-10-03 at 14:37 +0100, Stefano Stabellini wrote: > > > > > PV on HVM guests don't have a start_info page mapped by Xen, so > > > > > xen_start_info is just NULL for them. > > > > > That is problem because other parts of the code expect xen_start_info to > > > > > point to something valid, for example xen_initial_domain() is defined as > > > > > follow: > > > > > > > > > > #define xen_initial_domain() (xen_domain() && \ > > > > > xen_start_info->flags & SIF_INITDOMAIN) > > > > > > > > But anyone who calls this before xen_start_info is setup is going to get > > > > a bogus result, specifically in this case they will think they are domU > > > > when in reality they are dom0 -- wouldn't it be better to fix those > > > > callsites? > > > > > > That cannot be the case because setting up xen_start_info is the very > > > first thing that is done, before even calling to C. > > > > On PV, yes, but you are trying to fix PVHVM here, no? > > > > Otherwise if this is always set before calling into C then what is the > > purpose of this patch? > > to fix this - as PVHVM has it set to NULL and we end up de-referencing > the xen_start_info and crashing. As so:: > Right, so returning to my original point: The caller here is calling xen_initial_domain() *before* start info is setup. This is bogus and is your actual bug, all this patch does is hide that real issue. With this "fix" the caller of xen_initial_domain shown in this trace now gets a rubbish result based on the content of a dummy shared info instead of the real answer from that actual shared info. The right fix is to fix the caller to not call xen_initial_domain() until after the shared info has been setup. Maybe that means moving shinfo setup earlier, or maybe it means deferring this call until later in the PVHVM case. > > Decompressing Linux... Parsing ELF... done. > Booting the kernel. > [ 0.000000] Initializing cgroup subsys cpuset > [ 0.000000] Initializing cgroup subsys cpu > [ 0.000000] Linux version 3.6.0upstream-04121-g0313983 (konrad@build.dumpdata.com) (gcc version 4.4.4 20100503 (Red Hat 4.4.4-2) (GCC) ) #1 SMP Tue Oct 2 16:31:21 EDT 2012 > [ 0.000000] Command line: initrd=initramf.gz console=ttyS0,115200 test=net nofb earlyprintk=serial,ttyS0,115200 BOOT_IMAGE=vmlinuz > [ 0.000000] e820: BIOS-provided physical RAM map: > [ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable > [ 0.000000] BIOS-e820: [mem 0x000000000009e000-0x000000000009ffff] reserved > [ 0.000000] BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved > [ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000007fffffff] usable > [ 0.000000] BIOS-e820: [mem 0x00000000fc000000-0x00000000ffffffff] reserved > [ 0.000000] bootconsole [earlyser0] enabled > [ 0.000000] NX (Execute Disable) protection: active > [ 0.000000] DMI 2.4 present. > [ 0.000000] Hypervisor detected: Xen HVM > [ 0.000000] Xen version 4.1. > [ 0.000000] Netfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated NICs. > [ 0.000000] Blkfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated disks. > [ 0.000000] You might have to change the root device > [ 0.000000] from /dev/hd[a-d] to /dev/xvd[a-d] > [ 0.000000] in your root= kernel command line option > [ 0.000000] No AGP bridge found > [ 0.000000] e820: last_pfn = 0x80000 max_arch_pfn = 0x400000000 > [ 0.000000] x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 > [ 0.000000] found SMP MP-table at [mem 0x000fbc90-0x000fbc9f] mapped at [ffff8800000fbc90] > [ 0.000000] init_memory_mapping: [mem 0x00000000-0x7fffffff] > [ 0.000000] RAMDISK: [mem 0x7abeb000-0x7ffdefff] > [ 0.000000] ACPI: RSDP 00000000000ea020 00024 (v02 Xen) > [ 0.000000] ACPI: XSDT 00000000fc00f2b0 00034 (v01 Xen HVM 00000000 HVML 00000000) > [ 0.000000] ACPI: FACP 00000000fc00f0d0 000F4 (v04 Xen HVM 00000000 HVML 00000000) > [ 0.000000] ACPI: DSDT 00000000fc003440 0BC09 (v02 Xen HVM 00000000 INTL 20100528) > [ 0.000000] ACPI: FACS 00000000fc003400 00040 > [ 0.000000] ACPI: APIC 00000000fc00f1d0 000D8 (v02 Xen HVM 00000000 HVML 00000000) > [ 0.000000] No NUMA configuration found > [ 0.000000] Faking a node at [mem 0x0000000000000000-0x000000007fffffff] > [ 0.000000] Initmem setup node 0 [mem 0x00000000-0x7fffffff] > [ 0.000000] NODE_DATA [mem 0x7fffc000-0x7fffffff] > [ 0.000000] Zone ranges: > [ 0.000000] DMA [mem 0x00010000-0x00ffffff] > [ 0.000000] DMA32 [mem 0x01000000-0xffffffff] > [ 0.000000] Normal empty > [ 0.000000] Movable zone start for each node > [ 0.000000] Early memory node ranges > [ 0.000000] node 0: [mem 0x00010000-0x0009dfff] > [ 0.000000] node 0: [mem 0x00100000-0x7fffffff] > [ 0.000000] ACPI: PM-Timer IO Port: 0xb008 > [ 0.000000] ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x02] enabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x08] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x05] lapic_id[0x0a] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x06] lapic_id[0x0c] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x07] lapic_id[0x0e] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x08] lapic_id[0x10] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x09] lapic_id[0x12] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x0a] lapic_id[0x14] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x0b] lapic_id[0x16] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x0c] lapic_id[0x18] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x0d] lapic_id[0x1a] disabled) > [ 0.000000] ACPI: LAPIC (acpi_id[0x0e] lapic_id[0x1c] disabled) > [ 0.000000] ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0]) > [ 0.000000] IOAPIC[0]: apic_id 1, version 17, address 0xfec00000, GSI 0-47 > [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) > [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 low level) > [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 low level) > [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 low level) > [ 0.000000] Using ACPI (MADT) for SMP configuration information > [ 0.000000] smpboot: Allowing 15 CPUs, 13 hotplug CPUs > [ 0.000000] PM: Registered nosave memory: 000000000009e000 - 00000000000a0000 > [ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 00000000000e0000 > [ 0.000000] PM: Registered nosave memory: 00000000000e0000 - 0000000000100000 > [ 0.000000] e820: [mem 0x80000000-0xfbffffff] available for PCI devices > [ 0.000000] Booting paravirtualized kernel on Xen HVM > [ 0.000000] setup_percpu: NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:15 nr_node_ids:1 > [ 0.000000] PERCPU: Embedded 28 pages/cpu @ffff88007a800000 s84352 r8192 d22144 u131072 > [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 517000 > [ 0.000000] Policy zone: DMA32 > [ 0.000000] Kernel command line: initrd=initramf.gz console=ttyS0,115200 test=net nofb earlyprintk=serial,ttyS0,115200 BOOT_IMAGE=vmlinuz > [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) > [ 0.000000] __ex_table already sorted, skipping sort > [ 0.000000] Checking aperture... > [ 0.000000] No AGP bridge found > [ 0.000000] Memory: 1967336k/2097152k available (6368k kernel code, 456k absent, 129360k reserved, 4525k data, 752k init) > [ 0.000000] Hierarchical RCU implementation. > [ 0.000000] RCU restricting CPUs from NR_CPUS=512 to nr_cpu_ids=15. > [ 0.000000] NR_IRQS:33024 nr_irqs:1208 16 > [ 0.000000] Xen HVM callback vector for event delivery is enabled > [ 0.000000] Console: colour VGA+ 80x25 > [ 0.000000] BUG: unable to handle kernel NULL pointer dereference at 0000000000000030 > [ 0.000000] IP: [] xen_cons_init+0x1e/0x60 > [ 0.000000] PGD 0 > [ 0.000000] Oops: 0000 [#1] SMP > [ 0.000000] Modules linked in: > [ 0.000000] CPU 0 > [ 0.000000] Pid: 0, comm: swapper/0 Not tainted 3.6.0upstream-04121-g0313983 #1 Xen HVM domU > [ 0.000000] RIP: 0010:[] [] xen_cons_init+0x1e/0x60 > [ 0.000000] RSP: 0000:ffffffff81a01ef8 EFLAGS: 00010202 > [ 0.000000] RAX: 0000000000000000 RBX: ffffffff81b3be60 RCX: 0000000000000002 > [ 0.000000] RDX: ffffffff81a59c40 RSI: ffffffff81a59b01 RDI: ffffffff81ba7e81 > [ 0.000000] RBP: ffffffff81a01ef8 R08: 00000000000003fd R09: 0000000000000020 > [ 0.000000] R10: 0000000000000000 R11: 000000000000000d R12: ffffffff81b008e0 > [ 0.000000] R13: ffffffff81b092e0 R14: 0000000000000000 R15: 0000000000026bf0 > [ 0.000000] FS: 0000000000000000(0000) GS:ffff88007a800000(0000) knlGS:0000000000000000 > [ 0.000000] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > [ 0.000000] CR2: 0000000000000030 CR3: 0000000001a0b000 CR4: 00000000000006b0 > [ 0.000000] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > [ 0.000000] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > [ 0.000000] Process swapper/0 (pid: 0, threadinfo ffffffff81a00000, task ffffffff81a13420) > [ 0.000000] Stack: > [ 0.000000] ffffffff81a01f18 ffffffff81aeb9fb ffffffff81b008e0 ffffffffffffffff > [ 0.000000] ffffffff81a01f68 ffffffff81abac39 ffffffff81aba80d 0000000000026bf0 > [ 0.000000] ffffffff81a01f58 ffffffff81b092e0 0000000001000000 0000000001c72000 > [ 0.000000] Call Trace: > [ 0.000000] [] console_init+0x19/0x2a > [ 0.000000] [] start_kernel+0x24a/0x3a3 > [ 0.000000] [] ? kernel_init+0x1e8/0x1e8 > [ 0.000000] [] x86_64_start_reservations+0x131/0x136 > [ 0.000000] [] x86_64_start_kernel+0x103/0x112 > [ 0.000000] Code: 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 8b 0d 5a 2e 7c 00 55 31 c0 48 89 e5 85 c9 74 37 48 8b 05 51 2e 7c 00 48 c7 c2 40 9c a5 81 40 30 02 75 15 83 f9 02 74 27 e8 52 fc ff ff 85 c0 78 15 48 > [ 0.000000] RIP [] xen_cons_init+0x1e/0x60 > [ 0.000000] RSP > [ 0.000000] CR2: 0000000000000030 > [ 0.000000] ---[ end trace 5cb378039a20e088 ]--- > [ 0.000000] Kernel panic - not syncing: Attempted to kill the idle task! > > > > > > > > > > > > Perhaps turn this into a static inline with a BUG_ON(!xen_start_info) to > > > > make catching these cases easier? > > > > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/