Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp3918144yba; Wed, 17 Apr 2019 00:11:23 -0700 (PDT) X-Google-Smtp-Source: APXvYqwFq7dtJzlh9XV2Ridev7+tM9kkxpLV3fsEAFgGtQqpLbGiFXbusoNxnyT6A3HcNNNzhzH3 X-Received: by 2002:a65:4689:: with SMTP id h9mr82772896pgr.295.1555485083476; Wed, 17 Apr 2019 00:11:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555485083; cv=none; d=google.com; s=arc-20160816; b=KYXrl9zO959qEhosLpX7flNuu09OXUZAYJlXl2dsU9nWWT5nQFEELLSqBdxJOzGLGc ecS9QSCwgClUY5VGvfZznkXBw5ZyjMMPRC3LD6zaunuNtQy/EOThpcmNJ8zi8sSn6F1W H+Y11b2dEBa56KdzqpVXmKVFKbfrL8wPAl2YsnlYxjwuCFej3JxsazB4Hw1e4eAVm1Tv LYzkWooT699i9Xilf9Utm66DsO0h8vhydHTQ2HaQsDBT005TXXgHpKAnUgdczRII9zxs v+ynR9ALuAvlLAvEg9EZoTAmbtSihz4/0Q2m+ivZmgMfb07CHtTpBWZXNNC1IxEhmGnC 3xmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=RJNGrd6S3r2nH06fJUrTB4M9SMfRvDIV4IBq8GfjyPI=; b=d/HcptrAUuCUa/DO2TO/s3fWhiBA1fqjZ6OZFFizOQ8s8NkIw0w898gUd3kTE/+3BK 38eHJmT8KLP/pxNB53sXwevfhxGJnGVV3d74cAT3uBDvZtrQ6IOQSd3XrdU3tpa1a7qA ghCBTh0kGccSIvNlX9n2qmenWTV4nzN7Pa+9Xu5z1JXEDgtdMZPXis92HQRCSUht27Pb MK7FQjAyFhH9aqUu1mOGncdiXQF84y8KWdkjzFSMThtWHxkH9/eb+ssSa4gp/YXfddVM 77SY04YvgJMyjDQibqPVkOrApjvmlBNdH3GGxiDKfM2oJOMBRY1/iLwzVGt5AhV7Yd88 MI9Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b9si43438111pfd.79.2019.04.17.00.11.08; Wed, 17 Apr 2019 00:11:23 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731141AbfDQHI2 (ORCPT + 99 others); Wed, 17 Apr 2019 03:08:28 -0400 Received: from mx1.redhat.com ([209.132.183.28]:35472 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728261AbfDQHI2 (ORCPT ); Wed, 17 Apr 2019 03:08:28 -0400 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.phx2.redhat.com [10.5.11.22]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id AA8BD3199362; Wed, 17 Apr 2019 07:08:27 +0000 (UTC) Received: from dhcp-128-65.nay.redhat.com (ovpn-12-44.pek2.redhat.com [10.72.12.44]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 738C61001DF3; Wed, 17 Apr 2019 07:08:22 +0000 (UTC) Date: Wed, 17 Apr 2019 15:08:18 +0800 From: Dave Young To: Kairui Song Cc: Borislav Petkov , Junichi Nomura , Chao Fan , Baoquan He , "x86@kernel.org" , "kexec@lists.infradead.org" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH] x86/boot: Use efi_setup_data for searching RSDP on kexec-ed kernels Message-ID: <20190417070818.GE8411@dhcp-128-65.nay.redhat.com> References: <20190415090717.GA29317@zn.tnic> <20190415102525.GB29317@zn.tnic> <23309b73-d135-a207-564b-6003cee39184@ce.jp.nec.com> <20190416094024.GE27892@zn.tnic> <20190416095209.GG27892@zn.tnic> <20190416114133.GA7541@dhcp-128-65.nay.redhat.com> <20190416132253.GE31772@zn.tnic> <20190417013838.GA8411@dhcp-128-65.nay.redhat.com> <20190417045447.GB8411@dhcp-128-65.nay.redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.11.3 (2019-02-01) X-Scanned-By: MIMEDefang 2.84 on 10.5.11.22 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.41]); Wed, 17 Apr 2019 07:08:27 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 04/17/19 at 02:00pm, Kairui Song wrote: > On Wed, Apr 17, 2019 at 12:57 PM Dave Young wrote: > > > > On 04/17/19 at 09:38am, Dave Young wrote: > > > On 04/16/19 at 03:22pm, Borislav Petkov wrote: > > > > On Tue, Apr 16, 2019 at 07:41:33PM +0800, Dave Young wrote: > > > > > On 04/16/19 at 11:52am, Borislav Petkov wrote: > > > > > > I'll queue the below in the next days if there are no more complaints: > > > > > > > > > > As for the kexec breakage, even with the V3 patch, kexec still hangs on > > > > > a Lenovo T420 laptop. Kairui also reproduced the problem. So can we > > > > > wait a few days see if we can make some progress to find the cause? > > > > > > > > How is applying this patch going to change anything? > > > > > > > > I was told that the breakage is there even without it... > > > > > > Without this patch, the bug happens in the efi_get_rsdp.. function, this > > > patch tries to fix that by adding kexec_get.. but the new introduced > > > kexec_* function does not work on some laptops, so it is not a 100% good > > > fix, I hoped we can get it working for all known issues. But if we can > > > not do it eg. within one week we can go with this version and leave the > > > laptop issue as a known issue. > > > > > > > Latest debugging status: > > > > Kexec boot works with commenting out some code like below, so the guid > > cmp (memcmp) caused a system reset), still need to find out why: > > > > diff --git a/arch/x86/boot/compressed/acpi.c b/arch/x86/boot/compressed/acpi.c > > index d9f9abd63c68..13e7a23ae94c 100644 > > --- a/arch/x86/boot/compressed/acpi.c > > +++ b/arch/x86/boot/compressed/acpi.c > > @@ -95,10 +95,12 @@ __efi_get_rsdp_addr(unsigned long config_tables, unsigned int nr_tables, > > table = tbl->table; > > } > > > > +/* > > if (!(efi_guidcmp(guid, ACPI_TABLE_GUID))) > > rsdp_addr = table; > > else if (!(efi_guidcmp(guid, ACPI_20_TABLE_GUID))) > > return table; > > +*/ > > } > > > > return rsdp_addr; > > @@ -291,9 +293,10 @@ acpi_physical_address get_rsdp_addr(void) > > if (!pa) > > pa = kexec_get_rsdp_addr(); > > > > +/* > > if (!pa) > > pa = efi_get_rsdp_addr(); > > - > > +*/ > > if (!pa) > > pa = bios_get_rsdp_addr(); > > > > > > Hi Dave, for this case I think it's just because GCC will found the > loop does nothing, and optimize out the whole loop in > __efi_get_rsdp_addr and will no longer read the actual nr_table value. > > I can fix the boot error on T420 with your patch, but if I add > anything, like a hardcode value assignment with the right value for > acpi_rsdp in the loop, it will reset the machine. But set acpi_rsdp > with a right initial value out side the loop works fine. > If the loop condition is false, then there should be no difference > between just comment out the line you mentioned and add an assignment. > Else it just assign the value multiple times, not very reasonable but > shouldn't fail. > > And, I inspected the generated ASM code also suggest the same thing. > So still, access the systab memory is the cause of the system reset on > certain machines. Makse sense, my previous debug also point to some systab accessing. Probably some early pg table mess up. Thanks Dave