Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5013347imm; Sun, 22 Jul 2018 10:58:47 -0700 (PDT) X-Google-Smtp-Source: AAOMgpeIipdrz/qSmLSgKytGSctDpy/vpD9QeL/p36EyRfVtW2jlehZ2h50A80Ol7j09LzrgLAaN X-Received: by 2002:a62:6003:: with SMTP id u3-v6mr10227398pfb.114.1532282327905; Sun, 22 Jul 2018 10:58:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1532282327; cv=none; d=google.com; s=arc-20160816; b=XxQSFpbZ01/SKLqsDjPF/tPMOdAoiIqEvwObQK4Pb/3b53FwAiq5jXzFCTQHSpxUNr hi21Qh/g7J2sA/+qT7LlQ0XDQYySNxkYZ+HsRLcHHmltsm+OgcROjQ/0qawXzFxUUhwD tk8B+3hI9ksx6hyhjwxyKLzN+HcEd4O+YEnR7szl43/UTn4SYsLtv/6/ZB40OwU9rbce vjZ4S8V6olC4378YLse8MP5Lylb5uzDLoQkLHmRbLjg+w8xx6/m+fR2MYjqbyAMylMAZ +FhVIyr7eSrm2cyBBR1hpSPsYwkXnx75Ak7gKB5LXNpj51DPvY3d93ES1i0kfifJGPLg ixqA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=H8TKIFQy0SzA9IztsJN0dFR2NVrf+j5lVzxNNKf85zU=; b=wMGtsdTeb/MVcx6X/fnDVm4U0qLADUZQGgtkBxtnmO+OaWvg0EhSV15odDngkO5aBw 5KBkgFJxNFLAC8H/yczpIgVq65w3mxW0/DxrOoabBtKJ45WA5TM+CV6kZBQBf26jqQ1/ TYbB0Z0AiqD7+Hx4QQb2xyLTRDYFT0AS68OkAsLBZ4loXuTaJKKmjlx7raXcukRZSe6+ /siNFDbl1wS7bKxFeiCJ6Wz5Ca8I85xDxI5uJYxgjAzwJD/HbdVcX30UyR76lorANPr8 km/UnIg3PVWpQ8YbFSKb8mkiqWJkT48GWOJDs5N61hxZfhj9uz3dfVFCjUhd+pvUho6J MnKQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HhL616ma; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t1-v6si7269779plj.334.2018.07.22.10.58.33; Sun, 22 Jul 2018 10:58:47 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HhL616ma; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730334AbeGVSyX (ORCPT + 99 others); Sun, 22 Jul 2018 14:54:23 -0400 Received: from mail.kernel.org ([198.145.29.99]:57724 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730122AbeGVSyW (ORCPT ); Sun, 22 Jul 2018 14:54:22 -0400 Received: from mail-wr1-f53.google.com (mail-wr1-f53.google.com [209.85.221.53]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 0428320893 for ; Sun, 22 Jul 2018 17:56:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1532282214; bh=JqqWVY2NOKDjeW+YktpM51WOLWKanwYfG4+S4fxooI8=; h=In-Reply-To:References:From:Date:Subject:To:Cc:From; b=HhL616magqNDJSCTMW3O1WP/1JfOMnjQ4Xa5Z6wiAHLDqPY5kqWYQWhjVJyYR7xQ2 hRusdQC4Yw7h2BTNO/lu/tyvu/DMOBGE4F98MrjMeqJHpHo6SolnVIpuSXAmp3aOiH 5zP9T1bUElFl+oZzGmH8A92cg6LLGn3i6XbyCM1Q= Received: by mail-wr1-f53.google.com with SMTP id j5-v6so15684137wrr.8 for ; Sun, 22 Jul 2018 10:56:53 -0700 (PDT) X-Gm-Message-State: AOUpUlFVZFN+hUtijAJB/fS/nHCM82qZ4oY22rCPNvLsQNKW83l0WwXH grTDvvQeVaUSu549uSyvytAxIU+oIgwc3Io6gO5+yg== X-Received: by 2002:adf:fe42:: with SMTP id m2-v6mr6248648wrs.171.1532282212367; Sun, 22 Jul 2018 10:56:52 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a1c:d548:0:0:0:0:0 with HTTP; Sun, 22 Jul 2018 10:56:31 -0700 (PDT) In-Reply-To: <74f29b48-7522-c4f4-c2e7-d8e41fd88873@runbox.com> References: <20180721194909.23903-1-m.v.b@runbox.com> <74f29b48-7522-c4f4-c2e7-d8e41fd88873@runbox.com> From: Andy Lutomirski Date: Sun, 22 Jul 2018 10:56:31 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH 1/2] x86/entry/64: Do not clear %rbx under Xen To: "M. Vefa Bicakci" Cc: Andy Lutomirski , LKML , Dominik Brodowski , Ingo Molnar , "H. Peter Anvin" , Thomas Gleixner , Boris Ostrovsky , Juergen Gross , xen-devel@lists.xenproject.org, X86 ML , stable , Dan Williams , Andi Kleen Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Jul 21, 2018 at 6:20 PM, M. Vefa Bicakci wrote: > On 07/21/2018 07:37 PM, M. Vefa Bicakci wrote: >> >> On 07/21/2018 07:30 PM, Andy Lutomirski wrote: >>> >>> On Sat, Jul 21, 2018 at 4:19 PM, M. Vefa Bicakci >>> wrote: >>>> >>>> On 07/21/2018 05:45 PM, Andy Lutomirski wrote: >>>>> >>>>> >>>>> On Sat, Jul 21, 2018 at 12:49 PM, M. Vefa Bicakci >>>>> wrote: >>>>>> >>>>>> >>>>>> Commit 3ac6d8c787b8 ("x86/entry/64: Clear registers for >>>>>> exceptions/interrupts, to reduce speculation attack surface") >>>>>> unintendedly >>>>>> broke Xen PV virtual machines by clearing the %rbx register at the end >>>>>> of >>>>>> xen_failsafe_callback before the latter jumps to error_exit. >>>>>> error_exit expects the %rbx register to be a flag indicating whether >>>>>> there should be a return to kernel mode. >>>>>> >>>>>> This commit makes sure that the %rbx register is not cleared by >>>>>> the PUSH_AND_CLEAR_REGS macro, when the macro in question is >>>>>> instantiated >>>>>> by xen_failsafe_callback, to avoid the issue. >>>>> >>>>> >>>>> >>>>> Seems like a genuine problem, but: >>>>> >>>>>> diff --git a/arch/x86/entry/entry_64.S b/arch/x86/entry/entry_64.S >>>>>> index c7449f377a77..96e8ff34129e 100644 >>>>>> --- a/arch/x86/entry/entry_64.S >>>>>> +++ b/arch/x86/entry/entry_64.S >>>>>> @@ -1129,7 +1129,7 @@ ENTRY(xen_failsafe_callback) >>>>>> addq $0x30, %rsp >>>>>> UNWIND_HINT_IRET_REGS >>>>>> pushq $-1 /* orig_ax = -1 => not a system call */ >>>>>> - PUSH_AND_CLEAR_REGS >>>>>> + PUSH_AND_CLEAR_REGS clear_rbx=0 >>>>>> ENCODE_FRAME_POINTER >>>>>> jmp error_exit >>>>> >>>>> >>>>> >>>>> The old code first set RBX to zero then, if frame pointers are on, >>>>> sets it to some special non-zero value, then crosses its fingers and >>>>> hopes for the best. Your patched code just skips the zeroing part, so >>>>> RBX either contains the ENCODE_FRAME_POINTER result or is >>>>> uninitialized. >>>>> >>>>> How about actually initializing rbx to something sensible like, say, 1? >>>> >>>> >>>> Hello Andy, >>>> >>>> Thank you for the review! Apparently, I have not done my homework fully. >>>> I will test your suggestion and report back, most likely in a few hours. >>>> >>>> I have been testing with the next/linux-next tree's master branch >>>> (dated 20180720), and I noticed that ENCODE_FRAME_POINTER changes the >>>> frame pointer (i.e., RBP) register, as opposed to the RBX register, >>>> which the patch aims to avoid changing before jumping to error_exit. >>>> It is possible that I am missing something though -- I am not sure about >>>> the connection between the RBP and RBX registers. >>> >>> >>> Sorry, brain fart on my part. >> >> >> No problem! :-) >> >>>> The change introduced by commit 3ac6d8c787b8 is in the excerpt below. >>>> Would >>>> it >>>> be valid to state that the original code had the same issue that you >>>> referred >>>> to (i.e., leaving the RBX register uninitialized)? >>> >>> >>> Presumably. >>> >>> I would propose a rather different fix: >>> >>> >>> https://git.kernel.org/pub/scm/linux/kernel/git/luto/linux.git/commit/?h=x86/pti&id=bb3d76b50c3bc78b67d79cf90d328f38a435c793 >>> >>> Any chance you could test that and see if it fixes your problem? >> >> >> Of course; I will report back with the result in a few hours. > > > Hello Andy, > > I confirm that the commit at [1] resolves the issue in question as well. > > To test, I first reverted my commit, applied your commit and verified that > the bug cannot be reproduced. Afterwards, I reverted your commit and > verified that the bug is reproducible. > > I am not sure about the best way to document the bug I encountered in your > commit message, but in case you plan to have your commit merged, please > feel free to add a "Reported-and-tested-by: M. Vefa Bicakci > " > tag to the commit message, possibly with a link to this e-mail thread. > > Finally, as I had mentioned in my commit message, this bug exists in all > kernel versions 4.14 and greater, so it would be nice if you could > carbon-copy > "stable@vger.kernel.org" in the commit message as well. > > Thank you, I'm curious why the sigreturn_64 selftest didn't catch this bug. Do you happen to know why? The xen_failsafe_callback mechanism is a bit mysterious to me. > > Vefa > > [1] > https://git.kernel.org/pub/scm/linux/kernel/git/luto/linux.git/commit/?h=x86/pti&id=bb3d76b50c3bc78b67d79cf90d328f38a435c793 >