Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp3843916yba; Tue, 7 May 2019 07:58:37 -0700 (PDT) X-Google-Smtp-Source: APXvYqxXZ8GpWUQRNf3Lt5mE/9iYEzYwcGeCRSw2NRYzBz1hw61cpcX99C+8XQ+zERxaDTIIxCFl X-Received: by 2002:aa7:8e04:: with SMTP id c4mr41209188pfr.48.1557241117488; Tue, 07 May 2019 07:58:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1557241117; cv=none; d=google.com; s=arc-20160816; b=FKXy0/c/OBpT1sAoPworF/jBYrosGNKiLPoWdXGObffUb4REap49jV0hWXzbeBWOdC xukzQvCQPQVR3x+oIKXObOXGzbFCzPKhA9zWtVuzFJnHTIaK6Zk2BbSufBQKiyCc7iN8 ZHRrQB2h1F1sI1Iqx8nbFzbHRZErLf5hja7nz4taVv5G6DSGcHQTZu0QyLNYRsaRqcz8 +XTr86b/qQyglwR0egVw1HKMPce1W2RqN+lp0UEeGn6/D+XfGL/N9yJYj5Bzr3jFJRIQ 9V8csugnf7UmNIoXugLSjM93N4wduF1+3HLUXCjVyUR4Ml8BmPALEUHiELgPv7IV+7RQ SQ1Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date; bh=h5rGjlDrpPhpMZ6sVK6nCAzQrVDwdJsiCzLyvD3pSJA=; b=hgbDx8LPE0pV2pUEwuc4GmJLF3mNfOj1Ij+Eet2ih4G5YLgqoQpCh2wg1W2I1x7KLQ 0zjHYP9GUTn6fPl55bypiP0FSgfH6b48QT4uQGBaSuUU1dMnp3xE5UOh7Dt2POHbLQi4 Zt/MBINJVE2jIytWn+tbonCgdiHml39+bJ5XmpajyHnqe7bmWc1D7tSKKK9TpPLFezvt yqH0Wsvns+LdWxvhhtr4J3IHqLr/oG1mEN1IFatsj2V0EtUAdPhUTpG4c421hyM5NAcA hK1fnEJpVN2cLKY3s5L9lKlx8OvOCecGeFrR+RoYx1RCcsPNHgW4mfzb9pAMNYtYoIVI TMSA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d6si19773169pgk.129.2019.05.07.07.58.21; Tue, 07 May 2019 07:58:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726510AbfEGO5a (ORCPT + 99 others); Tue, 7 May 2019 10:57:30 -0400 Received: from mail.kernel.org ([198.145.29.99]:43168 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726322AbfEGO5a (ORCPT ); Tue, 7 May 2019 10:57:30 -0400 Received: from gandalf.local.home (cpe-66-24-58-225.stny.res.rr.com [66.24.58.225]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id E695120578; Tue, 7 May 2019 14:57:25 +0000 (UTC) Date: Tue, 7 May 2019 10:57:24 -0400 From: Steven Rostedt To: David Laight Cc: 'Peter Zijlstra' , Linus Torvalds , Andy Lutomirski , "Linux List Kernel Mailing" , Ingo Molnar , Andrew Morton , "Andy Lutomirski" , Nicolai Stange , "Thomas Gleixner" , Ingo Molnar , "Borislav Petkov" , "H. Peter Anvin" , "the arch/x86 maintainers" , Josh Poimboeuf , "Jiri Kosina" , Miroslav Benes , Petr Mladek , Joe Lawrence , Shuah Khan , Konrad Rzeszutek Wilk , Tim Chen , Sebastian Andrzej Siewior , Mimi Zohar , Juergen Gross , Nick Desaulniers , Nayna Jain , Masahiro Yamada , "Joerg Roedel" , "open list:KERNEL SELFTEST FRAMEWORK" , stable Subject: Re: [RFC][PATCH 1/2] x86: Allow breakpoints to emulate call functions Message-ID: <20190507105724.02abe6f6@gandalf.local.home> In-Reply-To: References: <20190502185225.0cdfc8bc@gandalf.local.home> <20190502193129.664c5b2e@gandalf.local.home> <20190502195052.0af473cf@gandalf.local.home> <20190503092959.GB2623@hirez.programming.kicks-ass.net> <20190503092247.20cc1ff0@gandalf.local.home> <2045370D-38D8-406C-9E94-C1D483E232C9@amacapital.net> <20190506081951.GJ2606@hirez.programming.kicks-ass.net> <20190507085753.GO2606@hirez.programming.kicks-ass.net> <20190507113050.GR2606@hirez.programming.kicks-ass.net> <20190507091403.556daba7@gandalf.local.home> X-Mailer: Claws Mail 3.17.3 (GTK+ 2.24.32; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 7 May 2019 14:50:26 +0000 David Laight wrote: > From: Steven Rostedt > > Sent: 07 May 2019 14:14 > > On Tue, 7 May 2019 12:57:15 +0000 > > David Laight wrote: > The 'user' (ie the kernel code that needs to emulate the call) doesn't > write the data to the stack, just to some per-cpu location. > (Actually it could be on the stack at the other end of pt-regs.) > So you get to the 'register restore and iret' code with the stack unaltered. > It is then a SMOP to replace the %flags saved by the int3 with the %ip > saved by the int3, the %ip with the address of the function to call, > restore the flags (push and popf) and issue a ret.f to remove the %ip and %cs. How would you handle NMIs doing the same thing? Yes, the NMI handlers have breakpoints that will need to emulated calls as well. > > (Actually you need to add 4 to the callers %ip address to allow for the > difference between the size of int3 (hopefully 0xcc, not 0xcd 0x3).) > > > > > For 32bit 'the gap' happens naturally when building a 5 entry frame. Yes > > > > it is possible to build a 5 entry frame on top of the old 3 entry one, > > > > but why bother... > > > > > > Presumably there is 'horrid' code to generate the gap in 64bit mode? > > > (less horrid than 32bit, but still horrid?) > > > Or does it copy the entire pt_regs into a local stack frame and use > > > that for the iret? > > > > On x86_64, the gap is only done for int3 and nothing else, thus it is > > much less horrid. That's because x86_64 has a sane pt_regs storage for > > all exceptions. > > Well, in particular, it always loads %sp as part of the iret. > So you can create a gap and the cpu will remove it for you. > > In 64bit mode you could overwrite the %ss with the return address > to the caller restore %eax and %flags, push the function address > and use ret.n to jump to the function subtracting the right amount > from %esp. > > Actually that means you can do the following in both modes: > if not emulated_call_address then pop %ax; iret else > # assume kernel<->kernel return > push emulated_call_address; > push flags_saved_by_int3 > load %ax, return_address_from_iret > add %ax,#4 > store %ax, first_stack_location_written_by_int3 > load %ax, value_saved_by_int3_entry > popf > ret.n > > The ret.n discards everything from the %ax to the required return address. > So 'n' is the size of the int3 frame, so 12 for i386 and 40 for amd64. > > If the register restore (done just before this code) finished with > 'add %sp, sizeof *pt_regs' then the emulated_call_address can be > loaded in %ax from the other end of pt_regs. > > This all reminds me of fixing up the in-kernel faults that happen > when loading the user segment registers during 'return to user' > fault in kernel space. This all sounds much more complex and fragile than the proposed solution. Why would we do this over what is being proposed? -- Steve