Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760378AbYFPIUa (ORCPT ); Mon, 16 Jun 2008 04:20:30 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753261AbYFPIUV (ORCPT ); Mon, 16 Jun 2008 04:20:21 -0400 Received: from lec.cs.unibo.it ([130.136.1.103]:33419 "EHLO lec.cs.unibo.it" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753250AbYFPIUT (ORCPT ); Mon, 16 Jun 2008 04:20:19 -0400 Date: Mon, 16 Jun 2008 09:58:04 +0200 To: LKML , Jeff Dike , Roland McGrath Subject: [PATCH 0/1] ptrace_vm: let us simplify the code for ptrace and add useful features for VM Message-ID: <20080616075804.GA6950@cs.unibo.it> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.13 (2006-08-11) From: renzo@cs.unibo.it (Renzo Davoli) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5168 Lines: 107 Proposal: let us simplify PTRACE_SYSCALL/PTRACE_SINGLESTEP/PTRACE_SYSEMU/PTRACE_SYSEMU_SINGLESTEP, and now PTRACE_BLOCKSTEP (which will require soon a PTRACE_SYSEMU_BLOCKSTEP), my PTRACE_SYSVM...etc. etc. Summary of the solution: Use tags in the "addr" parameter of existing PTRACE_SYSCALL/PTRACE_SINGLESTEP/PTRACE_CONT/PTRACE_BLOCKSTEP calls to skip the current call (PTRACE_VM_SKIPCALL) or skip the second upcall to the VM/debugger after the syscall execution (PTRACE_VM_SKIPEXIT). Note: The patch is against linux-2.6.26-rc6, it applies with some line offset warnings to git2, too. Motivation: The ptrace tag PTRACE_SYSEMU is a feature mainly used for User-Mode Linux, or at most for other virtual machines aiming to virtualize *all* the syscalls (total virtual machines). In fact: ptrace(PTRACE_SYSEMU, pid, 0, 0) means that the *next* system call will not be executed. PTRACE_SYSEMU AFAIK has been implemented only for x86_32. I already proposed some time ago a different tag: PTRACE_SYSVM (and I maintain a patch for it) where: ptrace(PTRACE_SYSVM, pid, XXX, 0) 1* is the same as PTRACE_SYSCALL when XXX==0, 2* skips the call (and stops before entering the next syscall) when PTRACE_VM_SKIPCALL | PTRACE_VM_SKIPEXIT 3* skips the ptrace call after the system call if PTRACE_VM_SKIPEXIT. PTRACE_SYSVM has been implemented for x86_32, powerpc_32, um+x86_32. (x86_64 and ppc64 exist too, but are less tested). The main difference between SYSEMU and SYSVM is that with SYSVM it is possible to decide if *this* system call should be executed or not (instead of the next one). SYSVM can be used also for partial virtual machines (some syscall gets virtualized and some others do not), like our umview. PTRACE_SYSVM above can be used instead of PTRACE_SYSEMU in user-mode linux and in all the others total virtual machines. In fact, provided user-mode linux skips *all* the syscalls it does not matter if the upcall happens just after (SYSEMU) or just before (SYSVM) having skipped the syscall. Briefly I would like to unify SYSCALL, SYSEMU and SYSVM. We don't need three different tags (and all their "variations", SINGLESTEP->SYSEMU_SINGLESTEP etc). We could keep PTRACE_SYSCALL, using the addr parameter as in PTRACE_SYSVM. In this case all the code I have seen (user-mode linux, strace, umview and googling around) use 0 or 1 for addr (being defined unused). defining PTRACE_VM_SKIPCALL=4 and PTRACE_VM_SKIPEXIT=2 (i.e. by ignoring the lsb) everything previously coded using PTRACE_SYSCALL should continue to work. In the same way PTRACE_SINGLESTEP, PTRACE_CONT and PTRACE_BLOCKSTEP can use the same tags restarting after a SYSCALL. This change would eventually simplify both the kernel code (reducing tags and exceptions) and even user-mode linux and umview. The skip-exit feature can be implemented in a arch-independent manner, while for skip_call some simple changes are needed (the entry assembly code should process the return value of the syscall tracing function call, like in arch/x86/kernel/Entry_32.S). Motivation summary: 1) (eventually) Reduce the number of PTRACE tags. The proposed patch does not add any tag. On the contrary after a period of deprecation SYSEMU* tags can be eliminated. 2) Backward compatible with existing software (existing UML kernels, strace already tested). Only software using strange "addr" values (currently ignored) could have portability problems. 3) (eventually) simplify kernel code. SYSEMU support is a bit messy and x86/32 only. These new PTRACE_VM tags for the addr parameter will allow to get rid of SYSEMU code. 4) It is simple to be ported across the architecture. This patch already support PTRACE_VM_SKIPEXIT for all architectures and PTRACE_VM_SKIPCALL for x86_32/64 (incl. x86_64 emu32), powerpc32/64, UML. (to be honest I have tested the code on all the architectures above but powerpc64. It does not mean that it is bug free even on the others, but at least I have tried some UML kernels and some umview runs). 5) It is more powerful than PTRACE_SYSEMU. It provides an optimized support for partial virtualization (some syscalls gets virtualized some other do not) while keeping support for total virtualization a' la UML. 6) Software currently using PTRACE_SYSEMU can be easily ported to this new support. The porting for UML (client side) is already in the patch. All the calls like: ptrace(PTRACE_SYSEMU, pid, 0, 0) can be converted into ptrace(PTRACE_SYSCALL, pid, PTRACE_VM_SKIPCALL, 0) (but the first PTRACE_SYSCALL, the one which starts up the emulation. In practice it is possible to set PTRACE_VM_SKIPCALL for the first call, too. The "addr" tag is ignored being no syscalls pending). The same feature has been implemented also against the new ptrace running on McGrath's utrace support. This specific patch can be found here: http://view-os.svn.sourceforge.net/viewvc/view-os/trunk/kmview-kernel-module/kernel_patches/ renzo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/