Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp702963pxk; Thu, 24 Sep 2020 16:45:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyc0ABvq1vMhmGapCSaHX4wo8g0iUZo6DI/0FkPovZbGhQJ1LUnwxCXaA3NQzID1bIanb12 X-Received: by 2002:a17:906:4553:: with SMTP id s19mr47862ejq.475.1600991129170; Thu, 24 Sep 2020 16:45:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1600991129; cv=none; d=google.com; s=arc-20160816; b=bG2269jqAqGg5dPgCOL+FxxTohZRP7AsxbLBNCYOAcwUHBv1wB1eBLi4WM2xdCT3Ms jYBTOddhInmefYIWJHF9v3HfFaCsEdXmW0p7YPPVWeNXebhJbxhN7mZm8tV5KAdra5v4 4359rxJ36prunbLTKVEBA7QHAn51lID/5uDT7hsjOi8CabGmqVX+hjdhhIX3P+a6QmKB twfksZMAGdmPN0KbgXkAtrRi1xqGg6AIhYPJOxcV2gYfEhxoRZMiFrdHv+ff9SNcWAnq u9kYXn4E4AlCA0HZcQecUAnByzOuj3IRw8OZ06gg3CeTQ+VN8W1mLqSPWTYeWOJMS1pq OCNw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:date:from:sender:dkim-signature; bh=+T95pP0uGxIm5Hz5QWTAjFs5Yw5r7ZYnLnu32ktfF3g=; b=00rL7ogbC64C9xuTwXECNWQol6WFfSybBSRVyPkSlUk5I8vBQqZAmzaA8hWFf668zw JSEv73HSo05Y8TRkmeySSEcAk2ncIvS7iLDm8zYQO+Yo4nHHXareKDkZ+A1K5CMnYLnY 4xJsxwWeZcyYAtd7h3rPZ8pWNTUSJ8F1t0OPtf0N1MDCVL61/5uojWaon+E8r55bxnRF /ildr5UwS1BciCKISCTpw4hK2yqohXYBCgC9dqLGoNUFjnkQ7w/kOQduE3Wvyqa6rj4w xX86ID+9x9N/KyQj3pjAGBwJGvKWkcw739t+Hki5izzZyCbb/z2GJH6zIchD0sz+/lCR JOow== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=SRS+3ztW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cx6si626729edb.437.2020.09.24.16.45.04; Thu, 24 Sep 2020 16:45:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=SRS+3ztW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726807AbgIXXnv (ORCPT + 99 others); Thu, 24 Sep 2020 19:43:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58652 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726397AbgIXXnv (ORCPT ); Thu, 24 Sep 2020 19:43:51 -0400 Received: from mail-qt1-x841.google.com (mail-qt1-x841.google.com [IPv6:2607:f8b0:4864:20::841]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 370CEC0613CE; Thu, 24 Sep 2020 16:43:51 -0700 (PDT) Received: by mail-qt1-x841.google.com with SMTP id 19so584963qtp.1; Thu, 24 Sep 2020 16:43:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:date:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=+T95pP0uGxIm5Hz5QWTAjFs5Yw5r7ZYnLnu32ktfF3g=; b=SRS+3ztWImIJz5ak5a8R2YUyAT3cOcKWla/DfSAbwA5EsQXf6lTgaBbOhFfu5VRAMx EBRH7dn0NSJYoCVohXXTd3AMFraMiAcZml1GyPB3z58QN07wqG9zoJivG2DgcQaQXRMb pWz1g3b2BP0i/X6v4wphLB+1NOv2S5Y2UonuWaD/OVTEuBAvqkQRTMfKw/959ClbM9aG aBdpSP0+xY/OxvAAtmGW3n2mOviHfCjFeRdI2E3xoJCVzhoPhpxVTCaqKyS10a3SpZZs WFJE9UifKsyVMPXz0QZXmvh6FqqjyMtTPWKuNF6a/pmAJT3YNgidThcGIy1lLmEKypUh CFyQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:date:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to; bh=+T95pP0uGxIm5Hz5QWTAjFs5Yw5r7ZYnLnu32ktfF3g=; b=KooTI82TBVY6vX671Yc+EuI+zv2gh4DhneSd45fqygTjvkiOZAIG4ycnAEs+Xvsqpp l+/3oSPLls5NceeqM8bu4PZRONu3EZgRaqCpclYKYuCjvEYDzWgNF2dwnFGOvnhfDz/o bQsUYqqLg7xg5Jg/Xr/qvNZkeTmawp+07dHuE4PqGaNn94HxeDxCMeOviyDFuApC3YGI 5y/7MPS6Ezjb8FUuMPSgTswVuPZoWV+mrYnL9SAlBtE2G0RK6Uxk+07n9bct0nfhZixQ kTVPbHDfUmpdqCjHGfgDGblIA4PUR0582WgqQD4r/BJeV2zHbGFND7VQ1C7xOGJxJtv2 w9qA== X-Gm-Message-State: AOAM531Kp2cDnxWaDN74q26UwdgE7HsyBl1df7QB1Te98E0llSP+8Ley IS2m/O92XzSngD03/lLFOvk= X-Received: by 2002:aed:2e26:: with SMTP id j35mr1773028qtd.377.1600991030270; Thu, 24 Sep 2020 16:43:50 -0700 (PDT) Received: from rani.riverdale.lan ([2001:470:1f07:5f3::b55f]) by smtp.gmail.com with ESMTPSA id w18sm793791qtt.19.2020.09.24.16.43.48 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 24 Sep 2020 16:43:49 -0700 (PDT) Sender: Arvind Sankar From: Arvind Sankar X-Google-Original-From: Arvind Sankar Date: Thu, 24 Sep 2020 19:43:47 -0400 To: "Madhavan T. Venkataraman" Cc: Arvind Sankar , Florian Weimer , kernel-hardening@lists.openwall.com, linux-api@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-fsdevel@vger.kernel.org, linux-integrity@vger.kernel.org, linux-kernel@vger.kernel.org, linux-security-module@vger.kernel.org, oleg@redhat.com, x86@kernel.org, libffi-discuss@sourceware.org, luto@kernel.org, David.Laight@ACULAB.COM, mark.rutland@arm.com, mic@digikod.net, pavel@ucw.cz Subject: Re: [PATCH v2 0/4] [RFC] Implement Trampoline File Descriptor Message-ID: <20200924234347.GA341645@rani.riverdale.lan> References: <20200916150826.5990-1-madvenka@linux.microsoft.com> <87v9gdz01h.fsf@mid.deneb.enyo.de> <96ea02df-4154-5888-1669-f3beeed60b33@linux.microsoft.com> <20200923014616.GA1216401@rani.riverdale.lan> <20200923091125.GB1240819@rani.riverdale.lan> <20200923195147.GA1358246@rani.riverdale.lan> <2ed2becd-49b5-7e76-9836-6a43707f539f@linux.microsoft.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <2ed2becd-49b5-7e76-9836-6a43707f539f@linux.microsoft.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Sep 24, 2020 at 03:23:52PM -0500, Madhavan T. Venkataraman wrote: > > > > Which ISA does not support PIC objects? You mentioned i386 below, but > > i386 does support them, it just needs to copy the PC into a GPR first > > (see below). > > Position Independent Code needs PC-relative branches. I was referring > to PC-relative data references. Like RIP-relative data references in > X64. i386 ISA does not support this. I was talking about PC-relative data references too: they are a requirement for PIC code that wants to access any global data. They can be implemented easily on i386 even though it doesn't have an addressing mode that uses the PC. > Otherwise, using an ABI quirk or a calling convention side effect to load the > PC into a GPR is, IMO, non-standard or non-compliant or non-approved or > whatever you want to call it. I would be conservative and not use it. Who knows > what incompatibility there will be with some future software or hardware > features? > > For instance, in the i386 example, we do a call without a matching return. > Also, we use a pop to undo the call. Can anyone tell me if this kind of use > is an ABI approved one? This doesn't have anything to do with the ABI, since what happened here isn't visible to any caller or callee. Any machine instruction sequence that has the effect of copying the PC into a GPR is acceptable, but this is basically the only possible solution on i386. If you don't like the call/pop mismatch (though that's supported by the hardware, and is what clang likes to use), you can use the slightly different technique used in my example, which copies the top of stack into a GPR after a call. This is how all i386 PIC code has always worked. > Standard API for all userland for all architectures > --------------------------------------------------- > > The next advantage in using the kernel is standardization. > > If the kernel supplies this, then all applications and libraries can use > it for all architectures with one single, simple API. Without this, each > application/library has to roll its own solution for every architecture-ABI > combo it wants to support. But you can get even more standardization out of a userspace library, because that can work even on non-linux OS's, as well as versions of linux where the new syscall isn't available. > > Furthermore, if this work gets accepted, I plan to add a glibc wrapper for > the kernel API. The glibc API would look something like this: > > Allocate a trampoline > --------------------- > > tramp = alloc_tramp(); > > Set trampoline parameters > ------------------------- > > init_tramp(tramp, code, data); > > Free the trampoline > ------------------- > > free_tramp(tramp); > > glibc will allocate and manage the code and data tables, handle kernel API > details and manage the trampoline table. glibc could do this already if it wants, even without the syscall, because this can be done in userspace already. > > Secure vs Performant trampoline > ------------------------------- > > If you recall, in version 1, I presented a trampoline type that is > implemented in the kernel. When an application invokes the trampoline, > it traps into the kernel and the kernel performs the work of the trampoline. > > The disadvantage is that a trip to the kernel is needed. That can be > expensive. > > The advantage is that the kernel can add security checks before doing the > work. Mainly, I am looking at checks that might prevent the trampoline > from being used in an ROP/BOP chain. Some half-baked ideas: > > - Check that the invocation is at the starting point of the > trampoline > > - Check if the trampoline is jumping to an allowed PC > > - Check if the trampoline is being invoked from an allowed > calling PC or PC range > > Allowed PCs can be input using the trampfd API mentioned in version 1. > Basically, an array of PCs is written into trampfd. The source PC will generally not be available if the compiler decided to tail-call optimize the call to the trampoline into a jump. What's special about these trampolines anyway? Any indirect function call could have these same problems -- an attacker could have overwritten the pointer the same way, whether it's supposed to point to a normal function or it is the target of this trampoline. For making them a bit safer, userspace could just map the page holding the data pointers/destination address(es) as read-only after initialization. > > Suggestions for other checks are most welcome! > > I would like to implement an option in the trampfd API. The user can > choose a secure trampoline or a performant trampoline. For a performant > trampoline, the kernel will generate the code. For a secure trampoline, > the kernel will do the work itself. > > In order to address the FFI_REGISTER ABI in libffi, we could use the secure > trampoline. In FFI_REGISTER, the data is pushed on the stack and the code > is jumped to without using any registers. > > As outlined in version 1, the kernel can push the data address on the stack > and write the code address into the PC and return to userland. > > For doing all of this, we need trampfd. We don't need this for FFI_REGISTER. I presented a solution that works in userspace. Even if you want to use a trampoline created by the kernel, there's no reason it needs to trap into the kernel at trampoline execution time. libffi's trampolines already handle this case today. > > Permitting the use of trampfd > ----------------------------- > > An "exectramp" setting can be implemented in SELinux to selectively allow the > use of trampfd for applications. > > Madhavan Applications can use their own userspace trampolines regardless of this setting, so it doesn't provide any additional security benefit by preventing usage of trampfd.