Received: by 2002:a05:6602:2086:0:0:0:0 with SMTP id a6csp3764338ioa; Tue, 26 Apr 2022 09:29:23 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxGA8Oz2knCKwbfgvT6Xxrwh9NnzqzfYajbtRCpoqLdYfPQeQaKuZT+VbpS6kMVQHu+NFt0 X-Received: by 2002:a17:907:1b10:b0:6e4:bac5:f080 with SMTP id mp16-20020a1709071b1000b006e4bac5f080mr22490042ejc.24.1650990562791; Tue, 26 Apr 2022 09:29:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650990562; cv=none; d=google.com; s=arc-20160816; b=EMjUkOUb0Qx74LpE3VjtSPGoAr4vpnr2+elT0QX3THHLl0GpO7Q+PwfMw8jzNBX/6Q R/EI70jziWDalEHVp0Mue8gRFA718fUS2C8xnh/gxjjrJkevPZCMjKK7WnnEyO6G6Ew4 H97Aw8Prr7n2dmmx3HwJKkQprCamJR1+DK1z9f5fGNEKxyO+1eXlhAWeh2PSsoU0eHby uOwhnR04FMjPTXv86l/KGbHsmkQ+MELR0yhrFeZ8GexQvitEKpdgvcFBxUkOchB563hq pgWlsL5vysWRHiTCDHi/tWyCbWIAlpTLJmt99k8KZpzU9HlV8hyLW9wSwIWDYpXSwavv NP1w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=wvANiMO3qAcyNKfc+bri5b5b+fmNZvyQPnZyk3RNVCc=; b=akQOdnWqecxsf9rPX9uppOKCG1fdPh7lA2lT22+IQSWanGjHz81Vt8/yPDg4MtKoBE +SvCmRAgKPLcKpmMtqPWgpyEBD8uLtQ0f67Xozg+92hff3Qji0LNvXjMTO5PhAJe3vj+ ehQRhCIcObqcw6N17WxQiSX+Z4jdS+6820PuyZaqbmabJFFUZFTjPRLe3u7+4lyPEkq2 IK3h74y7jdFYDcHUnunTVXunjBr05kgz2/9CR3xQp9PIrVSXesA80ld/uzVg2FhZs+LC x+WlRx5S+tIw30f1l7IBiq3TFpNYtFNNunPs9du4l6cFiY0ypkKG83P53qud4gO72NKz 6Qqw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id vs23-20020a170907139700b006df76385f31si15027999ejb.977.2022.04.26.09.28.57; Tue, 26 Apr 2022 09:29:22 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1349474AbiDZLTT (ORCPT + 99 others); Tue, 26 Apr 2022 07:19:19 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38306 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1349499AbiDZLS6 (ORCPT ); Tue, 26 Apr 2022 07:18:58 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 83326DF4 for ; Tue, 26 Apr 2022 04:15:49 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 47BF7ED1; Tue, 26 Apr 2022 04:15:49 -0700 (PDT) Received: from FVFF77S0Q05N (unknown [10.57.76.208]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id DBF313F5A1; Tue, 26 Apr 2022 04:15:47 -0700 (PDT) Date: Tue, 26 Apr 2022 12:15:40 +0100 From: Mark Rutland To: Kees Cook Cc: linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, akpm@linux-foundation.org, alex.popov@linux.com, catalin.marinas@arm.com, luto@kernel.org, will@kernel.org Subject: Re: [PATCH 0/8] stackleak: fixes and rework Message-ID: References: <20220425115603.781311-1-mark.rutland@arm.com> <202204251551.0CFE01DF4@keescook> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 26, 2022 at 11:37:47AM +0100, Mark Rutland wrote: > On Tue, Apr 26, 2022 at 11:10:52AM +0100, Mark Rutland wrote: > > On Mon, Apr 25, 2022 at 03:54:00PM -0700, Kees Cook wrote: > > > On Mon, Apr 25, 2022 at 12:55:55PM +0100, Mark Rutland wrote: > > > > This series reworks the stackleak code. The first patch fixes some > > > > latent issues on arm64, and the subsequent patches improve the code to > > > > improve clarity and permit better code generation. > > > > > > This looks nice; thanks! I'll put this through build testing and get it > > > applied shortly... > > > > Thanks! > > > > Patch 1 is liable to conflict with come other stacktrace bits that may go in > > for v5.19, so it'd be good if either that could be queued as a fix for > > v5.1-rc4, or we'll have to figure out how to deal with conflicts later. > > > > > > While the improvement is small, I think the improvement to clarity and > > > > code generation is a win regardless. > > > > > > Agreed. I also want to manually inspect the resulting memory just to > > > make sure things didn't accidentally regress. There's also an LKDTM test > > > for basic functionality. > > > > I assume that's the STACKLEAK_ERASING test? > > > > I gave that a spin, but on arm64 that test is flaky even on baseline v5.18-rc1. > > On x86_64 it seems consistent after 100s of runs. I'll go dig into that now. > > I hacked in some debug, and it looks like the sp used in the test is far above > the current lowest_sp. The test is slightly wrong since it grabs the address of > a local variable rather than using current_stack_pointer, but the offset I see > is much larger: > > # echo STACKLEAK_ERASING > /sys/kernel/debug/provoke-crash/DIRECT > [ 27.665221] lkdtm: Performing direct entry STACKLEAK_ERASING > [ 27.665986] lkdtm: FAIL: lowest_stack 0xffff8000083a39e0 is lower than test sp 0xffff8000083a3c80 > [ 27.667530] lkdtm: FAIL: the thread stack is NOT properly erased! > > That's off by 0x2a0 (AKA 672) bytes, and it seems to be consistent from run to > run. > > I note that an interrupt occuring could cause similar (since on arm64 those are > taken/triaged on the task stack before moving to the irq stack, and the irq > regs alone will take 300+ bytes), but that doesn't seem to be the problem here > given this is consistent, and it appears some prior function consumed a lot of > stack. > > I *think* the same irq problem would apply to x86, but maybe that initial > triage happens on a trampoline stack. > > I'll dig a bit more into the arm64 side... That offset above seems to be due to the earlier logic in direct_entry(), which I guess is running out-of-line. With that hacked to: ---------------- diff --git a/drivers/misc/lkdtm/core.c b/drivers/misc/lkdtm/core.c index e2228b6fc09bb..53f3027e8202d 100644 --- a/drivers/misc/lkdtm/core.c +++ b/drivers/misc/lkdtm/core.c @@ -378,8 +378,9 @@ static ssize_t direct_entry(struct file *f, const char __user *user_buf, size_t count, loff_t *off) { const struct crashtype *crashtype; - char *buf; + char *buf = "STACKLEAK_ERASING"; +#if 0 if (count >= PAGE_SIZE) return -EINVAL; if (count < 1) @@ -395,13 +396,17 @@ static ssize_t direct_entry(struct file *f, const char __user *user_buf, /* NULL-terminate and remove enter */ buf[count] = '\0'; strim(buf); +#endif crashtype = find_crashtype(buf); + +#if 0 free_page((unsigned long) buf); if (!crashtype) return -EINVAL; +#endif - pr_info("Performing direct entry %s\n", crashtype->name); + // pr_info("Performing direct entry %s\n", crashtype->name); lkdtm_do_action(crashtype); *off += count; ---------------- ... the SP check doesn't fail, but I still see intermittent bad value failures. Those might be due to interrupt frames. Thanks, Mark.