Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp597790pxb; Wed, 1 Sep 2021 06:10:00 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxM3Lf0lV9B2Lg0UFMecFzJLZPe7SOOwbIfHZyo8C9QWOOvlqMwto/GEwStNUuJU1F8GuWf X-Received: by 2002:a2e:b703:: with SMTP id j3mr24900358ljo.63.1630501799897; Wed, 01 Sep 2021 06:09:59 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1630501799; cv=none; d=google.com; s=arc-20160816; b=xTjTqSI4SLd9iANo/k2wJwMX/kJgtTAOls89cgrUjfnb69OHoB6BPLiRZydiymuYzS Os/DTAGBwLpUZDBr03rilUsXWg/61nYF41gOjtvAna8trP1S3VKZ9RwdQzii10+TJcOz JgzbGxjVGbWVHmczqfnvxss3ejthp5TjDhwWP1NZMHqCIS2ufyPJh0jRs/zm7fX3jDxe Rklq2SMAAvCesr0KxByiBPJenosPyFTcaiWQfLoujYyB4m+8myZ/1daV8UbgxFB9k1EM VSVevLUH2nbSGtHDn0CAX33YQN8/Ur/9tdfeeNgCNoGZrpk7rNGOOxl1lL3tn62Pyrjh AQmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=rmuIuK2FpCv0DEu4TLFx/eI7rk7UX7fPyKr+Xpl6u6Y=; b=BAX4lHyQkQRjNQ0gbCfIZMRcyK+AHPW1TVV2V7wJjZFT3C6+pLUwPUt1SZQ+J+XJsX KHYhLtk9gjwbHZE6EIbRrN8+RHPqbiX9Lpv1hwWVD8iqEmHwJoD1V1LCmpO5JDt2UkTY 6qHEZk7LTFSNavj28mDzp62mtkM0DkgZz4Aj3ipuNI5Plu7DToK0Amh39TF78hNehiN/ LMdJM0m794Zl1aL0HqX5ZwjrXtTTQQ7clVxf0hoZcU3+KLKof46kOs8zESq3pRBFy/oD 9at5YkgKlva308WbkoZMDcNzIeVEkTMdIZtIP8OQedTIYCmlkyuqr8zRqetZJ8BQOxs/ HH+w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=r6jEqxA1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e1si909433edl.348.2021.09.01.06.09.23; Wed, 01 Sep 2021 06:09:59 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=r6jEqxA1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344184AbhIANDB (ORCPT + 99 others); Wed, 1 Sep 2021 09:03:01 -0400 Received: from mail.skyhub.de ([5.9.137.197]:60240 "EHLO mail.skyhub.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1345667AbhIANAf (ORCPT ); Wed, 1 Sep 2021 09:00:35 -0400 Received: from zn.tnic (p200300ec2f0f3000a727e3aff00b12e4.dip0.t-ipconnect.de [IPv6:2003:ec:2f0f:3000:a727:e3af:f00b:12e4]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.skyhub.de (SuperMail on ZX Spectrum 128k) with ESMTPSA id A1A921EC01A9; Wed, 1 Sep 2021 14:59:32 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alien8.de; s=dkim; t=1630501172; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:in-reply-to:in-reply-to: references:references; bh=rmuIuK2FpCv0DEu4TLFx/eI7rk7UX7fPyKr+Xpl6u6Y=; b=r6jEqxA1kBmuK19FN0TZsUfbKoQ7P8WADTQ/AuwKBFXRf0vuABalTI+NeMLZzvHfQhU0lz JXg1YfbZcebXpZsQS93mOTHnX6cbN/oIqd0/Trgkwk2YknnwEPwDH+M/mOd99wNtlfUDXz Ptadg5YGr8MMIdz7HKbk7vTpO7zRv9g= Date: Wed, 1 Sep 2021 15:00:07 +0200 From: Borislav Petkov To: Dave Hansen Cc: "Yu, Yu-cheng" , x86@kernel.org, "H. Peter Anvin" , Thomas Gleixner , Ingo Molnar , linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-api@vger.kernel.org, Arnd Bergmann , Andy Lutomirski , Balbir Singh , Cyrill Gorcunov , Dave Hansen , Eugene Syromiatnikov , Florian Weimer , "H.J. Lu" , Jann Horn , Jonathan Corbet , Kees Cook , Mike Kravetz , Nadav Amit , Oleg Nesterov , Pavel Machek , Peter Zijlstra , Randy Dunlap , "Ravi V. Shankar" , Dave Martin , Weijiang Yang , Pengfei Xu , Haitao Huang , Rick P Edgecombe Subject: Re: [PATCH v29 23/32] x86/cet/shstk: Add user-mode shadow stack support Message-ID: References: <20210820181201.31490-1-yu-cheng.yu@intel.com> <20210820181201.31490-24-yu-cheng.yu@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org First of all, thanks a lot Dave for taking the time to communicate properly with me! On Fri, Aug 27, 2021 at 01:25:29PM -0700, Dave Hansen wrote: > I don't think this has anything to do with context-switching, really. > > The code lands in shstk_setup() which wants to make sure that the new > MSR values are set before the task goes out to userspace. If > TIF_NEED_FPU_LOAD was set, it could do that by going out to the XSAVE > buffer and setting the MSR state in the buffer. Before returning to > userspace, it would be XRSTOR'd. A WRMSR by itself would not be > persistent because that XRSTOR would overwrite it. > > But, if TIF_NEED_FPU_LOAD is *clear* it means the XSAVE buffer is > out-of-date and the registers are live. WRMSR can be used and there > will be a XSAVE* to the task buffer during a context switch. > > So, this code takes the coward's way out: it *forces* TIF_NEED_FPU_LOAD > to be clear by making the registers live with fpregs_restore_userregs(). > That lets it just use WRMSR instead of dealing with the XSAVE buffer > directly. If it didn't do this with the *WHOLE* set of user FPU state, > we'd need more fine-granted "NEED_*_LOAD" tracking than our one FPU bit. > > This is also *only* safe because the task is newly-exec()'d and the FPU > state was just reset. Otherwise, we might have had to worry that the > non-PL3 SSPs have garbage or that non-SHSTK bits are set in MSR_IA32_U_CET. > > That said, after staring at it, I *think* this code is functionally > correct and OK performance-wise. Right, except that that is being done in setup_signal_shadow_stack()/restore_signal_shadow_stack() too, for the restore token. Which means, a potential XRSTOR each time just for a single MSR. That means, twice per signal in the worst case. Which means, shadow stack should be pretty noticeable in signal-heavy benchmarks... > I suspect that the (very blunt) XRSTOR inside of > start_update_msrs()->fpregs_restore_userregs() is quite rare because > TIF_NEED_FPU_LOAD will usually be clear due to the proximity to > execve(). So, adding direct XSAVE buffer manipulation would probably > only make it more error prone. @Yu-cheng: please take Dave's explanation as is and stick it over start_update_msrs() so that it is clear what that thing is doing. Thx. -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette