Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp1994524ybl; Thu, 19 Dec 2019 06:24:45 -0800 (PST) X-Google-Smtp-Source: APXvYqwzWGJj+HZepMG9/x8UTf7uB//59tuDCzomV/R+1SiRxwzvbi26svqwKMkCf7WadzJ4BNVc X-Received: by 2002:a05:6830:1c85:: with SMTP id v5mr9564149otf.35.1576765485770; Thu, 19 Dec 2019 06:24:45 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1576765485; cv=none; d=google.com; s=arc-20160816; b=w4IV18OKJals2t5SKxqGNs5DSTDfsMybNO9k9CNlJgF8SY9wSgRegSao0Kd0pqOlyu lnfikiGNxv01YAm1RUSYdRP/L4jMzoJSU4RTrTCYFZPZzpvrUgwEU0+o76JkMw/3Qbmi J7xY+xl/dTaRi8WCLMBfFOcNL4J1eTbsouenmqelfqlHdq9JA4ybAYYk10ztzNGwJqbb LaoLymT74fmgwu/rD8WRC06uAiFGpOuf4f7VhfnYXJHeSoIWvDl3A8/kmpzTWxqzRpT7 njYJBOd6V70WXuRoZmIBD5hgPLAfRv36+oK2KpRVJx1lRryH96s8kKn8qNr2knDKQgKk 9zKQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=rOb4F7v6NzOkO94AvO9f81Kbs/T8wrmeZnUR4ih7okg=; b=OM9S9WFL50NrpngbRVbVTu3ymaZrW/L80KFLW6KHTFmIbTHQOJ9pdpM/Ajcn+/SVHa N7shWEQhxii4eNeFUDmdFcbcydhBrWhCzWI08A7NkwRWsWeDTrfq/INGQttLXs3YmMVQ uN8SR/HkRL74IR57lSRJCUd/vzlo25YZcOj/AFiDDO+PcIRo3jV4wdQCl/8tcBUAthcu SooroseXFFNVPgwoOjwkpfBZVx14rWM2lXi0X07CpJvbiLfYyuDeiWLmOgYzb7Rs4j0K rbYcD49+e2I3so2w0T4NaWPND6cD4NOlWjej88Wv+hQoTEt+voraV1FrZ0OvJyKSNpHv g5Gg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r12si3283250otq.156.2019.12.19.06.24.32; Thu, 19 Dec 2019 06:24:45 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726928AbfLSOW0 (ORCPT + 99 others); Thu, 19 Dec 2019 09:22:26 -0500 Received: from Galois.linutronix.de ([193.142.43.55]:60103 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726701AbfLSOWZ (ORCPT ); Thu, 19 Dec 2019 09:22:25 -0500 Received: from bigeasy by Galois.linutronix.de with local (Exim 4.80) (envelope-from ) id 1ihwgz-0005lY-97; Thu, 19 Dec 2019 15:22:17 +0100 Date: Thu, 19 Dec 2019 15:22:17 +0100 From: Sebastian Andrzej Siewior To: Yu-cheng Yu Cc: linux-kernel@vger.kernel.org, x86@kernel.org, "H. Peter Anvin" , Thomas Gleixner , Ingo Molnar , Dave Hansen , Tony Luck , Andy Lutomirski , Borislav Petkov , Rik van Riel , "Ravi V. Shankar" , Fenghua Yu , Peter Zijlstra Subject: Re: [PATCH v2 3/3] x86/fpu/xstate: Invalidate fpregs when __fpu_restore_sig() fails Message-ID: <20191219142217.axgpqlb7zzluoxnf@linutronix.de> References: <20191212210855.19260-1-yu-cheng.yu@intel.com> <20191212210855.19260-4-yu-cheng.yu@intel.com> <20191218155449.sk4gjabtynh67jqb@linutronix.de> <587463c4e5fa82dff8748e5f753890ac390e993e.camel@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <587463c4e5fa82dff8748e5f753890ac390e993e.camel@intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019-12-18 12:53:59 [-0800], Yu-cheng Yu wrote: > I could have explained this better, sorry! I will explain the first > case below; other cases are similar. > > In copy_user_to_fpregs_zeroing(), we have: > > if (user_xsave()) { > ... > if (unlikely(init_bv)) > copy_kernel_to_xregs(&init_fpstate.xsave, init_bv); > return copy_user_to_xregs(buf, xbv); > ... > } > > The copy_user_to_xregs() may fail, and when that happens, before going to > the slow path, there is fpregs_unlock() and context switches may happen. The context switch may only happen after fpregs_unlock(). > However, at this point, fpu_fpregs_owner_ctx has not been changed; it could > still be another task's FPU. TIF_NEED_FPU_LOAD is set for the task in __fpu__restore_sig() and its context (__fpu_invalidate_fpregs_state()) has been invalidated. So the FPU register may contain another task's content and fpu_fpregs_owner_ctx points to another context. > For this to happen and to be detected, the user > stack page needs to be non-present, fpu_fpregs_owner_ctx need to be another task, > and that other task needs to be able to detect its registers are modified. > The last factor is not easy to reproduce, and a CET control-protection fault > helps. So far everything is legal. However. If there is a context switch before fpregs_lock() then this is bad before we don't account for that. So that: diff --git a/arch/x86/kernel/fpu/signal.c b/arch/x86/kernel/fpu/signal.c --- a/arch/x86/kernel/fpu/signal.c +++ b/arch/x86/kernel/fpu/signal.c @@ -352,6 +352,7 @@ static int __fpu__restore_sig(void __user *buf, void __user *buf_fx, int size) fpregs_unlock(); return 0; } + fpregs_deactivate(fpu); fpregs_unlock(); } @@ -403,6 +404,8 @@ static int __fpu__restore_sig(void __user *buf, void __user *buf_fx, int size) } if (!ret) fpregs_mark_activate(); + else + fpregs_deactivate(fpu); fpregs_unlock(); err_out: Should be enough. > > Can you tell me which glibc test? I would like to reproduce this. > > > > > The introduction of supervisor xstates and CET, while not contributing to > > > the problem, makes it more detectable. After init_fpstate and the Shadow > > > Stack pointer have been restored to xregs, the XRSTOR from user stack > > > fails and fpu_fpregs_owner_ctx is not updated. The task currently owning > > > fpregs then uses the corrupted Shadow Stack pointer and triggers a control- > > > protection fault. > > > > So I don't need new HW with supervisor and CET? A plain KVM box with > > SSE2 and so should be enough? > > What I do is, clone the whole glibc source, and run mutiple copies of > "make check". In about 40 minutes or so, there are unexplained seg faults, > or a few control-protection faults (if you enable CET). Please let me > know if more clarification is needed. Okay. Can you please try the above and if not, I try that glibc thing myself. > Thanks, > Yu-cheng Sebastian