Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758651AbcJQUKq (ORCPT ); Mon, 17 Oct 2016 16:10:46 -0400 Received: from shelob.surriel.com ([74.92.59.67]:60228 "EHLO shelob.surriel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757555AbcJQUJv (ORCPT ); Mon, 17 Oct 2016 16:09:51 -0400 From: riel@redhat.com To: linux-kernel@vger.kernel.org Cc: mingo@kernel.org, bp@alien8.de, torvalds@linux-foundation.org, luto@kernel.org, dave.hansen@intel.linux.com, tglx@linutronix.de, hpa@zytor.com Subject: [PATCH RFC 0/3] x86/fpu: defer FPU state loading until return to userspace Date: Mon, 17 Oct 2016 16:09:41 -0400 Message-Id: <1476734984-13839-1-git-send-email-riel@redhat.com> X-Mailer: git-send-email 2.7.4 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1040 Lines: 24 These patches defer FPU state loading until return to userspace. This has the advantage of not clobbering the FPU state of one task with that of another, when that other task only stays in kernel mode. It also allows us to skip the FPU restore in kernel_fpu_end(), which will help tasks that do multiple invokations of kernel_fpu_begin/end without returning to userspace, for example KVM VCPU tasks. We could also skip the restore of the KVM VCPU guest FPU state at guest entry time, if it is still valid, but I have not implemented that yet. The code that loads FPU context directly into registers from user space memory, or saves directly to user space memory, is wrapped in a retry loop, that ensures the FPU state is correctly set up at the start, and verifies that it is still valid at the end. I have stress tested these patches with various FPU test programs, and things seem to survive. However, I have not found any good test suites that mix FPU use and signal handlers. Close scrutiny of these patches would be appreciated.