Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5322672imm; Tue, 21 Aug 2018 09:45:39 -0700 (PDT) X-Google-Smtp-Source: AA+uWPwxZqW60U9h7nXURDk+tFvgRENrAp+Yc5/hlDYxr/N/T3klh8z/qEnpphft9FYPQCK8YDyZ X-Received: by 2002:a63:790b:: with SMTP id u11-v6mr48038171pgc.111.1534869939551; Tue, 21 Aug 2018 09:45:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1534869939; cv=none; d=google.com; s=arc-20160816; b=IxqQErJ7Ac8/DHFh+mBDORG6rgerCgh5xxsUBO8qB8jclEYoFuDHzyfy517i9fzhq7 rI3/3qkX+D8XBCs+pK1etZjSI3SacOf+7sOIf7F9qwbagQDdVoXYifHe5QKMH+/96a/n Z9lvj3Ty7ONlPT1q+XT7vFHdZQbno0J+5YTRvJUyjvISFMv3ZguSdCvHjuk+QItsoKA7 C18vru6EqZPiIzigIKWPYiNvkeaMBM1pbS+5Eq97ZagmLfOJQ1VFeB3QnK9zR3Edr4xp bHsjjcy+KqpObjdK2RuGgLrkYgvbHpJ0c70imSWrNzkDBekhdobsivz1w3JX63bHeRW7 580w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-disposition :content-transfer-encoding:mime-version:in-reply-to:references :subject:cc:to:from:date:message-id:arc-authentication-results; bh=Pp862XbULXc8LfCRoSC66wCeJjpapa1sYj48GQLaRFg=; b=ZekPurcpfJKatcpreZOOEOTx+GmHvW2oPEOKeIOOk6jY22GbIHv/JjRCmQ7M4Z3sfZ ZpYKgUfXUPS99vbOJNkIj1aF5i4ns+rYrLPLmGwQRaoXsfayU3jEXlrW8/3mRZHPGtx0 zM8E2mqMS2q8FPZI9/ed4LdbJIqllyuTADE/2hS5GfeFx65zGSSRfYpUrx5pBcGHQUN7 SxC5gdFK4FjhQhiFcdd3s8Kiwr/MZihckFkBN+XQpGcQvyD6hgWBFUaLplDWSeHNsB4w dp8fG0NgDfuJXsdl0crl3U+NGhW6zeMf+RI7+2QPTIeBXW6d6fJaNKO+M8tpJcS7NXVj h+5g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p14-v6si12246616plo.357.2018.08.21.09.45.24; Tue, 21 Aug 2018 09:45:39 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728114AbeHUTS0 convert rfc822-to-8bit (ORCPT + 99 others); Tue, 21 Aug 2018 15:18:26 -0400 Received: from prv1-mh.provo.novell.com ([137.65.248.33]:51099 "EHLO prv1-mh.provo.novell.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726967AbeHUTSZ (ORCPT ); Tue, 21 Aug 2018 15:18:25 -0400 Received: from INET-PRV1-MTA by prv1-mh.provo.novell.com with Novell_GroupWise; Tue, 21 Aug 2018 09:57:43 -0600 Message-Id: <5B7C367002000078001E0987@prv1-mh.provo.novell.com> X-Mailer: Novell GroupWise Internet Agent 18.0.2 Date: Tue, 21 Aug 2018 09:57:36 -0600 From: "Jan Beulich" To: "Juergen Gross" Cc: "the arch/x86 maintainers" , , "xen-devel" , "Boris Ostrovsky" , , , Subject: Re: [Xen-devel] [PATCH v2 2/2] x86/pae: use 64 bit atomic xchg function in native_ptep_get_and_clear References: <20180821153755.30462-1-jgross@suse.com> <20180821153755.30462-3-jgross@suse.com> In-Reply-To: <20180821153755.30462-3-jgross@suse.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 8BIT Content-Disposition: inline Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org >>> On 21.08.18 at 17:37, wrote: > Using only 32-bit writes for the pte will result in an intermediate > L1TF vulnerable PTE. When running as a Xen PV guest this will at once > switch the guest to shadow mode resulting in a loss of performance. > > Use arch_atomic64_xchg() instead which will perform the requested > operation atomically with all 64 bits. > > Some performance considerations according to: > > https://software.intel.com/sites/default/files/managed/ad/dc/Intel-Xeon-Scal > able-Processor-throughput-latency.pdf > > The main number should be the latency, as there is no tight loop around > native_ptep_get_and_clear(). > > "lock cmpxchg8b" has a latency of 20 cycles, while "lock xchg" (with a > memory operand) isn't mentioned in that document. "lock xadd" (with xadd > having 3 cycles less latency than xchg) has a latency of 11, so we can > assume a latency of 14 for "lock xchg". > > Signed-off-by: Juergen Gross Reviewed-by: Jan Beulich with one further remark: > @@ -150,10 +152,7 @@ static inline pte_t native_ptep_get_and_clear(pte_t *ptep) > { > pte_t res; > > - /* xchg acts as a barrier before the setting of the high bits */ > - res.pte_low = xchg(&ptep->pte_low, 0); > - res.pte_high = ptep->pte_high; > - ptep->pte_high = 0; > + res.pte = (pteval_t)arch_atomic64_xchg((atomic64_t *)ptep, 0); Is the cast on the return value really needed here? Jan