Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751105AbdFSRLm (ORCPT ); Mon, 19 Jun 2017 13:11:42 -0400 Received: from mail-pg0-f66.google.com ([74.125.83.66]:36379 "EHLO mail-pg0-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750844AbdFSRLj (ORCPT ); Mon, 19 Jun 2017 13:11:39 -0400 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: [PATCHv2 1/3] x86/mm: Provide pmdp_establish() helper From: Nadav Amit In-Reply-To: <20170615145224.66200-2-kirill.shutemov@linux.intel.com> Date: Mon, 19 Jun 2017 10:11:35 -0700 Cc: Andrew Morton , Vlastimil Babka , Vineet Gupta , Russell King , Will Deacon , Catalin Marinas , Ralf Baechle , "David S. Miller" , "Aneesh Kumar K . V" , Martin Schwidefsky , Heiko Carstens , Andrea Arcangeli , linux-arch@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Ingo Molnar , "H . Peter Anvin" , Thomas Gleixner Message-Id: References: <20170615145224.66200-1-kirill.shutemov@linux.intel.com> <20170615145224.66200-2-kirill.shutemov@linux.intel.com> To: "Kirill A. Shutemov" X-Mailer: Apple Mail (2.3273) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by mail.home.local id v5JHBqgZ001764 Content-Length: 989 Lines: 30 Kirill A. Shutemov wrote: > We need an atomic way to setup pmd page table entry, avoiding races with > CPU setting dirty/accessed bits. This is required to implement > pmdp_invalidate() that doesn't loose these bits. > > On PAE we have to use cmpxchg8b as we cannot assume what is value of new pmd and > setting it up half-by-half can expose broken corrupted entry to CPU. ... > > +#ifndef pmdp_establish > +#define pmdp_establish pmdp_establish > +static inline pmd_t pmdp_establish(pmd_t *pmdp, pmd_t pmd) > +{ > + if (IS_ENABLED(CONFIG_SMP)) { > + return xchg(pmdp, pmd); > + } else { > + pmd_t old = *pmdp; > + *pmdp = pmd; I think you may want to use WRITE_ONCE() here - otherwise nobody guarantees that the compiler will not split writes to *pmdp. Although the kernel uses similar code to setting PTEs and PMDs, I think that it is best to start fixing it. Obviously, you might need a different code path for 32-bit kernels. Regards, Nadav