Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933079AbaJXQWp (ORCPT ); Fri, 24 Oct 2014 12:22:45 -0400 Received: from bedivere.hansenpartnership.com ([66.63.167.143]:58498 "EHLO bedivere.hansenpartnership.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751510AbaJXQWn (ORCPT ); Fri, 24 Oct 2014 12:22:43 -0400 Message-ID: <1414167761.19984.17.camel@jarvis.lan> Subject: Re: [PATCH V2 1/2] mm: Update generic gup implementation to handle hugepage directory From: James Bottomley To: Benjamin Herrenschmidt Cc: David Miller , akpm@linux-foundation.org, aneesh.kumar@linux.vnet.ibm.com, steve.capper@linaro.org, aarcange@redhat.com, mpe@ellerman.id.au, linux-mm@kvack.org, linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, linux-arch@vger.kernel.org, hannes@cmpxchg.org Date: Fri, 24 Oct 2014 09:22:41 -0700 In-Reply-To: <1414107635.364.91.camel@pasglop> References: <1413520687-31729-1-git-send-email-aneesh.kumar@linux.vnet.ibm.com> <20141022160224.9c2268795e55d5a2eff5b94d@linux-foundation.org> <20141023.184035.388557314666522484.davem@davemloft.net> <1414107635.364.91.camel@pasglop> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.10.4 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2014-10-24 at 10:40 +1100, Benjamin Herrenschmidt wrote: > On Thu, 2014-10-23 at 18:40 -0400, David Miller wrote: > > Hey guys, was looking over the generic GUP while working on a sparc64 > > issue and I noticed that you guys do speculative page gets, and after > > talking with Johannes Weiner (CC:'d) about this we don't see how it > > could be necessary. > > > > If interrupts are disabled during the page table scan (which they > > are), no IPI tlb flushes can arrive. Therefore any removal from the > > page tables is guarded by interrupts being re-enabled. And as a > > result, page counts of pages we see in the page tables must always > > have a count > 0. > > > > x86 does direct atomic_add() on &page->_count because of this > > invariant and I would rather see the generic version do this too. > > This is of course only true of archs who use IPIs for TLB flushes, so if > we are going down the path of not being speculative, powerpc would have > to go back to doing its own since our broadcast TLB flush means we > aren't protected (we are only protected vs. the page tables themselves > being freed since we do that via sched RCU). > > AFAIK, ARM also broadcasts TLB flushes... Parisc does this. As soon as one CPU issues a TLB purge, it's broadcast to all the CPUs on the inter-CPU bus. The next instruction isn't executed until they respond. But this is only for our CPU TLB. There's no other external consequence, so removal from the page tables isn't effected by this TLB flush, therefore the theory on which Dave bases the change to atomic_add() should work for us (of course, atomic_add is lock add unlock on our CPU, so it's not going to be of much benefit). James > Another option would be to make the generic code use something defined > by the arch to decide whether to use speculative get or > not. I like the idea of keeping the bulk of that code generic... > > Cheers, > Ben. > > > -- > > To unsubscribe, send a message with 'unsubscribe linux-mm' in > > the body to majordomo@kvack.org. For more info on Linux MM, > > see: http://www.linux-mm.org/ . > > Don't email: email@kvack.org > > > -- > To unsubscribe, send a message with 'unsubscribe linux-mm' in > the body to majordomo@kvack.org. For more info on Linux MM, > see: http://www.linux-mm.org/ . > Don't email: email@kvack.org > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/