Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754760AbcLTCbf (ORCPT ); Mon, 19 Dec 2016 21:31:35 -0500 Received: from mail-pg0-f68.google.com ([74.125.83.68]:32980 "EHLO mail-pg0-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751432AbcLTCbd (ORCPT ); Mon, 19 Dec 2016 21:31:33 -0500 Date: Tue, 20 Dec 2016 12:31:13 +1000 From: Nicholas Piggin To: Dave Hansen Cc: Linus Torvalds , Bob Peterson , Linux Kernel Mailing List , swhiteho@redhat.com, luto@kernel.org, agruenba@redhat.com, peterz@infradead.org, mgorman@techsingularity.net, linux-mm@kvack.org Subject: Re: [RFC][PATCH] make global bitlock waitqueues per-node Message-ID: <20161220123113.1e1de7b0@roar.ozlabs.ibm.com> In-Reply-To: <156a5b34-ad3b-d0aa-83c9-109b366c1bdf@linux.intel.com> References: <20161219225826.F8CB356F@viggo.jf.intel.com> <156a5b34-ad3b-d0aa-83c9-109b366c1bdf@linux.intel.com> Organization: IBM X-Mailer: Claws Mail 3.14.1 (GTK+ 2.24.31; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1552 Lines: 47 On Mon, 19 Dec 2016 16:20:05 -0800 Dave Hansen wrote: > On 12/19/2016 03:07 PM, Linus Torvalds wrote: > > +wait_queue_head_t *bit_waitqueue(void *word, int bit) > > +{ > > + const int __maybe_unused nid = page_to_nid(virt_to_page(word)); > > + > > + return __bit_waitqueue(word, bit, nid); > > > > No can do. Part of the problem with the old coffee was that it did that > > virt_to_page() crud. That doesn't work with the virtually mapped stack. > > Ahhh, got it. > > So, what did you have in mind? Just redirect bit_waitqueue() to the > "first_online_node" waitqueues? > > wait_queue_head_t *bit_waitqueue(void *word, int bit) > { > return __bit_waitqueue(word, bit, first_online_node); > } > > We could do some fancy stuff like only do virt_to_page() for things in > the linear map, but I'm not sure we'll see much of a gain for it. None > of the other waitqueue users look as pathological as the 'struct page' > ones. Maybe: > > wait_queue_head_t *bit_waitqueue(void *word, int bit) > { > int nid > if (word >= VMALLOC_START) /* all addrs not in linear map */ > nid = first_online_node; > else > nid = page_to_nid(virt_to_page(word)); > return __bit_waitqueue(word, bit, nid); > } I think he meant just make the page_waitqueue do the per-node thing and leave bit_waitqueue as the global bit. It would be cool if CPUs had an instruction that translates an address though. You could avoid all that lookup and just do it with the TLB :) Thanks, Nick