Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754041AbcKRX1w (ORCPT ); Fri, 18 Nov 2016 18:27:52 -0500 Received: from mail-pg0-f51.google.com ([74.125.83.51]:33775 "EHLO mail-pg0-f51.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752842AbcKRX1S (ORCPT ); Fri, 18 Nov 2016 18:27:18 -0500 Date: Fri, 18 Nov 2016 15:27:10 -0800 (PST) From: Hugh Dickins X-X-Sender: hugh@eggly.anvils To: Boris Ostrovsky cc: Hugh Dickins , Mel Gorman , david.vrabel@citrix.com, jgross@suse.com, xen-devel@lists.xenproject.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, olaf@aepfle.de Subject: Re: [PATCH v3 (re-send)] xen/gntdev: Use mempolicy instead of VM_IO flag to avoid NUMA balancing In-Reply-To: <05c24d23-0298-5b58-d0e8-095ba64cdf9b@oracle.com> Message-ID: References: <1479413404-27332-1-git-send-email-boris.ostrovsky@oracle.com> <2bf041f3-8918-3c6f-8afb-c9edcc03dcd9@oracle.com> <05c24d23-0298-5b58-d0e8-095ba64cdf9b@oracle.com> User-Agent: Alpine 2.11 (LSU 23 2013-08-11) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2157 Lines: 46 On Fri, 18 Nov 2016, Boris Ostrovsky wrote: > On 11/18/2016 05:27 PM, Hugh Dickins wrote: > > On Fri, 18 Nov 2016, Boris Ostrovsky wrote: > >> On 11/18/2016 04:51 PM, Hugh Dickins wrote: > >>> Hmm, sorry, but this seems overcomplicated to me: ingenious, but an > >>> unusual use of the ->get_policy method, which is a little worrying, > >>> since it has only been used for shmem (+ shm and kernfs) until now. > >>> > >>> Maybe I'm wrong, but wouldn't substituting VM_MIXEDMAP for VM_IO > >>> solve the problem more simply? > >> It would indeed. I didn't want to use it because it has specific meaning > >> ("Can contain "struct page" and pure PFN pages") and that didn't seem > >> like the right flag to describe this vma. > > It is okay if it contains 0 pure PFN pages; and no worse than VM_IO was. > > A comment on why VM_MIXEDMAP is being used there would certainly be good. > > But I do find its use preferable to enlisting an unusual ->get_policy. > > OK, I'll set VM_MIXEDMAP then. Thanks, if it accomplishes what you need, then please do use it. > > I am still curious though why you feel get_policy is not appropriate > here (beside the fact that so far it had limited use). It is essentially > trying to say that the only policy to be consulted (in vma_policy_mof()) > is of the vma itself and not of the task. I agree that get_policy is explicitly about NUMA, and so relevant to the matter of (discouraging) NUMA balancing, without any apology needed. But there are no other examples of its use that way, it's been something private to shmem (hence shm and kernfs) up until now: the complement of set_policy, which implements the mbind() syscall on shmem objects. Introduce an exceptional new usage, and we're likely to introduce bugs (not to mention the long history of bugs in mpol_dup() that you also use). Perhaps I'd find one already if I took the time to study your patch. Full disclosure: I'm also contemplating a change to its interface, to handle a possible NUMA interleave issue, so I do need to keep an eye on all its callers. If we have to choose between two less-than-ideal solutions, please let's choose the simplest. Hugh