Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp7467776imu; Mon, 3 Dec 2018 13:26:56 -0800 (PST) X-Google-Smtp-Source: AFSGD/UUojBDtKTpjJIeN60iokT2d4z1BQsjTT9BaUleTVzBKy32VMrBmYHOF8iguY7MJ1CEWRUc X-Received: by 2002:a17:902:f44:: with SMTP id 62mr17735583ply.38.1543872416509; Mon, 03 Dec 2018 13:26:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543872416; cv=none; d=google.com; s=arc-20160816; b=DHDER2ebie1KOgsxcx5OtnqjRWqJ/m6duXqwNQqCdvzfq1xwVl2RQcfXTDYPXOFkHN LKNxyMkxyHGsVP0h1dsBxeZEGJBcCFHtsJCIKNaNrL68ONzJ0SJ8k60w5CQX2DJE3r5P HgNp9+kqQFVqD79qB8FnHu0UzCYYe/Da1qoRVF3uBTuwnTHfe+7YjRrfXbaOCC2mvB8m 8CK83FkDuWf/h4SqGIyMZoHqaMKzHEcMiA6at2FxWUG7bCkJ4gJngjFkkaoBFkjM6K71 Uj7rh8mYrWXz6/4eNvrtHWJa/zJff3pY4lnXPzxFOrXvTuE5QmUFwkVHkuGKL0eq/RVF EQWA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=b588KdWmD0QNUzACJ4IDZ/2nwdRanor8adhooX0Qluk=; b=pIh/KOp4aoyU0VXh5a+iN8kNy3dz3p6Q7c/1wseBVvae4ofuL9izMlvHauPqc19JJV JNe1YXhbMT3RHZSZ36Uj8XA07UOPfcinwn8uNBmg4lzeY/d6mnQzJF7o1Nt7tEKHOemq fLYvcWntiyaMek/4Zc30JMHlpB3H5wMUdAWEBY/k8Ux6n207APjF7ZyVmPmgepV9Rc04 YVfuROuUpNGH7i/VOdUyoGDKBV5EM/39V/paGQ3p+E0qXnUN4Ie/i3AGWwWWCdfNR3/7 hLiBr9ZNAETfWgWSyUJoGtiF1iUEJStYP0gn/eVAWkWpPIFO7dfGAbJu4X8y3PT64GzE 7tZg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x1si15606474plb.366.2018.12.03.13.26.40; Mon, 03 Dec 2018 13:26:56 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725999AbeLCVZo (ORCPT + 99 others); Mon, 3 Dec 2018 16:25:44 -0500 Received: from mx2.suse.de ([195.135.220.15]:38686 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725903AbeLCVZn (ORCPT ); Mon, 3 Dec 2018 16:25:43 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id CA236B059; Mon, 3 Dec 2018 21:25:41 +0000 (UTC) Date: Mon, 3 Dec 2018 22:25:39 +0100 From: Michal Hocko To: David Rientjes Cc: Linus Torvalds , ying.huang@intel.com, Andrea Arcangeli , s.priebe@profihost.ag, mgorman@techsingularity.net, Linux List Kernel Mailing , alex.williamson@redhat.com, lkp@01.org, kirill@shutemov.name, Andrew Morton , zi.yan@cs.rutgers.edu, Vlastimil Babka Subject: Re: [LKP] [mm] ac5b2c1891: vm-scalability.throughput -61.3% regression Message-ID: <20181203212539.GR31738@dhcp22.suse.cz> References: <20181127205737.GI16136@redhat.com> <87tvk1yjkp.fsf@yhuang-dev.intel.com> <20181203181456.GK31738@dhcp22.suse.cz> <20181203183050.GL31738@dhcp22.suse.cz> <20181203185954.GM31738@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 03-12-18 12:39:34, David Rientjes wrote: > On Mon, 3 Dec 2018, Michal Hocko wrote: > > > I have merely said that a better THP locality needs more work and during > > the review discussion I have even volunteered to work on that. There > > are other reclaim related fixes under work right now. All I am saying > > is that MADV_TRANSHUGE having numa locality implications cannot satisfy > > all the usecases and it is particurarly KVM that suffers from it. > > I think extending functionality so thp can be allocated remotely if truly > desired is worthwhile This is a complete NUMA policy antipatern that we have for all other user memory allocations. So far you have to be explicit for your numa requirements. You are trying to conflate NUMA api with MADV and that is just conflating two orthogonal things and that is just wrong. Let's put the __GFP_THISNODE issue aside. I do not remember you confirming that __GFP_COMPACT_ONLY patch is OK for you (sorry it might got lost in the emails storm from back then) but if that is the only agreeable solution for now then I can live with that. __GFP_NORETRY hack was shown to not work properly by Mel AFAIR. Again if I misremember then I am sorry and I can live with that. But conflating MADV_TRANSHUGE with an implicit numa placement policy and/or adding an opt-in for remote NUMA placing is completely backwards and a broken API which will likely bites us later. I sincerely hope we are not going to repeat mistakes from the past. -- Michal Hocko SUSE Labs