Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp229204imu; Tue, 27 Nov 2018 11:28:58 -0800 (PST) X-Google-Smtp-Source: AFSGD/VMBG48HXj6ODiEfwPLj/5BYojrAQ+pVPq+b9tgvQszU7dvo0qREfFfeEw+gGwNEqXwIS++ X-Received: by 2002:a63:ff62:: with SMTP id s34mr30549054pgk.325.1543346938910; Tue, 27 Nov 2018 11:28:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543346938; cv=none; d=google.com; s=arc-20160816; b=Rg2P+BrwsHrf/WbxB4YfaJraEfVg9xXpNFGNj3qINT+YKbtm4daL//7VHUpgeNucqv XvFvbmE76DDfCmAFvg7ywBx/T52Tv2VWDrZ0Y/HMMuaEfoM+Dp7sKoBskdvuePeFnloK 1vZGA5nDSTxwKC/qjhhJYJ7BeqOeFW5pQgR5XeUunGVzBerhFMWczOecprF4D4qe42OB xbQ8sNebpx5LqpI1A5svRsUCCDeui5p8a2ddF8qLC5Gf++w+OJES5Z+lsLWcmKX1Vwes cfXYE0CcSOV8J+VGxWCLcmAtcL8323xrTh9mgYHgib+y59uIFXAU5w02kAI4LbDs/1Wu dajg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=+knI/GcDMDon3/4HwnDhCQ0nU9jWcQ4J6twFQWFeFvo=; b=G7Oo/D83BrkA4QxYElKrU8Y11jxin/XVlxgYPOuOmvILaV4iRT0IXkWYdxjuLd6J9N oep1azpJThKbqJ5lhiw3W4ls7ij4cjchNtIeuxwNfqPjYopAKDSYvZ8wqjsiMiGJsiyp NaE4I6UG+ZKz2b5PuOwv1SraByQ7lHYmBVhuJvl8IVjkqWAAuKUx4lWd3jpJz/1yG86l SvKmvfax6Q5b4QTmfPoT5KPJqPwVwIkcvF4LIRnFtoWeGpBihfq/hntyRXQC/tGIOlS4 E4moMmKpCcdi1O9DOkbLyc7/hznADFNFhftX0sMrBg+UY4/llwLwrX1d/eZn+ZVZBaUr 0DNA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 26si4458193pgq.402.2018.11.27.11.28.43; Tue, 27 Nov 2018 11:28:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726780AbeK1FQP (ORCPT + 99 others); Wed, 28 Nov 2018 00:16:15 -0500 Received: from mx2.suse.de ([195.135.220.15]:49492 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725740AbeK1FQP (ORCPT ); Wed, 28 Nov 2018 00:16:15 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 771EDAFF6; Tue, 27 Nov 2018 18:17:29 +0000 (UTC) Date: Tue, 27 Nov 2018 19:17:27 +0100 From: Michal Hocko To: Linus Torvalds Cc: rong.a.chen@intel.com, Andrea Arcangeli , s.priebe@profihost.ag, alex.williamson@redhat.com, mgorman@techsingularity.net, zi.yan@cs.rutgers.edu, Vlastimil Babka , rientjes@google.com, kirill@shutemov.name, Andrew Morton , Linux List Kernel Mailing , lkp@01.org Subject: Re: [LKP] [mm] ac5b2c1891: vm-scalability.throughput -61.3% regression Message-ID: <20181127181727.GD6923@dhcp22.suse.cz> References: <20181127062503.GH6163@shao2-debian> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 27-11-18 09:08:50, Linus Torvalds wrote: > On Mon, Nov 26, 2018 at 10:24 PM kernel test robot > wrote: > > > > FYI, we noticed a -61.3% regression of vm-scalability.throughput due > > to commit ac5b2c18911f ("mm: thp: relax __GFP_THISNODE for > > MADV_HUGEPAGE mappings") > > Well, that's certainly noticeable and not good. > > Andrea, I suspect it might be causing fights with auto numa migration.. > > Lots more system time, but also look at this: > > > 1122389 ? 9% +17.2% 1315380 ? 4% proc-vmstat.numa_hit > > 214722 ? 5% +21.6% 261076 ? 3% proc-vmstat.numa_huge_pte_updates > > 1108142 ? 9% +17.4% 1300857 ? 4% proc-vmstat.numa_local > > 145368 ? 48% +63.1% 237050 ? 17% proc-vmstat.numa_miss > > 159615 ? 44% +57.6% 251573 ? 16% proc-vmstat.numa_other > > 185.50 ? 81% +8278.6% 15542 ? 40% proc-vmstat.numa_pages_migrated > > Should the commit be reverted? Or perhaps at least modified? Well, the commit is trying to revert to the behavior before 5265047ac301 because there are real usecases that suffered from that change and bug reports as a result of that. will-it-scale is certainly worth considering but it is an artificial testcase. A higher NUMA miss rate is an expected side effect of the patch because the fallback to a different NUMA node is more likely. The __GFP_THISNODE side effect is basically introducing node-reclaim behavior for THPages. Another thing is that there is no good behavior for everybody. Reclaim locally vs. THP on a remote node is hard to tell by default. We have discussed that at length and there were some conclusions. One of them is that we need a numa policy to tell whether a expensive localility is preferred over remote allocation. Also we definitely need a better pro-active defragmentation to allow larger pages on a local node. This is a work in progress and this patch is a stop gap fix. -- Michal Hocko SUSE Labs