Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp230014imu; Tue, 27 Nov 2018 11:29:44 -0800 (PST) X-Google-Smtp-Source: AFSGD/WBmWZzGPw1HSYvf+GCRCNrnfXuxAjzkXN6wgpnY5PS3rG/sVwVD9lsEekg3l8sS5Uy/xVL X-Received: by 2002:a17:902:6b09:: with SMTP id o9mr1200141plk.208.1543346984516; Tue, 27 Nov 2018 11:29:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543346984; cv=none; d=google.com; s=arc-20160816; b=hjygUAEeiCk7Iq8X+prCbcHwQ7Fy8r0vY5YPUi4ZK//m/A2WD2skBxFFd6mn5ZFb+b UJGkRThpMdb76+PTnIKAgH3HaWX/3Dk8Do6KZYTMOAmS4wftDpWsJOLCLA9JkQ9atR8w L1K4zAQ5LOs07zhRw7zLO5ozHPqY8jq+gm7WqnnqS5ZDFNWB600s9mk15+XALEMpfGbf nhwQH9rlqgBDxlwyxrBTsBmHfiRIy2xiqXirOIS8zrckGwZh8CHvqGsQOqcqatve+scN R7IjIPNF2I+YBxEvASciz/Bno36x1jWAuKYNAWzyT2loPx3/TNq1tr5Bcjp/l+G12nGm 5yOg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=Z8TqIIxRUohcciK3sE/u2VHIVbMn4lSB8XSq+hiCfHw=; b=ldou/EB+E9avV2m28Mb6s0M8u83u66UUx4wIQra8u3BDVPbqm36HQ4JmczXuWpKgQ2 1d05+/G9f2N9pBDM+Nz/r+G1H8QttfP8XUWfTowrlpt3bEf2jcId5HQ8CWr5p+/H0KJ3 3OUBML6rkDBipdu35qfSzdDkUjfiO/C4BLHrQrUXmmPcF/CuIl4zfVAtzManhP2dOYSI 3qGC0BQTY6EvLymi93A0QQ0froOaoRaHzDoAUhO3KjqF6PUYq9/gEQ4vLmTClMU9mQuE ygf76W/UaMyUZt4szac5L8xUPkWcrAwbHCvCHjbT4cQFCw5+KQl7XQFl9jrlyhWvoWz4 NwGw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r197si5398265pfr.192.2018.11.27.11.29.29; Tue, 27 Nov 2018 11:29:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732029AbeK1FUY (ORCPT + 99 others); Wed, 28 Nov 2018 00:20:24 -0500 Received: from mx2.suse.de ([195.135.220.15]:50376 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726234AbeK1FUY (ORCPT ); Wed, 28 Nov 2018 00:20:24 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 4D8C6AFF6; Tue, 27 Nov 2018 18:21:38 +0000 (UTC) Date: Tue, 27 Nov 2018 19:21:37 +0100 From: Michal Hocko To: Linus Torvalds Cc: rong.a.chen@intel.com, Andrea Arcangeli , s.priebe@profihost.ag, alex.williamson@redhat.com, mgorman@techsingularity.net, zi.yan@cs.rutgers.edu, Vlastimil Babka , rientjes@google.com, kirill@shutemov.name, Andrew Morton , Linux List Kernel Mailing , lkp@01.org Subject: Re: [LKP] [mm] ac5b2c1891: vm-scalability.throughput -61.3% regression Message-ID: <20181127182137.GE6923@dhcp22.suse.cz> References: <20181127062503.GH6163@shao2-debian> <20181127181727.GD6923@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20181127181727.GD6923@dhcp22.suse.cz> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 27-11-18 19:17:27, Michal Hocko wrote: > On Tue 27-11-18 09:08:50, Linus Torvalds wrote: > > On Mon, Nov 26, 2018 at 10:24 PM kernel test robot > > wrote: > > > > > > FYI, we noticed a -61.3% regression of vm-scalability.throughput due > > > to commit ac5b2c18911f ("mm: thp: relax __GFP_THISNODE for > > > MADV_HUGEPAGE mappings") > > > > Well, that's certainly noticeable and not good. > > > > Andrea, I suspect it might be causing fights with auto numa migration.. > > > > Lots more system time, but also look at this: > > > > > 1122389 ? 9% +17.2% 1315380 ? 4% proc-vmstat.numa_hit > > > 214722 ? 5% +21.6% 261076 ? 3% proc-vmstat.numa_huge_pte_updates > > > 1108142 ? 9% +17.4% 1300857 ? 4% proc-vmstat.numa_local > > > 145368 ? 48% +63.1% 237050 ? 17% proc-vmstat.numa_miss > > > 159615 ? 44% +57.6% 251573 ? 16% proc-vmstat.numa_other > > > 185.50 ? 81% +8278.6% 15542 ? 40% proc-vmstat.numa_pages_migrated > > > > Should the commit be reverted? Or perhaps at least modified? > > Well, the commit is trying to revert to the behavior before > 5265047ac301 because there are real usecases that suffered from that > change and bug reports as a result of that. > > will-it-scale is certainly worth considering but it is an artificial > testcase. A higher NUMA miss rate is an expected side effect of the > patch because the fallback to a different NUMA node is more likely. The > __GFP_THISNODE side effect is basically introducing node-reclaim > behavior for THPages. Another thing is that there is no good behavior > for everybody. Reclaim locally vs. THP on a remote node is hard to > tell by default. We have discussed that at length and there were some > conclusions. One of them is that we need a numa policy to tell whether > a expensive localility is preferred over remote allocation. Also we > definitely need a better pro-active defragmentation to allow larger > pages on a local node. This is a work in progress and this patch is a > stop gap fix. Btw. the associated discussion is http://lkml.kernel.org/r/20180925120326.24392-1-mhocko@kernel.org -- Michal Hocko SUSE Labs