Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp1359735ybc; Tue, 12 Nov 2019 19:44:56 -0800 (PST) X-Google-Smtp-Source: APXvYqxB2hWbPd0Tgwz+WrpK4gaSZCxtYui+jVgM90rEKQ86X5SJsnYex2E94Z/EcTo4hWkP1Ccy X-Received: by 2002:aa7:d908:: with SMTP id a8mr1273926edr.173.1573616696269; Tue, 12 Nov 2019 19:44:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1573616696; cv=none; d=google.com; s=arc-20160816; b=CO7fDFMcHZ5XTqhy7MN4UViqGd1cGXda5+IurxLRFo7+qiy7Piub/tQebLo+tdf3N8 R21/gR9B1D8JiYrWqPkzfR/LaK8HTpOf0mJgoZ1l+naENcxpOhdEPbrWJr+tbfq3TDXE QQpQLD11ACl+GMOiyo8jfHWHV+Bl2UUkxdRG0L2YasTeyhWbWdeyTI35IsWtsSeLfidy Z2X6f9UCVVIY+hT5/OU+YH5Ap+sJURf8LYbzmQlR5X5sv6pmKYYnK27CtHuTlCrnavXt yNZ66kHfVpPwz+Fc0CqbHxIMacXdamifyhgYpY1hVnK7k2uA3JjwlMFT2W59wEbh/F4F kxtQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:mime-version:user-agent:date:message-id:subject :from:to; bh=mhFOQrs52gNehsadgmrnN84NgZUItbCxCVf7dGI6vcU=; b=tWHZP8/DTYDTu9aTkFfD6Q6eyWUp4SBxvIS+QhtIWj1KP9PacUQHXSjq7I1+2qOsqy g4rpyKmv0+nU5etD0nLnpwaANpzj9uRtLjexhueHMIXfqp1n8ZJGpWJOGmJnexGgldrt ZCGc/pEnhipj5MVZ8kUkrAqXSlePnf2nIAip0iBirogonzRYG6plPmfJYZSqCjqunjUZ IkbNC7l1yY8Y9/zEtToke4bmeJlPBqhfM6ag8HiXTtCZvoiLJ1IpMrFqJuSMDOvb8Q9Z h9wv45a6MINiVpvG+oP/QePTE0MRWdaNvx36fjlnlnen6F4UfUvg9l4KtFq0lSlVM/Dq 3CJQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b10si464773eds.63.2019.11.12.19.44.31; Tue, 12 Nov 2019 19:44:56 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727528AbfKMDno (ORCPT + 99 others); Tue, 12 Nov 2019 22:43:44 -0500 Received: from out30-130.freemail.mail.aliyun.com ([115.124.30.130]:55702 "EHLO out30-130.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726994AbfKMDno (ORCPT ); Tue, 12 Nov 2019 22:43:44 -0500 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R131e4;CH=green;DM=||false|;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e04423;MF=yun.wang@linux.alibaba.com;NM=1;PH=DS;RN=16;SR=0;TI=SMTPD_---0ThxeMyX_1573616617; Received: from testdeMacBook-Pro.local(mailfrom:yun.wang@linux.alibaba.com fp:SMTPD_---0ThxeMyX_1573616617) by smtp.aliyun-inc.com(127.0.0.1); Wed, 13 Nov 2019 11:43:38 +0800 To: Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Steven Rostedt , Ben Segall , Mel Gorman , Luis Chamberlain , Kees Cook , Iurii Zaikin , =?UTF-8?Q?Michal_Koutn=c3=bd?= , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, "Paul E. McKenney" From: =?UTF-8?B?546L6LSH?= Subject: [PATCH 0/3] sched/numa: introduce advanced numa statistic Message-ID: <743eecad-9556-a241-546b-c8a66339840e@linux.alibaba.com> Date: Wed, 13 Nov 2019 11:43:37 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Modern production environment could use hundreds of cgroup to control the resources for different workloads, along with the complicated resource binding. On NUMA platforms where we have multiple nodes, things become even more complicated, we hope there are more local memory access to improve the performance, and NUMA Balancing keep working hard to achieve that, however, wrong memory policy or node binding could easily waste the effort, result a lot of remote page accessing. We need to perceive such problems, then we got chance to fix it before there are too much damages, however, there are no good approach yet to help catch the mouse who introduced the remote access. This patch set is trying to fill in the missing pieces, by introduce the per-cgroup NUMA locality/exectime statistics, and expose the per-task page migration failure counter, with these statistics, we could achieve the daily monitoring on NUMA efficiency, to give warning when things going too wrong. Please check the third patch for more details. Thanks to Peter, Mel and Michal for the good advices :-) Michael Wang (3): sched/numa: advanced per-cgroup numa statistic sched/numa: expose per-task pages-migration-failure counter sched/numa: documentation for per-cgroup numa stat Documentation/admin-guide/cg-numa-stat.rst | 161 ++++++++++++++++++++++++ Documentation/admin-guide/kernel-parameters.txt | 4 + Documentation/admin-guide/sysctl/kernel.rst | 9 ++ include/linux/sched.h | 18 ++- include/linux/sched/sysctl.h | 6 + init/Kconfig | 9 ++ kernel/sched/core.c | 91 ++++++++++++++ kernel/sched/debug.c | 1 + kernel/sched/fair.c | 33 +++++ kernel/sched/sched.h | 17 +++ kernel/sysctl.c | 11 ++ 11 files changed, 359 insertions(+), 1 deletion(-) create mode 100644 Documentation/admin-guide/cg-numa-stat.rst -- 2.14.4.44.g2045bb6