Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp1065258ybv; Sat, 22 Feb 2020 22:13:51 -0800 (PST) X-Google-Smtp-Source: APXvYqxFayj25gWNRbA3+M4nYY84eM5h2tsSWziGgaMtSrXj7BGGTsLtlTdWPyB2GrE7CbRep8s/ X-Received: by 2002:aca:aa0e:: with SMTP id t14mr8647626oie.149.1582438431136; Sat, 22 Feb 2020 22:13:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582438431; cv=none; d=google.com; s=arc-20160816; b=Y2nfQ3PLK86Bkh93oHImzsNHyjL/tUBbaDqLUv39cOOanub/yWF53NbXnGAXIsqMpr XohsEcymAiM3eIN4xMJTSsoW+7Q96dFNs/5RBpggTedeqfRRJnuwNcErohpwEzyEu21x ZVz4KtqtRIzhz1bh+iAp3MDsBGPD99FLO2dR9yIj716UHx+eszSTQJ+5PS+B5nbu4MBJ MnMtmzoelAg7+3mqL8K6c0lToUXb7CSrcHRwm6SHD3d7rSZqybR2qWNqvDCMF2ICQj4+ WvjeFXpiSQPkfCtX7Y+vqQwlfhZ7PjMVuZrXoa8bNFYwaKN5P1yeoJDVdaKcKMkQZsX3 L+uw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:from :references:cc:to:subject; bh=ArGyf8KcgZE6hhMHtXTumUa/G/FdQoPAWN1ucAvrydc=; b=nFTsdL6rtl5Im/P07my9oTHC/Kzau6qjqEF8C/QmqVlm38ZYr6sS7J5e4vBZhYzk2I /1M0Ut1yct+lT1m7F97cAEYEnq0gBOfmiBsiVVZQF3xr9IMB4e/CCBrf/R159FiTSDUS YmdidpuWTA8b0up9EozkI46a9fTEsCI0a2LAR7Wzsdg00uE+VhDnVznMJiRSt1iJbOBU jjyeu6aBcsKzCESpMGSNaniWYrgnm3EN4Jos/YdTrcrP3KT1AowDwaGVoIECpzB32hTr FBvczemMk7UB+8rYK6TZCn0nQoPc1CmOdGQ3wqrEAurLhGG/hWD+URG3gwmQpZv4NzzS vW/g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x12si4014637oto.194.2020.02.22.22.13.07; Sat, 22 Feb 2020 22:13:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726277AbgBWGIj (ORCPT + 99 others); Sun, 23 Feb 2020 01:08:39 -0500 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:26753 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725978AbgBWGIj (ORCPT ); Sun, 23 Feb 2020 01:08:39 -0500 Received: from pps.filterd (m0098393.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 01N64rkU074664 for ; Sun, 23 Feb 2020 01:08:38 -0500 Received: from e06smtp02.uk.ibm.com (e06smtp02.uk.ibm.com [195.75.94.98]) by mx0a-001b2d01.pphosted.com with ESMTP id 2yax36dbeg-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Sun, 23 Feb 2020 01:08:37 -0500 Received: from localhost by e06smtp02.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Sun, 23 Feb 2020 06:08:35 -0000 Received: from b06cxnps3074.portsmouth.uk.ibm.com (9.149.109.194) by e06smtp02.uk.ibm.com (192.168.101.132) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Sun, 23 Feb 2020 06:08:30 -0000 Received: from d06av22.portsmouth.uk.ibm.com (d06av22.portsmouth.uk.ibm.com [9.149.105.58]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 01N68TJm40173634 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Sun, 23 Feb 2020 06:08:29 GMT Received: from d06av22.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 4BE604C052; Sun, 23 Feb 2020 06:08:29 +0000 (GMT) Received: from d06av22.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 760224C044; Sun, 23 Feb 2020 06:08:24 +0000 (GMT) Received: from localhost.localdomain (unknown [9.102.3.81]) by d06av22.portsmouth.uk.ibm.com (Postfix) with ESMTP; Sun, 23 Feb 2020 06:08:24 +0000 (GMT) Subject: Re: [PATCH v4 0/5] remove runnable_load_avg and improve group_classify To: Vincent Guittot , mingo@redhat.com, peterz@infradead.org, juri.lelli@redhat.com, dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, mgorman@suse.de, linux-kernel@vger.kernel.org Cc: pauld@redhat.com, valentin.schneider@arm.com, hdanton@sina.com References: <20200221132715.20648-1-vincent.guittot@linaro.org> From: Parth Shah Date: Sun, 23 Feb 2020 11:38:23 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: <20200221132715.20648-1-vincent.guittot@linaro.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 x-cbid: 20022306-0008-0000-0000-000003559290 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 20022306-0009-0000-0000-00004A76A812 Message-Id: X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138,18.0.572 definitions=2020-02-22_08:2020-02-21,2020-02-22 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxscore=0 impostorscore=0 bulkscore=0 spamscore=0 suspectscore=0 mlxlogscore=999 priorityscore=1501 lowpriorityscore=0 clxscore=1015 phishscore=0 malwarescore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2001150001 definitions=main-2002230052 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 2/21/20 6:57 PM, Vincent Guittot wrote: > This new version stays quite close to the previous one and should > replace without problems the previous one that part of Mel's patchset: > https://lkml.org/lkml/2020/2/14/156 > > NUMA load balancing is the last remaining piece of code that uses the > runnable_load_avg of PELT to balance tasks between nodes. The normal > load_balance has replaced it by a better description of the current state > of the group of cpus. The same policy can be applied to the numa > balancing. > > Once unused, runnable_load_avg can be replaced by a simpler runnable_avg > signal that tracks the waiting time of tasks on rq. Currently, the state > of a group of CPUs is defined thanks to the number of running task and the > level of utilization of rq. But the utilization can be temporarly low > after the migration of a task whereas the rq is still overloaded with > tasks. In such case where tasks were competing for the rq, the > runnable_avg will stay high after the migration. > > Some hackbench results: > > - small arm64 dual quad cores system > hackbench -l (2560/#grp) -g #grp > > grp tip/sched/core +patchset improvement > 1 1,327(+/-10,06 %) 1,247(+/-5,45 %) 5,97 % > 4 1,250(+/- 2,55 %) 1,207(+/-2,12 %) 3,42 % > 8 1,189(+/- 1,47 %) 1,179(+/-1,93 %) 0,90 % > 16 1,221(+/- 3,25 %) 1,219(+/-2,44 %) 0,16 % > > - large arm64 2 nodes / 224 cores system > hackbench -l (256000/#grp) -g #grp > > grp tip/sched/core +patchset improvement > 1 14,197(+/- 2,73 %) 13,917(+/- 2,19 %) 1,98 % > 4 6,817(+/- 1,27 %) 6,523(+/-11,96 %) 4,31 % > 16 2,930(+/- 1,07 %) 2,911(+/- 1,08 %) 0,66 % > 32 2,735(+/- 1,71 %) 2,725(+/- 1,53 %) 0,37 % > 64 2,702(+/- 0,32 %) 2,717(+/- 1,07 %) -0,53 % > 128 3,533(+/-14,66 %) 3,123(+/-12,47 %) 11,59 % > 256 3,918(+/-19,93 %) 3,390(+/- 5,93 %) 13,47 % [...] I performed similar experiment on IBM POWER9 system with 2 nodes 44 cores system (22 per node) - hackbench -l (256000/#grp) -g #grp +-----+----------------+-------+ | grp | tip/sched/core | v4 | +-----+----------------+-------+ | 1 | 76.97 | 76.31 | | 4 | 56.56 | 56.86 | | 8 | 54.23 | 54.25 | | 16 | 53.94 | 53.24 | | 32 | 54.10 | 54.01 | | 64 | 54.38 | 54.35 | | 128 | 55.11 | 55.08 | | 256 | 55.97 | 56.04 | | 512 | 54.81 | 55.5 | +-----+----------------+-------+ - deviation in the result is very marginal ( < 1%) The results shows no changes with respect to the hackbench. I will do further benchmarking to see if any observable changes occurs. - Parth