Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp5522185ybi; Tue, 30 Jul 2019 22:40:57 -0700 (PDT) X-Google-Smtp-Source: APXvYqydrHLefCjKhtKFnn5nd1L4Y5xLOwuMQMnPnAKJ1cD0mxz+y7EmLvZBCcLwAdSLrapZCoag X-Received: by 2002:a17:902:42d:: with SMTP id 42mr113250942ple.228.1564551657726; Tue, 30 Jul 2019 22:40:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1564551657; cv=none; d=google.com; s=arc-20160816; b=dzd3nH+B7LGwD6IbNyOLjGYSk3mxw4VitSkMXmcYWYjEsu8q5bRo/vtjsTk8hqzI0j SKhtEw/487X5sE+9x8yIf4pBjhJ0mvpfaIooCE1DV1A8ucfwqo05BgMxnwojBf+hd1cg OVWVnO4NlGvsQnFItVNrD+iBARn44+oMvVe5o2nLm4tSq0YduF3bXG+Kfz9tqx1iOZLB 0ftWdfPz9ZMcEN0r2jkKjBYUDS8DZqpWOTiN0HjrKQhzTi/9D4j00j9SEi84mdmxm6kr SIUHuiihSG2pBjUhHOhFc+JmHptcTjooWnenBei0+91XKKuxt57mhPETGMwb5hAAIoft RhRQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=5sElAW/ZHxWZQpZNUm78L6hw0vnYJqfHGzsrzIzjN6Y=; b=C7fARAtcSXQZemvQ91VjxZ+yDYWXmC/pP0hXjdQXa2hnm+EUIkf9WvQl7Q3cr7EC4n w6x0mIbYy4zwQEdPSU8M5nIdHgw/z2Vq0lXJ5LUmnxiltaKgmY2VrnDm0R6witlveyzX kCJLcv1ELc/gPqjE7r3WFSXP72b3H3KYJ15iYwVwqUT0VJ7342rpfLbMPI5QJ2oHxmpV aGhZZ7jSHsubMnJK+ZppTOpOKKMNqlhDKDIZBC5kOE5ugvQj3A169bf9kj6Z6hnFezZ6 xnHrwudOjcPQHJ+b1Vj9F0LgceQtZlDLPlwwn/SySg6xXRIjB6cB66z7+DShbQjdcE4w JGow== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id bi3si26916842plb.226.2019.07.30.22.40.43; Tue, 30 Jul 2019 22:40:57 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727875AbfGaCmU (ORCPT + 99 others); Tue, 30 Jul 2019 22:42:20 -0400 Received: from mga05.intel.com ([192.55.52.43]:7466 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727714AbfGaCmU (ORCPT ); Tue, 30 Jul 2019 22:42:20 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga003.fm.intel.com ([10.253.24.29]) by fmsmga105.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 30 Jul 2019 19:42:19 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.64,328,1559545200"; d="scan'208";a="180086832" Received: from cli6-desk1.ccr.corp.intel.com (HELO [10.239.161.118]) ([10.239.161.118]) by FMSMGA003.fm.intel.com with ESMTP; 30 Jul 2019 19:42:16 -0700 Subject: Re: [RFC PATCH v3 00/16] Core scheduling v3 To: Julien Desfossez , Aaron Lu Cc: Aubrey Li , Subhra Mazumdar , Vineeth Remanan Pillai , Nishanth Aravamudan , Peter Zijlstra , Tim Chen , Ingo Molnar , Thomas Gleixner , Paul Turner , Linus Torvalds , Linux List Kernel Mailing , =?UTF-8?B?RnLDqWTDqXJpYyBXZWlzYmVja2Vy?= , Kees Cook , Greg Kerr , Phil Auld , Valentin Schneider , Mel Gorman , Pawan Gupta , Paolo Bonzini References: <20190531210816.GA24027@sinkpad> <20190606152637.GA5703@sinkpad> <20190612163345.GB26997@sinkpad> <635c01b0-d8f3-561b-5396-10c75ed03712@oracle.com> <20190613032246.GA17752@sinkpad> <20190619183302.GA6775@sinkpad> <20190718100714.GA469@aaronlu> <20190725143003.GA992@aaronlu> <20190726152101.GA27884@sinkpad> From: "Li, Aubrey" Message-ID: <7dc86e3c-aa3f-905f-3745-01181a3b0dac@linux.intel.com> Date: Wed, 31 Jul 2019 10:42:15 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.1.1 MIME-Version: 1.0 In-Reply-To: <20190726152101.GA27884@sinkpad> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/7/26 23:21, Julien Desfossez wrote: > On 25-Jul-2019 10:30:03 PM, Aaron Lu wrote: >> >> I tried a different approach based on vruntime with 3 patches following. > [...] > > We have experimented with this new patchset and indeed the fairness is > now much better. Interactive tasks with v3 were complete starving when > there were cpu-intensive tasks running, now they can run consistently. Yeah, the fairness is much better now. For two cgroups created, I limited the cpuset to be one core(two siblings) of these two cgroups, I still run gemmbench and sysbench-mysql, and here is the mysql result: Latency: .----------------------------------------------------------------------------------------------. |NA/AVX vanilla-SMT [std% / sem%] cpu% |coresched-SMT [std% / sem%] +/- cpu% | |----------------------------------------------------------------------------------------------| | 1/1 6.7 [13.8%/ 1.4%] 2.1% | 6.4 [14.6%/ 1.5%] 4.0% 2.0% | | 2/2 9.1 [ 5.0%/ 0.5%] 4.0% | 11.4 [ 6.8%/ 0.7%] -24.9% 3.9% | '----------------------------------------------------------------------------------------------' Throughput: .----------------------------------------------------------------------------------------------. |NA/AVX vanilla-SMT [std% / sem%] cpu% |coresched-SMT [std% / sem%] +/- cpu% | |----------------------------------------------------------------------------------------------| | 1/1 310.2 [ 4.1%/ 0.4%] 2.1% | 296.2 [ 5.0%/ 0.5%] -4.5% 2.0% | | 2/2 547.7 [ 3.6%/ 0.4%] 4.0% | 368.3 [ 4.8%/ 0.5%] -32.8% 3.9% | '----------------------------------------------------------------------------------------------' Note: 2/2 case means 4 threads run on one core, which is overloaded.(cpu% is overall system report) Though the latency/throughput has regressions, but standard deviation is much better now. > With my initial test of TPC-C running in large VMs with a lot of > background noise VMs, the results are pretty similar to v3, I will run > more thorough tests and report the results back here. I see something similar. I guess task placement could be another problem. We don't check cookie matching in load balance and task wakeup, so - if tasks with different cookie happen to be dispatched onto different cores, The result should be good - if tasks with different cookie are unfortunately dispatched onto the same core, the result should be bad. This problem is bypassed in my testing setup above, but may be one cause of my other scenarios, need a while to sort out. Thanks, -Aubrey