Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp3169509ybv; Mon, 24 Feb 2020 19:45:24 -0800 (PST) X-Google-Smtp-Source: APXvYqzt0YuDOF2PXFg9nfW9B6V0um3vr6A3I8pUfRq6tKDsDhKZcjNAJd08ce6PCoXrKwf43y2v X-Received: by 2002:a05:6830:2102:: with SMTP id i2mr41912753otc.123.1582602324750; Mon, 24 Feb 2020 19:45:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582602324; cv=none; d=google.com; s=arc-20160816; b=sSwzFG4fjY1z4L8fITjaiogrF8fCc267Lo0K7bCk2x3iGY82B1gjesyAxcL0wsjAIQ Q0lDGt4yicx/iH1X/EyjC7EIJpXObgSUZgzrhS+hHo30ETgBqaetSXJvGD5qDuYq5w4B hHVKfsyEs3ZnRTbLENyx/W/cuSGyWSUmbzCl3tNH51sY2bs5y0R1Iyr/R6HStTYJkmrU cY4pYykAA14VECB0SrUdf9lSRBXOk/nIlty2bCrAJ1bw3xhDIpeAKlVUPDlderoSAXG/ yZYXq3KNO994B6uF2gdfqgdcdZyVc1ZXng2RsuIikTcFD7h+DRxwOltP1HCwEewMln/p 1VGg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=iKIwG2W+z0cryGHjYQGewymShZk7YQHTShR2retuQ1I=; b=ESHm+gftzlK08aBiF4wJIQ03qbSSb/VsZtR86jpHqidC0DNHJ88BJcaCAXWpKh0ikh SXyY91NUVAYYec+naVVVq9DjIK/+5GhYzxKUDPbP8iIGGR2Sr/5I2xxyEN0HwRBQXPKe IGTO6AEy+2lSaE6sNIz9BNdrCpQb0qiAp67Fs/qvEA1gmVEleaqO6XevKyVYP1fIQXi6 KehIJ9vC+wWjIoYl0uZmtj47l/xtextFMYv0ubqZsfnzUxSOfDroK2H5YTFISy3pigLR 44KCnN2CJqUQLNU4Ug59D6CPNkaohpQm799DKGGj5928yGKBRLQv8l2mN3haxxOoTmmK jlIQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=jRm7Atxw; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o23si4647170oic.127.2020.02.24.19.45.11; Mon, 24 Feb 2020 19:45:24 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=jRm7Atxw; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728776AbgBYDos (ORCPT + 99 others); Mon, 24 Feb 2020 22:44:48 -0500 Received: from mail-pj1-f66.google.com ([209.85.216.66]:36935 "EHLO mail-pj1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726962AbgBYDos (ORCPT ); Mon, 24 Feb 2020 22:44:48 -0500 Received: by mail-pj1-f66.google.com with SMTP id m13so682216pjb.2 for ; Mon, 24 Feb 2020 19:44:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=iKIwG2W+z0cryGHjYQGewymShZk7YQHTShR2retuQ1I=; b=jRm7AtxwnjKep/goXIg5XTWa2/32AFuCOhpGCTfEd8sLFgrK9hVpGj+KkZNynqvPTN LBgjaVSkCIfTc1F2TET0JWQec65b4QgAgNugn+LD0bTWH2phmUDhL1eF1D8kxGxN+9VO eDhV7g+MRyfs0T2HcTqEhASfHrc1dABKZx8ewx90TPx0C7ukjaEb23THARseb3Yn/cIF x6DpmgM1LduRD3O9DFjVYujA9SKU8rE00w00hH9qg1gA6GbznIxs1LIyHC/T6bm73qWL VEhprIM1vFqnKS+0uU5+8uwYaLKl5zROnnLjfQg5RKK4Pf+miq4xeRr5KvbjoVFlyL9H z+Lg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=iKIwG2W+z0cryGHjYQGewymShZk7YQHTShR2retuQ1I=; b=OfwfMEpct9KjoAiam38AwCtNFtJkPpOJd5p912zxb8v9YrEf5rEoBfo0xjTMABw6BP ltSHgqVlpvX60HjBOME3WoZ0uLR0ylLcdfT/qZ3URCTRp95tcE46hS00WyMlym5rcSE0 Ik5U8PueerTh9dVtHs2sru8Rx54TP6n1Z/rjsQlSLk3YFLMle+wk2ji0dSGzoFQXS4tf aAS0tLZhLSTBR7BZMbO0DThAJrNZ74PMEQzQozM9CFKz5eYqL41b/RQOVxuGFhyMQ5qt r0qUmYyed70+3yn+5kgESO1V9rwkJvbtW6ySrxdn4bI5BXqDJ+DPxWuopafH1CVfGmjL QkXg== X-Gm-Message-State: APjAAAVDMY5iH+aS2IpicYKBx07UOSZBAExLVUKdi9MonC0SAEw69J1V MCeqBNOg+3e4faQWaKxzFZg= X-Received: by 2002:a17:90a:8a98:: with SMTP id x24mr2843092pjn.113.1582602287660; Mon, 24 Feb 2020 19:44:47 -0800 (PST) Received: from ziqianlu-desktop.localdomain ([47.89.83.64]) by smtp.gmail.com with ESMTPSA id h22sm5081758pgn.57.2020.02.24.19.44.42 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 24 Feb 2020 19:44:46 -0800 (PST) Date: Tue, 25 Feb 2020 11:44:38 +0800 From: Aaron Lu To: Aubrey Li Cc: Tim Chen , Julien Desfossez , Vineeth Remanan Pillai , Nishanth Aravamudan , Peter Zijlstra , Ingo Molnar , Thomas Gleixner , Paul Turner , Linus Torvalds , Linux List Kernel Mailing , Dario Faggioli , =?iso-8859-1?Q?Fr=E9d=E9ric?= Weisbecker , Kees Cook , Greg Kerr , Phil Auld , Valentin Schneider , Mel Gorman , Pawan Gupta , Paolo Bonzini Subject: Re: [RFC PATCH v4 00/19] Core scheduling v4 Message-ID: <20200225034438.GA617271@ziqianlu-desktop.localdomain> References: <5e3cea14-28d1-bf1e-cabe-fb5b48fdeadc@linux.intel.com> <3c3c56c1-b8dc-652c-535e-74f6dcf45560@linux.intel.com> <20200212230705.GA25315@sinkpad> <29d43466-1e18-6b42-d4d0-20ccde20ff07@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Feb 14, 2020 at 02:10:53PM +0800, Aubrey Li wrote: > On Fri, Feb 14, 2020 at 2:37 AM Tim Chen wrote: > > > > On 2/12/20 3:07 PM, Julien Desfossez wrote: > > > > >> > > >> Have you guys been able to make progress on the issues with I/O intensive workload? > > > > > > I finally have some results with the following branch: > > > https://github.com/digitalocean/linux-coresched/tree/coresched/v4-v5.5.y > > > > > > > > > So the main conclusion is that for all the test cases we have studied, > > > core scheduling performs better than nosmt ! This is different than what > > > we tested a while back, so it's looking really good ! > > > > Thanks for the data. They look really encouraging. > > > > Aubrey is working on updating his patches so it will load balance > > to the idle cores a bit better. We are testing those and will post > > the update soon. > > I added a helper to check task and cpu cookie match, including the > entire core idle case. The refined patchset updated at here: > https://github.com/aubreyli/linux/tree/coresched_v4-v5.5.2 > > This branch also includes Tim's patchset. According to our testing > result, the performance data looks on par with the previous version. > A good news is, v5.4.y stability issue on our 8 numa node machine > is gone on this v5.5.2 branch. One problem I have when testing this branch: the weight of the croup seems to be ignored. On a 2sockets/16cores/32threads VM, I grouped 8 sysbench(cpu mode) threads into one cgroup(cgA) and another 16 sysbench(cpu mode) threads into another cgroup(cgB). cgA and cgB's cpusets are set to the same socket's 8 cores/16 CPUs and cgA's cpu.shares is set to 10240 while cgB's cpu.shares is set to 2(so consider cgB as noise workload and cgA as the real workload). I had expected cgA to occupy 8 cpus(with each cpu on a different core) most of the time since it has way more weight than cgB, while cgB should occupy almost no CPUs since: - when cgB's task is in the same CPU queue as cgA's task, then cgB's task is given very little CPU due to its small weight; - when cgB's task is in a CPU queue whose sibling's queue has cgA's task, cgB's task should be forced idle(again, due to its small weight). But testing shows cgA occupies only 2 cpus during the entire run while cgB enjoys the remaining 14 cpus. As a comparison, when coresched is off, cgA can occupy 8 cpus during its run. I haven't taken a look at the patches, but would like to raise the problem first. My gut feeling is that, we didn't make the CPU's load balanced. P.S. it's not that I care about VM's performance, it's just easier to test kernel stuff using a VM than on a bare metal. Its CPU setup might seem weird, I just set it up to be the same as my host setup.