Received: by 2002:a05:6358:bb9e:b0:b9:5105:a5b4 with SMTP id df30csp3930369rwb; Mon, 5 Sep 2022 22:52:31 -0700 (PDT) X-Google-Smtp-Source: AA6agR4Fj8k7UL0Oss1GEWjir9ygYQc/s6HrlEPPNo5RyVXoWZ1tFpbm6Xfkjql/J4GYYiXVatpO X-Received: by 2002:a17:90b:4d81:b0:1fb:5e0c:6800 with SMTP id oj1-20020a17090b4d8100b001fb5e0c6800mr23674446pjb.171.1662443551563; Mon, 05 Sep 2022 22:52:31 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1662443551; cv=pass; d=google.com; s=arc-20160816; b=XfAoi/QyvZLGRJCEluYy4HvckaxxH8gmSzjnuK+xRsDlVWSTycQUqVwLU83MCy8dsY mIYdmYI+Zdv1Ke90u5szRM6jz4DUf/l+d633WK6fD7sHh0qYxAAGkTTu6NOuEpXlvach la7zKw8hQPY6Thjubh3r0xhmKSqrypt/rheH/rHS93WgZ25m8BZStzT1Obj0fhTpoCaa 6Lq24B2kb61RuwXEdfHOCgPpwdmG3Slt18SHUdksMW2bWS9QNWKKdfvtNGdMzdxAUBIN VpIupIqtEjfZLMPCCJEy9tjesDz555vP1mFxh6n5pKdOkTRTf5nmJ7t5/d53K0SrbrOb thUw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:content-transfer-encoding :in-reply-to:from:references:cc:to:content-language:subject :user-agent:date:message-id:dkim-signature; bh=QgU4BhoqScJgUMaAKt3YjS0RSsQS8ttfalJaxVt0v9o=; b=xcIeAXHo0JM2ePqtQW7kVkCdVKU8CyrosZ8raVHTZgSNhTfm2qIpxGK+gTJ7Zjb/3v 83W377tLX6meq9rZPQp3ED/bh1UjAa7wuM3M3Txl0136WC0gLFq5fCb0faodr//NTi4C 7bS14uYhjbXWf1RnOdjj8LnMTYmHbjQlEdYHpem1wywHccq0A1opwe0mRyV/18tRT6y+ xsUllFP6uq/FwudgCaHlcITH/GM4oraF3Iy1V+XcWj5s9lzRYMLHpIcwXmBRQiCsmHPO FmVw6gnOqlGTSq8VucDTSI9aKk3LIPqiPy3V44xyg6rohpe3YPmiN8X3J67Mjpajo1vs GHjw== ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@amd.com header.s=selector1 header.b=wyvBcY4X; arc=pass (i=1 spf=pass spfdomain=amd.com dkim=pass dkdomain=amd.com dmarc=pass fromdomain=amd.com); spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amd.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id b29-20020a63715d000000b0042c87a10af7si12654265pgn.400.2022.09.05.22.52.20; Mon, 05 Sep 2022 22:52:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@amd.com header.s=selector1 header.b=wyvBcY4X; arc=pass (i=1 spf=pass spfdomain=amd.com dkim=pass dkdomain=amd.com dmarc=pass fromdomain=amd.com); spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amd.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232761AbiIFF3E (ORCPT + 99 others); Tue, 6 Sep 2022 01:29:04 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59832 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231664AbiIFF3B (ORCPT ); Tue, 6 Sep 2022 01:29:01 -0400 Received: from NAM02-DM3-obe.outbound.protection.outlook.com (mail-dm3nam02on2083.outbound.protection.outlook.com [40.107.95.83]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B30295FAF6 for ; Mon, 5 Sep 2022 22:28:59 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=HJKz0SKEb32Hypu4Rkci/rqzUo1nFnenXnPMnSz5pLUOSwmQKxGOsS42fa9Xsm5rOe5MkJ4TKenwgJsGmC3f25jwcMDDNpMnlkMKF4ecW3G/z4Q1jOg9DT7ZSMPibDeWaD34H5AsbscTui+tYhejWO0TfoIwHQ+9P32Q+hu4REkw96kdiJvJzmHllf2J4h+5MwWmgFQGQdmYdev74zkywSIFJWKD7aurjqxTBouD/hFBAJYiIiPOeDzC/t1BNvsDR4pIc65oLIIgJMtuHZUqxdDbTcHWCrHMSgT/XIx+xdNy29NQya5b8k0u7FAHehR59nHVbRLvc0wLjCnCk2SiEQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=QgU4BhoqScJgUMaAKt3YjS0RSsQS8ttfalJaxVt0v9o=; b=Xr6Ca6anscJRpwC9l20juE6XoJORQ1O+XmWi5ONY3RRLLcwSfgLhKbyqAvPX+B1voZi5M+qlvIJernscG+vp0nnEx0UlNxU5PkZrXEyILw/5FbLMLmNKi+xgKUQKHxi/nRrRx+71IiXWUaxPZsu3FS0cKPoinmjbr5Ea8eYyyN+1V+qO9RGwSB7Z0cCSvYr/tcAVVFALh9Ft449xaINWccceoKOyNrU8056nc01CnfelR6oOSdsA2vd8uhLa9TIMTGY9dfALwl2s3jCpJwTxUohA3KGga5DqxEdw6xtVIlYz8TiV+//K5+Pr8K+uS6gbgTHeOQr+wxUAJi1HV/8/gQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=amd.com; dmarc=pass action=none header.from=amd.com; dkim=pass header.d=amd.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=QgU4BhoqScJgUMaAKt3YjS0RSsQS8ttfalJaxVt0v9o=; b=wyvBcY4X9eibQG+8MwmVD4dfkSD8bLFRDJC23UnBwxUnZ3dWMgV+NSV95lOz+HrKV9K93XaccackbLSC7M53nmvXsqmuFsNsBzILSGzoLEFI9FkUsPB8I3MZGFOYQ9Ne86i7ZMcDitTij1lmxown90o2IF49FVuujisyemrh+VY= Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=amd.com; Received: from MW2PR12MB2379.namprd12.prod.outlook.com (2603:10b6:907:9::24) by SJ0PR12MB6710.namprd12.prod.outlook.com (2603:10b6:a03:44c::7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5588.10; Tue, 6 Sep 2022 05:28:57 +0000 Received: from MW2PR12MB2379.namprd12.prod.outlook.com ([fe80::4472:b345:3db2:bcd]) by MW2PR12MB2379.namprd12.prod.outlook.com ([fe80::4472:b345:3db2:bcd%6]) with mapi id 15.20.5588.017; Tue, 6 Sep 2022 05:28:56 +0000 Message-ID: Date: Tue, 6 Sep 2022 10:58:40 +0530 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.13.0 Subject: Re: [PATCH v7 0/2] sched/fair: Scan cluster before scanning LLC in wake-up path Content-Language: en-US To: Yicong Yang , peterz@infradead.org, mingo@redhat.com, juri.lelli@redhat.com, vincent.guittot@linaro.org, tim.c.chen@linux.intel.com, gautham.shenoy@amd.com, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org Cc: dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, bristot@redhat.com, prime.zeng@huawei.com, yangyicong@hisilicon.com, jonathan.cameron@huawei.com, ego@linux.vnet.ibm.com, srikar@linux.vnet.ibm.com, linuxarm@huawei.com, 21cnbao@gmail.com, guodong.xu@linaro.org, hesham.almatary@huawei.com, john.garry@huawei.com, shenyang39@huawei.com, yu.c.chen@intel.com, wuyun.abel@bytedance.com References: <20220822073610.27205-1-yangyicong@huawei.com> From: K Prateek Nayak In-Reply-To: <20220822073610.27205-1-yangyicong@huawei.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: PN3PR01CA0135.INDPRD01.PROD.OUTLOOK.COM (2603:1096:c01:bf::6) To MW2PR12MB2379.namprd12.prod.outlook.com (2603:10b6:907:9::24) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: c2a3c475-68d0-4ae8-203b-08da8fc8b148 X-MS-TrafficTypeDiagnostic: SJ0PR12MB6710:EE_ X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 6QJHrVQzciFn6gWJvroeRZa+P6I39ShyOmA3CImam+FYbc7sYzfKOKm1Uub//UP4u3Tz1uUP1vLcbBMVPa8hcxl7dyR9TecFRSGYmxg5+g4UfcU9gkcjT8Rhenm/UTxEoFURlyfe1Ewe9tCIBMJWKDOt2RRH8Uczy/rBdqbFInhptybzaqVGAVriWSxjogcm8W+WsymbbkPu/xXCSIEV5urXRUcb9LFNGCT9K9ZXyBvFipqV4HXbePAPYPdHk6Z8IkujYZHIL0agA2hjWXfutfqJdfF38JNV21lfihsuSqP8FZtSym4bj3JBv8YWEI4SUcVI1fRopROVUOexW+JHodKc2U0XD6ksIcpwzUCbwcAkvK27F5nZDgp9go8J6SULQnnqi+d+UUd7XtG+e/9ZhxMnG1pq+7w7+UyxmfHJsBktK3CzmWxOUGKJnMFFxRBl3DdIsKtBKpXgqtZrjkpmIcDYcHG9EWINNIiEna8aF+KkzggM6vgqOL6AT6jxSDfuDrNDEudu3cVWjAsp/qI3lfBva9VVgLUJzolpS7MDGAIK7ZXa9hAApxOb1Ti/qfq06fKJGXzj64/uRzLOY0IA9hplfjKbbD0Ohikl/uzUhi+NILHw/oZrW9Xgk1zPuCErnapxDNfO9jcSA4VlgXW//G0ZHDLKhGxPHW5PDJTt+f5H9XXAqeaFMUpV8ixKAHtK9UYDJRud2rtYevcI1xZIIBovhPG6iGDX4R2Qp0xftlEeodYQZ5cjVCUog58jwLp+jNZpKZb8HOFV7whzIIw+d6LYt64ggBRLqE0tDqFWpecDh/TLJymjWgnfZFBwun0rHXsCHhcGY6crifrQ7XiSSw== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:MW2PR12MB2379.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230016)(4636009)(136003)(39860400002)(396003)(376002)(346002)(366004)(53546011)(2906002)(6666004)(6512007)(8676002)(66556008)(6506007)(66946007)(66476007)(316002)(36756003)(478600001)(2616005)(41300700001)(26005)(4326008)(31696002)(86362001)(6486002)(83380400001)(8936002)(5660300002)(7416002)(31686004)(186003)(38100700002)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?MnkrcnUyRlplVHlxdjBsT2RYUXExN3laK245T1pBaHc3dk5EbndpYVpER0Yz?= =?utf-8?B?V2Z1eHRDY0tHNk1XZ0ozWGszOXltWFFMNWQrbDFrbTN4Umo1R05UTjN4WXZE?= =?utf-8?B?NWxHK2xZL2wxR2ZnN29JN1FENDNJS2RCOW1ObnZ1UTg2b3NCRi9YMHZwdWYy?= =?utf-8?B?M1ZMQlVyZ3lQUHNsTWtKMXBPak01bXgxT3Nudkh2aElhamQrQURveDFLdFg0?= =?utf-8?B?U1crYW1MamorMjNISTk5SEFCeitiV21rOFJhRUZHbnVTYmxyRDgrRHNpZUJU?= =?utf-8?B?dlZhaXUxeHFYVDI5M01MNnBicWlOZlQveitlWXFTNmpLT3ljUXR3L1N1THM0?= =?utf-8?B?Y3BsQmhTNmk1UXRHNmMvMWpqSVFlM0pUWHdwd1RFZFJSMWExQ3NPZkZCUFhP?= =?utf-8?B?aDZxdE1DUG1uVUo5QklHNHdxMzdubTRjaXFuV1ZTdk1SNTBUdjdmWE44WVda?= =?utf-8?B?L2Y4RFNGU2srZ3RxYm5yZ2JrWVRHelg4NndVYUxNaVpha043eWRpakpINStQ?= =?utf-8?B?ZTBZa0Eycmgzb3FCeTI5djV2aTNuOHJSdHhwQjFxbGNvUXJCZnlzckxpVXpN?= =?utf-8?B?bnRydEdhZG9iVWpRWUp2ZWxkTjhuSlZ4QmFUb3RVMnJ6bXlXSHd0dVA4UjB4?= =?utf-8?B?TkViMXBaa1JoZ3RRdExLL1VWcWhTNGxhOVZackJGUnRLelZTRUQ2NGJrblND?= =?utf-8?B?cGN0Q09XUnR2Tkt1SEx2SkR0TmJqaG5CRnB0VmljUmp2VWdKOGxoTjJ6WUZ3?= =?utf-8?B?d0g5ZFV5MkY1dHRNSFpFY2xnaTVMT2hiSUFDUnBvQXkyeE5zZG1nRExwVWFK?= =?utf-8?B?T1pkaTl4d3ovcHF3Vi9NM0VoNkFLSmxxTUxKNWFJQ1h4L0RlMXB0VmtDTlRs?= =?utf-8?B?QkNEME82NC9oSUt4T0lENVNtYnlJM1dWenpTbEZEalFmOUFMY1htWXliQnBC?= =?utf-8?B?SVVhbjhaSGJsbDMwejQ2ZnlrRjBIdlN2a1NXNDVSOUQvajE2L2Fad29wNXMx?= =?utf-8?B?M3A3Yk12bzdUOEZyMlJkUk02Q3BYNENCSmo0WVlUbkVBeHFJWFU4T0RvSXJN?= =?utf-8?B?NEpESnBCZnBMd1dKRHN4ZkZvbjhiU3FmZFlzelBCUHdIaXZ6dmJvNmtOVlJw?= =?utf-8?B?cklud0dWMXRDZGw1eWxkYy9PTzJFaDRPUko0dVJPVG5xOVViL2srUEx4RHZX?= =?utf-8?B?QVhyUkt1c2xoK0F6WmlWemo4T0tJd1BxN2huSU1rNlQyNG9NNGdLSkhudTJS?= =?utf-8?B?akUwc0ZLWjU5bXNNdnRLb0prTnBMWTJOWWlhTWt2ZU9sbzhEWHd3ck5QbDgx?= =?utf-8?B?U1ZsMjlWbmtTY0xFU3IvRlQ4UnFmL05VenpFMXVxSkxqekJKM0p2T3B6dkpU?= =?utf-8?B?Kzd5QTZlVTIxcW1nV0ZhQ3FpRkpwT3FhdGxBWGhYdHY4aFo4eTRZRVU2dkcx?= =?utf-8?B?TEZPbVpvYjllVTl0RjUrSjdONUJONTNkVFNYS0NXNE5RL2JKYVFwd2wrQWY2?= =?utf-8?B?V2VCd0tJSWZKYVg4Tk50RmFqTEdmWEUzQ1lHZ0g1cTczZjh1NlJoM01MSEJ1?= =?utf-8?B?WmYxWFRta05zWUxUdm94UEFJbW9DbVJ5MDlhSUxySEFlVk16WUdiQXRSSCs0?= =?utf-8?B?RHRyL3E4aG04ci83dmRQOVhYYXNnNktyYmV0Tjh4RlBlUGhBZ2xXZFluSTk2?= =?utf-8?B?dHR1MHpzWTk3RFdaellkRE9oRGJqQ05qY3o2NXlNSTlnalBFUVZVcUJuM1lD?= =?utf-8?B?R2hERTVlbzB6a1E5cmJETkRQbnVUUXVCZzdobWN6ZCtjZWpFR2pOU1dpdEF3?= =?utf-8?B?UlFnYVoyalNFbDBkamxtN1dzNGtUbnFCd2o5VmFDdkdHV0pKQkdKbTdQOHF6?= =?utf-8?B?Mm5FYkRXbEN6NDJWWklkZHVTNTR6ZWxzd0RsK2Q3QWwrMEV2VGhvazJva0hz?= =?utf-8?B?RklzUzAycVFpQnlsMERGN1psZlVHU2RUaksxVlZpSUlINVNFK1lRVDR6K2pp?= =?utf-8?B?ZlQ4dXpmWXc5OStMQU9NSVNVQ2ZudFFjcHNqTUNDeUU2NE9neUxkV0E0L2xj?= =?utf-8?B?OEd5amhxMlNvcExBZk1ONkRYMUowaWo5cEl4dlgyQ08ybVkvTWhHY3VGNkF3?= =?utf-8?Q?nDwWiO/sNqaqgnWSomGaq3T5D?= X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-Network-Message-Id: c2a3c475-68d0-4ae8-203b-08da8fc8b148 X-MS-Exchange-CrossTenant-AuthSource: MW2PR12MB2379.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 06 Sep 2022 05:28:56.3433 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: 7vStszpQIgFcoA2ccLRJf50wPOS6HmmHEGqKQPdWx2vgBKJx6NSMIYmvzFCcObiTYppCA4exbOnqqHCr3qczDg== X-MS-Exchange-Transport-CrossTenantHeadersStamped: SJ0PR12MB6710 X-Spam-Status: No, score=-3.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello Yicong, We've tested the series on a dual socket Zen3 system (2 x 64C/128T). tl;dr - The results look good and the changes do not affect the Zen3 machine which doesn't contain any sched domain with SD_CLUSTER flag set. - With the latest BIOS, I don't see any regression due to the addition of the new per CPU variables. We had observed a regression in tbench previously when testing the v4 of the series on the system with a slightly outdated BIOS (https://lore.kernel.org/lkml/e000b124-afd4-28e1-fde2-393b0e38ce19@amd.com/) but that doesn't seem to be the case with the latest BIOS :) Detailed results from the standard benchmarks are reported below. On 8/22/2022 1:06 PM, Yicong Yang wrote: > From: Yicong Yang > > This is the follow-up work to support cluster scheduler. Previously > we have added cluster level in the scheduler for both ARM64[1] and > X86[2] to support load balance between clusters to bring more memory > bandwidth and decrease cache contention. This patchset, on the other > hand, takes care of wake-up path by giving CPUs within the same cluster > a try before scanning the whole LLC to benefit those tasks communicating > with each other. > > [1] 778c558f49a2 ("sched: Add cluster scheduler level in core and related Kconfig for ARM64") > [2] 66558b730f25 ("sched: Add cluster scheduler level for x86") > Discussed below are the results from running standard benchmarks on a dual socket Zen3 (2 x 64C/128T) machine configured in different NPS modes. NPS Modes are used to logically divide single socket into multiple NUMA region. Following is the NUMA configuration for each NPS mode on the system: NPS1: Each socket is a NUMA node. Total 2 NUMA nodes in the dual socket machine. Node 0: 0-63, 128-191 Node 1: 64-127, 192-255 NPS2: Each socket is further logically divided into 2 NUMA regions. Total 4 NUMA nodes exist over 2 socket. Node 0: 0-31, 128-159 Node 1: 32-63, 160-191 Node 2: 64-95, 192-223 Node 3: 96-127, 223-255 NPS4: Each socket is logically divided into 4 NUMA regions. Total 8 NUMA nodes exist over 2 socket. Node 0: 0-15, 128-143 Node 1: 16-31, 144-159 Node 2: 32-47, 160-175 Node 3: 48-63, 176-191 Node 4: 64-79, 192-207 Node 5: 80-95, 208-223 Node 6: 96-111, 223-231 Node 7: 112-127, 232-255 Benchmark Results: Kernel versions: - tip: 5.19.0 tip sched/core - cluster: 5.19.0 tip sched/core + both the patches of the series When we started testing, the tip was at: commit: 5531ecffa4b9 "sched: Add update_current_exec_runtime helper" ~~~~~~~~~~~~~ ~ hackbench ~ ~~~~~~~~~~~~~ NPS1 Test: tip cluster 1-groups: 4.31 (0.00 pct) 4.31 (0.00 pct) 2-groups: 4.93 (0.00 pct) 4.86 (1.41 pct) 4-groups: 5.38 (0.00 pct) 5.36 (0.37 pct) 8-groups: 5.59 (0.00 pct) 5.54 (0.89 pct) 16-groups: 7.18 (0.00 pct) 7.47 (-4.03 pct) NPS2 Test: tip cluster 1-groups: 4.25 (0.00 pct) 4.40 (-3.52 pct) 2-groups: 4.83 (0.00 pct) 4.73 (2.07 pct) 4-groups: 5.25 (0.00 pct) 5.18 (1.33 pct) 8-groups: 5.56 (0.00 pct) 5.45 (1.97 pct) 16-groups: 6.72 (0.00 pct) 6.63 (1.33 pct) NPS4 Test: tip cluster 1-groups: 4.24 (0.00 pct) 4.23 (0.23 pct) 2-groups: 4.88 (0.00 pct) 4.78 (2.04 pct) 4-groups: 5.30 (0.00 pct) 5.25 (0.94 pct) 8-groups: 5.66 (0.00 pct) 5.61 (0.88 pct) 16-groups: 6.79 (0.00 pct) 7.05 (-3.82 pct) ~~~~~~~~~~~~ ~ schbench ~ ~~~~~~~~~~~~ NPS1 #workers: tip cluster 1: 37.00 (0.00 pct) 22.00 (40.54 pct) 2: 39.00 (0.00 pct) 23.00 (41.02 pct) 4: 41.00 (0.00 pct) 30.00 (26.82 pct) 8: 53.00 (0.00 pct) 47.00 (11.32 pct) 16: 73.00 (0.00 pct) 73.00 (0.00 pct) 32: 116.00 (0.00 pct) 117.00 (-0.86 pct) 64: 217.00 (0.00 pct) 221.00 (-1.84 pct) 128: 477.00 (0.00 pct) 444.00 (6.91 pct) 256: 1062.00 (0.00 pct) 1050.00 (1.12 pct) 512: 47552.00 (0.00 pct) 48576.00 (-2.15 pct) NPS2 #workers: tip cluster 1: 20.00 (0.00 pct) 20.00 (0.00 pct) 2: 22.00 (0.00 pct) 23.00 (-4.54 pct) 4: 30.00 (0.00 pct) 31.00 (-3.33 pct) 8: 46.00 (0.00 pct) 49.00 (-6.52 pct) 16: 70.00 (0.00 pct) 72.00 (-2.85 pct) 32: 120.00 (0.00 pct) 118.00 (1.66 pct) 64: 215.00 (0.00 pct) 216.00 (-0.46 pct) 128: 482.00 (0.00 pct) 449.00 (6.84 pct) 256: 1042.00 (0.00 pct) 995.00 (4.51 pct) 512: 47552.00 (0.00 pct) 47296.00 (0.53 pct) NPS4 #workers: tip cluster 1: 18.00 (0.00 pct) 20.00 (-11.11 pct) 2: 23.00 (0.00 pct) 22.00 (4.34 pct) 4: 27.00 (0.00 pct) 30.00 (-11.11 pct) 8: 57.00 (0.00 pct) 60.00 (-5.26 pct) 16: 76.00 (0.00 pct) 84.00 (-10.52 pct) 32: 120.00 (0.00 pct) 115.00 (4.16 pct) 64: 219.00 (0.00 pct) 212.00 (3.19 pct) 128: 459.00 (0.00 pct) 442.00 (3.70 pct) 256: 1078.00 (0.00 pct) 983.00 (8.81 pct) 512: 47040.00 (0.00 pct) 48192.00 (-2.44 pct) Note: schbench displays lot of run to run variance for low worker count. This behavior is due to the timing of new-idle balance which is not consistent across runs. ~~~~~~~~~~ ~ tbench ~ ~~~~~~~~~~ NPS1 Clients: tip cluster 1 573.26 (0.00 pct) 572.61 (-0.11 pct) 2 1131.19 (0.00 pct) 1122.41 (-0.77 pct) 4 2100.07 (0.00 pct) 2081.74 (-0.87 pct) 8 3809.88 (0.00 pct) 3732.14 (-2.04 pct) 16 6560.72 (0.00 pct) 6289.22 (-4.13 pct) 32 12203.23 (0.00 pct) 11811.74 (-3.20 pct) 64 22389.81 (0.00 pct) 21587.79 (-3.58 pct) 128 32449.37 (0.00 pct) 32967.15 (1.59 pct) 256 58962.40 (0.00 pct) 56604.63 (-3.99 pct) 512 59608.71 (0.00 pct) 56529.95 (-5.16 pct) * (Machine Overloaded) 512 57925.05 (0.00 pct) 56697.38 (-2.11 pct) [Verification Run] 1024 58037.02 (0.00 pct) 55751.53 (-3.93 pct) NPS2 Clients: tip cluster 1 574.20 (0.00 pct) 572.49 (-0.29 pct) 2 1131.56 (0.00 pct) 1149.53 (1.58 pct) 4 2132.26 (0.00 pct) 2084.18 (-2.25 pct) 8 3812.20 (0.00 pct) 3683.04 (-3.38 pct) 16 6457.61 (0.00 pct) 6340.70 (-1.81 pct) 32 12263.82 (0.00 pct) 11714.15 (-4.48 pct) 64 22224.11 (0.00 pct) 21226.34 (-4.48 pct) 128 33040.38 (0.00 pct) 32478.99 (-1.69 pct) 256 56547.25 (0.00 pct) 52915.71 (-6.42 pct) * (Machine Overloaded) 256 55631.80 (0.00 pct) 52905.99 (-4.89 pct) [Verification Run] 512 56220.67 (0.00 pct) 54735.69 (-2.64 pct) 1024 56048.88 (0.00 pct) 54426.63 (-2.89 pct) NPS4 Clients: tip cluster 1 575.50 (0.00 pct) 570.65 (-0.84 pct) 2 1138.70 (0.00 pct) 1137.75 (-0.08 pct) 4 2070.66 (0.00 pct) 2103.18 (1.57 pct) 8 3811.70 (0.00 pct) 3573.52 (-6.24 pct) * 8 3769.53 (0.00 pct) 3653.05 (-3.09 pct) [Verification Run] 16 6312.80 (0.00 pct) 6212.41 (-1.59 pct) 32 11418.14 (0.00 pct) 11721.01 (2.65 pct) 64 19671.16 (0.00 pct) 20053.77 (1.94 pct) 128 30258.53 (0.00 pct) 32585.15 (7.68 pct) 256 55838.10 (0.00 pct) 51318.64 (-8.09 pct) * (Machine Overloaded) 256 54291.03 (0.00 pct) 54379.80 (0.16 pct) [Verification Run] 512 55586.44 (0.00 pct) 51538.93 (-7.28 pct) * (Machine Overloaded) 512 54190.04 (0.00 pct) 54096.16 (-0.17 pct) [Verification Run] 1024 56370.35 (0.00 pct) 50768.68 (-9.93 pct) * (Machine Overloaded) 1024 56498.36 (0.00 pct) 54661.85 (-3.25 pct) [Verification Run] ~~~~~~~~~~ ~ stream ~ ~~~~~~~~~~ NPS1 - 10 Runs: Test: tip cluster Copy: 332237.51 (0.00 pct) 338085.24 (1.76 pct) Scale: 215236.94 (0.00 pct) 214179.72 (-0.49 pct) Add: 250753.67 (0.00 pct) 251181.86 (0.17 pct) Triad: 259467.60 (0.00 pct) 262541.92 (1.18 pct) - 100 Runs: Test: tip cluster Copy: 329320.65 (0.00 pct) 336947.39 (2.31 pct) Scale: 218102.78 (0.00 pct) 219617.85 (0.69 pct) Add: 251283.30 (0.00 pct) 251918.03 (0.25 pct) Triad: 258044.33 (0.00 pct) 261512.99 (1.34 pct) NPS2 - 10 Runs: Test: tip cluster Copy: 336926.24 (0.00 pct) 324310.01 (-3.74 pct) Scale: 220120.41 (0.00 pct) 212795.43 (-3.32 pct) Add: 252428.34 (0.00 pct) 254355.80 (0.76 pct) Triad: 274268.23 (0.00 pct) 261777.03 (-4.55 pct) - 100 Runs: Test: tip cluster Copy: 338126.49 (0.00 pct) 338947.03 (0.24 pct) Scale: 230229.59 (0.00 pct) 229991.65 (-0.10 pct) Add: 253964.25 (0.00 pct) 264374.57 (4.09 pct) Triad: 272176.19 (0.00 pct) 274587.35 (0.88 pct) NPS4 - 10 Runs: Test: tip cluster Copy: 367144.56 (0.00 pct) 375452.26 (2.26 pct) Scale: 246928.04 (0.00 pct) 243651.53 (-1.32 pct) Add: 272096.30 (0.00 pct) 272845.33 (0.27 pct) Triad: 286644.55 (0.00 pct) 290925.20 (1.49 pct) - 100 Runs: Test: tip cluster Copy: 351980.15 (0.00 pct) 375854.72 (6.78 pct) Scale: 254918.41 (0.00 pct) 255904.90 (0.38 pct) Add: 272722.89 (0.00 pct) 274075.11 (0.49 pct) Triad: 283340.94 (0.00 pct) 287608.77 (1.50 pct) ~~~~~~~~~~~~~~~~~~~~ ~ Additional notes ~ ~~~~~~~~~~~~~~~~~~~~ - schbench is know to have a noticeable run-to-run variation for lower worker counts and any improvements or regression observed can be safely ignored. The results are included to make sure there are no unnecessarily large regressions as a result of task pileup. - tbench shows slight run to run variation with larger number of clients on both tip and patched kernel. This is expected as the machine is overloaded at that point (equivalent of two or more tasks per CPU). "Verification Run" shows none of these regressions are persistent. > > [..snip..] > Overall, the changes look good and doesn't affect system without a SD_CLUSTER domain like the Zen3 system used during testing. Tested-by: K Prateek Nayak -- Thanks and Regards, Prateek