Received: by 2002:a05:7412:a9a2:b0:e2:908c:2ebd with SMTP id o34csp322376rdh; Thu, 26 Oct 2023 03:20:17 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFGxIxk8CeoDa3zXCqMfH3wvyu4C9clEg9K1oQlOFmBMshZt+8yt9drI2LGNbI6LBYwiEmq X-Received: by 2002:a0d:e743:0:b0:5a7:e445:fad9 with SMTP id q64-20020a0de743000000b005a7e445fad9mr16316543ywe.35.1698315617382; Thu, 26 Oct 2023 03:20:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698315617; cv=none; d=google.com; s=arc-20160816; b=nCHa0Ion55S4c4VavG2r48uqXzwjYgFf8OJ0Dv3qsMxLLhFQFnUGEpV0xpQLgH/56k BwfDR4HBKXXrUGmUD8zvQCnaf133VoffiJirRkVk1De1jrCdCpVU3oCJFi0Noy+uPhkg 9fkJdCGucmGJsHH5+73DrK3HLalyenXWYYNfnFmpkrPnvYPvrII+F1m/Dhd9TzkoM37U a/PyVCG1ISNXMWFgIHJzTgLhpl9eK1JzwUR6uaLZAai+42zxrjT6wJPywBJ5oyhOvRrU ww4sH/UpAeombaX4BFacakEEU3ycT8hmrseauM7kkyfGLAxdogN8fczDrL/+L8mmC9am ZaJQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=ovM917uH+87OQ0k9Dg80qKD8TwUEd4qrT2eob+aCDVU=; fh=u4XIdY0DYjvtTTjUYLO7QbzqkOE6loFVRUNK9Cnal3k=; b=I6trJQ9HZLt2TZG83RpknAGS1tYf8tx3hPCJDKaw2zQZm0N9aT121jHK4xhGkHp+1t 18OiaWpwEtJeRlx4MzvTCbOxHLl+kT4T4pNTHXoSYNkQinJFGQfNnKgrx/YeIQ5hNX4/ 5FjneG2j+T6NG3ZdLkqVdMKtBEUvG+bZg3hnjBJqejUDoyDyEIr3DmQ36QCuaYR1ovv1 /lxBQfYqFalF6nTdeeQUIAHB0gx7nyCg8/+8FrwiavlQXRVLlsvQTeWhexbEZabKBqWB 00JXef247r3GGVGp9a8aS/Mpd3OwLRQxD5B5SGN8ShQqaQH7Vvm+bS4nk8N4iOOOz1TV rjoA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=EWwuFy0C; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from snail.vger.email (snail.vger.email. [23.128.96.37]) by mx.google.com with ESMTPS id x72-20020a81a04b000000b005a4d69af753si13761493ywg.328.2023.10.26.03.20.16 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 26 Oct 2023 03:20:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) client-ip=23.128.96.37; Authentication-Results: mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=EWwuFy0C; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=NONE dis=NONE) header.from=ibm.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id 0A9D482164CA; Thu, 26 Oct 2023 03:20:16 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231311AbjJZKUI (ORCPT + 99 others); Thu, 26 Oct 2023 06:20:08 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47248 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231239AbjJZKUB (ORCPT ); Thu, 26 Oct 2023 06:20:01 -0400 Received: from mx0b-001b2d01.pphosted.com (mx0b-001b2d01.pphosted.com [148.163.158.5]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9C703199 for ; Thu, 26 Oct 2023 03:19:58 -0700 (PDT) Received: from pps.filterd (m0360072.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 39QAF9f3023246; Thu, 26 Oct 2023 10:19:38 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=from : to : cc : subject : date : message-id : in-reply-to : references : mime-version : content-transfer-encoding; s=pp1; bh=ovM917uH+87OQ0k9Dg80qKD8TwUEd4qrT2eob+aCDVU=; b=EWwuFy0Ctcj4OV6v/Z8tDzN/6zqFqh3f0JC/N44x6eG3GpXeGRb7Ck04bmk2J0h6Jwi2 u8KQPDo+WDExkPIyQfQgy3PubSEHoSdyWDdi6GaK3jxMyub7NcMV9McZGsx8QC9nSbg5 KcY0gwNDuM8zAQZJpP3GA697mUbsEeb7GKegUb9cVHfArzpSJffsCosBIni/JjXX28rY ZsT41VbFUhjz2E4L0DkWMiSVr+eCIwwFdQX3JHU2lNnJ6PFtDe8faXPhKPhC420EXstA MgJcacgnL9oehpFfX+Fu5tgf7DV0BEnRTIU66zrqtdUL0TVhQbCNVjNsaAUabyI+9EjV Dw== Received: from pps.reinject (localhost [127.0.0.1]) by mx0a-001b2d01.pphosted.com (PPS) with ESMTPS id 3typ6qg3xy-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 26 Oct 2023 10:19:37 +0000 Received: from m0360072.ppops.net (m0360072.ppops.net [127.0.0.1]) by pps.reinject (8.17.1.5/8.17.1.5) with ESMTP id 39QAFGC6024458; Thu, 26 Oct 2023 10:19:37 GMT Received: from ppma13.dal12v.mail.ibm.com (dd.9e.1632.ip4.static.sl-reverse.com [50.22.158.221]) by mx0a-001b2d01.pphosted.com (PPS) with ESMTPS id 3typ6qg3xk-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 26 Oct 2023 10:19:37 +0000 Received: from pps.filterd (ppma13.dal12v.mail.ibm.com [127.0.0.1]) by ppma13.dal12v.mail.ibm.com (8.17.1.19/8.17.1.19) with ESMTP id 39QAFijh024403; Thu, 26 Oct 2023 10:19:36 GMT Received: from smtprelay01.fra02v.mail.ibm.com ([9.218.2.227]) by ppma13.dal12v.mail.ibm.com (PPS) with ESMTPS id 3tvu6kd137-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 26 Oct 2023 10:19:36 +0000 Received: from smtpav04.fra02v.mail.ibm.com (smtpav04.fra02v.mail.ibm.com [10.20.54.103]) by smtprelay01.fra02v.mail.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 39QAJYnv15008352 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 26 Oct 2023 10:19:34 GMT Received: from smtpav04.fra02v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 96EDC20063; Thu, 26 Oct 2023 10:19:34 +0000 (GMT) Received: from smtpav04.fra02v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 39FE62007C; Thu, 26 Oct 2023 10:19:32 +0000 (GMT) Received: from sapthagiri.in.ibm.com (unknown [9.109.198.113]) by smtpav04.fra02v.mail.ibm.com (Postfix) with ESMTP; Thu, 26 Oct 2023 10:19:32 +0000 (GMT) From: Srikar Dronamraju To: Michael Ellerman , Nicholas Piggin , Christophe Leroy Cc: linuxppc-dev , Srikar Dronamraju , linux-kernel@vger.kernel.org, Mark Rutland , "ndesaulniers@google.com" , "Paul E. McKenney" , "Peter Zijlstra (Intel)" , Rohan McLure , Valentin Schneider Subject: [PATCH v3 1/5] powerpc/smp: Enable Asym packing for cores on shared processor Date: Thu, 26 Oct 2023 15:48:36 +0530 Message-ID: <20231026101843.56784-2-srikar@linux.vnet.ibm.com> X-Mailer: git-send-email 2.41.0 In-Reply-To: <20231026101843.56784-1-srikar@linux.vnet.ibm.com> References: <20231026101843.56784-1-srikar@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 X-Proofpoint-GUID: b1_Afhd855bKBcsXoZutgBbOx5d4_kg0 X-Proofpoint-ORIG-GUID: aL2h6E4vKUQo0rgGMIpbgHT8PY5JSp1Z X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.272,Aquarius:18.0.980,Hydra:6.0.619,FMLib:17.11.176.26 definitions=2023-10-26_07,2023-10-25_01,2023-05-22_02 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxscore=0 lowpriorityscore=0 priorityscore=1501 impostorscore=0 mlxlogscore=999 adultscore=0 suspectscore=0 malwarescore=0 clxscore=1011 bulkscore=0 spamscore=0 phishscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2310170001 definitions=main-2310260086 X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_EF,RCVD_IN_DNSWL_BLOCKED,RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Thu, 26 Oct 2023 03:20:16 -0700 (PDT) If there are shared processor LPARs, underlying Hypervisor can have more virtual cores to handle than actual physical cores. Starting with Power 9, a big core (aka SMT8 core) has 2 nearly independent thread groups. On a shared processors LPARs, it helps to pack threads to lesser number of cores so that the overall system performance and utilization improves. PowerVM schedules at a big core level. Hence packing to fewer cores helps. For example: Lets says there are two 8-core Shared LPARs that are actually sharing a 8 Core shared physical pool, each running 8 threads each. Then Consolidating 8 threads to 4 cores on each LPAR would help them to perform better. This is because each of the LPAR will get 100% time to run applications and there will no switching required by the Hypervisor. To achieve this, enable SD_ASYM_PACKING flag at CACHE, MC and DIE level when the system is running in shared processor mode and has big cores. Signed-off-by: Srikar Dronamraju --- Changelog: v1->v2: Using Jump label instead of a variable. v2 -> v3: - Handle comments on commit message (Michael Ellerman) - Rework using existing cpu_has_features static key (Michael Ellerman) - Added a comment on why we do asym_packing at core (Peter Zijlstra) arch/powerpc/kernel/smp.c | 32 ++++++++++++++++++++++++-------- 1 file changed, 24 insertions(+), 8 deletions(-) diff --git a/arch/powerpc/kernel/smp.c b/arch/powerpc/kernel/smp.c index 5826f5108a12..dbf0a584804b 100644 --- a/arch/powerpc/kernel/smp.c +++ b/arch/powerpc/kernel/smp.c @@ -988,18 +988,22 @@ static int __init init_thread_group_cache_map(int cpu, int cache_property) } static bool shared_caches; +/* + * On shared processor LPARs scheduled on a big core (which has two or more + * independent thread groups per core), prefer lower numbered CPUs, so + * that workload consolidates to lesser number of cores. + */ +static __ro_after_init DEFINE_STATIC_KEY_FALSE(splpar_asym_pack); #ifdef CONFIG_SCHED_SMT /* cpumask of CPUs with asymmetric SMT dependency */ static int powerpc_smt_flags(void) { - int flags = SD_SHARE_CPUCAPACITY | SD_SHARE_PKG_RESOURCES; + if (cpu_has_feature(CPU_FTR_ASYM_SMT) || + static_branch_unlikely(&splpar_asym_pack)) + return SD_SHARE_CPUCAPACITY | SD_SHARE_PKG_RESOURCES | SD_ASYM_PACKING; - if (cpu_has_feature(CPU_FTR_ASYM_SMT)) { - printk_once(KERN_INFO "Enabling Asymmetric SMT scheduling\n"); - flags |= SD_ASYM_PACKING; - } - return flags; + return SD_SHARE_CPUCAPACITY | SD_SHARE_PKG_RESOURCES; } #endif @@ -1011,9 +1015,20 @@ static int powerpc_smt_flags(void) */ static int powerpc_shared_cache_flags(void) { + if (static_branch_unlikely(&splpar_asym_pack)) + return SD_SHARE_PKG_RESOURCES | SD_ASYM_PACKING; + return SD_SHARE_PKG_RESOURCES; } +static int powerpc_shared_proc_flags(void) +{ + if (static_branch_unlikely(&splpar_asym_pack)) + return SD_ASYM_PACKING; + + return 0; +} + /* * We can't just pass cpu_l2_cache_mask() directly because * returns a non-const pointer and the compiler barfs on that. @@ -1050,8 +1065,8 @@ static struct sched_domain_topology_level powerpc_topology[] = { { cpu_smt_mask, powerpc_smt_flags, SD_INIT_NAME(SMT) }, #endif { shared_cache_mask, powerpc_shared_cache_flags, SD_INIT_NAME(CACHE) }, - { cpu_mc_mask, SD_INIT_NAME(MC) }, - { cpu_cpu_mask, SD_INIT_NAME(DIE) }, + { cpu_mc_mask, powerpc_shared_proc_flags, SD_INIT_NAME(MC) }, + { cpu_cpu_mask, powerpc_shared_proc_flags, SD_INIT_NAME(DIE) }, { NULL, }, }; @@ -1686,7 +1701,13 @@ static void __init fixup_topology(void) { int i; + if (is_shared_processor() && has_big_cores) + static_branch_enable(&splpar_asym_pack); + #ifdef CONFIG_SCHED_SMT + if (cpu_has_feature(CPU_FTR_ASYM_SMT)) + pr_info_once("Enabling Asymmetric SMT scheduling\n"); + if (has_big_cores) { pr_info("Big cores detected but using small core scheduling\n"); powerpc_topology[smt_idx].mask = smallcore_smt_mask; -- 2.31.1