Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp558744pxb; Wed, 18 Aug 2021 08:33:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJybxAHK49NkgsFVL8ClxY2MLCZO5xLn9gwVyEMJQzJIt6EoxMgA63uQLM4UEAAm6XXxcyTo X-Received: by 2002:a05:6638:3805:: with SMTP id i5mr6003986jav.72.1629300809679; Wed, 18 Aug 2021 08:33:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629300809; cv=none; d=google.com; s=arc-20160816; b=uith2Mmzi3//cH/8m9b3VizO+xLTkNNXIPNNiywF8w4vHw9mMznhf1cZ4hHBJP1r9B 5kbLTMnDRARy8Kt3gKJFK+eRpgel1udD2DVfoUAoKEJf8IgrlM1wDGOtFCpAta3r/S2V laee0UGK53Az4/HxnQ6SsuyS2hO6h08Bt4Bz9ECcdw1dwMXE+vUBS1jvL0l6Anj9qqwY 6/zJxuR3VJf4byMBPyVMh8kO6A2VFGdHb+BtB84unl2Jij6o0C5uXiQej0zcOVRGPKRQ WA7lm5xQrqIz2X26y75XsdlyysSSTLjNJPPC4kudNWJxf9RdnajVAFRdAUbmJse0CXw7 L4GA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=QUwQRVGpi/+7ugWtkT0cyw7P1CMwthhRqw/QWWXV2xI=; b=e+KlmIbsPdPqPJwRSarNMj5/reWaTxUvILWecY4SFrO+sbKstmVJrmM9KAxLlIoozq CDkEQ0ORZ5iIARntk6C/fYwSFV05dF/Y70wTFebCCQjmputZo4ROnwBvFaHm064yRSYc WoR/YCT8us/698Leakc6Gwcj3pvN/Q6frUFDHHPxMgvbMCmgCx5pYZ70x4xMofz5+DLA 5P3wMhOf/io5tbvDu5LAR7O7bmXCrL/bGB+sWnvVrSZzVW/OAa474COzVVZfDz9+9R9L ZCZVwBAEXWfivNTAzDJN//gP17zBfCRpom+pnkDg4C/Yc4pOcorEJ+aQaNq8KfyrS4IF dXgg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p19si56900iov.90.2021.08.18.08.33.17; Wed, 18 Aug 2021 08:33:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239533AbhHRPcd (ORCPT + 99 others); Wed, 18 Aug 2021 11:32:33 -0400 Received: from outbound-smtp27.blacknight.com ([81.17.249.195]:55411 "EHLO outbound-smtp27.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236859AbhHRPcc (ORCPT ); Wed, 18 Aug 2021 11:32:32 -0400 Received: from mail.blacknight.com (pemlinmail03.blacknight.ie [81.17.254.16]) by outbound-smtp27.blacknight.com (Postfix) with ESMTPS id 277BDCAD20 for ; Wed, 18 Aug 2021 16:31:57 +0100 (IST) Received: (qmail 5934 invoked from network); 18 Aug 2021 15:31:57 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.17.255]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 18 Aug 2021 15:31:56 -0000 Date: Wed, 18 Aug 2021 16:31:55 +0100 From: Mel Gorman To: Linus Torvalds Cc: kernel test robot , Sandeep Patil , Michael Kerrisk , LKML , lkp@lists.01.org, kernel test robot , "Huang, Ying" , Feng Tang , Zhengjun Xing Subject: Re: [pipe] 3a34b13a88: hackbench.throughput -12.6% regression Message-ID: <20210818153155.GL6464@techsingularity.net> References: <20210802024945.GA8372@xsang-OptiPlex-9020> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 02, 2021 at 10:14:14AM -0700, Linus Torvalds wrote: > On Mon, Aug 2, 2021 at 10:06 AM Linus Torvalds > wrote: > > > > Sandeep, does something like the attached patch (written to be on top > > of the existing one) work for you? > > .. and by "attached patch" I obviously meant the patch that wasn't attached. > > It's here. > Hi Linus, Our own test systems also detected the regression and automatically bisected to 3a34b13a88 but it is not universal. For example, it's not visible on a single socket 8-cpu skylake machine but it gets progressively worse the more CPUs there are and potentially the more LLCs there are. I say LLCs because the number of interrupts generated was significantly higher which *might* be more IPIs being sent to tasks running on CPUs that do not share target with the waker. Your "attached" patch alleviates the problem. The worse results were on a Zen3 machine (256 CPUs, lots of last level caches). hackbench-process-pipes 5.14.0-rc3 5.14.0-rc3 5.14.0-rc3 vanilla alwayswake-v1r1 specialcase-v1r1 Amean 1 0.3800 ( 0.00%) 0.4520 * -18.95%* 0.3580 ( 5.79%) Amean 4 0.5107 ( 0.00%) 0.6660 * -30.42%* 0.5080 ( 0.52%) Amean 7 0.5717 ( 0.00%) 0.6387 * -11.72%* 0.5310 * 7.11%* Amean 12 0.5853 ( 0.00%) 0.8563 * -46.30%* 0.5457 ( 6.78%) Amean 21 0.7693 ( 0.00%) 1.3150 * -70.93%* 0.7297 ( 5.16%) Amean 30 0.9667 ( 0.00%) 1.8530 * -91.69%* 0.8547 ( 11.59%) Amean 48 1.3100 ( 0.00%) 2.5027 * -91.04%* 1.3143 ( -0.33%) Amean 79 2.0633 ( 0.00%) 3.4983 * -69.55%* 2.1180 ( -2.65%) Amean 110 2.8080 ( 0.00%) 3.9927 * -42.19%* 2.7850 ( 0.82%) Amean 141 3.6237 ( 0.00%) 4.6810 * -29.18%* 3.6663 ( -1.18%) Amean 172 4.6480 ( 0.00%) 5.6707 * -22.00%* 4.8003 ( -3.28%) Amean 203 5.2630 ( 0.00%) 6.5000 * -23.50%* 5.6100 ( -6.59%) Amean 234 6.3290 ( 0.00%) 7.1473 * -12.93%* 6.1570 ( 2.72%) Amean 265 7.0907 ( 0.00%) 7.9200 * -11.70%* 7.1417 ( -0.72%) Amean 296 7.7550 ( 0.00%) 8.2590 * -6.50%* 7.8167 ( -0.80%) alwayswake-v1r1 is commit 3a34b13a88 cherry-picked on top of 5.14-rc3. specialcase-v1r1 is your suggested patch on top. The * mark results that are likely statistically significant so with your suggested patch, most of the results are mostly in the noise. While this is Zen3, the same problem exists on at least 2-socket cascadelake, 2-socket skylake and 2-socket broadwell systems although not to the same degree (different CPU counts and topology). I know hackbench is not particularly interesting but it's used often enough when comparing kernels and patches that commit 3a34b13a88 will be a regression magnet. -- Mel Gorman SUSE Labs