Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id BC1C1C7EE23 for ; Mon, 27 Feb 2023 16:50:24 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230220AbjB0QuX (ORCPT ); Mon, 27 Feb 2023 11:50:23 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58846 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229515AbjB0QuU (ORCPT ); Mon, 27 Feb 2023 11:50:20 -0500 Received: from mail-yw1-x1135.google.com (mail-yw1-x1135.google.com [IPv6:2607:f8b0:4864:20::1135]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 22FC61042B for ; Mon, 27 Feb 2023 08:50:19 -0800 (PST) Received: by mail-yw1-x1135.google.com with SMTP id 00721157ae682-536be69eadfso193902727b3.1 for ; Mon, 27 Feb 2023 08:50:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=yZeTsz2LgmEqTFu2D1y6H+6n8BeVxT4dDCxIvwbJYZY=; b=jMbv2r0SjzyLfd/109qTg+Zt8ATsSAGsZlaSIpfCNwq5MIkuMOmLkPrv5s0SX2Niqc MW13m5uCGpMy+WActWyp/aBFjIsABYD7rOJZxYz+L8ev/at3VpEsOl1aSGD/fg9fAxua eJ06FHs9y2Gd2u2RJdup1zDKoxef+VlmA1id7W+V1aNeMurRTMCoKUeuxbOl/2FJMH+K 69ocsLyGP1GZMP839JOIlS0Gn2tFsDBARF/ntfFuAjaNjJAF8gfgd+ru+qMGlRREUcd8 PoQCRdH5cvAnSFFw/+0AoDeKZqRUsNif+iN7CSIiwJgxom+C+kB7bPml9a7g3HDM5xhG H+dg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=yZeTsz2LgmEqTFu2D1y6H+6n8BeVxT4dDCxIvwbJYZY=; b=sGiF4em1NRX579QsHl3uwLk337bfYW5tTAwxitGm/M/ygqrRlqAFEcdwGEk8SKo/0k iBshKcRrZl7olvjhhI7z3n8c7kROJy6LgxsnCm0bMVY/FMObPM3zVDWVTi3w9jcN/sc3 LkFI6p7N3CMhvJ3Qs+E7WY0EgC1uWifUMoLVioKZhsLod8i4ZcRPx4UGJ7+A6IGEJGqq BVKu1G+APUQzwBg/ov+2N9jEU0y29GBxH2iRxcrLQz2EGAFSswkVgq8XBQqiTQkRkoUw dv09DCgVdkWY/LevhqwGtBeaV6PwqTJBy1r1KAVWPZgPZsJFb9kJn67ylMPEK99do/S3 qbUA== X-Gm-Message-State: AO0yUKUSWMlduPVGKgx9hG5SyKnDvYH8mDNF+/Sb7rBxvsbpiPf6bW7i myvkte2DuI+b60hhauJY3oMkGbJdBTeMONqDsyHv8w== X-Google-Smtp-Source: AK7set8lTXaiez4ACEuOb2wNkB4Dm/JTTL0C1kWWQrm108DPBrKq3P5Ta7PTFE2EKJWgLgv4skNklgr9Jbg7qnJNPeI= X-Received: by 2002:a81:ae52:0:b0:536:155a:b73d with SMTP id g18-20020a81ae52000000b00536155ab73dmr10845609ywk.2.1677516618212; Mon, 27 Feb 2023 08:50:18 -0800 (PST) MIME-Version: 1.0 References: <202301301057.e55dad5b-oliver.sang@intel.com> <20230131052352.5qnqegzwmt7akk7t@google.com> <20230131055743.tsilxx5vfl6gx4dj@google.com> In-Reply-To: From: Shakeel Butt Date: Mon, 27 Feb 2023 08:50:07 -0800 Message-ID: Subject: Re: [linus:master] [mm] f1a7941243: unixbench.score -19.2% regression To: "Yin, Fengwei" Cc: "willy@infradead.org" , "tj@kernel.org" , "dennis@kernel.org" , "akpm@linux-foundation.org" , lkp , "zhengjun.xing@linux.intel.com" , "Huang, Ying" , "linux-mm@kvack.org" , "Tang, Feng" , "linux-kernel@vger.kernel.org" , "m.szyprowski@samsung.com" , "Sang, Oliver" , "linux-trace-kernel@vger.kernel.org" , "oe-lkp@lists.linux.dev" Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Feb 26, 2023 at 10:35=E2=80=AFPM Yin, Fengwei wrote: > > Hi Andrew, Shakeel, > > On Tue, 2023-01-31 at 10:26 -0800, Shakeel Butt wrote: > > +per-cpu memory maintainers for FYI. > > > > Thread started at > > https://lore.kernel.org/linux-mm/202301301057.e55dad5b-oliver.sang@inte= l.com/ > > > > On Mon, Jan 30, 2023 at 9:57 PM Shakeel Butt > > wrote: > > > > > [...] > > > > > > > > We could cut down the number of calls to pcpu_alloc() by a factor > > > > of 4 > > > > by having a pcpu_alloc_bulk() that would allocate all four RSS > > > > counters > > > > at once. > > > > > > > > Just throwing out ideas ... > > > > > > Thanks, I will take a stab at pcpu_alloc_bulk() and will share the > > > result tomorrow. > > > > > > > OK, not a one day effort. > > > > Andrew, which option do you prefer? > > > > 1. Keep the patches as the test workload (fork ping pong) is not a > > representative of real world workload and work on improving > > pcpu_alloc() for 6.4+. > > > > 2. Revert the patches for now, improve pcpu_alloc() and re-introduce > > the patches once we confirm that fork-ping-pong is not regressed > > anymore. > This performance regression still can be reproduced on latest master > branch. So we took option1 here. Right? Thanks. > Yes unless some real workload reports regression. Shakeel