Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp735637pxm; Fri, 25 Feb 2022 18:43:34 -0800 (PST) X-Google-Smtp-Source: ABdhPJwIy09vlArsjOb5H4BdWGQPnYQr1/beo6/A/MSCrPDLg9uSFlALpkERnJxh5+Ol9tQIsJTy X-Received: by 2002:a05:6871:78b:b0:d4:2636:b26 with SMTP id o11-20020a056871078b00b000d426360b26mr2865662oap.14.1645843414597; Fri, 25 Feb 2022 18:43:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645843414; cv=none; d=google.com; s=arc-20160816; b=V4cB1s6oL3QVKndn9+ImBOXJSsEXDxU2fx9mf9DwgJruATE+doQza0mraXNke86p5J LXC8L0juubogJ+4E5aI2SLF1CwNO5qcLI9TqTYu7jme/H2FwssObipfc+lLPf30eUI0/ 4btOGg5kA0KrPbXvvGreuE8Z13vytBL0RG/VmAomEMhtoOWVwodQtKMAsC5fhZZ78Gzb VMbmCES/Yv74OUnZi465Ci5XneLoe+uf+QvhjK6NPOTNmtIhnPY7tinDgayleJ6TLgS5 MP1jgFNRDsK7eP2LPkQWg8E+Rx/8WUxrLLz9yo3wdwVc5XSTxO25n0wKNQ5wvYbzUptB mlOg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:to:from:date :dkim-signature; bh=RGt8DQEifPBrHzvfkid/O+zzG/dPgSOS/WseA1w/v1Q=; b=k6nMsOeaU4lodWTraOg6FvqlcIRpAY3y/7lC3gs6SDFvRcPMZxBFr57Y29tOB/YQk2 e2hMc7SySUDDbb8kW2oZFfT/n0MaSamntMfwI3kwE31mWsAMvd1R+Woxnj053Z0/qCs+ qr+uFUTnerUXJcesuwgmlQSJFRoq+ifhm8kKqyWBjoQEttNj899L3/jejMw+1oPiR9/Z bRIK8lhGxTqlTMThoCWSIu5WeRP2miKp6EvS9DlB1TvJhuSImMxbGgCMT2KJuiguEQU/ kXa/WamuVY+oUOwAyt2q2stFXnm9rV/Cgo4yRJ1CmYdwhCbNf5gu8f7nC7oSIzxaWF+a Lgig== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@linux-foundation.org header.s=korg header.b=zmWl8r1z; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id q17-20020a05683022d100b005ad51f38f26si2352110otc.108.2022.02.25.18.43.33 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 25 Feb 2022 18:43:34 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=fail header.i=@linux-foundation.org header.s=korg header.b=zmWl8r1z; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id C9F4C2404D9; Fri, 25 Feb 2022 18:07:14 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233033AbiBZBU7 (ORCPT + 99 others); Fri, 25 Feb 2022 20:20:59 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37388 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229757AbiBZBU5 (ORCPT ); Fri, 25 Feb 2022 20:20:57 -0500 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3C9082261CD; Fri, 25 Feb 2022 17:20:24 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id EBD8FB833C1; Sat, 26 Feb 2022 01:20:22 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 2C31EC340E7; Sat, 26 Feb 2022 01:20:21 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linux-foundation.org; s=korg; t=1645838421; bh=MNZX72o3nFMbB/Wwvsuo/JcaX7p+1wDXXsOIDM8WRLU=; h=Date:From:To:Subject:In-Reply-To:References:From; b=zmWl8r1zxiQ0jrS8v8GX2dMS2IbwAeJT3rJc4OKcvYDvfgriT/2Uh+pJMTxjwkmga dec2ncVjw75/YQsfaJ41+6Vkm0UlnLtet/OAZkUCcbZEBLuq/+WccgxsVMq89VFcHB 3RaCOTxB6U4AZBiA/00lgniy6doBCdZfikWpFUHY= Date: Fri, 25 Feb 2022 17:20:20 -0800 From: Andrew Morton To: Shakeel Butt , =?ISO-8859-1?Q? "Michal_Koutn=FD" ?= , Johannes Weiner , Michal Hocko , Roman Gushchin , Ivan Babrou , cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Daniel Dao , stable@vger.kernel.org Subject: Re: [PATCH] memcg: async flush memcg stats from perf sensitive codepaths Message-Id: <20220225172020.b3e59e11a0a3dd15e0d34141@linux-foundation.org> In-Reply-To: <20220225165842.561d3a475310aeab86a2d653@linux-foundation.org> References: <20220226002412.113819-1-shakeelb@google.com> <20220225165842.561d3a475310aeab86a2d653@linux-foundation.org> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 25 Feb 2022 16:58:42 -0800 Andrew Morton wrote: > On Fri, 25 Feb 2022 16:24:12 -0800 Shakeel Butt wrote: > > > Daniel Dao has reported [1] a regression on workloads that may trigger > > a lot of refaults (anon and file). The underlying issue is that flushing > > rstat is expensive. Although rstat flush are batched with (nr_cpus * > > MEMCG_BATCH) stat updates, it seems like there are workloads which > > genuinely do stat updates larger than batch value within short amount of > > time. Since the rstat flush can happen in the performance critical > > codepaths like page faults, such workload can suffer greatly. > > > > The easiest fix for now is for performance critical codepaths trigger > > the rstat flush asynchronously. This patch converts the refault codepath > > to use async rstat flush. In addition, this patch has premptively > > converted mem_cgroup_wb_stats and shrink_node to also use the async > > rstat flush as they may also similar performance regressions. > > Gee we do this trick a lot and gee I don't like it :( > > a) if we're doing too much work then we're doing too much work. > Punting that work over to a different CPU or thread doesn't alter > that - it in fact adds more work. > > b) there's an assumption here that the flusher is able to keep up > with the producer. What happens if that isn't the case? Do we > simply wind up the deferred items until the system goes oom? > > What happens if there's a producer running on each CPU? Can the > flushers keep up? > > Pathologically, what happens if the producer is running > task_is_realtime() on a single-CPU system? Or if there's a > task_is_realtime() producer running on every CPU? The flusher never > gets to run and we're dead? Not some theoretical thing, btw. See how __read_swap_cache_async() just got its sins exposed by real-time tasks: https://lkml.kernel.org/r/20220221111749.1928222-1-cgel.zte@gmail.com