Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 310C8C433EF for ; Thu, 9 Dec 2021 17:46:09 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237853AbhLIRtm (ORCPT ); Thu, 9 Dec 2021 12:49:42 -0500 Received: from us-smtp-delivery-124.mimecast.com ([170.10.129.124]:25538 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237989AbhLIRtf (ORCPT ); Thu, 9 Dec 2021 12:49:35 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1639071960; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=4Lo6dpXi15zAMgWKrGSNjO6+8EwzMzVM9I8nYMM1AGg=; b=il/2CTWhQuTcWsKqPFchhClRgcvRg/j1HzmkLAJKmDeOZsQs3kzpmMeiOtshXY3cfWw6d1 qp8xQ5zEpkI0fX6NNJM3WPFdPGwhelzsXQa5L/7JrdmqNYTY7YoJaoEWCn37uxFLpcyo9v UcRPz2o3WXB1MZKEIDUHk5PReDhotxI= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-455-Wz9cGJ3mO7-GzAulEl4VkQ-1; Thu, 09 Dec 2021 12:45:56 -0500 X-MC-Unique: Wz9cGJ3mO7-GzAulEl4VkQ-1 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 77650190B2A1; Thu, 9 Dec 2021 17:45:54 +0000 (UTC) Received: from fuller.cnet (ovpn-112-3.gru2.redhat.com [10.97.112.3]) by smtp.corp.redhat.com (Postfix) with ESMTPS id EE2C060BF1; Thu, 9 Dec 2021 17:45:53 +0000 (UTC) Received: by fuller.cnet (Postfix, from userid 1000) id 9D445415EE65; Thu, 9 Dec 2021 14:45:35 -0300 (-03) Date: Thu, 9 Dec 2021 14:45:35 -0300 From: Marcelo Tosatti To: Mel Gorman Cc: Nicolas Saenz Julienne , akpm@linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, frederic@kernel.org, tglx@linutronix.de, peterz@infradead.org, nilal@redhat.com, linux-rt-users@vger.kernel.org, vbabka@suse.cz, cl@linux.com, ppandit@redhat.com Subject: Re: [PATCH v2 3/3] mm/page_alloc: Remotely drain per-cpu lists Message-ID: <20211209174535.GA70283@fuller.cnet> References: <20211103170512.2745765-1-nsaenzju@redhat.com> <20211103170512.2745765-4-nsaenzju@redhat.com> <20211203141306.GG3301@suse.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20211203141306.GG3301@suse.de> User-Agent: Mutt/1.10.1 (2018-07-13) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Dec 03, 2021 at 02:13:06PM +0000, Mel Gorman wrote: > On Wed, Nov 03, 2021 at 06:05:12PM +0100, Nicolas Saenz Julienne wrote: > > Some setups, notably NOHZ_FULL CPUs, are too busy to handle the per-cpu > > drain work queued by __drain_all_pages(). So introduce new a mechanism > > to remotely drain the per-cpu lists. It is made possible by remotely > > locking 'struct per_cpu_pages' new per-cpu spinlocks. A benefit of this > > new scheme is that drain operations are now migration safe. > > > > There was no observed performance degradation vs. the previous scheme. > > Both netperf and hackbench were run in parallel to triggering the > > __drain_all_pages(NULL, true) code path around ~100 times per second. > > The new scheme performs a bit better (~5%), although the important point > > here is there are no performance regressions vs. the previous mechanism. > > Per-cpu lists draining happens only in slow paths. > > > > netperf and hackbench are not great indicators of page allocator > performance as IIRC they are more slab-intensive than page allocator > intensive. I ran the series through a few benchmarks and can confirm > that there was negligible difference to netperf and hackbench. > > However, on Page Fault Test (pft in mmtests), it is noticable. On a > 2-socket cascadelake machine I get > > pft timings > 5.16.0-rc1 5.16.0-rc1 > vanilla mm-remotedrain-v2r1 > Amean system-1 27.48 ( 0.00%) 27.85 * -1.35%* > Amean system-4 28.65 ( 0.00%) 30.84 * -7.65%* > Amean system-7 28.70 ( 0.00%) 32.43 * -13.00%* > Amean system-12 30.33 ( 0.00%) 34.21 * -12.80%* > Amean system-21 37.14 ( 0.00%) 41.51 * -11.76%* > Amean system-30 36.79 ( 0.00%) 46.15 * -25.43%* > Amean system-48 58.95 ( 0.00%) 65.28 * -10.73%* > Amean system-79 111.61 ( 0.00%) 114.78 * -2.84%* > Amean system-80 113.59 ( 0.00%) 116.73 * -2.77%* > Amean elapsed-1 32.83 ( 0.00%) 33.12 * -0.88%* > Amean elapsed-4 8.60 ( 0.00%) 9.17 * -6.66%* > Amean elapsed-7 4.97 ( 0.00%) 5.53 * -11.30%* > Amean elapsed-12 3.08 ( 0.00%) 3.43 * -11.41%* > Amean elapsed-21 2.19 ( 0.00%) 2.41 * -10.06%* > Amean elapsed-30 1.73 ( 0.00%) 2.04 * -17.87%* > Amean elapsed-48 1.73 ( 0.00%) 2.03 * -17.77%* > Amean elapsed-79 1.61 ( 0.00%) 1.64 * -1.90%* > Amean elapsed-80 1.60 ( 0.00%) 1.64 * -2.50%* > > It's not specific to cascade lake, I see varying size regressions on > different Intel and AMD chips, some better and worse than this result. > The smallest regression was on a single CPU skylake machine with a 2-6% > hit. Worst was Zen1 with a 3-107% hit. > > I didn't profile it to establish why but in all cases the system CPU > usage was much higher. It *might* be because the spinlock in > per_cpu_pages crosses a new cache line and it might be cold although the > penalty seems a bit high for that to be the only factor. > > Code-wise, the patches look fine but the apparent penalty for PFT is > too severe. Mel, Have you read Nicolas RCU patches? Date: Fri, 8 Oct 2021 18:19:19 +0200 From: Nicolas Saenz Julienne Subject: [RFC 0/3] mm/page_alloc: Remote per-cpu lists drain support RCU seems like a natural fit, we were wondering whether people see any fundamental problem with this approach.