Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp3831803pxb; Tue, 26 Jan 2021 06:04:21 -0800 (PST) X-Google-Smtp-Source: ABdhPJzr3D4HixeftO8P/WNjffuIUKXDNvrVi/OZjhpAUFEeGr2iWLwQVEv90hU9DkKNX2yeqp7H X-Received: by 2002:a05:6402:558:: with SMTP id i24mr4710866edx.141.1611669861030; Tue, 26 Jan 2021 06:04:21 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611669861; cv=none; d=google.com; s=arc-20160816; b=qedZf9eJFlvfs1F9lyv0p8+1y8FIROqFZRmhZzRq+Ahc2aH3YZpDcORdjfEMF/DzGo VewKC/cZpNUr5twEIbOjKSedaysgYmZXifvA0uwx8v4nR+FfelYBfF4xYk5onKEF/6Nq IysEO03JJDCnJBDgk2uM/oRwIykBR5EBUtO58GXlIq6qfv3qkQTkRTQQjvADEV0TrruL wwNsbDFGS3S6NBrBszH67FTEj/R8/XLOv3Y5JuS4U5Jiw0UMOUkGSUgzYt8WzOM7l1WG 7GFFo2Ik0FgzAR4fKLCXAZXWxJV9Ks35uShPSbbc+KG7bQtafq9swKikXZuoYLBLXUKd tVvA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=JCAbd7z4tqnsZ/4DW7kFHtVQefHj8K36GoZRlW3XnSo=; b=SIPG2nXNlp49nb3mUbFqHPUpZXno7i9322ZpX/HfBQSTqe/b9rklBXUvqiidNReJxU Q529l8AZGJ+4ynm6nMkFr/HGxuIL37Qjp9oZKsdI3Omklx2j8si4R2vQX4BJvfq1/hay fwS/z7N5bNCnbkIPW1QCM18BB9vJcKigIxG1E/Kh2JXl6jUww/MQhvYjNW1xdTzcpRVK 4E0yFXuB7LWgyBk+NtKMdFKJ5Tn6ki0kjQsmsGXiUGiaQLJPYbV8elhN3arbK+GPZXcz mGpxBmgyqHHCpzyoCJtSDuHnI2cqUCXlPjHtwq+5n1dRrBW1J1BDgGouUwiVKZgaDFhr 9OTA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=ZVLCr3UY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id se3si7004854ejb.303.2021.01.26.06.03.50; Tue, 26 Jan 2021 06:04:21 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=ZVLCr3UY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2405629AbhAZOBb (ORCPT + 99 others); Tue, 26 Jan 2021 09:01:31 -0500 Received: from mx2.suse.de ([195.135.220.15]:55868 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404105AbhAZOAJ (ORCPT ); Tue, 26 Jan 2021 09:00:09 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1611669562; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=JCAbd7z4tqnsZ/4DW7kFHtVQefHj8K36GoZRlW3XnSo=; b=ZVLCr3UY2N0PhZ0VDIfebHpRaQ/hMHCTcYTjnRQBQG3Yag66LWNdyr/5XdFhp6daJM2I3E x+5YvjS6MlrU8aDPbmWDJT3dE0/o5r5HoFgftOyRjNeyTPYIOud0km9wUY65Sk/ak27xq+ 1HTlodXIwUGChEPqDjTdqe0j0DyWiA4= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id C8F26AB9F; Tue, 26 Jan 2021 13:59:21 +0000 (UTC) Date: Tue, 26 Jan 2021 14:59:18 +0100 From: Michal Hocko To: Vincent Guittot Cc: Vlastimil Babka , Christoph Lameter , Bharata B Rao , linux-kernel , linux-mm@kvack.org, David Rientjes , Joonsoo Kim , Andrew Morton , guro@fb.com, Shakeel Butt , Johannes Weiner , aneesh.kumar@linux.ibm.com, Jann Horn Subject: Re: [RFC PATCH v0] mm/slub: Let number of online CPUs determine the slub page order Message-ID: <20210126135918.GQ827@dhcp22.suse.cz> References: <20201118082759.1413056-1-bharata@linux.ibm.com> <20210121053003.GB2587010@in.ibm.com> <20210126085243.GE827@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 26-01-21 14:38:14, Vincent Guittot wrote: > On Tue, 26 Jan 2021 at 09:52, Michal Hocko wrote: > > > > On Thu 21-01-21 19:19:21, Vlastimil Babka wrote: > > [...] > > > We could also start questioning the very assumption that number of cpus should > > > affect slab page size in the first place. Should it? After all, each CPU will > > > have one or more slab pages privately cached, as we discuss in the other > > > thread... So why make the slab pages also larger? > > > > I do agree. What is the acutal justification for this scaling? > > /* > > * Attempt to find best configuration for a slab. This > > * works by first attempting to generate a layout with > > * the best configuration and backing off gradually. > > * > > * First we increase the acceptable waste in a slab. Then > > * we reduce the minimum objects required in a slab. > > */ > > > > doesn't speak about CPUs. 9b2cd506e5f2 ("slub: Calculate min_objects > > based on number of processors.") does talk about hackbench "This has > > been shown to address the performance issues in hackbench on 16p etc." > > but it doesn't give any more details to tell actually _why_ that works. > > > > This thread shows that this is still somehow related to performance but > > the real reason is not clear. I believe we should be focusing on the > > actual reasons for the performance impact than playing with some fancy > > math and tuning for a benchmark on a particular machine which doesn't > > work for others due to subtle initialization timing issues. > > > > Fundamentally why should higher number of CPUs imply the size of slab in > > the first place? > > A 1st answer is that the activity and the number of threads involved > scales with the number of CPUs. Regarding the hackbench benchmark as > an example, the number of group/threads raise to a higher level on the > server than on the small system which doesn't seem unreasonable. > > On 8 CPUs, I run hackbench with up to 16 groups which means 16*40 > threads. But I raise up to 256 groups, which means 256*40 threads, on > the 224 CPUs system. In fact, hackbench -g 1 (with 1 group) doesn't > regress on the 224 CPUs system. The next test with 4 groups starts > to regress by -7%. But the next one: hackbench -g 16 regresses by 187% > (duration is almost 3 times longer). It seems reasonable to assume > that the number of running threads and resources scale with the number > of CPUs because we want to run more stuff. OK, I do understand that more jobs scale with the number of CPUs but I would also expect that higher order pages are generally more expensive to get so this is not really a clear cut especially under some more demand on the memory where allocations are smooth. So the question really is whether this is not just optimizing for artificial conditions. -- Michal Hocko SUSE Labs