Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp1458508ybt; Thu, 9 Jul 2020 07:33:22 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwDECoLzue0/CL6qlA/cUTjUHCGcnvTp97EIKT/sSY1k2R+qOHnjCjwR1DvKnfoJPjOBrOi X-Received: by 2002:a05:6402:134e:: with SMTP id y14mr73028654edw.4.1594305202473; Thu, 09 Jul 2020 07:33:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1594305202; cv=none; d=google.com; s=arc-20160816; b=ssv+h0zxRkGkgMn20HyLplkFqEYL97hsySIbQY9ir5q6CJcYaGL68dZ5cu/llN3Ol1 tY0UgCEaoAuZQgJZdGpwhg5VFYigDgF2X0lXADOT9YlZhI604kKZ8geDklpVbZLH4c0g k78VfpJiOd9y1dcC7IyYdCLoGgLPvXlPeLMC7kPVzYXJZ3nG1gqiOPVHy6eqfuWc3iJQ obmc1OTbQn6n9OWYR/u5ycNyRpYwUdYOHYDyxOUemnNcJUeuYl2opYqauuHXmERc4/8s X5I9d2ngHiGO+pynL7oXQiQdMITXRf4PqDeHmmyYxdNxgabgHnNGDzi1x+k5FMoOlNZi r6GA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date; bh=lVTgb/uwS75Q/rgLpDWBdC1Ng61spnTYEUQtrs+xYEY=; b=o0ij/RUagYrCDx9RMLg+pz2yt3QXklP9k5IZoBjMrPwFbAawIOcEi//O4HussRwGvJ NQy9i13+VJV6D6p3Sr9qcGd5w3+qa8cux1+A+lGLmepQUkMSNz379pK9nm46jWHm576Z 1rIfZuAhimKhsNDjIu8GLzt0qHEKQJVTIK9SD/YpOA+eTvazy+R5te/gH08k8QoMTdkD 156S0VAB2/jMfbx+OEO7qQRg1PBKvwpnuJkQTBChBeg5y8FAL173eyc8nP6Gkr+Fh/JM LwqKNq+RdaGYNiQapp/tcwLnC8wazntvNNkSU3SfgcNr9mW2DXKCQSXtuJOVbCj97pFZ geAw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dg8si1999877edb.184.2020.07.09.07.32.58; Thu, 09 Jul 2020 07:33:22 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726617AbgGIOcg (ORCPT + 99 others); Thu, 9 Jul 2020 10:32:36 -0400 Received: from gentwo.org ([3.19.106.255]:51906 "EHLO gentwo.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726353AbgGIOcg (ORCPT ); Thu, 9 Jul 2020 10:32:36 -0400 Received: by gentwo.org (Postfix, from userid 1002) id 026D63F1F2; Thu, 9 Jul 2020 14:32:33 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by gentwo.org (Postfix) with ESMTP id 000053F1EC; Thu, 9 Jul 2020 14:32:33 +0000 (UTC) Date: Thu, 9 Jul 2020 14:32:33 +0000 (UTC) From: Christopher Lameter X-X-Sender: cl@www.lameter.com To: Pekka Enberg cc: Xunlei Pang , Andrew Morton , Wen Yang , Yang Shi , Roman Gushchin , "linux-mm@kvack.org" , LKML Subject: Re: [PATCH 1/2] mm/slub: Introduce two counters for the partial objects In-Reply-To: Message-ID: References: <1593678728-128358-1-git-send-email-xlpang@linux.alibaba.com> <7374a9fd-460b-1a51-1ab4-25170337e5f2@linux.alibaba.com> User-Agent: Alpine 2.22 (DEB 394 2020-01-19) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 7 Jul 2020, Pekka Enberg wrote: > On Fri, Jul 3, 2020 at 12:38 PM xunlei wrote: > > > > On 2020/7/2 PM 7:59, Pekka Enberg wrote: > > > On Thu, Jul 2, 2020 at 11:32 AM Xunlei Pang wrote: > > >> The node list_lock in count_partial() spend long time iterating > > >> in case of large amount of partial page lists, which can cause > > >> thunder herd effect to the list_lock contention, e.g. it cause > > >> business response-time jitters when accessing "/proc/slabinfo" > > >> in our production environments. > > > > > > Would you have any numbers to share to quantify this jitter? I have no > > > > We have HSF RT(High-speed Service Framework Response-Time) monitors, the > > RT figures fluctuated randomly, then we deployed a tool detecting "irq > > off" and "preempt off" to dump the culprit's calltrace, capturing the > > list_lock cost up to 100ms with irq off issued by "ss", this also caused > > network timeouts. > > Thanks for the follow up. This sounds like a good enough motivation > for this patch, but please include it in the changelog. Well this is access via sysfs causing a holdoff. Another way of access to the same information without adding atomics and counters would be best. > > I also have no idea what's the standard SLUB benchmark for the > > regression test, any specific suggestion? > > I don't know what people use these days. When I did benchmarking in > the past, hackbench and netperf were known to be slab-allocation > intensive macro-benchmarks. Christoph also had some SLUB > micro-benchmarks, but I don't think we ever merged them into the tree. They are still where they have been for the last decade or so. In my git tree on kernel.org. They were also reworked a couple of times and posted to linux-mm. There are historical posts going back over the years where individuals have modified them and used them to create multiple other tests.