Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp2029302pxa; Mon, 24 Aug 2020 03:05:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxRPjPcTwpCx9aj2+psu6k3NwkCQTyqrtbQHwxERB757yA3e7k5NTJGeWbiGGmHYzFqfBgr X-Received: by 2002:a17:906:c7cd:: with SMTP id dc13mr4721563ejb.446.1598263528556; Mon, 24 Aug 2020 03:05:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598263528; cv=none; d=google.com; s=arc-20160816; b=XJwzlPqPpQbT3K/Z+PFFld1Rv0kdUfeQTeLRes7XjZo7Hu8DpvZ3GAsBqurJREfHQC qiZAvbxBXjYKVDIbwVidZl7H++/y+iUplZN3gmoZXhL50ebM2cCcURZGkQrwyA/JPZBS dKek6uxiitCKASkbBO1KRzKHsEySIKqTN1o0vKpt/lfuAgSi7rUS8Nkpn7QLv9CdI6hg 8CYXr+INFMcdXIDQi/W+6XWf7WgJjEvU+NdA6IqZiVozGywoUp/QlDDbdlugJ8mybUWJ n9wDkY3/8qXHyyaToKEgv6u+28HbWFMxS9JexeRT7KHqsEv+BGBkSRUbBr5qyxAdpCwF 42eg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject:reply-to; bh=u0vkW9HJSi3T2XER8FG7cIsdLR4/mDeBNWuVVFHs2yQ=; b=Kv4RmihjIN0lfTcSaubh8oYLSyESbUj2TKf8jl4yEYExsxtavpKubSM2BAKQtO+qeI 0BK7yXWqtKbiibCIHTt8CJjNDcCMO6IZ5Kiacsq1ie8yAHRVL4OM75RZIPPokACwtnXr 8qd8biRIKyHiQsjvUoMe+hxBwI88IXmx2L9FauB7QVF5sR+K3Ngk4UHnGzboknmYzl9P aZBNReJing5kHMIcmAY8rEvtb1q8vYPZ98HHpMjUOkxI2aF4T8tFJWkQZefE7ojd8rtw TR9etxYkDbN+P/vc1U2P3ZTMSl63xkIjlba+o6F4YVEfkJ2+csDJ76PDnU85kErzDNUp MgfQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id k21si6247780eje.480.2020.08.24.03.05.04; Mon, 24 Aug 2020 03:05:28 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726593AbgHXKEa (ORCPT + 99 others); Mon, 24 Aug 2020 06:04:30 -0400 Received: from out30-133.freemail.mail.aliyun.com ([115.124.30.133]:36598 "EHLO out30-133.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726086AbgHXKE3 (ORCPT ); Mon, 24 Aug 2020 06:04:29 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R741e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01f04427;MF=xlpang@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0U6giGpr_1598263465; Received: from xunleideMacBook-Pro.local(mailfrom:xlpang@linux.alibaba.com fp:SMTPD_---0U6giGpr_1598263465) by smtp.aliyun-inc.com(127.0.0.1); Mon, 24 Aug 2020 18:04:25 +0800 Reply-To: xlpang@linux.alibaba.com Subject: Re: [PATCH v2 0/3] mm/slub: Fix count_partial() problem To: Pekka Enberg Cc: Vlastimil Babka , Christoph Lameter , Wen Yang , Roman Gushchin , Konstantin Khlebnikov , David Rientjes , LKML , "linux-mm@kvack.org" References: <1597061872-58724-1-git-send-email-xlpang@linux.alibaba.com> From: xunlei Message-ID: Date: Mon, 24 Aug 2020 18:04:25 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/8/20 下午10:02, Pekka Enberg wrote: > On Mon, Aug 10, 2020 at 3:18 PM Xunlei Pang wrote: >> >> v1->v2: >> - Improved changelog and variable naming for PATCH 1~2. >> - PATCH3 adds per-cpu counter to avoid performance regression >> in concurrent __slab_free(). >> >> [Testing] >> On my 32-cpu 2-socket physical machine: >> Intel(R) Xeon(R) CPU E5-2650 v2 @ 2.60GHz >> perf stat --null --repeat 10 -- hackbench 20 thread 20000 >> >> == original, no patched >> 19.211637055 seconds time elapsed ( +- 0.57% ) >> >> == patched with patch1~2 >> Performance counter stats for 'hackbench 20 thread 20000' (10 runs): >> >> 21.731833146 seconds time elapsed ( +- 0.17% ) >> >> == patched with patch1~3 >> Performance counter stats for 'hackbench 20 thread 20000' (10 runs): >> >> 19.112106847 seconds time elapsed ( +- 0.64% ) >> >> >> Xunlei Pang (3): >> mm/slub: Introduce two counters for partial objects >> mm/slub: Get rid of count_partial() >> mm/slub: Use percpu partial free counter >> >> mm/slab.h | 2 + >> mm/slub.c | 124 +++++++++++++++++++++++++++++++++++++++++++------------------- >> 2 files changed, 89 insertions(+), 37 deletions(-) > > We probably need to wrap the counters under CONFIG_SLUB_DEBUG because > AFAICT all the code that uses them is also wrapped under it. /sys/kernel/slab/***/partial sysfs also uses it, I can wrap it with CONFIG_SLUB_DEBUG or CONFIG_SYSFS for backward compatibility. > > An alternative approach for this patch would be to somehow make the > lock in count_partial() more granular, but I don't know how feasible > that actually is. > > Anyway, I am OK with this approach: > > Reviewed-by: Pekka Enberg Thanks! > > You still need to convince Christoph, though, because he had > objections over this approach. Christoph, what do you think, or any better suggestion to address this *in production* issue? > > - Pekka >