Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp232020imm; Thu, 6 Sep 2018 01:18:53 -0700 (PDT) X-Google-Smtp-Source: ANB0VdZvsnxqcXh4R5ZJSpAXiYdhxeIw5V90XHRAJ1yTFHeldwsPKjlO8yZ7GJebNnNW77i5HMeY X-Received: by 2002:a62:3001:: with SMTP id w1-v6mr1660343pfw.19.1536221933825; Thu, 06 Sep 2018 01:18:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536221933; cv=none; d=google.com; s=arc-20160816; b=Pbcmed6TIDAhBrzJMVNBNJqsLF3t17pwlVIgl96wVI2GYLMkVui/Z7wLLLhBLmY4Ey YtcoEDAWhTFL0RCcM5LROMMewLF0GM3VIpcaOx39cisgMrRMxG6HzWTMWiFILIpSn9+Y ZIaQfXpPv7bpHnf5gfcGkcSlaNsXduxsrgb7iZY4Zdj3O9WrS0GayZKnwyWLZY5wPIni 2j0XFyRRkiXb57ibHLCeG2Qa0uIyT5XQ5ltx+rS5xch+56wrhhU0mJkrhDYA0ZLOS/8v 9X4Lo5kBz0UnfpTdziauuKBSfPjFGfBjgw64l19vkAg45+ssp/wjE8QISs/GGpqGG+G0 dLZA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version; bh=EmrzoUzfT7zxPZ19pX+8kD6VAmZ9M/X6zaRNO2uMEAs=; b=iUDlKhJHv4mcOZREGPoNjy80gE6Z/d+Fhbt6VLF0AxLCrY8zmQqbKPrLHlSHG4k9dx DOwE/T5ReW9W2DeG69rT4BmOOIGl1piylAXf0InS/xpYlSgsoV5y3ohMa7B5ngRyiHo2 j/rGMwZerlzYlkfrbxeJ0rzQIRuaAoAqOe1USE7TishEQ8EAsMH9wIa8+Sn9NC8zef9y 0cwTr4wR7JZa9VJIChsJQbFCrkgWZpxuh3qIAtoCFiYSzLMXHy10QGWZ2BRHRK6vI+z0 NkvtO6GP4ItJyrD9azeBZBkAVKAt2IQQqIpyeN+p6fzbEOnQuIDrBRQMz/zCdiZls7eD 8Lzg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v17-v6si4447044pfl.233.2018.09.06.01.18.37; Thu, 06 Sep 2018 01:18:53 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727993AbeIFMup (ORCPT + 99 others); Thu, 6 Sep 2018 08:50:45 -0400 Received: from mail-oi0-f66.google.com ([209.85.218.66]:45350 "EHLO mail-oi0-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725819AbeIFMup (ORCPT ); Thu, 6 Sep 2018 08:50:45 -0400 Received: by mail-oi0-f66.google.com with SMTP id t68-v6so18859609oie.12 for ; Thu, 06 Sep 2018 01:16:29 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=EmrzoUzfT7zxPZ19pX+8kD6VAmZ9M/X6zaRNO2uMEAs=; b=orXGqDZ/i2ff8z7eRKCN58NghOfY4/pne4BEGMxQYv+SbCqaEj53plJfxDHy9XPvFJ XK+9gW0vuX0lJhNJ3gXPCZoTVStv1NpeGnDMCWtTivjsYWy2BIBk3k3V+souMFUqQsmv fobSHX0LM5PVvWGL2z8vog2+zhKkbhq3r30AQ+cZOEhKdmtsXqN+cyYkRzWWNAGL4Gnk j7OV+B02jbhTLx0jKQWekL3bLodafVw3h8OEnb9ZoA0leJwoez9yqFXknJBFtGhAcC9w LW6jb14qx1CdvX6GPe+kHJJCgHcc9DjZRkOcrpXFw63qRyt/4u6OdVc2ynLhmn3pF6eH 45Ag== X-Gm-Message-State: APzg51CP0qm/58TLl//8DhsXTxr/LEeMKGsPcIEjMhm5dWMk6ymfqEbk qJE4nOncQQmXntb0Krtkr/7+ldHM1EWrfsDib1+VlvxLnVk= X-Received: by 2002:aca:2cd3:: with SMTP id s202-v6mr1540591ois.253.1536221789187; Thu, 06 Sep 2018 01:16:29 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a4a:55cf:0:0:0:0:0 with HTTP; Thu, 6 Sep 2018 01:16:28 -0700 (PDT) In-Reply-To: References: <20180627084954.73ucqvac62v5gje4@techsingularity.net> <20180717100329.yfy7igdsrpk5ujf4@techsingularity.net> <20180904090053.GB1719@techsingularity.net> From: Jirka Hladky Date: Thu, 6 Sep 2018 10:16:28 +0200 Message-ID: Subject: Re: [4.17 regression] Performance drop on kernel-4.17 visible on Stream, Linpack and NAS parallel benchmarks To: Mel Gorman Cc: Kamil Kolakowski , Jakub Racek , linux-kernel , "Rafael J. Wysocki" , Len Brown , linux-acpi@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Mel, we have results with 2d4056fafa196e1ab4e7161bae4df76f9602d56d reverted. * Compared to 4.18, there is still performance regression - especially with NAS (sp_C_x subtest) and SPECjvm2008. On 4 NUMA systems, regression is around 10-15% * Compared to 4.19rc1 there is a clear gain across all benchmarks around 20% While reverting 2d4056fafa196e1ab4e7161bae4df76f9602d56d has helped a lot there is another issue as well. Could you please recommend some commit prior to 2d4056fafa196e1ab4e7161bae4df76f9602d56d to try? Regarding the current results, how do we proceed? Could you please contact Srikar and ask for the advice or should we contact him directly? Thanks a lot! Jirka On Tue, Sep 4, 2018 at 12:07 PM, Jirka Hladky wrote: > Hi Mel, > > thanks for sharing the background information! We will check if > 2d4056fafa196e1ab4e7161bae4df76f9602d56d is causing the current > regression in 4.19 rc1 and let you know the outcome. > > Jirka > > On Tue, Sep 4, 2018 at 11:00 AM, Mel Gorman wrote: >> On Mon, Sep 03, 2018 at 05:07:15PM +0200, Jirka Hladky wrote: >>> Resending in the plain text mode. >>> >>> > My own testing completed and the results are within expectations and I >>> > saw no red flags. Unfortunately, I consider it unlikely they'll be merged >>> > for 4.18. Srikar Dronamraju's series is likely to need another update >>> > and I would need to rebase my patches on top of that. Given the scope >>> > and complexity, I find it unlikely they would be accepted for an -rc, >>> > particularly this late of an rc. Whether we hit the 4.19 merge window or >>> > not will depend on when Srikar's series gets updated. >>> >>> >>> Hi Mel, >>> >>> we have collaborated back in July on the scheduler patch, improving >>> the performance by allowing faster memory migration. You came up with >>> the "sched-numa-fast-crossnode-v1r12" series here: >>> >>> https://git.kernel.org/pub/scm/linux/kernel/git/mel/linux.git >>> >>> which has shown good performance results both in your and our testing. >>> >> >> I remember. >> >>> Do you have some update on the latest status? Is there any plan to >>> merge this series into 4.19 kernel? We have just tested 4.19.0-0.rc1.1 >>> and based on the results it seems that the patch is not included (and >>> I don't see it listed in git shortlog v4.18..v4.19-rc1 >>> ./kernel/sched) >>> >> >> Srikar's series that mine depended upon was only partially merged due to >> a review bottleneck. He posted a v2 but it was during the merge window >> and likely will need a v3 to avoid falling through the cracks. When it >> is merged, I'll rebase my series on top and post it. While I didn't >> check against 4.19-rc1, I did find that rebasing on top of the partial >> series in 4.18 did not have as big an improvement. >> >>> With 4.19rc1 we see performance drop >>> * up to 40% (NAS bench) relatively to 4.18 + sched-numa-fast-crossnode-v1r12 >>> * up to 20% (NAS, Stream, SPECjbb2005, SPECjvm2008) relatively to 4.18 vanilla >>> The performance is dropping. It's quite unclear what are the next >>> steps - should we wait for "sched-numa-fast-crossnode-v1r12" to be >>> merged or should we start looking at what has caused the drop in >>> performance going from 4.19rc1 to 4.18? >>> >> >> Both are valid options. If you take the latter option, I suggest looking >> at whether 2d4056fafa196e1ab4e7161bae4df76f9602d56d is the source of the >> issue as at least one auto-bisection found that it may be problematic. >> Whether it is an issue or not depends heavily on the number of threads >> relative to a socket size. >> >> -- >> Mel Gorman >> SUSE Labs