Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp2026562imm; Thu, 7 Jun 2018 04:21:12 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKh9usFoE6F/VebWyVM3FW++b2XPhiYTPC+qwKjr24vnk0XsvWMm/0F3paQYZEB6o3IlarH X-Received: by 2002:a62:b612:: with SMTP id j18-v6mr1402871pff.199.1528370472697; Thu, 07 Jun 2018 04:21:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528370472; cv=none; d=google.com; s=arc-20160816; b=At/oke7C5cx1aI3a2+H8hp3lzhawir/shLqIoS8jx01PMV/dxtw25NBmWl1vI0IfHB 2rg6/2FqJcCJjATREYU3L8jjYqEu29dWv1kS5FmhN83+quKLt8Wx+7cuJvws/m+2Olta UMuqMKyz++hc5SQROiNWwlqeKpqDnI9wb0GpjrNHFehfSYkQJb6+zGvH7Y/NFpkZ4RMS kPx2flL7XpBQP+iFqPJy+mpwUMnBtO5esYcjE5eP2/uaj8XK9zD9Bzzi9w0f6dOeSKRK ElRExepMh1CwVKPqv+woco2RHzPjBDOSS0tJhQEAfO3MFcFi2pKnIH+sgK42+NVc2yf2 D30w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=8LODUr+uLTsjYkELqid5qa4R7M9MPhrz+zwVA/UYTQk=; b=MgMY10RiiMLplEsg4XWv0p7T6Q1MkH3Uu+s3TvNUx7GIncnFt6O9rY+KBPmLpXecDT boCfrVIIo2UQP1ZVP6kkTMYQj3KxBjZZfcYQ7Taa6gjZz8sYZssnGZlvT8Y13SU0eSF3 o1DvicFGRsAkKziOKbLcUcx2VRpqqNbZJbMzyNAmJLHvhgo3/r0Zakh91rp61/z7Jw9p hbkx6mle7yZA9PQrpI7QaaCcOPRrH+HKutZLtgLus3ut9RiR+3Ndv3HDTWXJ2dD0i3vv dl9QDkx1VvkB06y+NOZnQ4MqXE07pHTgnU7bN1O3modYbZe2dD/uTsatywMQzGnkM+5Z +U0Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s194-v6si27681745pgc.602.2018.06.07.04.20.57; Thu, 07 Jun 2018 04:21:12 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753583AbeFGLTb (ORCPT + 99 others); Thu, 7 Jun 2018 07:19:31 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:42096 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753325AbeFGLTa (ORCPT ); Thu, 7 Jun 2018 07:19:30 -0400 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.rdu2.redhat.com [10.11.54.6]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 9E59D35C1; Thu, 7 Jun 2018 11:19:29 +0000 (UTC) Received: from [10.40.204.234] (ovpn-204-234.brq.redhat.com [10.40.204.234]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 2C182217B409; Thu, 7 Jun 2018 11:19:27 +0000 (UTC) Subject: Re: [4.17 regression] Performance drop on kernel-4.17 visible on Stream, Linpack and NAS parallel benchmarks To: Michal Hocko Cc: linux-kernel@vger.kernel.org, "Rafael J. Wysocki" , Len Brown , linux-acpi@vger.kernel.org, Mel Gorman , linux-mm@kvack.org References: <20180606122731.GB27707@jra-laptop.brq.redhat.com> <20180607110713.GJ32433@dhcp22.suse.cz> From: =?UTF-8?Q?Jakub_Ra=c4=8dek?= Message-ID: Date: Thu, 7 Jun 2018 13:19:26 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: <20180607110713.GJ32433@dhcp22.suse.cz> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.78 on 10.11.54.6 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Thu, 07 Jun 2018 11:19:29 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Thu, 07 Jun 2018 11:19:29 +0000 (UTC) for IP:'10.11.54.6' DOMAIN:'int-mx06.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'jracek@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 06/07/2018 01:07 PM, Michal Hocko wrote: > [CCing Mel and MM mailing list] > > On Wed 06-06-18 14:27:32, Jakub Racek wrote: >> Hi, >> >> There is a huge performance regression on the 2 and 4 NUMA node systems on >> stream benchmark with 4.17 kernel compared to 4.16 kernel. Stream, Linpack >> and NAS parallel benchmarks show upto 50% performance drop. >> >> When running for example 20 stream processes in parallel, we see the following behavior: >> >> * all processes are started at NODE #1 >> * memory is also allocated on NODE #1 >> * roughly half of the processes are moved to the NODE #0 very quickly. * >> however, memory is not moved to NODE #0 and stays allocated on NODE #1 >> >> As the result, half of the processes are running on NODE#0 with memory being >> still allocated on NODE#1. This leads to non-local memory accesses >> on the high Remote-To-Local Memory Access Ratio on the numatop charts. >> >> So it seems that 4.17 is not doing a good job to move the memory to the right NUMA >> node after the process has been moved. >> >> ----8<---- >> >> The above is an excerpt from performance testing on 4.16 and 4.17 kernels. >> >> For now I'm merely making sure the problem is reported. > > Do you have numa balancing enabled? > Yes. The relevant settings are: kernel.numa_balancing = 1 kernel.numa_balancing_scan_delay_ms = 1000 kernel.numa_balancing_scan_period_max_ms = 60000 kernel.numa_balancing_scan_period_min_ms = 1000 kernel.numa_balancing_scan_size_mb = 256 -- Best regards, Jakub Racek FMK