Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756879AbcK3T7B (ORCPT ); Wed, 30 Nov 2016 14:59:01 -0500 Received: from smtp.gentoo.org ([140.211.166.183]:57366 "EHLO smtp.gentoo.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754505AbcK3T6w (ORCPT ); Wed, 30 Nov 2016 14:58:52 -0500 Date: Wed, 30 Nov 2016 19:58:37 +0000 From: "Robin H. Johnson" To: Michal Hocko Cc: Michal Nazarewicz , "Robin H. Johnson" , linux-kernel@vger.kernel.org, robbat2@gentoo.org, linux-mm@kvack.org Subject: Re: PROBLEM-PERSISTS: dmesg spam: alloc_contig_range: [XX, YY) PFNs busy Message-ID: References: <20161130092239.GD18437@dhcp22.suse.cz> <20161130132848.GG18432@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20161130132848.GG18432@dhcp22.suse.cz> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2042 Lines: 55 (I'm going to respond directly to this email with the stack trace.) On Wed, Nov 30, 2016 at 02:28:49PM +0100, Michal Hocko wrote: > > On the other hand, if this didn’t happen and now happens all the time, > > this indicates a regression in CMA’s capability to allocate pages so > > just rate limiting the output would hide the potential actual issue. > > Or there might be just a much larger demand on those large blocks, no? > But seriously, dumping those message again and again into the low (see > the 2.5_GB_/h to the log is just insane. So there really should be some > throttling. > > Does the following help you Robin. At least to not get swamped by those > message. Here's what I whipped up based on that, to ensure that dump_stack got rate-limited at the same pass as PFNs-busy. It dropped the dmesg spew to ~25MB/hour (and is suppressing ~43 entries/second right now). commit 6ad4037e18ec2199f8755274d8a745a9904241a1 Author: Robin H. Johnson Date: Wed Nov 30 10:32:57 2016 -0800 mm: ratelimit & trace PFNs busy. Signed-off-by: Robin H. Johnson diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 6de9440e3ae2..3c28ec3d18f8 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -7289,8 +7289,15 @@ int alloc_contig_range(unsigned long start, unsigned long end, /* Make sure the range is really isolated. */ if (test_pages_isolated(outer_start, end, false)) { - pr_info("%s: [%lx, %lx) PFNs busy\n", - __func__, outer_start, end); + static DEFINE_RATELIMIT_STATE(ratelimit_pfn_busy, + DEFAULT_RATELIMIT_INTERVAL, + DEFAULT_RATELIMIT_BURST); + if (__ratelimit(&ratelimit_pfn_busy)) { + pr_info("%s: [%lx, %lx) PFNs busy\n", + __func__, outer_start, end); + dump_stack(); + } + ret = -EBUSY; goto done; } -- Robin Hugh Johnson Gentoo Linux: Dev, Infra Lead, Foundation Trustee & Treasurer E-Mail : robbat2@gentoo.org GnuPG FP : 11ACBA4F 4778E3F6 E4EDF38E B27B944E 34884E85 GnuPG FP : 7D0B3CEB E9B85B1F 825BCECF EE05E6F6 A48F6136