Received: by 10.192.165.148 with SMTP id m20csp3577795imm; Mon, 30 Apr 2018 02:43:54 -0700 (PDT) X-Google-Smtp-Source: AB8JxZq4G9odipDvzHn1HpJhUo1NDu/3hkQCBuysMi/Go6XXyf/1HH9xuHkkXqSJVQjGgLKoMpBD X-Received: by 2002:a63:7945:: with SMTP id u66-v6mr9471325pgc.376.1525081434000; Mon, 30 Apr 2018 02:43:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525081433; cv=none; d=google.com; s=arc-20160816; b=AlCsXjPhbdgtBj3/BgypLHrFeVeIMWwmSaJUKpCujTjQq5JWD3LUDQnqoP8GaSeGbf SsdbcD8ac9FiBc4MyP5ejNqJnP0fMaCQxWweyBvYC02l+BkOJ5hA5sbcIGEtZOkhcos2 5kJpzBEg+vPjimEs8iIXVNa3TkdoLj2jjlFnW25maGkQrG36n80sjRFMTXMp7Zu9Il40 mY1a33V+khkPTs3hx4n4dqTL6ZlAa6m4YSE5a04BFF37+CVkXgjUdEX9aUXCeNP3ol2Q 7KXfUO9q0x9nqriRxhOe3nU4WKWUht3xEtvrMxK3z/4cLqwoDtBJ3krIYEzfmMJHbmxP U1Rw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:arc-authentication-results; bh=Y8fMAfSHritLnWkT0WvH7RFx/qlMA9hp8NNej6VUNds=; b=fhO8hq+zfL2uxty1JJ3I7m7Z8Mdw3aKlynIDL2J+Iwld7NIpDw9jg4ngMr/sg1KK/1 3Y+JxB47YKL2n5RSQwucmeYQYhPXUntmUi1yZ+MB7iuvziFjimAJH+34tF8CHlDHgJfd JfADanKzytJlFLec0pU4yu8kda8/zI2SZuBpxC9ro8Rrj7bkSsb1F3eXqsO2nd/1nM5u NMA2c4AsdiJJ9JC1emScnF1BomH+45HAO+CCC4cHdM3t2pn1WCwoQW3LyL1Im8RycgmN XJRaEE+dRNdTIBfChCZsUeaIU53Ue+pmU5ciikKmwt0qYaTcTmVhfnIkhfXl00UW/xGi +Pxg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 90-v6si7230913plf.340.2018.04.30.02.43.40; Mon, 30 Apr 2018 02:43:53 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753415AbeD3JnC (ORCPT + 99 others); Mon, 30 Apr 2018 05:43:02 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:45614 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753003AbeD3Jm7 (ORCPT ); Mon, 30 Apr 2018 05:42:59 -0400 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.rdu2.redhat.com [10.11.54.4]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id E5FA0406C74C; Mon, 30 Apr 2018 09:42:58 +0000 (UTC) Received: from t460s.redhat.com (ovpn-117-122.ams2.redhat.com [10.36.117.122]) by smtp.corp.redhat.com (Postfix) with ESMTP id 5C9B42023499; Mon, 30 Apr 2018 09:42:56 +0000 (UTC) From: David Hildenbrand To: linux-mm@kvack.org Cc: linux-kernel@vger.kernel.org, David Hildenbrand , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , Greg Kroah-Hartman , Rashmica Gupta , Balbir Singh , Andrew Morton , Michal Hocko , Vlastimil Babka , Dan Williams , Joonsoo Kim , Pavel Tatashin , Reza Arbab , Thomas Gleixner Subject: [PATCH RCFv2 6/7] mm/memory_hotplug: teach offline_pages() to not try forever Date: Mon, 30 Apr 2018 11:42:35 +0200 Message-Id: <20180430094236.29056-7-david@redhat.com> In-Reply-To: <20180430094236.29056-1-david@redhat.com> References: <20180430094236.29056-1-david@redhat.com> X-Scanned-By: MIMEDefang 2.78 on 10.11.54.4 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.5]); Mon, 30 Apr 2018 09:42:59 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.5]); Mon, 30 Apr 2018 09:42:59 +0000 (UTC) for IP:'10.11.54.4' DOMAIN:'int-mx04.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'david@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org It can easily happen that we get stuck forever trying to offline pages - e.g. on persistent errors. Let's add a way to change this behavior and fail fast. This is interesting if offline_pages() is called from a driver and we just want to find some block to offline. Cc: Benjamin Herrenschmidt Cc: Paul Mackerras Cc: Michael Ellerman Cc: Greg Kroah-Hartman Cc: Rashmica Gupta Cc: Balbir Singh Cc: Andrew Morton Cc: Michal Hocko Cc: Vlastimil Babka Cc: Dan Williams Cc: Joonsoo Kim Cc: Pavel Tatashin Cc: Reza Arbab Cc: Thomas Gleixner Signed-off-by: David Hildenbrand --- arch/powerpc/platforms/powernv/memtrace.c | 2 +- drivers/base/memory.c | 2 +- include/linux/memory_hotplug.h | 8 ++++---- mm/memory_hotplug.c | 14 ++++++++++---- 4 files changed, 16 insertions(+), 10 deletions(-) diff --git a/arch/powerpc/platforms/powernv/memtrace.c b/arch/powerpc/platforms/powernv/memtrace.c index fc222a0c2ac4..8ce71f7e1558 100644 --- a/arch/powerpc/platforms/powernv/memtrace.c +++ b/arch/powerpc/platforms/powernv/memtrace.c @@ -110,7 +110,7 @@ static bool memtrace_offline_pages(u32 nid, u64 start_pfn, u64 nr_pages) walk_memory_range(start_pfn, end_pfn, (void *)MEM_GOING_OFFLINE, change_memblock_state); - if (offline_pages(start_pfn, nr_pages)) { + if (offline_pages(start_pfn, nr_pages, true)) { walk_memory_range(start_pfn, end_pfn, (void *)MEM_ONLINE, change_memblock_state); return false; diff --git a/drivers/base/memory.c b/drivers/base/memory.c index 3b8616551561..c785e4c01b23 100644 --- a/drivers/base/memory.c +++ b/drivers/base/memory.c @@ -248,7 +248,7 @@ memory_block_action(struct memory_block *mem, unsigned long action) ret = online_pages(start_pfn, nr_pages, mem->online_type); break; case MEM_OFFLINE: - ret = offline_pages(start_pfn, nr_pages); + ret = offline_pages(start_pfn, nr_pages, true); break; default: WARN(1, KERN_WARNING "%s(%ld, %ld) unknown action: " diff --git a/include/linux/memory_hotplug.h b/include/linux/memory_hotplug.h index 497e28f5b000..ae53017b54df 100644 --- a/include/linux/memory_hotplug.h +++ b/include/linux/memory_hotplug.h @@ -303,7 +303,8 @@ static inline void pgdat_resize_init(struct pglist_data *pgdat) {} extern bool is_mem_section_removable(unsigned long pfn, unsigned long nr_pages); extern void try_offline_node(int nid); -extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages); +extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages, + bool retry_forever); extern void remove_memory(int nid, u64 start, u64 size); #else @@ -315,7 +316,8 @@ static inline bool is_mem_section_removable(unsigned long pfn, static inline void try_offline_node(int nid) {} -static inline int offline_pages(unsigned long start_pfn, unsigned long nr_pages) +static inline int offline_pages(unsigned long start_pfn, unsigned long nr_pages, + bool retry_forever) { return -EINVAL; } @@ -333,9 +335,7 @@ extern int arch_add_memory(int nid, u64 start, u64 size, struct vmem_altmap *altmap, bool want_memblock); extern void move_pfn_range_to_zone(struct zone *zone, unsigned long start_pfn, unsigned long nr_pages, struct vmem_altmap *altmap); -extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages); extern bool is_memblock_offlined(struct memory_block *mem); -extern void remove_memory(int nid, u64 start, u64 size); extern int sparse_add_one_section(struct pglist_data *pgdat, unsigned long start_pfn, struct vmem_altmap *altmap); extern void sparse_remove_one_section(struct zone *zone, struct mem_section *ms, diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index d8f127754c2e..c47cc68341fc 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1618,8 +1618,8 @@ static void node_states_clear_node(int node, struct memory_notify *arg) node_clear_state(node, N_MEMORY); } -static int __ref __offline_pages(unsigned long start_pfn, - unsigned long end_pfn) +static int __ref __offline_pages(unsigned long start_pfn, unsigned long end_pfn, + bool retry_forever) { unsigned long pfn, nr_pages; long offlined_pages; @@ -1671,6 +1671,10 @@ static int __ref __offline_pages(unsigned long start_pfn, pfn = scan_movable_pages(start_pfn, end_pfn); if (pfn) { /* We have movable pages */ ret = do_migrate_range(pfn, end_pfn); + if (ret && !retry_forever) { + ret = -EBUSY; + goto failed_removal; + } goto repeat; } @@ -1737,6 +1741,7 @@ static int __ref __offline_pages(unsigned long start_pfn, * offline_pages - offline pages in a given range (that are currently online) * @start_pfn: start pfn of the memory range * @nr_pages: the number of pages + * @retry_forever: weather to retry (possibly) forever * * This function tries to offline the given pages. The alignment/size that * can be used is given by offline_nr_pages. @@ -1749,9 +1754,10 @@ static int __ref __offline_pages(unsigned long start_pfn, * * Must be protected by mem_hotplug_begin() or a device_lock */ -int offline_pages(unsigned long start_pfn, unsigned long nr_pages) +int offline_pages(unsigned long start_pfn, unsigned long nr_pages, + bool retry_forever) { - return __offline_pages(start_pfn, start_pfn + nr_pages); + return __offline_pages(start_pfn, start_pfn + nr_pages, retry_forever); } #endif /* CONFIG_MEMORY_HOTREMOVE */ -- 2.14.3