Received: by 2002:a25:c205:0:0:0:0:0 with SMTP id s5csp2292650ybf; Mon, 2 Mar 2020 05:53:40 -0800 (PST) X-Google-Smtp-Source: APXvYqyxjD+eN1t0srBLR/Sa4bFHyKxKsuNJ2NnBTigpM/+sxTuhKiqGJsH4G+i46c2+iOTW5CXq X-Received: by 2002:a05:6830:13d9:: with SMTP id e25mr13020675otq.134.1583157220084; Mon, 02 Mar 2020 05:53:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1583157220; cv=none; d=google.com; s=arc-20160816; b=jh4+Q80iHeJ9c61JZdO/7FiGoGgxPnbgwJeAFVlJho7yhjJ+bKA7nGXaaa4fSgTS4+ e57+DIPib2abGG+XPKGchRs4GC/Ji82ngD3Nvqu1Ula/2SdNUawBz35RtYeggdtfQvyo 707KprgkJGio/VfxWBNBk5qJ8r6L0DMzjl4nS2xvqQt94MUsMPnBYfKT/POkrQ94cO70 bqiOM+av5Gore56AtJ50gx0+vVvQh5ddIK65cgIisHIRv/oK1rTuob41YPE8qIR3IawV biifK5TnVGmbTxob8YWlDflsEH2GwmasKt5kcWuCfhORvqgcMVhXRBj0bqYSYbeYJJg0 7sEQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=R9YlCAxZTLeLO624BWfRaiPaRVXS4Jlz4OonzSFN6/Y=; b=cQ9sSlx36ROPrkJR7rXzos6n76xQWFGO6plRcD9fCULBRvkh4cB2EF36tIuowM/vFl MVlShvHdIBgQ6hy/9WzSTJSftFiLZa7ogSqzLSvfknKb1I9VbFJMXF713JleHDnSmal9 gvtsCFTkUvWyMYKIy/G3c02LQWM7Roy2Y3ZYOePd/7zN8/0q/EINGs0u58VhJVENGnNi q1A0Yi1cpjc2uUw09qexBnskbTRMRV1MHwrCIxeA4Or0eN4e5TuctxIAbEJdTQMqVbNR A1BTj4rAEbu2e6WvuMhI9TDtE+CsYKeBJgVkvDNgMT0nIA36nZSpQNrP+hb/gUBTWgyG 2tcg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Deq6zRlb; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 45si6261077otg.7.2020.03.02.05.53.28; Mon, 02 Mar 2020 05:53:40 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Deq6zRlb; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727250AbgCBNxD (ORCPT + 99 others); Mon, 2 Mar 2020 08:53:03 -0500 Received: from us-smtp-1.mimecast.com ([205.139.110.61]:56291 "EHLO us-smtp-delivery-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726740AbgCBNxD (ORCPT ); Mon, 2 Mar 2020 08:53:03 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1583157182; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=R9YlCAxZTLeLO624BWfRaiPaRVXS4Jlz4OonzSFN6/Y=; b=Deq6zRlbK3o/7eTUUnyXociahIwxTSiR/dHQOmQPmeixDTRZRWqNbZr2Hpru5dl8eqXmRZ 1jeAepxsmmEjjY3GTT70L94q/q203GguXvfHv+tqhrLfrcYvSEUr1TXnoGOd3f6kLczfK3 FbPOOVNhT6GQune/tnj5NjJ6NAeq/JU= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-40-QkfpYAOAPTGAmQMiM64pbg-1; Mon, 02 Mar 2020 08:53:00 -0500 X-MC-Unique: QkfpYAOAPTGAmQMiM64pbg-1 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.23]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 4E674802563; Mon, 2 Mar 2020 13:52:58 +0000 (UTC) Received: from t480s.redhat.com (ovpn-116-114.ams2.redhat.com [10.36.116.114]) by smtp.corp.redhat.com (Postfix) with ESMTP id 7D9EC397; Mon, 2 Mar 2020 13:52:46 +0000 (UTC) From: David Hildenbrand To: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, virtio-dev@lists.oasis-open.org, virtualization@lists.linux-foundation.org, kvm@vger.kernel.org, Michal Hocko , Andrew Morton , "Michael S . Tsirkin" , David Hildenbrand , Oscar Salvador , Michal Hocko , Pavel Tatashin , Wei Yang , Dan Williams , Qian Cai Subject: [PATCH v1 08/11] mm/memory_hotplug: Introduce offline_and_remove_memory() Date: Mon, 2 Mar 2020 14:49:38 +0100 Message-Id: <20200302134941.315212-9-david@redhat.com> In-Reply-To: <20200302134941.315212-1-david@redhat.com> References: <20200302134941.315212-1-david@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.84 on 10.5.11.23 Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org virtio-mem wants to offline and remove a memory block once it unplugged all subblocks (e.g., using alloc_contig_range()). Let's provide an interface to do that from a driver. virtio-mem already supports to offline partially unplugged memory blocks. Offlining a fully unplugged memory block will not require to migrate any pages. All unplugged subblocks are PageOffline() and have a reference count of 0 - so offlining code will simply skip them. All we need is an interface to offline and remove the memory from kernel module context, where we don't have access to the memory block devices (esp. find_memory_block() and device_offline()) and the device hotplug lock. To keep things simple, allow to only work on a single memory block. Cc: Andrew Morton Cc: David Hildenbrand Cc: Oscar Salvador Cc: Michal Hocko Cc: Pavel Tatashin Cc: Wei Yang Cc: Dan Williams Cc: Qian Cai Signed-off-by: David Hildenbrand --- include/linux/memory_hotplug.h | 1 + mm/memory_hotplug.c | 37 ++++++++++++++++++++++++++++++++++ 2 files changed, 38 insertions(+) diff --git a/include/linux/memory_hotplug.h b/include/linux/memory_hotplu= g.h index f4d59155f3d4..a98aa16dbfa1 100644 --- a/include/linux/memory_hotplug.h +++ b/include/linux/memory_hotplug.h @@ -311,6 +311,7 @@ extern void try_offline_node(int nid); extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages= ); extern int remove_memory(int nid, u64 start, u64 size); extern void __remove_memory(int nid, u64 start, u64 size); +extern int offline_and_remove_memory(int nid, u64 start, u64 size); =20 #else static inline bool is_mem_section_removable(unsigned long pfn, diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index ab1c31e67fd1..d0d337918a15 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1818,4 +1818,41 @@ int remove_memory(int nid, u64 start, u64 size) return rc; } EXPORT_SYMBOL_GPL(remove_memory); + +/* + * Try to offline and remove a memory block. Might take a long time to + * finish in case memory is still in use. Primarily useful for memory de= vices + * that logically unplugged all memory (so it's no longer in use) and wa= nt to + * offline + remove the memory block. + */ +int offline_and_remove_memory(int nid, u64 start, u64 size) +{ + struct memory_block *mem; + int rc =3D -EINVAL; + + if (!IS_ALIGNED(start, memory_block_size_bytes()) || + size !=3D memory_block_size_bytes()) + return rc; + + lock_device_hotplug(); + mem =3D find_memory_block(__pfn_to_section(PFN_DOWN(start))); + if (mem) + rc =3D device_offline(&mem->dev); + /* Ignore if the device is already offline. */ + if (rc > 0) + rc =3D 0; + + /* + * In case we succeeded to offline the memory block, remove it. + * This cannot fail as it cannot get onlined in the meantime. + */ + if (!rc) { + rc =3D try_remove_memory(nid, start, size); + WARN_ON_ONCE(rc); + } + unlock_device_hotplug(); + + return rc; +} +EXPORT_SYMBOL_GPL(offline_and_remove_memory); #endif /* CONFIG_MEMORY_HOTREMOVE */ --=20 2.24.1