Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp4062814imm; Mon, 15 Oct 2018 08:31:55 -0700 (PDT) X-Google-Smtp-Source: ACcGV62UIdcIImEjZ03Upu6D7Y+bbnySrnYNJ1Mi3SukK6jN6CGvQJ/rwl06wIJ2QXOM7mkSU3zj X-Received: by 2002:a65:5083:: with SMTP id r3-v6mr16132788pgp.355.1539617515710; Mon, 15 Oct 2018 08:31:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539617515; cv=none; d=google.com; s=arc-20160816; b=R+l+mBEkFgKNJJSZ7/2GNWT4kcAzZuuGS1eVtxFMOHPBKReBxfldiStrB/d0OeYSqF oC0oekXHaY+kaWS1I/TD83yt08pgqcqZ21DasD2CC75Z7y048dquKCkkNS36CxnPH1av 2Jvkc45B3HPXIqawDGVdJlD3M84y4whUW3Gd/HB1NiHJNEsOwW95Da+54uL0cYZAOYoc VunlIe7LrojPI70btv/1Qx8Xmn6BAPtlODh6nGSRYJV4VY98+rXBHE4rUODe6IGRkifi hXw3dLK5aypAyByuFMuwXTA6twKS9yMgIs47uAhPPMgvqbGiqA6JtoZ+jc5vDkEZDMz9 /eBA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from; bh=Klahqg36Ibpp1aNBF2mqe1EkWDF8ejY5IvTH1L9cox0=; b=fTW5qP6I9rjIyrOzfhcLU3eGalNbTCDlz4daFAnF70/2zkzpoSJ83+mHv0BGK97A5x 5yEG350YUWXkf08Dw9BHcSmEIqhMAtKBdWu4clf5XmVymA2v0X4pF/Xpzr2c8ufYd1Sd KzTCXry39EojeWqqyqSbnsJy7k0D+fFkohhT8myVnwmROhR1jZneOEReqFvk+2y4b7de WXSE+KgOt+HiXfN5nHrTUPl9uYbpbwg6SXNdzVQRw1I34HghycF5lWBcJIkvUmu8TEz2 ayzFquRXZ9SP3UhP/xzYEREJFFaQeZQHNFvTjBulLwBIXI6ILakVhM3eZKGCce93S3mN XVhg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q143-v6si10169948pgq.483.2018.10.15.08.31.31; Mon, 15 Oct 2018 08:31:55 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726781AbeJOXQn (ORCPT + 99 others); Mon, 15 Oct 2018 19:16:43 -0400 Received: from mail-wr1-f65.google.com ([209.85.221.65]:39814 "EHLO mail-wr1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726545AbeJOXQm (ORCPT ); Mon, 15 Oct 2018 19:16:42 -0400 Received: by mail-wr1-f65.google.com with SMTP id 61-v6so21855396wrb.6 for ; Mon, 15 Oct 2018 08:30:57 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=Klahqg36Ibpp1aNBF2mqe1EkWDF8ejY5IvTH1L9cox0=; b=oc3R+93bXmBp17GEfQRua1xDRQkbHBL3mUYIqBRWIN+63EJ1wX/XK/jLi1BYGaxVqr CbuR964zYE1tDzInuGDO+vK68kiivD/nXj7iFdIztmC0ijEkuPpiu1XmsJQGUq12rsjJ D3saoIdxwAqqMKmU0NvYAArPTyqNfO4sn8PdReL5eQQwdQNg2erVIFUFdU498EJxyOMl sNktM1VPhb+gFU+fn/DcSLX1QVqzFSrPyW1doDgRaHe42oXLwMSRJUK2ALNp/YkrPyis E8vh3rYHFP0XFZprRWfEpVjHCZf8tfR9Mnq6LBfSKYiUKHs783y1vjcK2rxKessuXPiQ JTcA== X-Gm-Message-State: ABuFfoh5/RZAIJTGrfe636/MOEHimLCKHIwija7ol21Ay1PhmDRPLpwm w1tWCqrPo2JYP1osRuBDzaiZOjj2kpg= X-Received: by 2002:a5d:4409:: with SMTP id z9-v6mr13979691wrq.227.1539617456804; Mon, 15 Oct 2018 08:30:56 -0700 (PDT) Received: from techadventures.net (techadventures.net. [62.201.165.239]) by smtp.gmail.com with ESMTPSA id a1-v6sm8859612wrt.79.2018.10.15.08.30.53 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 15 Oct 2018 08:30:54 -0700 (PDT) Received: from d104.suse.de (charybdis-ext.suse.de [195.135.221.2]) by techadventures.net (Postfix) with ESMTPA id E588A1251F0; Mon, 15 Oct 2018 17:30:52 +0200 (CEST) From: Oscar Salvador To: akpm@linux-foundation.org Cc: mhocko@suse.com, dan.j.williams@intel.com, yasu.isimatu@gmail.com, rppt@linux.vnet.ibm.com, malat@debian.org, linux-kernel@vger.kernel.org, pavel.tatashin@microsoft.com, jglisse@redhat.com, Jonathan.Cameron@huawei.com, rafael@kernel.org, david@redhat.com, dave.jiang@intel.com, linux-mm@kvack.org, alexander.h.duyck@linux.intel.com, Oscar Salvador Subject: [PATCH 5/5] mm/memory-hotplug: Rework unregister_mem_sect_under_nodes Date: Mon, 15 Oct 2018 17:30:34 +0200 Message-Id: <20181015153034.32203-6-osalvador@techadventures.net> X-Mailer: git-send-email 2.13.6 In-Reply-To: <20181015153034.32203-1-osalvador@techadventures.net> References: <20181015153034.32203-1-osalvador@techadventures.net> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Oscar Salvador This tries to address another issue about accessing unitiliazed pages. Jonathan reported a problem [1] where we can access steal pages in case we hot-remove memory without onlining it first. This time is in unregister_mem_sect_under_nodes. This function tries to get the nid from the pfn and then tries to remove the symlink between mem_blk <-> nid and vice versa. Since we already know the nid in remove_memory(), we can pass it down the chain to unregister_mem_sect_under_nodes. There we can just remove the symlinks without the need to look into the pages. [1] https://www.spinics.net/lists/linux-mm/msg161316.html Signed-off-by: Oscar Salvador --- drivers/base/memory.c | 9 ++++----- drivers/base/node.c | 38 +++++++------------------------------- include/linux/memory.h | 2 +- include/linux/node.h | 9 ++++----- mm/memory_hotplug.c | 2 +- 5 files changed, 17 insertions(+), 43 deletions(-) diff --git a/drivers/base/memory.c b/drivers/base/memory.c index 0e5985682642..3d8c65d84bea 100644 --- a/drivers/base/memory.c +++ b/drivers/base/memory.c @@ -744,8 +744,7 @@ unregister_memory(struct memory_block *memory) device_unregister(&memory->dev); } -static int remove_memory_section(unsigned long node_id, - struct mem_section *section, int phys_device) +static int remove_memory_section(unsigned long nid, struct mem_section *section) { struct memory_block *mem; @@ -759,7 +758,7 @@ static int remove_memory_section(unsigned long node_id, if (!mem) goto out_unlock; - unregister_mem_sect_under_nodes(mem, __section_nr(section)); + unregister_mem_sect_under_nodes(nid, mem); mem->section_count--; if (mem->section_count == 0) @@ -772,12 +771,12 @@ static int remove_memory_section(unsigned long node_id, return 0; } -int unregister_memory_section(struct mem_section *section) +int unregister_memory_section(int nid, struct mem_section *section) { if (!present_section(section)) return -EINVAL; - return remove_memory_section(0, section, 0); + return remove_memory_section(nid, section); } #endif /* CONFIG_MEMORY_HOTREMOVE */ diff --git a/drivers/base/node.c b/drivers/base/node.c index 86d6cd92ce3d..65bc5920bd3d 100644 --- a/drivers/base/node.c +++ b/drivers/base/node.c @@ -453,40 +453,16 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, void *arg) return 0; } -/* unregister memory section under all nodes that it spans */ -int unregister_mem_sect_under_nodes(struct memory_block *mem_blk, - unsigned long phys_index) +/* + * This mem_blk is going to be removed, so let us remove the link + * to the node and vice versa + */ +void unregister_mem_sect_under_nodes(int nid, struct memory_block *mem_blk) { - NODEMASK_ALLOC(nodemask_t, unlinked_nodes, GFP_KERNEL); - unsigned long pfn, sect_start_pfn, sect_end_pfn; - - if (!mem_blk) { - NODEMASK_FREE(unlinked_nodes); - return -EFAULT; - } - if (!unlinked_nodes) - return -ENOMEM; - nodes_clear(*unlinked_nodes); - - sect_start_pfn = section_nr_to_pfn(phys_index); - sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1; - for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) { - int nid; - - nid = get_nid_for_pfn(pfn); - if (nid < 0) - continue; - if (!node_online(nid)) - continue; - if (node_test_and_set(nid, *unlinked_nodes)) - continue; - sysfs_remove_link(&node_devices[nid]->dev.kobj, + sysfs_remove_link(&node_devices[nid]->dev.kobj, kobject_name(&mem_blk->dev.kobj)); - sysfs_remove_link(&mem_blk->dev.kobj, + sysfs_remove_link(&mem_blk->dev.kobj, kobject_name(&node_devices[nid]->dev.kobj)); - } - NODEMASK_FREE(unlinked_nodes); - return 0; } int link_mem_sections(int nid, unsigned long start_pfn, unsigned long end_pfn) diff --git a/include/linux/memory.h b/include/linux/memory.h index a6ddefc60517..d75ec88ca09d 100644 --- a/include/linux/memory.h +++ b/include/linux/memory.h @@ -113,7 +113,7 @@ extern int register_memory_isolate_notifier(struct notifier_block *nb); extern void unregister_memory_isolate_notifier(struct notifier_block *nb); int hotplug_memory_register(int nid, struct mem_section *section); #ifdef CONFIG_MEMORY_HOTREMOVE -extern int unregister_memory_section(struct mem_section *); +extern int unregister_memory_section(int nid, struct mem_section *); #endif extern int memory_dev_init(void); extern int memory_notify(unsigned long val, void *v); diff --git a/include/linux/node.h b/include/linux/node.h index 257bb3d6d014..dddead9937ab 100644 --- a/include/linux/node.h +++ b/include/linux/node.h @@ -72,8 +72,8 @@ extern int register_cpu_under_node(unsigned int cpu, unsigned int nid); extern int unregister_cpu_under_node(unsigned int cpu, unsigned int nid); extern int register_mem_sect_under_node(struct memory_block *mem_blk, void *arg); -extern int unregister_mem_sect_under_nodes(struct memory_block *mem_blk, - unsigned long phys_index); +extern void unregister_mem_sect_under_nodes(int nid, + struct memory_block *mem_blk); #ifdef CONFIG_HUGETLBFS extern void register_hugetlbfs_with_node(node_registration_func_t doregister, @@ -105,10 +105,9 @@ static inline int register_mem_sect_under_node(struct memory_block *mem_blk, { return 0; } -static inline int unregister_mem_sect_under_nodes(struct memory_block *mem_blk, - unsigned long phys_index) +static inline void unregister_mem_sect_under_nodes(int nid, + struct memory_block *mem_blk) { - return 0; } static inline void register_hugetlbfs_with_node(node_registration_func_t reg, diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index 6b98321aa52f..66ccbb5b8a88 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -528,7 +528,7 @@ static int __remove_section(int nid, struct mem_section *ms, if (!valid_section(ms)) return ret; - ret = unregister_memory_section(ms); + ret = unregister_memory_section(nid, ms); if (ret) return ret; -- 2.13.6