Received: by 10.192.165.148 with SMTP id m20csp135258imm; Thu, 3 May 2018 16:32:35 -0700 (PDT) X-Google-Smtp-Source: AB8JxZqTn6isb66/MMsXAHOFCWMfJiUIxDXL1TaqC03HQBj2SkwN1m535ctWOnsoX65GNbSC08dh X-Received: by 2002:a63:5f82:: with SMTP id t124-v6mr20775839pgb.150.1525390355306; Thu, 03 May 2018 16:32:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525390355; cv=none; d=google.com; s=arc-20160816; b=WrI+cFARCZo47ouGyRABpjEWccT2VwjGGxz9t9Ivloh5ipu5bKgfZQRuLEOwkjME6a 8p0xqD5YSo/FHLcVXIO5kjJMKW/fNoS0N3TlNLKXbaeOu/KHXQayaNdrkyznKH5mMbEK pblaeygPVYkxoReYrdjr7P/iUDfjEdaH5ZzoTavYrwP0JrXd3PA9lk0F6up2Pfy9JO3G 5Y42lswYdEmPq3wugS4bfXwfa8RSjmNBAKnSnrFqXkaBdhNkGiefP8wiBOPaf9N5mB1y mHInXKs9nx5lCst2Ygv15Lv06U+1iGQQCtg/0p7x6OaC1XQw5iLjvlMsc/lAaI36NLJv SuOw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature:arc-authentication-results; bh=jUk0lM3XJWD9spuqYF4ecKo7JAU6MZLjoNWbamcEEtM=; b=LgeTvsCs48+m8r1avhixCTp+JvI+R+/Rwj5z+No2rGb985wRwsx7Z66fFcK1jXgoou zO1cn1e7D6Dg/+Fbj6c8tUvlkuMkEsydC5mmdtqmvfb5V28GAKCJ3UmTIsJNp8VvWm3A sPJ4J/Mvx359QGBZhMSZ7NFDAoT3DDim+JEPuPgVxvAUnVkVMnEEardmwtoSLTPJTmja AP++wq4DPxVU0bCtDWXwra8DM7HU825jUsVpaZSKqLpM9AJ+xziaFb+omYmoifNIUecx nDOfTS7N8XRhawcHST2uPap/5PiriyJZvzinDdMNyC9vPJXisBK9u2m/8jMWa3vSHZYO 9sbA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=i6z6PMqP; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l186si15372418pfl.155.2018.05.03.16.32.21; Thu, 03 May 2018 16:32:35 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=i6z6PMqP; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751311AbeECXaa (ORCPT + 99 others); Thu, 3 May 2018 19:30:30 -0400 Received: from aserp2120.oracle.com ([141.146.126.78]:42842 "EHLO aserp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750930AbeECXa2 (ORCPT ); Thu, 3 May 2018 19:30:28 -0400 Received: from pps.filterd (aserp2120.oracle.com [127.0.0.1]) by aserp2120.oracle.com (8.16.0.22/8.16.0.22) with SMTP id w43NM2o0103561; Thu, 3 May 2018 23:29:49 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=from : to : cc : subject : date : message-id : in-reply-to : references; s=corp-2017-10-26; bh=jUk0lM3XJWD9spuqYF4ecKo7JAU6MZLjoNWbamcEEtM=; b=i6z6PMqPSnDCmcH4AL8a39C9NNUltg8QBseCImjeYzxsn/iNwy0fMriFfj0COsGImHDu mexyUUVyDSugAMI9Pb6Wtcq3CiP/j5yMP22qQk8A2ZTdSzt5IWnsFFsJAFUkwceNRcW6 T4LV9DxaiNADmj0SXzBdq0ka3xOzrWfkEgAU+U6iq+sX5Lw1jgTKjxwDfuI9Hg4U6+MI MqMIQjellSe8kYwp+Z4G25uGwTcBoaQhpcra7uY4zv5PzN1qYdqB8rwqNzIk9rrBJv8b yM88aOVAZ6bBXcwv/T/9yhjkq7wqluSgaTYtO6MdnFYvlGJeIqKqNqFLUBmMp3vU5YIZ Dg== Received: from aserv0022.oracle.com (aserv0022.oracle.com [141.146.126.234]) by aserp2120.oracle.com with ESMTP id 2hmgxg3xkf-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 03 May 2018 23:29:49 +0000 Received: from userv0122.oracle.com (userv0122.oracle.com [156.151.31.75]) by aserv0022.oracle.com (8.14.4/8.14.4) with ESMTP id w43NTmOA023052 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 3 May 2018 23:29:48 GMT Received: from abhmp0019.oracle.com (abhmp0019.oracle.com [141.146.116.25]) by userv0122.oracle.com (8.14.4/8.14.4) with ESMTP id w43NTlph030995; Thu, 3 May 2018 23:29:47 GMT Received: from monkey.oracle.com (/50.38.38.67) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Thu, 03 May 2018 16:29:47 -0700 From: Mike Kravetz To: linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-api@vger.kernel.org Cc: Reinette Chatre , Michal Hocko , Christopher Lameter , Guy Shattah , Anshuman Khandual , Michal Nazarewicz , Vlastimil Babka , David Nellans , Laura Abbott , Pavel Machek , Dave Hansen , Andrew Morton , Mike Kravetz Subject: [PATCH v2 2/4] mm: check for proper migrate type during isolation Date: Thu, 3 May 2018 16:29:33 -0700 Message-Id: <20180503232935.22539-3-mike.kravetz@oracle.com> X-Mailer: git-send-email 2.13.6 In-Reply-To: <20180503232935.22539-1-mike.kravetz@oracle.com> References: <20180503232935.22539-1-mike.kravetz@oracle.com> X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=8882 signatures=668698 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=2 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1805030204 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The routine start_isolate_page_range and alloc_contig_range have comments saying that migratetype must be either MIGRATE_MOVABLE or MIGRATE_CMA. However, this is not enforced. What is important is that that all pageblocks in the range are of type migratetype. This is because blocks will be set to migratetype on error. Add a boolean argument enforce_migratetype to the routine start_isolate_page_range. If set, it will check that all pageblocks in the range have the passed migratetype. Return -EINVAL is pageblock is wrong type is found in range. A boolean is used for enforce_migratetype as there are two primary users. Contiguous range allocation which wants to enforce migration type checking. Memory offline (hotplug) which is not concerned about type checking. Signed-off-by: Mike Kravetz --- include/linux/page-isolation.h | 8 +++----- mm/memory_hotplug.c | 2 +- mm/page_alloc.c | 17 +++++++++-------- mm/page_isolation.c | 40 ++++++++++++++++++++++++++++++---------- 4 files changed, 43 insertions(+), 24 deletions(-) diff --git a/include/linux/page-isolation.h b/include/linux/page-isolation.h index 4ae347cbc36d..2ab7e5a399ac 100644 --- a/include/linux/page-isolation.h +++ b/include/linux/page-isolation.h @@ -38,8 +38,6 @@ int move_freepages_block(struct zone *zone, struct page *page, /* * Changes migrate type in [start_pfn, end_pfn) to be MIGRATE_ISOLATE. - * If specified range includes migrate types other than MOVABLE or CMA, - * this will fail with -EBUSY. * * For isolating all pages in the range finally, the caller have to * free all pages in the range. test_page_isolated() can be used for @@ -47,11 +45,11 @@ int move_freepages_block(struct zone *zone, struct page *page, */ int start_isolate_page_range(unsigned long start_pfn, unsigned long end_pfn, - unsigned migratetype, bool skip_hwpoisoned_pages); + unsigned migratetype, bool skip_hwpoisoned_pages, + bool enforce_migratetype); /* - * Changes MIGRATE_ISOLATE to MIGRATE_MOVABLE. - * target range is [start_pfn, end_pfn) + * Changes MIGRATE_ISOLATE to migratetype for range [start_pfn, end_pfn) */ int undo_isolate_page_range(unsigned long start_pfn, unsigned long end_pfn, diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index f74826cdceea..ebc1c8c330e2 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1601,7 +1601,7 @@ static int __ref __offline_pages(unsigned long start_pfn, /* set above range as isolated */ ret = start_isolate_page_range(start_pfn, end_pfn, - MIGRATE_MOVABLE, true); + MIGRATE_MOVABLE, true, false); if (ret) return ret; diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 0fd5e8e2456e..cb1a5e0be6ee 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -7787,9 +7787,10 @@ static int __alloc_contig_migrate_range(struct compact_control *cc, * alloc_contig_range() -- tries to allocate given range of pages * @start: start PFN to allocate * @end: one-past-the-last PFN to allocate - * @migratetype: migratetype of the underlaying pageblocks (either - * #MIGRATE_MOVABLE or #MIGRATE_CMA). All pageblocks - * in range must have the same migratetype and it must + * @migratetype: migratetype of the underlaying pageblocks. All + * pageblocks in range must have the same migratetype. + * migratetype is typically MIGRATE_MOVABLE or + * MIGRATE_CMA, but this is not a requirement. * be either of the two. * @gfp_mask: GFP mask to use during compaction * @@ -7840,15 +7841,15 @@ int alloc_contig_range(unsigned long start, unsigned long end, * allocator removing them from the buddy system. This way * page allocator will never consider using them. * - * This lets us mark the pageblocks back as - * MIGRATE_CMA/MIGRATE_MOVABLE so that free pages in the - * aligned range but not in the unaligned, original range are - * put back to page allocator so that buddy can use them. + * This lets us mark the pageblocks back as their original + * migrate type so that free pages in the aligned range but + * not in the unaligned, original range are put back to page + * allocator so that buddy can use them. */ ret = start_isolate_page_range(pfn_max_align_down(start), pfn_max_align_up(end), migratetype, - false); + false, true); if (ret) return ret; diff --git a/mm/page_isolation.c b/mm/page_isolation.c index 43e085608846..472191cc1909 100644 --- a/mm/page_isolation.c +++ b/mm/page_isolation.c @@ -16,7 +16,8 @@ #include static int set_migratetype_isolate(struct page *page, int migratetype, - bool skip_hwpoisoned_pages) + bool skip_hwpoisoned_pages, + bool enforce_migratetype) { struct zone *zone; unsigned long flags, pfn; @@ -36,6 +37,17 @@ static int set_migratetype_isolate(struct page *page, int migratetype, if (is_migrate_isolate_page(page)) goto out; + /* + * If requested, check migration type of pageblock and make sure + * it matches migratetype + */ + if (enforce_migratetype) { + if (get_pageblock_migratetype(page) != migratetype) { + ret = -EINVAL; + goto out; + } + } + pfn = page_to_pfn(page); arg.start_pfn = pfn; arg.nr_pages = pageblock_nr_pages; @@ -167,14 +179,16 @@ __first_valid_page(unsigned long pfn, unsigned long nr_pages) * to be MIGRATE_ISOLATE. * @start_pfn: The lower PFN of the range to be isolated. * @end_pfn: The upper PFN of the range to be isolated. - * @migratetype: migrate type to set in error recovery. + * @migratetype: migrate type of all blocks in range. * * Making page-allocation-type to be MIGRATE_ISOLATE means free pages in * the range will never be allocated. Any free pages and pages freed in the * future will not be allocated again. * * start_pfn/end_pfn must be aligned to pageblock_order. - * Return 0 on success and -EBUSY if any part of range cannot be isolated. + * Return 0 on success or error returned by set_migratetype_isolate. Typical + * errors are -EBUSY if any part of range cannot be isolated or -EINVAL if + * any page block is not of migratetype. * * There is no high level synchronization mechanism that prevents two threads * from trying to isolate overlapping ranges. If this happens, one thread @@ -185,11 +199,13 @@ __first_valid_page(unsigned long pfn, unsigned long nr_pages) * prevents two threads from simultaneously working on overlapping ranges. */ int start_isolate_page_range(unsigned long start_pfn, unsigned long end_pfn, - unsigned migratetype, bool skip_hwpoisoned_pages) + unsigned migratetype, bool skip_hwpoisoned_pages, + bool enforce_migratetype) { unsigned long pfn; unsigned long undo_pfn; struct page *page; + int ret = 0; BUG_ON(!IS_ALIGNED(start_pfn, pageblock_nr_pages)); BUG_ON(!IS_ALIGNED(end_pfn, pageblock_nr_pages)); @@ -198,13 +214,17 @@ int start_isolate_page_range(unsigned long start_pfn, unsigned long end_pfn, pfn < end_pfn; pfn += pageblock_nr_pages) { page = __first_valid_page(pfn, pageblock_nr_pages); - if (page && - set_migratetype_isolate(page, migratetype, skip_hwpoisoned_pages)) { - undo_pfn = pfn; - goto undo; + if (page) { + ret = set_migratetype_isolate(page, migratetype, + skip_hwpoisoned_pages, + enforce_migratetype); + if (ret) { + undo_pfn = pfn; + goto undo; + } } } - return 0; + return ret; undo: for (pfn = start_pfn; pfn < undo_pfn; @@ -215,7 +235,7 @@ int start_isolate_page_range(unsigned long start_pfn, unsigned long end_pfn, unset_migratetype_isolate(page, migratetype); } - return -EBUSY; + return ret; } /* -- 2.13.6