Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp4097129ybl; Mon, 27 Jan 2020 16:36:05 -0800 (PST) X-Google-Smtp-Source: APXvYqwo626Cm3iJD7YcqKKNMA1ozQXaWE4GDW5A35321oHrJXLiRIFcLxwZkBkTg1I8KSN/WUfn X-Received: by 2002:aca:f584:: with SMTP id t126mr1239119oih.132.1580171765385; Mon, 27 Jan 2020 16:36:05 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1580171765; cv=none; d=google.com; s=arc-20160816; b=X5Ot7i9iJoRf6um8gJ6woRb6XtwGbK/ClHbvuWMB9Yk8IUlzHeeW08miO7pntcXYFO B1aQOOhrMUeKzq9HLtUVqEx1oX7YQBZYMGXgzqzwZDSkmIlwpKsXIzK77v0x1ZYPgdEi zXmHPWkEX8OzBf9GcR5uwGT6I36U6C6/0peJPzlvBEpT7MhayLK+sjroogn/G0emoRSI tDJnM7q/HjnL23/oHsBwleIrqfCnPFGqEJgP626TX2+WsOmZ1dLqe+vYxR7dwPbwd/nv NsGj9SDI7E9ETHOoMN0pJckgoXTR1yeqvpcthbsj2IycVpVLREwVQkeubNgZKsW53ydF GYBQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date; bh=zBkO8OqEySP/tPjcXAHDsthLp3x1073lq1U3H8WB3J8=; b=iRkqWh2NAUt1//M02p5huMrG+B7b49HTQcn32Rz9PXbPhSQfhE+gzwwj/c0G78GqzP mHw5WYhdfFUt7ROwe9dEME9cnC/gMSizgCpLIfPtRbhaFvAA0oVK8vJAi13i9AzmBUm4 9DFEnjMYU+kbq7vB0zIcKBH9RQPWhn3pGovZrmNa79I9IPqMoacoYEx/Ys4eSLndx6is tqpR3o6IHS+0WKJ4KDC+Ir0LX7ZjaJk6ek9LmOJKlX5FOo28VftvkDinuMSQ3b8+GuI/ IOjaVyBPeAvExyWEvJASwNZ9YPSLtALuoSid6RcR0p7MjEWbqHq3q4wnMbDVL+TpCwdQ tBlA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r8si2244500ota.288.2020.01.27.16.35.47; Mon, 27 Jan 2020 16:36:05 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726191AbgA1Aeb (ORCPT + 99 others); Mon, 27 Jan 2020 19:34:31 -0500 Received: from mga11.intel.com ([192.55.52.93]:59246 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725955AbgA1Aeb (ORCPT ); Mon, 27 Jan 2020 19:34:31 -0500 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 27 Jan 2020 16:34:29 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.70,371,1574150400"; d="scan'208";a="261263777" Received: from richard.sh.intel.com (HELO localhost) ([10.239.159.54]) by fmsmga002.fm.intel.com with ESMTP; 27 Jan 2020 16:34:27 -0800 Date: Tue, 28 Jan 2020 08:34:40 +0800 From: Wei Yang To: Yang Shi Cc: mhocko@suse.com, richardw.yang@linux.intel.com, willy@infradead.org, akpm@linux-foundation.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org Subject: Re: [v4 PATCH] mm: move_pages: report the number of non-attempted pages Message-ID: <20200128003440.GB20624@richard> Reply-To: Wei Yang References: <1580160527-109104-1-git-send-email-yang.shi@linux.alibaba.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1580160527-109104-1-git-send-email-yang.shi@linux.alibaba.com> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jan 28, 2020 at 05:28:47AM +0800, Yang Shi wrote: >Since commit a49bd4d71637 ("mm, numa: rework do_pages_move"), >the semantic of move_pages() has changed to return the number of >non-migrated pages if they were result of a non-fatal reasons (usually a >busy page). This was an unintentional change that hasn't been noticed >except for LTP tests which checked for the documented behavior. > >There are two ways to go around this change. We can even get back to the >original behavior and return -EAGAIN whenever migrate_pages is not able >to migrate pages due to non-fatal reasons. Another option would be to >simply continue with the changed semantic and extend move_pages >documentation to clarify that -errno is returned on an invalid input or >when migration simply cannot succeed (e.g. -ENOMEM, -EBUSY) or the >number of pages that couldn't have been migrated due to ephemeral >reasons (e.g. page is pinned or locked for other reasons). > >This patch implements the second option because this behavior is in >place for some time without anybody complaining and possibly new users >depending on it. Also it allows to have a slightly easier error handling >as the caller knows that it is worth to retry when err > 0. > >But since the new semantic would be aborted immediately if migration is >failed due to ephemeral reasons, need include the number of non-attempted >pages in the return value too. > >Fixes: a49bd4d71637 ("mm, numa: rework do_pages_move") >Suggested-by: Michal Hocko >Acked-by: Michal Hocko >Cc: Wei Yang >Cc: [4.17+] >Signed-off-by: Yang Shi Reviewed-by: Wei Yang >--- >v4: Fixed some typo and grammar errors caught by Willy >v3: Rephrased the commit log per Michal and added Michal's Acked-by >v2: Rebased on top of the latest mainline kernel per Andrew > > mm/migrate.c | 25 +++++++++++++++++++++++-- > 1 file changed, 23 insertions(+), 2 deletions(-) > >diff --git a/mm/migrate.c b/mm/migrate.c >index 86873b6..2530860 100644 >--- a/mm/migrate.c >+++ b/mm/migrate.c >@@ -1627,8 +1627,19 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > start = i; > } else if (node != current_node) { > err = do_move_pages_to_node(mm, &pagelist, current_node); >- if (err) >+ if (err) { >+ /* >+ * Positive err means the number of failed >+ * pages to migrate. Since we are going to >+ * abort and return the number of non-migrated >+ * pages, so need to incude the rest of the >+ * nr_pages that have not been attempted as >+ * well. >+ */ >+ if (err > 0) >+ err += nr_pages - i - 1; > goto out; >+ } > err = store_status(status, start, current_node, i - start); > if (err) > goto out; >@@ -1659,8 +1670,11 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > goto out_flush; > > err = do_move_pages_to_node(mm, &pagelist, current_node); >- if (err) >+ if (err) { >+ if (err > 0) >+ err += nr_pages - i - 1; > goto out; >+ } > if (i > start) { > err = store_status(status, start, current_node, i - start); > if (err) >@@ -1674,6 +1688,13 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > > /* Make sure we do not overwrite the existing error */ > err1 = do_move_pages_to_node(mm, &pagelist, current_node); >+ /* >+ * Don't have to report non-attempted pages here since: >+ * - If the above loop is done gracefully all pages have been >+ * attempted. >+ * - If the above loop is aborted it means a fatal error >+ * happened, should return ret. >+ */ > if (!err1) > err1 = store_status(status, start, current_node, i - start); > if (!err) >-- >1.8.3.1 -- Wei Yang Help you, Help me