Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp3946019ybl; Mon, 27 Jan 2020 13:31:09 -0800 (PST) X-Google-Smtp-Source: APXvYqwl+YLEqoVzjsf0je3RDUHuBAlhkLoR7lsr89dEEAuvoq3Y89dGRgQ4dDfzVCiKQEgFTLqt X-Received: by 2002:a9d:7083:: with SMTP id l3mr6629366otj.193.1580160669669; Mon, 27 Jan 2020 13:31:09 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1580160669; cv=none; d=google.com; s=arc-20160816; b=bXOEfEMoGZub3XBDv/8OB6sGLCFEPfTb92d63H9jdXiw1k0WnyRi8HMfSAOiJh4lky qnnTVb7Vaas+X+0M5lg5wWOsKRjz8NuUH2hi+BUcZQmm+kzZ+SPIXQuwVIgYaL+5KKxY p7ormI7+ty3282gzmY2gxC2IS4/g6XyvBIUWj0d6IQTgTljQ2umNGwrAT8GQZ1b8U+yl JWMhmwD7/L4Y5zFMdTv2CT0s/KnawVxSkcNsaVdl6CrOXNGh1baTSmheVuQtwTHFjWZO Jtsub7GBgUU4hMz2xSYWxFc2RF+eQNsTaczQkPKbudZ4ytuUBp9BbSWwS695Hyh/uwYw EFNA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from; bh=KGd2bWAXhSzrUut5v/X7BP/+juvGV46OpJMU6G+OZN0=; b=KQjILwbKoir7CTHeUlAdwl05tIBB97LMRdp33B9HYAz1rcKWqrkoWsKH4ASi/zi/a4 CjqkPWlkQN4D/yso4/UgPjuL7+g6mx8Nqnb0YmR1RCSv/4GwW972rgBOsShYLMvynNvC AnD2BETyEs40Vz4ARxbJCxRYYRzdbjGKwSmxu8xRIWwANfjZiDg0qhFp7uTGvET0+NHm 4xNj59grqTn7xryABlgSzLT3jUlgixsh6wjxAwUevrkkE29bZ8yDgnsqg9R6XCCO39iN c01n9K8wNj0Qs+SM4RwmiaL/XdZUhs8cDP8s4R+foMOkZmzJzTF7UBOfy1zNAi766YVU tFCQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o15si7551936otp.314.2020.01.27.13.30.57; Mon, 27 Jan 2020 13:31:09 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726338AbgA0V27 (ORCPT + 99 others); Mon, 27 Jan 2020 16:28:59 -0500 Received: from out30-131.freemail.mail.aliyun.com ([115.124.30.131]:52700 "EHLO out30-131.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725955AbgA0V27 (ORCPT ); Mon, 27 Jan 2020 16:28:59 -0500 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R171e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e04426;MF=yang.shi@linux.alibaba.com;NM=1;PH=DS;RN=8;SR=0;TI=SMTPD_---0TodWSHI_1580160527; Received: from localhost(mailfrom:yang.shi@linux.alibaba.com fp:SMTPD_---0TodWSHI_1580160527) by smtp.aliyun-inc.com(127.0.0.1); Tue, 28 Jan 2020 05:28:55 +0800 From: Yang Shi To: mhocko@suse.com, richardw.yang@linux.intel.com, willy@infradead.org, akpm@linux-foundation.org Cc: yang.shi@linux.alibaba.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org Subject: [v4 PATCH] mm: move_pages: report the number of non-attempted pages Date: Tue, 28 Jan 2020 05:28:47 +0800 Message-Id: <1580160527-109104-1-git-send-email-yang.shi@linux.alibaba.com> X-Mailer: git-send-email 1.8.3.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Since commit a49bd4d71637 ("mm, numa: rework do_pages_move"), the semantic of move_pages() has changed to return the number of non-migrated pages if they were result of a non-fatal reasons (usually a busy page). This was an unintentional change that hasn't been noticed except for LTP tests which checked for the documented behavior. There are two ways to go around this change. We can even get back to the original behavior and return -EAGAIN whenever migrate_pages is not able to migrate pages due to non-fatal reasons. Another option would be to simply continue with the changed semantic and extend move_pages documentation to clarify that -errno is returned on an invalid input or when migration simply cannot succeed (e.g. -ENOMEM, -EBUSY) or the number of pages that couldn't have been migrated due to ephemeral reasons (e.g. page is pinned or locked for other reasons). This patch implements the second option because this behavior is in place for some time without anybody complaining and possibly new users depending on it. Also it allows to have a slightly easier error handling as the caller knows that it is worth to retry when err > 0. But since the new semantic would be aborted immediately if migration is failed due to ephemeral reasons, need include the number of non-attempted pages in the return value too. Fixes: a49bd4d71637 ("mm, numa: rework do_pages_move") Suggested-by: Michal Hocko Acked-by: Michal Hocko Cc: Wei Yang Cc: [4.17+] Signed-off-by: Yang Shi --- v4: Fixed some typo and grammar errors caught by Willy v3: Rephrased the commit log per Michal and added Michal's Acked-by v2: Rebased on top of the latest mainline kernel per Andrew mm/migrate.c | 25 +++++++++++++++++++++++-- 1 file changed, 23 insertions(+), 2 deletions(-) diff --git a/mm/migrate.c b/mm/migrate.c index 86873b6..2530860 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -1627,8 +1627,19 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, start = i; } else if (node != current_node) { err = do_move_pages_to_node(mm, &pagelist, current_node); - if (err) + if (err) { + /* + * Positive err means the number of failed + * pages to migrate. Since we are going to + * abort and return the number of non-migrated + * pages, so need to incude the rest of the + * nr_pages that have not been attempted as + * well. + */ + if (err > 0) + err += nr_pages - i - 1; goto out; + } err = store_status(status, start, current_node, i - start); if (err) goto out; @@ -1659,8 +1670,11 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, goto out_flush; err = do_move_pages_to_node(mm, &pagelist, current_node); - if (err) + if (err) { + if (err > 0) + err += nr_pages - i - 1; goto out; + } if (i > start) { err = store_status(status, start, current_node, i - start); if (err) @@ -1674,6 +1688,13 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, /* Make sure we do not overwrite the existing error */ err1 = do_move_pages_to_node(mm, &pagelist, current_node); + /* + * Don't have to report non-attempted pages here since: + * - If the above loop is done gracefully all pages have been + * attempted. + * - If the above loop is aborted it means a fatal error + * happened, should return ret. + */ if (!err1) err1 = store_status(status, start, current_node, i - start); if (!err) -- 1.8.3.1