Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp202259ybl; Wed, 22 Jan 2020 19:28:39 -0800 (PST) X-Google-Smtp-Source: APXvYqxoxvA7XRoipg6oirQwgqwX3gSs22m8/mA+6fxrsIJdGHW7Pp8xSDnQR3bt/a0Udkhoscpt X-Received: by 2002:aca:7244:: with SMTP id p65mr8877062oic.50.1579750119587; Wed, 22 Jan 2020 19:28:39 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579750119; cv=none; d=google.com; s=arc-20160816; b=I1h88AhGq+Dm7Lg3Yl1hvR+xQtbpw9GoLLDa7zATZ3od8OS7K4yzUTF09VUYABUs8G d61pvOAUK2kxEwRMVmJih07sA8bPWqiA6+1RcoAqdQEPwhy0+8tKLm5T3ITd73DuF6Fs E1dpmTzLpRFD4Z57/gp6XNeD1TbLY4OsCNU2ex3M58WPj7rS4jFmr32hAS9pO0uxo6Hh U4Nt53ETN5VW0zSJtI6rPhZMLiscuhx5Jqp0owxjVle14WbehsQ9dSXNE7J7GLMHOlOg FS04+1hX5LsqjZzl6yXzBuXsVZD5fRpnAegKyfYcaMvuOszskQysa7tEsQa1jCO79600 A9FA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date; bh=wRZ6x5KhgLz/ZycT/2rVuXEFx67QoG4ccXandutyNWs=; b=UUM+1pZZWOb+OJ8mlv3VxCRh+xFZoRPKavFUYQgQQ7WRV3jF3Uljaxk3tbufdnpuX7 eIPdXAa8tz1E92bxkkSvxvOPu2v5WZseVkgNxWlnWCZfoH67MWS2R8UPRgYZMxL6Ixum 1Xl/PkLCEW5F1jnUnS1oKqHxOtbF4PkPX/5OnodnU0tGymAP3WHubqlqDV5z8ra/qiXp 2YdB5cdXODSTjfJBUDZl4NvYKQTdU/ldGNkoOccJ3AXShSlXEIP3OIQfWwhcS4u3HBU3 b7X9QnSoBiZCKse7/3TklsNF09HzLUnbvgEVgldTu42WV1631yQoOCtJRWtXeDhCRoX5 kR8g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h13si442113otk.291.2020.01.22.19.28.25; Wed, 22 Jan 2020 19:28:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726099AbgAWD11 (ORCPT + 99 others); Wed, 22 Jan 2020 22:27:27 -0500 Received: from mga07.intel.com ([134.134.136.100]:59857 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725911AbgAWD11 (ORCPT ); Wed, 22 Jan 2020 22:27:27 -0500 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from orsmga003.jf.intel.com ([10.7.209.27]) by orsmga105.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 22 Jan 2020 19:27:26 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.70,352,1574150400"; d="scan'208";a="227873142" Received: from richard.sh.intel.com (HELO localhost) ([10.239.159.54]) by orsmga003.jf.intel.com with ESMTP; 22 Jan 2020 19:27:24 -0800 Date: Thu, 23 Jan 2020 11:27:36 +0800 From: Wei Yang To: Yang Shi Cc: mhocko@suse.com, richardw.yang@linux.intel.com, akpm@linux-foundation.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org Subject: Re: [v2 PATCH] mm: move_pages: report the number of non-attempted pages Message-ID: <20200123032736.GA22196@richard> Reply-To: Wei Yang References: <1579736331-85494-1-git-send-email-yang.shi@linux.alibaba.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1579736331-85494-1-git-send-email-yang.shi@linux.alibaba.com> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 23, 2020 at 07:38:51AM +0800, Yang Shi wrote: >Since commit a49bd4d71637 ("mm, numa: rework do_pages_move"), >the semantic of move_pages() was changed to return the number of >non-migrated pages (failed to migration) and the call would be aborted >immediately if migrate_pages() returns positive value. But it didn't >report the number of pages that we even haven't attempted to migrate. >So, fix it by including non-attempted pages in the return value. > First, we want to change the semantic of move_pages(2). The return value indicates the number of pages we didn't managed to migrate? Second, the return value from migrate_pages() doesn't mean the number of pages we failed to migrate. For example, one -ENOMEM is returned on the first page, migrate_pages() would return 1. But actually, no page successfully migrated. Third, even the migrate_pages() return the exact non-migrate page, we are not sure those non-migrated pages are at the tail of the list. Because in the last case in migrate_pages(), it just remove the page from list. It could be a page in the middle of the list. Then, in userspace, how the return value be leveraged to determine the valid status? Any page in the list could be the victim. Sounds we need to think about this carefully. >Fixes: a49bd4d71637 ("mm, numa: rework do_pages_move") >Suggested-by: Michal Hocko >Cc: Wei Yang >Cc: [4.17+] >Signed-off-by: Yang Shi >--- >v2: Rebased on top of the latest mainline kernel per Andrew > > mm/migrate.c | 24 ++++++++++++++++++++++-- > 1 file changed, 22 insertions(+), 2 deletions(-) > >diff --git a/mm/migrate.c b/mm/migrate.c >index 86873b6..9b8eb5d 100644 >--- a/mm/migrate.c >+++ b/mm/migrate.c >@@ -1627,8 +1627,18 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > start = i; > } else if (node != current_node) { > err = do_move_pages_to_node(mm, &pagelist, current_node); >- if (err) >+ if (err) { >+ /* >+ * Positive err means the number of failed >+ * pages to migrate. Since we are going to >+ * abort and return the number of non-migrated >+ * pages, so need incude the rest of the >+ * nr_pages that have not attempted as well. >+ */ >+ if (err > 0) >+ err += nr_pages - i - 1; > goto out; >+ } > err = store_status(status, start, current_node, i - start); > if (err) > goto out; >@@ -1659,8 +1669,11 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > goto out_flush; > > err = do_move_pages_to_node(mm, &pagelist, current_node); >- if (err) >+ if (err) { >+ if (err > 0) >+ err += nr_pages - i - 1; > goto out; >+ } > if (i > start) { > err = store_status(status, start, current_node, i - start); > if (err) >@@ -1674,6 +1687,13 @@ static int do_pages_move(struct mm_struct *mm, nodemask_t task_nodes, > > /* Make sure we do not overwrite the existing error */ > err1 = do_move_pages_to_node(mm, &pagelist, current_node); >+ /* >+ * Don't have to report non-attempted pages here since: >+ * - If the above loop is done gracefully there is not non-attempted >+ * page. >+ * - If the above loop is aborted to it means more fatal error >+ * happened, should return err. >+ */ > if (!err1) > err1 = store_status(status, start, current_node, i - start); > if (!err) >-- >1.8.3.1 -- Wei Yang Help you, Help me