Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id D2E4DC64ED6 for ; Wed, 1 Mar 2023 06:19:58 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229748AbjCAGT5 (ORCPT ); Wed, 1 Mar 2023 01:19:57 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50462 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229548AbjCAGTz (ORCPT ); Wed, 1 Mar 2023 01:19:55 -0500 Received: from mga12.intel.com (mga12.intel.com [192.55.52.136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id AB64B83ED for ; Tue, 28 Feb 2023 22:19:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1677651593; x=1709187593; h=from:to:cc:subject:references:date:in-reply-to: message-id:mime-version; bh=f1EvyqOI7iFrXy5SrYy2Qs5gMHTyapwHJlT7PO8ZAYY=; b=kQagKBOF9hI33/l7Vzv2qyADqYDIlvIINo018a/5D2wW7TjLaLzu45pg D/01BmcF/nt4cP1VErvXJ0Kwm9La0y8PbIKFfKOU7nIkFcDnf+SvoW4mC OfqjSsgoxsYUI09e6ToTO8sdG5JHl/rGAhQVRCkH8tuZUqJv8wnuu/kZ+ n7SIK0jFPos5ZrY0aJk6jxXt0U/h/2h00mvlk5Fb/YyDdKSXfeqFP9S5c Fq5+himA4Rcv82I7kEYYo+E1zRsm/nnhDRZkEMgAeg4WUJbziNqvGVpxW LheJSo0us4KVpWpLoev9SIgtjkh45XG5FMGfxTC+fwpoj2Vr30nH2Q0ic A==; X-IronPort-AV: E=McAfee;i="6500,9779,10635"; a="314001789" X-IronPort-AV: E=Sophos;i="5.98,224,1673942400"; d="scan'208";a="314001789" Received: from fmsmga007.fm.intel.com ([10.253.24.52]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Feb 2023 22:19:51 -0800 X-IronPort-AV: E=McAfee;i="6500,9779,10635"; a="676632503" X-IronPort-AV: E=Sophos;i="5.98,224,1673942400"; d="scan'208";a="676632503" Received: from yhuang6-desk2.sh.intel.com (HELO yhuang6-desk2.ccr.corp.intel.com) ([10.238.208.55]) by fmsmga007-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Feb 2023 22:19:47 -0800 From: "Huang, Ying" To: Baolin Wang Cc: Andrew Morton , , , Hugh Dickins , "Xu, Pengfei" , Christoph Hellwig , Stefan Roesch , Tejun Heo , Xin Hao , Zi Yan , Yang Shi , Matthew Wilcox , Mike Kravetz Subject: Re: [PATCH 3/3] migrate_pages: try migrate in batch asynchronously firstly References: <20230224141145.96814-1-ying.huang@intel.com> <20230224141145.96814-4-ying.huang@intel.com> Date: Wed, 01 Mar 2023 14:18:51 +0800 In-Reply-To: (Baolin Wang's message of "Wed, 1 Mar 2023 11:08:26 +0800") Message-ID: <87zg8x9epg.fsf@yhuang6-desk2.ccr.corp.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=ascii Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Baolin Wang writes: > On 2/24/2023 10:11 PM, Huang Ying wrote: >> When we have locked more than one folios, we cannot wait the lock or >> bit (e.g., page lock, buffer head lock, writeback bit) synchronously. >> Otherwise deadlock may be triggered. This make it hard to batch the >> synchronous migration directly. >> This patch re-enables batching synchronous migration via trying to >> migrate in batch asynchronously firstly. And any folios that are >> failed to be migrated asynchronously will be migrated synchronously >> one by one. >> Test shows that this can restore the TLB flushing batching >> performance >> for synchronous migration effectively. >> Signed-off-by: "Huang, Ying" >> Cc: Hugh Dickins >> Cc: "Xu, Pengfei" >> Cc: Christoph Hellwig >> Cc: Stefan Roesch >> Cc: Tejun Heo >> Cc: Xin Hao >> Cc: Zi Yan >> Cc: Yang Shi >> Cc: Baolin Wang >> Cc: Matthew Wilcox >> Cc: Mike Kravetz >> --- >> mm/migrate.c | 65 ++++++++++++++++++++++++++++++++++++++++++++-------- >> 1 file changed, 55 insertions(+), 10 deletions(-) >> diff --git a/mm/migrate.c b/mm/migrate.c >> index 91198b487e49..c17ce5ee8d92 100644 >> --- a/mm/migrate.c >> +++ b/mm/migrate.c >> @@ -1843,6 +1843,51 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, >> return rc; >> } >> +static int migrate_pages_sync(struct list_head *from, new_page_t >> get_new_page, >> + free_page_t put_new_page, unsigned long private, >> + enum migrate_mode mode, int reason, struct list_head *ret_folios, >> + struct list_head *split_folios, struct migrate_pages_stats *stats) >> +{ >> + int rc, nr_failed = 0; >> + LIST_HEAD(folios); >> + struct migrate_pages_stats astats; >> + >> + memset(&astats, 0, sizeof(astats)); >> + /* Try to migrate in batch with MIGRATE_ASYNC mode firstly */ >> + rc = migrate_pages_batch(from, get_new_page, put_new_page, private, MIGRATE_ASYNC, >> + reason, &folios, split_folios, &astats, >> + NR_MAX_MIGRATE_PAGES_RETRY); >> + stats->nr_succeeded += astats.nr_succeeded; >> + stats->nr_thp_succeeded += astats.nr_thp_succeeded; >> + stats->nr_thp_split += astats.nr_thp_split; >> + if (rc < 0) { >> + stats->nr_failed_pages += astats.nr_failed_pages; >> + stats->nr_thp_failed += astats.nr_thp_failed; >> + list_splice_tail(&folios, ret_folios); >> + return rc; >> + } >> + stats->nr_thp_failed += astats.nr_thp_split; >> + nr_failed += astats.nr_thp_split; >> + /* >> + * Fall back to migrate all failed folios one by one synchronously. All >> + * failed folios except split THPs will be retried, so their failure >> + * isn't counted >> + */ >> + list_splice_tail_init(&folios, from); >> + while (!list_empty(from)) { >> + list_move(from->next, &folios); >> + rc = migrate_pages_batch(&folios, get_new_page, put_new_page, >> + private, mode, reason, ret_folios, >> + split_folios, stats, NR_MAX_MIGRATE_PAGES_RETRY); >> + list_splice_tail_init(&folios, ret_folios); >> + if (rc < 0) >> + return rc; >> + nr_failed += rc; >> + } >> + >> + return nr_failed; >> +} >> + >> /* >> * migrate_pages - migrate the folios specified in a list, to the free folios >> * supplied as the target for the page migration >> @@ -1874,7 +1919,7 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >> enum migrate_mode mode, int reason, unsigned int *ret_succeeded) >> { >> int rc, rc_gather; >> - int nr_pages, batch; >> + int nr_pages; >> struct folio *folio, *folio2; >> LIST_HEAD(folios); >> LIST_HEAD(ret_folios); >> @@ -1890,10 +1935,6 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >> if (rc_gather < 0) >> goto out; >> - if (mode == MIGRATE_ASYNC) >> - batch = NR_MAX_BATCHED_MIGRATION; >> - else >> - batch = 1; >> again: >> nr_pages = 0; >> list_for_each_entry_safe(folio, folio2, from, lru) { >> @@ -1904,16 +1945,20 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >> } >> nr_pages += folio_nr_pages(folio); >> - if (nr_pages >= batch) >> + if (nr_pages >= NR_MAX_BATCHED_MIGRATION) >> break; >> } >> - if (nr_pages >= batch) >> + if (nr_pages >= NR_MAX_BATCHED_MIGRATION) >> list_cut_before(&folios, from, &folio2->lru); >> else >> list_splice_init(from, &folios); >> - rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, >> - mode, reason, &ret_folios, &split_folios, &stats, >> - NR_MAX_MIGRATE_PAGES_RETRY); >> + if (mode == MIGRATE_ASYNC) >> + rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, >> + mode, reason, &ret_folios, &split_folios, &stats, >> + NR_MAX_MIGRATE_PAGES_RETRY); >> + else >> + rc = migrate_pages_sync(&folios, get_new_page, put_new_page, private, >> + mode, reason, &ret_folios, &split_folios, &stats); > > For split folios, it seems also reasonable to use migrate_pages_sync() > instead of always using fixed MIGRATE_ASYNC mode? For split folios, we only try to migrate them with minimal effort. Previously, we decrease the retry number from 10 to 1. Now, I think that it's reasonable to change the migration mode to MIGRATE_ASYNC to reduce latency. They have been counted as failure anyway. >> list_splice_tail_init(&folios, &ret_folios); >> if (rc < 0) { >> rc_gather = rc; Best Regards, Huang, Ying