Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752742AbcJNIiS (ORCPT ); Fri, 14 Oct 2016 04:38:18 -0400 Received: from sci-ig2.spreadtrum.com ([222.66.158.135]:16945 "EHLO SHSQR01.spreadtrum.com" rhost-flags-OK-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1752437AbcJNIiA (ORCPT ); Fri, 14 Oct 2016 04:38:00 -0400 Date: Fri, 14 Oct 2016 16:32:19 +0800 From: Ming Ling To: Michal Hocko CC: , , , , , , , , , , , , , , , , , , , , , Subject: Re: [PATCH v2] mm: exclude isolated non-lru pages from NR_ISOLATED_ANON or NR_ISOLATED_FILE. Message-ID: <20161014083219.GA20260@spreadtrum.com> References: <1476340749-13281-1-git-send-email-ming.ling@spreadtrum.com> <20161013080936.GG21678@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20161013080936.GG21678@dhcp22.suse.cz> User-Agent: Mutt/1.5.21 (2010-09-15) X-MAIL: SHSQR01.spreadtrum.com u9E8bQFw086620 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 7073 Lines: 183 On 四, 10月 13, 2016 at 10:09:37上午 +0200, Michal Hocko wrote: Hello, > On Thu 13-10-16 14:39:09, ming.ling wrote: > > From: Ming Ling > > > > Non-lru pages don't belong to any lru, so counting them to > > NR_ISOLATED_ANON or NR_ISOLATED_FILE doesn't make any sense. > > It may misguide functions such as pgdat_reclaimable_pages and > > too_many_isolated. > > That doesn't make much sense to me. I guess you wanted to say something > like > " > Accounting non-lru pages isolated for migration during pfn walk to > NR_ISOLATED_{ANON,FILE} doesn't make any sense and it can misguide > heuristics based on those counters such as pgdat_reclaimable_pages resp. > too_many_isolated. Note that __alloc_contig_migrate_range can isolate > a lot of pages at once. > " Yes,your understanding is right, and your description is clearer than mine. Do your mind if i borrow it as a comment of this patch in next version? > > On mobile devices such as 512M ram android Phone, it may use > > a big zram swap. In some cases zram(zsmalloc) uses too many > > non-lru pages, such as: > > MemTotal: 468148 kB > > Normal free:5620kB > > Free swap:4736kB > > Total swap:409596kB > > ZRAM: 164616kB(zsmalloc non-lru pages) > > active_anon:60700kB > > inactive_anon:60744kB > > active_file:34420kB > > inactive_file:37532kB > > I assume those zsmalloc pages are migrateable and that is the problem? > Please state that explicitly so that even people not familiar with > zsmalloc understand the motivation. Yes, since Minchan Kim had committed ‘mm: migrate: support non-lru movable page migration’, those zsmalloc pages are migrateable now. And i will state that explicitly in next version. > > > More non-lru pages which used by zram for swap, it influences > > pgdat_reclaimable_pages and too_many_isolated more. > > It would be good to mention what would be a visible effect of this. > "If the NR_ISOLATED_* is too large then the direct reclaim might get > throttled prematurely inducing longer allocation latencies without any > strong reason." > I will detail the effect of counting so many non-lru pages into NR_ISOLATED_{ANON,FILE} such as: 'In function shrink_inactive_list, if there are too many isolated pages,it will wait for a moment. So If we miscounting large number non-lru pages into NR_ISOLATED_{ANON,FILE}, direct reclaim might getthrottled prematurely inducing longer allocation latencies without any strong reason. Actually there is no need to take non-lru pages into account in shrink_inactive_list which just deals with lru pages. In function pgdat_reclaimable_pages, you had considered isolated pages in zone_reclaimable_pages. So miscounting non-lru pages into NR_ISOLATED_{ANON,FILE} also larger zone_reclaimable_pages and will lead to a more optimistic zone_reclaimable judgement. ' > > This patch excludes isolated non-lru pages from NR_ISOLATED_ANON > > or NR_ISOLATED_FILE to ensure their counts are right. > > But this patch doesn't do that. It just relies on __PageMovable. It is > true that all LRU pages should be movable (well except for > NR_UNEVICTABLE in certain configurations) but is it true that all > movable pages are on the LRU list? > I don't think so. In commit bda807d4 'mm: migrate: support non-lru movable page migration', Minchan Kim point out : 'For testing of non-lru movable page, VM supports __PageMovable function. However, it doesn't guarantee to identify non-lru movable page because page->mapping field is unified with other variables in struct page. As well, if driver releases the page after isolation by VM, page->mapping doesn't have stable value although it has PAGE_MAPPING_MOVABLE (Look at __ClearPageMovable). But __PageMovable is cheap to catch whether page is LRU or non-lru movable once the page has been isolated. Because LRU pages never can have PAGE_MAPPING_MOVABLE in page->mapping. It is also good for just peeking to test non-lru movable pages before more expensive checking with lock_page in pfn scanning to select victim.'. And he uses __PageMovable to judge whether a isolated page is a lru page such as: void putback_movable_pages(struct list_head *l) { ...... /* * We isolated non-lru movable page so here we can use * __PageMovable because LRU page's mapping cannot have * PAGE_MAPPING_MOVABLE. */ if (unlikely(__PageMovable(page))) { VM_BUG_ON_PAGE(!PageIsolated(page), page); lock_page(page); if (PageMovable(page)) putback_movable_page(page); else __ClearPageIsolated(page); unlock_page(page); put_page(page); } else { putback_lru_page(page); } } > Why don't you simply mimic what shrink_inactive_list does? Aka count the > number of isolated pages and then account them when appropriate? > I think i am correcting clearly wrong part. So, there is no need to describe it too detailed. It's a misunderstanding, and i will add more comments as you suggest. I am looking forward to more suggestions from you. Thank you very much. > > Signed-off-by: Ming ling > > --- > > mm/compaction.c | 6 ++++-- > > mm/migrate.c | 9 +++++---- > > 2 files changed, 9 insertions(+), 6 deletions(-) > > > > diff --git a/mm/compaction.c b/mm/compaction.c > > index 0409a4a..ed4c553 100644 > > --- a/mm/compaction.c > > +++ b/mm/compaction.c > > @@ -643,8 +643,10 @@ static void acct_isolated(struct zone *zone, struct compact_control *cc) > > if (list_empty(&cc->migratepages)) > > return; > > > > - list_for_each_entry(page, &cc->migratepages, lru) > > - count[!!page_is_file_cache(page)]++; > > + list_for_each_entry(page, &cc->migratepages, lru) { > > + if (likely(!__PageMovable(page))) > > + count[!!page_is_file_cache(page)]++; > > + } > > > > mod_node_page_state(zone->zone_pgdat, NR_ISOLATED_ANON, count[0]); > > mod_node_page_state(zone->zone_pgdat, NR_ISOLATED_FILE, count[1]); > > diff --git a/mm/migrate.c b/mm/migrate.c > > index 99250ae..abe48cc 100644 > > --- a/mm/migrate.c > > +++ b/mm/migrate.c > > @@ -168,8 +168,6 @@ void putback_movable_pages(struct list_head *l) > > continue; > > } > > list_del(&page->lru); > > - dec_node_page_state(page, NR_ISOLATED_ANON + > > - page_is_file_cache(page)); > > /* > > * We isolated non-lru movable page so here we can use > > * __PageMovable because LRU page's mapping cannot have > > @@ -185,6 +183,8 @@ void putback_movable_pages(struct list_head *l) > > unlock_page(page); > > put_page(page); > > } else { > > + dec_node_page_state(page, NR_ISOLATED_ANON + > > + page_is_file_cache(page)); > > putback_lru_page(page); > > } > > } > > @@ -1121,8 +1121,9 @@ static ICE_noinline int unmap_and_move(new_page_t get_new_page, > > * restored. > > */ > > list_del(&page->lru); > > - dec_node_page_state(page, NR_ISOLATED_ANON + > > - page_is_file_cache(page)); > > + if (likely(!__PageMovable(page))) > > + dec_node_page_state(page, NR_ISOLATED_ANON + > > + page_is_file_cache(page)); > > } > > > > /* > > -- > > 1.9.1 > > -- > Michal Hocko > SUSE Labs