Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752634AbcD2JQv (ORCPT ); Fri, 29 Apr 2016 05:16:51 -0400 Received: from mx2.suse.de ([195.135.220.15]:56920 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751169AbcD2JQs (ORCPT ); Fri, 29 Apr 2016 05:16:48 -0400 Subject: Re: [PATCH 09/14] mm: use compaction feedback for thp backoff conditions To: Michal Hocko References: <1461181647-8039-1-git-send-email-mhocko@kernel.org> <1461181647-8039-10-git-send-email-mhocko@kernel.org> <5721CF7E.9020106@suse.cz> <20160428123545.GG31489@dhcp22.suse.cz> Cc: Andrew Morton , Linus Torvalds , Johannes Weiner , Mel Gorman , David Rientjes , Tetsuo Handa , Joonsoo Kim , Hillf Danton , linux-mm@kvack.org, LKML , Andrea Arcangeli From: Vlastimil Babka Message-ID: <5723267C.1050903@suse.cz> Date: Fri, 29 Apr 2016 11:16:44 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.7.2 MIME-Version: 1.0 In-Reply-To: <20160428123545.GG31489@dhcp22.suse.cz> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3436 Lines: 68 On 04/28/2016 02:35 PM, Michal Hocko wrote: > On Thu 28-04-16 10:53:18, Vlastimil Babka wrote: >> On 04/20/2016 09:47 PM, Michal Hocko wrote: >>> From: Michal Hocko >>> >>> THP requests skip the direct reclaim if the compaction is either >>> deferred or contended to reduce stalls which wouldn't help the >>> allocation success anyway. These checks are ignoring other potential >>> feedback modes which we have available now. >>> >>> It clearly doesn't make much sense to go and reclaim few pages if the >>> previous compaction has failed. >>> >>> We can also simplify the check by using compaction_withdrawn which >>> checks for both COMPACT_CONTENDED and COMPACT_DEFERRED. This check >>> is however covering more reasons why the compaction was withdrawn. >>> None of them should be a problem for the THP case though. >>> >>> It is safe to back of if we see COMPACT_SKIPPED because that means >>> that compaction_suitable failed and a single round of the reclaim is >>> unlikely to make any difference here. We would have to be close to Hmm this is actually incorrect, as should_continue_reclaim() will keep shrink_zone() going as much as needed for compaction to become enabled, so it doesn't reclaim just SWAP_CLUSTER_MAX. >>> the low watermark to reclaim enough and even then there is no guarantee >>> that the compaction would make any progress while the direct reclaim >>> would have caused the stall. >>> >>> COMPACT_PARTIAL_SKIPPED is slightly different because that means that we >>> have only seen a part of the zone so a retry would make some sense. But >>> it would be a compaction retry not a reclaim retry to perform. We are >>> not doing that and that might indeed lead to situations where THP fails >>> but this should happen only rarely and it would be really hard to >>> measure. >>> >>> Signed-off-by: Michal Hocko >> >> THP's don't compact by default in page fault path anymore, so we don't need >> to restrict them even more. And hopefully we'll replace the >> is_thp_gfp_mask() hack with something better soon, so this might be just >> extra code churn. But I don't feel strongly enough to nack it. > > My main point was to simplify the code and get rid of as much compaction > specific hacks as possible. We might very well drop this later on but it > would be at least less code to grasp through. I do not have any problem > with dropping this but I think this shouldn't collide with other patches > much so reducing the number of lines is worth it. I just realized it also affects khugepaged, and not just THP page faults, so it may potentially cripple THP's completely. My main issue is that the reasons to bail out includes COMPACT_SKIPPED, and for a wrong reason (see the comment above). It also goes against the comment below the noretry label: * High-order allocations do not necessarily loop after direct reclaim * and reclaim/compaction depends on compaction being called after * reclaim so call directly if necessary. Given that THP's are large, I expect reclaim would indeed be quite often necessary before compaction, and the first optimistic async compaction attempt will just return SKIPPED. After this patch, there will be no more reclaim/compaction attempts for THP's, including khugepaged. And given the change of THP page fault defaults, even crippling that path should no longer be necessary. So I would just drop this for now indeed.