Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932229AbcLLP4E (ORCPT ); Mon, 12 Dec 2016 10:56:04 -0500 Received: from gum.cmpxchg.org ([85.214.110.215]:52256 "EHLO gum.cmpxchg.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752447AbcLLP4C (ORCPT ); Mon, 12 Dec 2016 10:56:02 -0500 Date: Mon, 12 Dec 2016 10:55:52 -0500 From: Johannes Weiner To: Vlastimil Babka Cc: Andrew Morton , Mel Gorman , linux-mm@kvack.org, linux-kernel@vger.kernel.org, kernel-team@fb.com Subject: Re: [PATCH] mm: fadvise: avoid expensive remote LRU cache draining after FADV_DONTNEED Message-ID: <20161212155552.GA7148@cmpxchg.org> References: <20161210172658.5182-1-hannes@cmpxchg.org> <5cc0eb6f-bede-a34a-522b-e30d06723ffa@suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <5cc0eb6f-bede-a34a-522b-e30d06723ffa@suse.cz> User-Agent: Mutt/1.7.1 (2016-10-04) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1704 Lines: 35 On Mon, Dec 12, 2016 at 10:21:24AM +0100, Vlastimil Babka wrote: > On 12/10/2016 06:26 PM, Johannes Weiner wrote: > > When FADV_DONTNEED cannot drop all pages in the range, it observes > > that some pages might still be on per-cpu LRU caches after recent > > instantiation and so initiates remote calls to all CPUs to flush their > > local caches. However, in most cases, the fadvise happens from the > > same context that instantiated the pages, and any pre-LRU pages in the > > specified range are most likely sitting on the local CPU's LRU cache, > > and so in many cases this results in unnecessary remote calls, which, > > in a loaded system, can hold up the fadvise() call significantly. > > Got any numbers for this part? I didn't record it in the extreme case we observed, unfortunately. We had a slow-to-respond system and noticed it spending seconds in lru_add_drain_all() after fadvise calls, and this patch came out of thinking about the code and how we commonly call FADV_DONTNEED. FWIW, I wrote a silly directory tree walker/searcher that recurses through /usr to read and FADV_DONTNEED each file it finds. On a 2 socket 40 ht machine, over 1% is spent in lru_add_drain_all(). With the patch, that cost is gone; the local drain cost shows at 0.09%. > > Try to avoid the remote call by flushing the local LRU cache before > > even attempting to invalidate anything. It's a cheap operation, and > > the local LRU cache is the most likely to hold any pre-LRU pages in > > the specified fadvise range. > > Anyway it looks like things can't be worse after this patch, so... > > > Signed-off-by: Johannes Weiner > > Acked-by: Vlastimil Babka Thanks!