Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1091521ybl; Tue, 13 Aug 2019 07:15:30 -0700 (PDT) X-Google-Smtp-Source: APXvYqx0YuNCDG58293+p4PT0vE95YJu7cRzLiaRzzhoGDD8zXpLC8GjXCUdAmTveVhJyiJlSHxb X-Received: by 2002:a63:c118:: with SMTP id w24mr35196361pgf.347.1565705730281; Tue, 13 Aug 2019 07:15:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565705730; cv=none; d=google.com; s=arc-20160816; b=Ecfg16HRADsi++qoetBf4P3qPvcIWx5GgSnBueloudaVUf8G6SykvWpfKHsDorymNS 51iEG3ohkySSv6d18+hsq5cGi60r4mZz2ZY8FVgu558ARTzM99DIBB53D1wwkzdv6RN4 ShI6lk0gMzg6qZPRhQoRps8FTLfXEblhi+p6NE8CAojiyu2oGqyYcfRIAsDWnqsjWj5q vl4Tb/ZHB29SrqW4CAzTbmERhskjSBZ+YDnVXdc+7iaCixnESwytBfvaxtK1jAKfnh/e oVBHdXZPOFxGQkm6bgOIVQ+38oSBFDQq98ltG7b2mhsm+XCnQxoOqcjjv2vNR4r/7BNF ELqw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=vuv+bpJjnpDF0pJiBa4mjxHiSQFmoTFcU68PEfUXl0I=; b=XRXobHP969Iu5TmHFdw10B2qYL8KzKofy4q5olxtktiRh5Sa6h5jAdvPdbQRPUZlj/ 9E22r1wZq8R/YFEBnAhVKtu5HDrZ1cLYtckVdaMOqMlD0doRqyJJN5evgg2bh/N31JM3 e7Szxni7tj74JoI0ay/kzpTSDFZqzMPsc8oa/iUg7j1ljWcV4RkZDJs3cjBJxsM1Gnql LsSPboiNBfN0mFq0+Ho+rZgoa8WCkxONgaqoHRKljvM9tO167wWyy90BEqNbFZURYkyO B06w/DYBQGLina0o9tTxD35bBKKSpYBORSo6fVkxu/6QKTWkWkJ6yoLaW8/5UAKUkmvk 3J0g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l2si66149531pff.221.2019.08.13.07.15.12; Tue, 13 Aug 2019 07:15:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729476AbfHMOOh (ORCPT + 99 others); Tue, 13 Aug 2019 10:14:37 -0400 Received: from mx2.suse.de ([195.135.220.15]:39100 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727682AbfHMOOh (ORCPT ); Tue, 13 Aug 2019 10:14:37 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 3F341AC26; Tue, 13 Aug 2019 14:14:34 +0000 (UTC) Date: Tue, 13 Aug 2019 16:14:32 +0200 From: Michal Hocko To: Joel Fernandes Cc: Andrew Morton , linux-kernel@vger.kernel.org, Alexey Dobriyan , Borislav Petkov , Brendan Gregg , Catalin Marinas , Christian Hansen , dancol@google.com, fmayer@google.com, "H. Peter Anvin" , Ingo Molnar , Jonathan Corbet , Kees Cook , kernel-team@android.com, linux-api@vger.kernel.org, linux-doc@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, Mike Rapoport , minchan@kernel.org, namhyung@google.com, paulmck@linux.ibm.com, Robin Murphy , Roman Gushchin , Stephen Rothwell , surenb@google.com, Thomas Gleixner , tkjos@google.com, Vladimir Davydov , Vlastimil Babka , Will Deacon Subject: Re: [PATCH v5 1/6] mm/page_idle: Add per-pid idle page tracking using virtual index Message-ID: <20190813141432.GL17933@dhcp22.suse.cz> References: <20190807171559.182301-1-joel@joelfernandes.org> <20190807130402.49c9ea8bf144d2f83bfeb353@linux-foundation.org> <20190807204530.GB90900@google.com> <20190807135840.92b852e980a9593fe91fbf59@linux-foundation.org> <20190807213105.GA14622@google.com> <20190808080044.GA18351@dhcp22.suse.cz> <20190812145620.GB224541@google.com> <20190813091430.GE17933@dhcp22.suse.cz> <20190813135152.GC258732@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190813135152.GC258732@google.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 13-08-19 09:51:52, Joel Fernandes wrote: > On Tue, Aug 13, 2019 at 11:14:30AM +0200, Michal Hocko wrote: > > On Mon 12-08-19 10:56:20, Joel Fernandes wrote: > > > On Thu, Aug 08, 2019 at 10:00:44AM +0200, Michal Hocko wrote: > > > > On Wed 07-08-19 17:31:05, Joel Fernandes wrote: > > > > > On Wed, Aug 07, 2019 at 01:58:40PM -0700, Andrew Morton wrote: > > > > > > On Wed, 7 Aug 2019 16:45:30 -0400 Joel Fernandes wrote: > > > > > > > > > > > > > On Wed, Aug 07, 2019 at 01:04:02PM -0700, Andrew Morton wrote: > > > > > > > > On Wed, 7 Aug 2019 13:15:54 -0400 "Joel Fernandes (Google)" wrote: > > > > > > > > > > > > > > > > > In Android, we are using this for the heap profiler (heapprofd) which > > > > > > > > > profiles and pin points code paths which allocates and leaves memory > > > > > > > > > idle for long periods of time. This method solves the security issue > > > > > > > > > with userspace learning the PFN, and while at it is also shown to yield > > > > > > > > > better results than the pagemap lookup, the theory being that the window > > > > > > > > > where the address space can change is reduced by eliminating the > > > > > > > > > intermediate pagemap look up stage. In virtual address indexing, the > > > > > > > > > process's mmap_sem is held for the duration of the access. > > > > > > > > > > > > > > > > So is heapprofd a developer-only thing? Is heapprofd included in > > > > > > > > end-user android loads? If not then, again, wouldn't it be better to > > > > > > > > make the feature Kconfigurable so that Android developers can enable it > > > > > > > > during development then disable it for production kernels? > > > > > > > > > > > > > > Almost all of this code is already configurable with > > > > > > > CONFIG_IDLE_PAGE_TRACKING. If you disable it, then all of this code gets > > > > > > > disabled. > > > > > > > > > > > > > > Or are you referring to something else that needs to be made configurable? > > > > > > > > > > > > Yes - the 300+ lines of code which this patchset adds! > > > > > > > > > > > > The impacted people will be those who use the existing > > > > > > idle-page-tracking feature but who will not use the new feature. I > > > > > > guess we can assume this set is small... > > > > > > > > > > Yes, I think this set should be small. The code size increase of page_idle.o > > > > > is from ~1KB to ~2KB. Most of the extra space is consumed by > > > > > page_idle_proc_generic() function which this patch adds. I don't think adding > > > > > another CONFIG option to disable this while keeping existing > > > > > CONFIG_IDLE_PAGE_TRACKING enabled, is worthwhile but I am open to the > > > > > addition of such an option if anyone feels strongly about it. I believe that > > > > > once this patch is merged, most like this new interface being added is what > > > > > will be used more than the old interface (for some of the usecases) so it > > > > > makes sense to keep it alive with CONFIG_IDLE_PAGE_TRACKING. > > > > > > > > I would tend to agree with Joel here. The functionality falls into an > > > > existing IDLE_PAGE_TRACKING config option quite nicely. If there really > > > > are users who want to save some space and this is standing in the way > > > > then they can easily add a new config option with some justification so > > > > the savings are clear. Without that an additional config simply adds to > > > > the already existing configurability complexity and balkanization. > > > > > > Michal, Andrew, Minchan, > > > > > > Would you have any other review comments on the v5 series? This is just a new > > > interface that does not disrupt existing users of the older page-idle > > > tracking, so as such it is a safe change (as in, doesn't change existing > > > functionality except for the draining bug fix). > > > > I hope to find some more time to finish the review but let me point out > > that "it's new it is regression safe" is not really a great argument for > > a new user visible API. > > Actually, I think you misunderstood me and took it out of context. I never > intended to say "it is regression safe". I meant to say it is "low risk", as > in that in all likelihood should not be hurting *existing users* of the *old > interface*. Also as you know, it has been tested. Yeah, misreading on my end. > > If the API is flawed then this is likely going > > to kick us later and will be hard to fix. I am still not convinced about > > the swap part of the thing TBH. > > Ok, then let us discuss it. As I mentioned before, without this we lose the > access information due to MADVISE or swapping. Minchan and Konstantin both > suggested it that's why I also added it (other than me also realizing that it > is neeed). I have described my concerns about the general idle bit behavior after unmapping pointing to discrepancy with !anon pages. And I believe those haven't been addressed yet. Besides that I am still not seeing any description of the usecase that would suffer from the lack of the functionality in changelogs. -- Michal Hocko SUSE Labs