Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp3956470yba; Tue, 7 May 2019 09:40:20 -0700 (PDT) X-Google-Smtp-Source: APXvYqxHg4UV6B3jOeSMakv7EzR4SuC0cuqhqJR2htPkxrZwZAtcNdNLA0gbt9Rym3Ky30lq124y X-Received: by 2002:a17:902:4624:: with SMTP id o33mr41120328pld.191.1557247220174; Tue, 07 May 2019 09:40:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1557247220; cv=none; d=google.com; s=arc-20160816; b=xZkfH8IaY9oaBYy4z/6KJ9XaFRe4uwHLBKeKIBa/USMj3x15Yye6qvi+cCEter7vZS 111dmHnDPRJqL6royC5cE3pPrDQqKk090lix8BpZavhdbuGyMjimW0wxs9sNErwRqvqi DZjLgpNzzVXzK7d7FLCcfK6Wndcyfkfr5WKbYSuI0aWNERlJUPB2ynrczpHtILnK3ATu qQmBSVGmt1yK0auKKQ5sFsmA/qiZH9c5FjAEiXoYebt/WTvfnyzqDWNWSgyvFMuxrDIR 1aFiM33IBV0YYr3yjyda28Gd9HkRouZnnAgC3xq7UfaRhF0/EcUxg80Gn5I20bli53cG xXdg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=L5mTg3IJOrwjhS+4cPOuegpeW4KFCpGlZhmjVpA3VdM=; b=vdSiFvx+YysfqDm8JEMaHwBk5Qm78ppPcZ2bUmKIKQpEikdlTevCAZEWjU9nblNuhk a55bxC2L0XATnHGX6cpZeB+fQIQjgWQgvppMSrP0v7FHIR1bCZkiL7CZlcxjZ+UkgPNV +OPNxd6EQYcPJF7qiZkF8i7XZU+dzDBbquC1cTrK/a0eiRpHrx7PUW5n5HbOF390XZoD l6WFuwHoqOT9Oq8uSjnrMZPSS8zU+jhELUNdZQWz6aMwcAhR6b5ucDgUWBEkOPMQfGJD PsQrZkV+CWHIjYukTWrYOe+O4cBZiXOJnAI4sF9lDrgYlrCWC7E4JWq26IG2P+CBPrKv hBMQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@brauner.io header.s=google header.b=TZqcIJFT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f13si147203pga.385.2019.05.07.09.40.03; Tue, 07 May 2019 09:40:20 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@brauner.io header.s=google header.b=TZqcIJFT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726853AbfEGQir (ORCPT + 99 others); Tue, 7 May 2019 12:38:47 -0400 Received: from mail-ed1-f67.google.com ([209.85.208.67]:45747 "EHLO mail-ed1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726494AbfEGQir (ORCPT ); Tue, 7 May 2019 12:38:47 -0400 Received: by mail-ed1-f67.google.com with SMTP id g57so19304395edc.12 for ; Tue, 07 May 2019 09:38:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=brauner.io; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to :user-agent; bh=L5mTg3IJOrwjhS+4cPOuegpeW4KFCpGlZhmjVpA3VdM=; b=TZqcIJFTbUSo9Ua8ldXyHOWmYelZuh+ci+tG/uK+Auxsogv2R0GzFbnxM2ohTHrxkk BLJit9pSJmlGzSX7jOKNlYqSax5frQHIo5rZrO7H/spEV59QpPRFaHIdhGNOdsHChlKL PPGgWoaQB2X8a18GyvrnwTqKS77YLA0aTxpL4YziGX+jOGOwDy/oVxm7OSBhSv8lFf3M 81UM8L+ulbjik4XbCKiDwnkOUH16HJMKcjJ1HBS51KbV35uZw18+6R5tdQayAKcKAq5b kvVslTyy4hgs6RdBUSCb7yoc5aGfnIvnd5/JP4YILmOZM8Qg/ef3RgJunaMicjznP1u+ iuNg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to:user-agent; bh=L5mTg3IJOrwjhS+4cPOuegpeW4KFCpGlZhmjVpA3VdM=; b=mZWjrgs2PstLh9he4SqM1DeEpkBB7WUE4o/VgIgS7Se/R3hxDZqAfq0BSH72boLh5a C3E7ynTXJcvgIWQsn1MkeL6tGTCvVIPKcjxDrfYKrr6m/gMxYxlReYOxK4EJbfbQA/t7 7kmybOkrSafRkkhExWDACuh9Nyi0ESIGqemYG3qS913GkUnVZTSOXMDFb1rEXxXm64ED 4PpVcJuuhd924af/BFK+3aSdyitjZv1ShVr85UHCZjq7aIhmP8TpexJS5tonDKQt2BJy bEjS/u/4kxCj1GR6oCEOG/WBHHwTaXcWsokd7vRvwjPpqZYTCiJ/S+G7rlawhd1A9mjL ll/Q== X-Gm-Message-State: APjAAAWqi0mLNRtcEFsCsVkz9uEScQdP8gtflbPjPLZZkJ7JpIC3+eGG dPVI7lyCpkxhYMovTgxCFvjYPg== X-Received: by 2002:a17:906:7c0b:: with SMTP id t11mr4919748ejo.100.1557247124621; Tue, 07 May 2019 09:38:44 -0700 (PDT) Received: from brauner.io ([212.91.227.56]) by smtp.gmail.com with ESMTPSA id n21sm260895eju.63.2019.05.07.09.38.43 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Tue, 07 May 2019 09:38:44 -0700 (PDT) Date: Tue, 7 May 2019 18:38:42 +0200 From: Christian Brauner To: Suren Baghdasaryan Cc: Sultan Alsawaf , Greg Kroah-Hartman , "open list:ANDROID DRIVERS" , Daniel Colascione , Todd Kjos , Kees Cook , Peter Zijlstra , Martijn Coenen , LKML , Tim Murray , Michal Hocko , linux-mm , Arve =?utf-8?B?SGrDuG5uZXbDpWc=?= , Ingo Molnar , Steven Rostedt , Oleg Nesterov , Joel Fernandes , Andy Lutomirski , kernel-team Subject: Re: [RFC] simple_lmk: Introduce Simple Low Memory Killer for Android Message-ID: <20190507163841.45v4ym63ug2ni7pb@brauner.io> References: <20190319231020.tdcttojlbmx57gke@brauner.io> <20190320015249.GC129907@google.com> <20190507021622.GA27300@sultan-box.localdomain> <20190507070430.GA24150@kroah.com> <20190507072721.GA4364@sultan-box.localdomain> <20190507074334.GB26478@kroah.com> <20190507081236.GA1531@sultan-box.localdomain> <20190507105826.oi6vah6x5brt257h@brauner.io> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: NeoMutt/20180716 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, May 07, 2019 at 09:28:47AM -0700, Suren Baghdasaryan wrote: > From: Christian Brauner > Date: Tue, May 7, 2019 at 3:58 AM > To: Sultan Alsawaf > Cc: Greg Kroah-Hartman, open list:ANDROID DRIVERS, Daniel Colascione, > Todd Kjos, Kees Cook, Peter Zijlstra, Martijn Coenen, LKML, Tim > Murray, Michal Hocko, Suren Baghdasaryan, linux-mm, Arve Hjønnevåg, > Ingo Molnar, Steven Rostedt, Oleg Nesterov, Joel Fernandes, Andy > Lutomirski, kernel-team > > > On Tue, May 07, 2019 at 01:12:36AM -0700, Sultan Alsawaf wrote: > > > On Tue, May 07, 2019 at 09:43:34AM +0200, Greg Kroah-Hartman wrote: > > > > Given that any "new" android device that gets shipped "soon" should be > > > > using 4.9.y or newer, is this a real issue? > > > > > > It's certainly a real issue for those who can't buy brand new Android devices > > > without software bugs every six months :) > > > > > Hi Sultan, > Looks like you are posting this patch for devices that do not use > userspace LMKD solution due to them using older kernels or due to > their vendors sticking to in-kernel solution. If so, I see couple > logistical issues with this patch. I don't see it being adopted in > upstream kernel 5.x since it re-implements a deprecated mechanism even > though vendors still use it. Vendors on the other hand, will not adopt > it until you show evidence that it works way better than what > lowmemorykilled driver does now. You would have to provide measurable > data and explain your tests before they would consider spending time > on this. > On the implementation side I'm not convinced at all that this would > work better on all devices and in all circumstances. We had cases when > a new mechanism would show very good results until one usecase > completely broke it. Bulk killing of processes that you are doing in > your patch was a very good example of such a decision which later on > we had to rethink. That's why baking these policies into kernel is > very problematic. Another problem I see with the implementation that > it ties process killing with the reclaim scan depth. It's very similar > to how vmpressure works and vmpressure in my experience is very > unpredictable. > > > > > And if it is, I'm sure that asking for those patches to be backported to > > > > 4.4.y would be just fine, have you asked? > > > > > > > > Note that I know of Android Go devices, running 3.18.y kernels, do NOT > > > > use the in-kernel memory killer, but instead use the userspace solution > > > > today. So trying to get another in-kernel memory killer solution added > > > > anywhere seems quite odd. > > > > > > It's even more odd that although a userspace solution is touted as the proper > > > way to go on LKML, almost no Android OEMs are using it, and even in that commit > > > > That's probably because without proper kernel changes this is rather > > tricky to use safely (see below). > > > > > I linked in the previous message, Google made a rather large set of > > > modifications to the supposedly-defunct lowmemorykiller.c not one month ago. > > > What's going on? > > If you look into that commit, it adds ability to report kill stats. If > that was a change in how that driver works it would be rejected. > > > > > > > Qualcomm still uses lowmemorykiller.c [1] on the Snapdragon 845. If PSI were > > > backported to 4.4, or even 3.18, would it really be used? I don't really > > > understand the aversion to an in-kernel memory killer on LKML despite the rest > > > of the industry's attraction to it. Perhaps there's some inherently great cost > > > in using the userspace solution that I'm unaware of? > > Vendors are cautious about adopting userspace solution and it is a > process to address all concerns but we are getting there. > > > > Regardless, even if PSI were backported, a full-fledged LMKD using it has yet to > > > be made, so it wouldn't be of much use now. > > > > This is work that is ongoing and requires kernel changes to make it > > feasible. One of the things that I have been working on for quite a > > while is the whole file descriptor for processes thing that is important > > for LMKD (Even though I never thought about this use-case when I started > > pitching this.). Joel and Daniel have joined in and are working on > > making LMKD possible. > > What I find odd is that every couple of weeks different solutions to the > > low memory problem are pitched. There is simple_lkml, there is LMKD, and > > there was a patchset that wanted to speed up memory reclaim at process > > kill-time by adding a new flag to the new pidfd_send_signal() syscall. > > That all seems - though related - rather uncoordinated. > > I'm not sure why pidfd_wait and expedited reclaim is seen as > uncoordinated effort. All of them are done to improve userspace LMKD. If so that wasn't very obvious and there was some disagreement there as well whether this would be the right solution. In any case, the point is that LMKD seems to be the way forward and with all of the arguments brought forward here this patchset seems like it's going in the wrong direction. Christian > > > Now granted, > > coordinated is usually not how kernel development necessarily works but > > it would probably be good to have some sort of direction and from what I > > have seen LMKD seems to be the most coordinated effort. But that might > > just be my impression. > > > > Christian > > Thanks, > Suren.