Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751612AbdCMNTl (ORCPT ); Mon, 13 Mar 2017 09:19:41 -0400 Received: from mx2.suse.de ([195.135.220.15]:33915 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750772AbdCMNTc (ORCPT ); Mon, 13 Mar 2017 09:19:32 -0400 Date: Mon, 13 Mar 2017 14:19:25 +0100 From: Michal Hocko To: Vitaly Kuznetsov Cc: Igor Mammedov , Heiko Carstens , linux-mm@kvack.org, Andrew Morton , Greg KH , "K. Y. Srinivasan" , David Rientjes , Daniel Kiper , linux-api@vger.kernel.org, LKML , linux-s390@vger.kernel.org, xen-devel@lists.xenproject.org, linux-acpi@vger.kernel.org, qiuxishi@huawei.com, toshi.kani@hpe.com, xieyisheng1@huawei.com, slaoub@gmail.com, iamjoonsoo.kim@lge.com, vbabka@suse.cz Subject: Re: [RFC PATCH] mm, hotplug: get rid of auto_online_blocks Message-ID: <20170313131924.GP31518@dhcp22.suse.cz> References: <20170302142816.GK1404@dhcp22.suse.cz> <20170302180315.78975d4b@nial.brq.redhat.com> <20170303082723.GB31499@dhcp22.suse.cz> <20170303183422.6358ee8f@nial.brq.redhat.com> <20170306145417.GG27953@dhcp22.suse.cz> <20170307134004.58343e14@nial.brq.redhat.com> <20170309125400.GI11592@dhcp22.suse.cz> <20170313115554.41d16b1f@nial.brq.redhat.com> <20170313122825.GO31518@dhcp22.suse.cz> <87a88pgwv0.fsf@vitty.brq.redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87a88pgwv0.fsf@vitty.brq.redhat.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3074 Lines: 64 On Mon 13-03-17 13:54:59, Vitaly Kuznetsov wrote: > Michal Hocko writes: > > > On Mon 13-03-17 11:55:54, Igor Mammedov wrote: > >> > > > >> > > - suggested RFC is not acceptable from virt point of view > >> > > as it regresses guests on top of x86 kvm/vmware which > >> > > both use ACPI based memory hotplug. > >> > > > >> > > - udev/userspace solution doesn't work in practice as it's > >> > > too slow and unreliable when system is under load which > >> > > is quite common in virt usecase. That's why auto online > >> > > has been introduced in the first place. > >> > > >> > Please try to be more specific why "too slow" is a problem. Also how > >> > much slower are we talking about? > >> > >> In virt case on host with lots VMs, userspace handler > >> processing could be scheduled late enough to trigger a race > >> between (guest memory going away/OOM handler) and memory > >> coming online. > > > > Either you are mixing two things together or this doesn't really make > > much sense. So is this a balloning based on memory hotplug (aka active > > memory hotadd initiated between guest and host automatically) or a guest > > asking for additional memory by other means (pay more for memory etc.)? > > Because if this is an administrative operation then I seriously question > > this reasoning. > > I'm probably repeating myself but it seems this point was lost: > > This is not really a 'ballooning', it is just a pure memory > hotplug. People may have any tools monitoring their VM memory usage and > when a VM is running low on memory they may want to hotplug more memory > to it. What is the API those guests ask for the memory? And who is actually responsible to ask for that memory? Is it a kernel or userspace solution? > With udev-style memory onlining they should be aware of page > tables and other in-kernel structures which require allocation so they > need to add memory slowly and gradually or they risk running into OOM > (at least getting some processes killed and these processes may be > important). With in-kernel memory hotplug everything happens > synchronously and no 'slowly and gradually' algorithm is required in > all tools which may trigger memory hotplug. What prevents those APIs being used reasonably and only asks so much memory as they can afford? I mean 1.5% available memory necessary for the hotplug is not all that much. Or more precisely what prevents to ask for this additional memory in a synchronous way? And just to prevent from a further solution, I can see why _in-kernel_ hotplug based 'ballooning (or whatever you call an on demand memory hotplug based on the memory pressure)' want's to be synchronous and that is why my patch changed those onlined the memory explicitly. I am questioning memory hotplug requested by admin/user space component to need any special from kernel assistance becuase it is only a shortcut which can be implemented from the userspace. I hope I've made myself clear finally. -- Michal Hocko SUSE Labs