Received: by 10.192.165.156 with SMTP id m28csp575601imm; Fri, 13 Apr 2018 04:22:07 -0700 (PDT) X-Google-Smtp-Source: AIpwx4+nj6UclRdZZ33Xt2ICb0aBS/uvykKkn9dgs+CSBmnsYCFWFBdC70m04hLl/bv6BDrXxLD8 X-Received: by 10.99.167.6 with SMTP id d6mr3716583pgf.287.1523618527713; Fri, 13 Apr 2018 04:22:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523618527; cv=none; d=google.com; s=arc-20160816; b=elwk6aRoaRG4DEldUp1J+Hle7IRMCoh1nUoTqCfQp0oBEN2c6Gmej975Hbibc1oHKv PEefDJcG6ZlZfcZx3LvrjnVkeHs9gT3w5HM4a90UCow+L30vxwisiMfC/xCG+Dl/Z2SA xNldlS2AjToK8Nv5iSiOlOS5M+f02Hv8F07jNFmVvBIw3stRVB2402OOk67dBtTRRxub lKQehv6GCuzla4qIQNRlZYoT5CKUhL6UhjX/y9lZcfZ6cDLlM1f36V9tWiRmwONcB+7O zI0bu7HxZfsURtarq6o7+Bb/Ev/WxHYOwEZDBiHgpb1bGMo9LXx8fyXmP9xZMwD6cTtv fE5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=ikC6mJkvWo/iqrFRNAWlTpoud/ACLP67zPAA3i8y46o=; b=e/K1p3zR3zsmhT2GYxrvFDo/F/6FEa+9L9K3Vq5pGuWjNJiNLgwSVUaATEeQF4ZtWm SsC+ZuQZXkKwK6CsPKWtQ0X60R1u2HYFFb7udtlKfXvPGXRnDr0pIsjOEsEQS3zwC8jb gCqIqsp4RkwSRTMrNprXkTiNtV6qaFVh8gvI0EsVbFZVDAIh7CvpmmcyP/3mGksxxHL9 t+OFZ4Iixoo2FTIhA87JUEzDc3236g6M3WZNjGw/AQYL27yFKYX1ILVJEERbSF8k4mdA 4TwkH5P77RogJyrBChs8Fb4RBApdAXM3gdG588Ux1VjRI7rehPatiWsKmejk6R5l+1tx zL3A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n189si4372641pfn.356.2018.04.13.04.21.53; Fri, 13 Apr 2018 04:22:07 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754206AbeDMLUk (ORCPT + 99 others); Fri, 13 Apr 2018 07:20:40 -0400 Received: from mx2.suse.de ([195.135.220.15]:38622 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750960AbeDMLUi (ORCPT ); Fri, 13 Apr 2018 07:20:38 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 38A8FAC3B; Fri, 13 Apr 2018 11:20:37 +0000 (UTC) Date: Fri, 13 Apr 2018 13:20:36 +0200 From: Michal Hocko To: Kirill Tkhai Cc: akpm@linux-foundation.org, hannes@cmpxchg.org, vdavydov.dev@gmail.com, cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] memcg: Remove memcg_cgroup::id from IDR on mem_cgroup_css_alloc() failure Message-ID: <20180413112036.GH17484@dhcp22.suse.cz> References: <152354470916.22460.14397070748001974638.stgit@localhost.localdomain> <20180413085553.GF17484@dhcp22.suse.cz> <20180413110200.GG17484@dhcp22.suse.cz> <06931a83-91d2-3dcf-31cf-0b98d82e957f@virtuozzo.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <06931a83-91d2-3dcf-31cf-0b98d82e957f@virtuozzo.com> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri 13-04-18 14:06:40, Kirill Tkhai wrote: > On 13.04.2018 14:02, Michal Hocko wrote: > > On Fri 13-04-18 12:35:22, Kirill Tkhai wrote: > >> On 13.04.2018 11:55, Michal Hocko wrote: > >>> On Thu 12-04-18 17:52:04, Kirill Tkhai wrote: > >>> [...] > >>>> @@ -4471,6 +4477,7 @@ mem_cgroup_css_alloc(struct cgroup_subsys_state *parent_css) > >>>> > >>>> return &memcg->css; > >>>> fail: > >>>> + mem_cgroup_id_remove(memcg); > >>>> mem_cgroup_free(memcg); > >>>> return ERR_PTR(-ENOMEM); > >>>> } > >>> > >>> The only path which jumps to fail: here (in the current mmotm tree) is > >>> error = memcg_online_kmem(memcg); > >>> if (error) > >>> goto fail; > >>> > >>> AFAICS and the only failure path in memcg_online_kmem > >>> memcg_id = memcg_alloc_cache_id(); > >>> if (memcg_id < 0) > >>> return memcg_id; > >>> > >>> I am not entirely clear on memcg_alloc_cache_id but it seems we do clean > >>> up properly. Or am I missing something? > >> > >> memcg_alloc_cache_id() may allocate a lot of memory, in case of the system reached > >> memcg_nr_cache_ids cgroups. In this case it iterates over all LRU lists, and double > >> size of every of them. In case of memory pressure it can fail. If this occurs, > >> mem_cgroup::id is not unhashed from IDR and we leak this id. > > > > OK, my bad I was looking at the bad code path. So you want to clean up > > after mem_cgroup_alloc not memcg_online_kmem. Now it makes much more > > sense. Sorry for the confusion on my end. > > > > Anyway, shouldn't we do the thing in mem_cgroup_free() to be symmetric > > to mem_cgroup_alloc? > > We can't, since it's called from mem_cgroup_css_free(), which doesn't have a deal > with idr freeing. All the asymmetry, we see, is because of the trick to unhash ID > earlier, then from mem_cgroup_css_free(). Are you sure. It's been some time since I've looked at the quite complex cgroup tear down code but from what I remember, css_free is called on the css release (aka when the reference count drops to zero). mem_cgroup_id_put_many seems to unpin the css reference so we should have idr_remove by the time when css_free is called. Or am I still wrong and should go over the brain hurting cgroup removal code again? -- Michal Hocko SUSE Labs