Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1774093imu; Thu, 10 Jan 2019 02:49:57 -0800 (PST) X-Google-Smtp-Source: ALg8bN5bxTYX5Ygp+LWVDA9SFBqXkMc89O+RNhrXZG2JnJq3oLrQmMEESEouSUzjWL3JM4G/LdTU X-Received: by 2002:a17:902:7588:: with SMTP id j8mr9962083pll.215.1547117397027; Thu, 10 Jan 2019 02:49:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1547117396; cv=none; d=google.com; s=arc-20160816; b=qW7zqectT/T2hDWL4qzOP+0xXlgJK/dmAaBgPoLbLXtK+CsrGZoAl8ehBrpImIgM03 cLHCvKgr59EV/6udkNyCXN3YlSXHo1UWS+TtEDCGPHT6dae0T8L2SXycVgOEQnaCYVQz vSX1/npiSDcQk3YSqUKGOjvISVaYf/FyZDui8vd2+/Xx7ZNp8uJdWWLpu7XnZA8htkJd CHTo7K+Vniwz4ggWpeqXRknP3zFCwTFP2Wkj58noBvaIRfbWO5fl1T7ZqJ93z/eBpNVJ mS1tTZ18gCWtQnodV3dsArCjccZLnHLHfQdy0gpEFzA7MR5yO/qNlAA/7ABXMS/AdjJ2 RTFg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject; bh=sKMNj0BsqEbTzBlCYQG0FZTr7b4JbjIlQ5rtarnp7cE=; b=FdgnNcTzMQqbMB9BBF8sQl1yOIQ420K+gzr1V3q569EyOeXDYohLMmtYQ/uRYHArVw 6P85cHOnBbiv9U6xbKSrhHWuKHuL/nUC7rkwtjbpmY3nnqnXs4WPxV1eYVAG8tciwUvU C4foDHQsWZQAiwHWxPZ4dx5H9wdI7DtukhpX5o5BWgaqkZLT2/NzWZ9VzU+lCRLsLiiA RnqQKqgFx2i1evsH5n5W6Yl2SkvAgqQH1uv0hvqOWTBHeT3KbTWJp8XKdCubfkE8+Cxf e0FdjM2Qp0KQ8vtglsbXPCifnzqfChRLxmUjHYytalFgrHMNbAJH9eY0YbeDksJoUm7/ zlxg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id bc7si5273641plb.120.2019.01.10.02.49.42; Thu, 10 Jan 2019 02:49:56 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727841AbfAJJWP (ORCPT + 99 others); Thu, 10 Jan 2019 04:22:15 -0500 Received: from relay.sw.ru ([185.231.240.75]:54628 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726255AbfAJJWP (ORCPT ); Thu, 10 Jan 2019 04:22:15 -0500 Received: from [172.16.25.169] by relay.sw.ru with esmtp (Exim 4.91) (envelope-from ) id 1ghWXR-0002yv-T5; Thu, 10 Jan 2019 12:22:10 +0300 Subject: Re: [PATCH v2] netfilter: account ebt_table_info to kmemcg From: Kirill Tkhai To: Shakeel Butt , Michal Hocko , Andrew Morton , Florian Westphal Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, syzbot+7713f3aa67be76b1552c@syzkaller.appspotmail.com, Pablo Neira Ayuso , Jozsef Kadlecsik , Roopa Prabhu , Nikolay Aleksandrov , netfilter-devel@vger.kernel.org, coreteam@netfilter.org, bridge@lists.linux-foundation.org References: <20190103031431.247970-1-shakeelb@google.com> <5cc8efad-9d3d-3136-3ddc-1f8a640cb1f8@virtuozzo.com> Message-ID: <2d8f28cb-8620-be05-21bc-dcf3009b2774@virtuozzo.com> Date: Thu, 10 Jan 2019 12:22:09 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.3.1 MIME-Version: 1.0 In-Reply-To: <5cc8efad-9d3d-3136-3ddc-1f8a640cb1f8@virtuozzo.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06.01.2019 14:00, Kirill Tkhai wrote: > On 03.01.2019 06:14, Shakeel Butt wrote: >> The [ip,ip6,arp]_tables use x_tables_info internally and the underlying >> memory is already accounted to kmemcg. Do the same for ebtables. The >> syzbot, by using setsockopt(EBT_SO_SET_ENTRIES), was able to OOM the >> whole system from a restricted memcg, a potential DoS. >> >> By accounting the ebt_table_info, the memory used for ebt_table_info can >> be contained within the memcg of the allocating process. However the >> lifetime of ebt_table_info is independent of the allocating process and >> is tied to the network namespace. So, the oom-killer will not be able to >> relieve the memory pressure due to ebt_table_info memory. The memory for >> ebt_table_info is allocated through vmalloc. Currently vmalloc does not >> handle the oom-killed allocating process correctly and one large >> allocation can bypass memcg limit enforcement. So, with this patch, >> at least the small allocations will be contained. For large allocations, >> we need to fix vmalloc. >> >> Reported-by: syzbot+7713f3aa67be76b1552c@syzkaller.appspotmail.com >> Signed-off-by: Shakeel Butt >> Cc: Florian Westphal >> Cc: Michal Hocko >> Cc: Kirill Tkhai >> Cc: Pablo Neira Ayuso >> Cc: Jozsef Kadlecsik >> Cc: Roopa Prabhu >> Cc: Nikolay Aleksandrov >> Cc: Andrew Morton >> Cc: Linux MM >> Cc: netfilter-devel@vger.kernel.org >> Cc: coreteam@netfilter.org >> Cc: bridge@lists.linux-foundation.org >> Cc: LKML >> --- >> Changelog since v1: >> - More descriptive commit message. > > Reviewed-by: Kirill Tkhai > >> >> net/bridge/netfilter/ebtables.c | 6 ++++-- >> 1 file changed, 4 insertions(+), 2 deletions(-) >> >> diff --git a/net/bridge/netfilter/ebtables.c b/net/bridge/netfilter/ebtables.c >> index 491828713e0b..5e55cef0cec3 100644 >> --- a/net/bridge/netfilter/ebtables.c >> +++ b/net/bridge/netfilter/ebtables.c >> @@ -1137,14 +1137,16 @@ static int do_replace(struct net *net, const void __user *user, >> tmp.name[sizeof(tmp.name) - 1] = 0; >> >> countersize = COUNTER_OFFSET(tmp.nentries) * nr_cpu_ids; >> - newinfo = vmalloc(sizeof(*newinfo) + countersize); >> + newinfo = __vmalloc(sizeof(*newinfo) + countersize, GFP_KERNEL_ACCOUNT, >> + PAGE_KERNEL); Do we need GFP_HIGHMEM here? >> if (!newinfo) >> return -ENOMEM; >> >> if (countersize) >> memset(newinfo->counters, 0, countersize); >> >> - newinfo->entries = vmalloc(tmp.entries_size); >> + newinfo->entries = __vmalloc(tmp.entries_size, GFP_KERNEL_ACCOUNT, >> + PAGE_KERNEL); >> if (!newinfo->entries) { >> ret = -ENOMEM; >> goto free_newinfo; >> >