Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp162190pxb; Mon, 18 Oct 2021 23:33:26 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxBZMM06m2CeYtjezxehgsHEKsvMJ5C5IjkY+MQgIZjsYyrWQeool+1WbAe5chwFEbd6jYJ X-Received: by 2002:aa7:ccc1:: with SMTP id y1mr52489694edt.177.1634625206577; Mon, 18 Oct 2021 23:33:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1634625206; cv=none; d=google.com; s=arc-20160816; b=AOaJ0ia8gyJl31c2yW0HgIVrmef7l2XFw+unTgk9PbI2Iumoe9KGsXK8YtvYWypFJl ZzIp6ck93C2Ipl6Lj+nw9aAZq165WtlNTFS/zn0KL7Prdb9hjgyBI4l7ZXD56XXd8pWO WJfKZ5psmNWo58DtHL7Juac67RoSl+Hh8UdLMvRVJqJD6jSXbGILiiFxiUHJeIrhHkKd HGN1UvpO8c+6BU+ll4tk1nHL7qGQ9XEn1vxA2S3O+3HkflZhArwa5atqKOfEi0wqjyRi dGFe+DIL5e6OfN15pKI/P0yL33vKOEdOHuzW7pVxgUsojk+tgFio/v73mZUqQ3mGcBlk E4rQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=tI9gj9p8cg4CnX2QaPSuoyzqKweCoJS55yHjYE7xcKM=; b=Fmqju/pquI8JmWnJt89JHxbImS4IIpUs8eRSYEe11CWWVUcryJbnqKXHgAc1K9r/OX ePo7yJ8sia3aHRwW21TOxqpr5Rd9u2JI5yXezIA8nztc/CWRtfq85dnOIoBwnHRNpLo0 P8486T4ApN6zHMJ+niVYgJ/8HnWheimlQO9D2xAlRo5OVCYyimb2nIK4eLt2NBCCrISk SqW/8kKS+gNVSQACCdn9xwEk7IrAz4T5V4VVBcET0OkpFev8CT/UMWkfGiUOTbgxzmZg y+9QG+SYgELvbOl1eU9impuij9HXJbeW4fmwZRZAjwuM9l1AQDQffw3+Yv+EDRuji+qa M4cg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@virtuozzo.com header.s=relay header.b=is7WHURE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id mp5si20808380ejc.271.2021.10.18.23.33.02; Mon, 18 Oct 2021 23:33:26 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@virtuozzo.com header.s=relay header.b=is7WHURE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234213AbhJSGc7 (ORCPT + 99 others); Tue, 19 Oct 2021 02:32:59 -0400 Received: from relay.sw.ru ([185.231.240.75]:36512 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229755AbhJSGc5 (ORCPT ); Tue, 19 Oct 2021 02:32:57 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=virtuozzo.com; s=relay; h=Content-Type:MIME-Version:Date:Message-ID:From: Subject; bh=tI9gj9p8cg4CnX2QaPSuoyzqKweCoJS55yHjYE7xcKM=; b=is7WHUREKxBUK1eox a3i8e7lEICkLoC40q5DdZ9O1639miZT0QpjLzmTbey1KH61kzqfqNE46f3xVo3z+vEfv84cDtb3BC 5uvyqf7GIL1gRY7JUEs8Zy0etcbnAGPB4cxnlKeeFvMMW0HehqKkIbzMNocCK8KtCM4Z+5ti9g7d8 =; Received: from [172.29.1.17] by relay.sw.ru with esmtp (Exim 4.94.2) (envelope-from ) id 1mcie0-006Qqh-4l; Tue, 19 Oct 2021 09:30:40 +0300 Subject: Re: [PATCH memcg 0/1] false global OOM triggered by memcg-limited task To: Michal Hocko Cc: Johannes Weiner , Vladimir Davydov , Andrew Morton , Roman Gushchin , Uladzislau Rezki , Vlastimil Babka , Shakeel Butt , Mel Gorman , cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, kernel@openvz.org References: <9d10df01-0127-fb40-81c3-cc53c9733c3e@virtuozzo.com> <6b751abe-aa52-d1d8-2631-ec471975cc3a@virtuozzo.com> From: Vasily Averin Message-ID: <339ae4b5-6efd-8fc2-33f1-2eb3aee71cb2@virtuozzo.com> Date: Tue, 19 Oct 2021 09:30:18 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.13.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 18.10.2021 14:53, Michal Hocko wrote: > On Mon 18-10-21 13:05:35, Vasily Averin wrote: >> On 18.10.2021 12:04, Michal Hocko wrote: >>> On Mon 18-10-21 11:13:52, Vasily Averin wrote: >>> [...] >>>> How could this happen? >>>> >>>> User-space task inside the memcg-limited container generated a page fault, >>>> its handler do_user_addr_fault() called handle_mm_fault which could not >>>> allocate the page due to exceeding the memcg limit and returned VM_FAULT_OOM. >>>> Then do_user_addr_fault() called pagefault_out_of_memory() which executed >>>> out_of_memory() without set of memcg. >>>> >>>> Partially this problem depends on one of my recent patches, disabled unlimited >>>> memory allocation for dying tasks. However I think the problem can happen >>>> on non-killed tasks too, for example because of kmem limit. >>> >>> Could you be more specific on how this can happen without your patch? I >>> have to say I haven't realized this side effect when discussing it. >> If required I can try to search how try_charge_memcg() can reject page allocation >> of non-dying task too. > > Yes. Now I think that such failure was very unlikely (w/o my patch and kmem limit). I cannot exclude it completely, because I did not finished this review and perhaps I missed something, but I checked most part of code and found nothing. With my patch ("memcg: prohibit unconditional exceeding the limit of dying tasks") try_charge_memcg() can fail: a) due to fatal signal b) when mem_cgroup_oom -> mem_cgroup_out_of_memory -> out_of_memory() returns false (when select_bad_process() found nothing) To handle a) we can follow to your suggestion and skip excution of out_of_memory() in pagefault_out_of memory() To handle b) we can go to retry: if mem_cgroup_oom() return OOM_FAILED. However all these cases can be successfully handled by my new patch "memcg: prevent false global OOM triggered by memcg limited task" and I think it is better solution. Thank you, Vasily Averin