Received: by 2002:a05:6a10:6006:0:0:0:0 with SMTP id w6csp62063pxa; Wed, 26 Aug 2020 04:59:03 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxdtwMb1SLsx1qL6Ut8voSOiWUBKaziqs5t21wiALpmM3LeZHAt3Zv6o26oJZyKAX0A0fH7 X-Received: by 2002:a17:906:e87:: with SMTP id p7mr14665274ejf.547.1598443143113; Wed, 26 Aug 2020 04:59:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598443143; cv=none; d=google.com; s=arc-20160816; b=QxSUZXUGmbJatzl2D7CUp1kDsrxj4fWzGFzGUSyFW7rO/eP4kLFld4su2+gRe+C0ko AVyTYnTMkxsAHoDMY+eiPn8myUdWoOyMZ16CEQ6rBZWC1mW2fxFRYXJUPP7I4Uuxn7Dt 7DN76JBEQsWwn/bCtgrHsvcuFYSsiJJFSMS30ISJRBJ4VCecpzoGTqLA4XVGkg8xN8nl ZoaEnVDTahyvA+my4zGMFsk7NG49BqSANB23R9b0qal8N2NgdKlK4C4NK5UzhmhUSpjM /+Budk7VABTKfuGMQTCAmcvIQuHt/9ZSL1iZ4xKPGj3H/iELcWDGdTqYhgdgg5FJgjGy xLQA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:references:cc:to:from :subject:reply-to; bh=6nWi5sOZx1/TrSkicXjcBR/B1FYQQBk/JLZ5scLKlWA=; b=Z9YVq4fXZWFSrIopwt+MJqY+YwVtoN0XNcdhz+0c9YCJL1quTbI913KwTpYGPFgJxh taGxcN9yBifKPsIQbxNrsM2AFJapu0vp/+MfITyczj/QHYZSdWVQvaoo2UPkN1iu+qZU MC3xRK+mtVsIstymi0lp37TIFZB78H3QAKEe8Kf7s9hdcniJYpEaoexZ4kOjzEGDYB0O X8smfqHRV7ZokZ3H4LZODjOaDLCRPwExOAur/AtVAi66jtCXsQqTx9vTGRx5Xd8/zqqt PmrMyPEf3e5Du3qdkrsZSHB2seeDuTnj5cJHv4rbvVsctbS6flHddsSHC6tKJ8boSZW9 +40g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ch15si1407878ejb.539.2020.08.26.04.58.39; Wed, 26 Aug 2020 04:59:03 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729279AbgHZLzB (ORCPT + 99 others); Wed, 26 Aug 2020 07:55:01 -0400 Received: from out30-54.freemail.mail.aliyun.com ([115.124.30.54]:41029 "EHLO out30-54.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729195AbgHZLye (ORCPT ); Wed, 26 Aug 2020 07:54:34 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R431e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01f04427;MF=xlpang@linux.alibaba.com;NM=1;PH=DS;RN=6;SR=0;TI=SMTPD_---0U6vV5Ro_1598442870; Received: from xunleideMacBook-Pro.local(mailfrom:xlpang@linux.alibaba.com fp:SMTPD_---0U6vV5Ro_1598442870) by smtp.aliyun-inc.com(127.0.0.1); Wed, 26 Aug 2020 19:54:31 +0800 Reply-To: xlpang@linux.alibaba.com Subject: Re: [PATCH] mm: memcg: Fix memcg reclaim soft lockup From: Xunlei Pang To: Michal Hocko Cc: Johannes Weiner , Andrew Morton , Vladimir Davydov , linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <1598426822-93737-1-git-send-email-xlpang@linux.alibaba.com> <20200826081102.GM22869@dhcp22.suse.cz> <99efed0e-050a-e313-46ab-8fe6228839d5@linux.alibaba.com> <20200826110015.GO22869@dhcp22.suse.cz> <5b22890d-190f-be1d-3be8-995765dbb957@linux.alibaba.com> Message-ID: <0cf41b39-f039-7e53-ea80-9d5d0c784e73@linux.alibaba.com> Date: Wed, 26 Aug 2020 19:54:30 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 In-Reply-To: <5b22890d-190f-be1d-3be8-995765dbb957@linux.alibaba.com> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/8/26 下午7:45, xunlei wrote: > On 2020/8/26 下午7:00, Michal Hocko wrote: >> On Wed 26-08-20 18:41:18, xunlei wrote: >>> On 2020/8/26 下午4:11, Michal Hocko wrote: >>>> On Wed 26-08-20 15:27:02, Xunlei Pang wrote: >>>>> We've met softlockup with "CONFIG_PREEMPT_NONE=y", when >>>>> the target memcg doesn't have any reclaimable memory. >>>> >>>> Do you have any scenario when this happens or is this some sort of a >>>> test case? >>> >>> It can happen on tiny guest scenarios. >> >> OK, you made me more curious. If this is a tiny guest and this is a hard >> limit reclaim path then we should trigger an oom killer which should >> kill the offender and that in turn bail out from the try_charge lopp >> (see should_force_charge). So how come this repeats enough in your setup >> that it causes soft lockups? >> > > oom_status = mem_cgroup_oom(mem_over_limit, gfp_mask, > get_order(nr_pages * PAGE_SIZE)); > switch (oom_status) { > case OOM_SUCCESS: > nr_retries = MAX_RECLAIM_RETRIES; Actually we can add "cond_resched()" here, but I think it's better to have one at the memcg reclaim path to avoid other unexpected issues. > goto retry; > > It retries here endlessly, because oom reaper has no cpu to schedule. >