Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5208256imm; Tue, 31 Jul 2018 07:15:57 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfXLXp8ODZXj2u1xj56mZw2yq3z7dU8+jt4oRc5sFMoC6CtwJvJpkyuT+ffLjmitj7aDw7p X-Received: by 2002:a63:3c4a:: with SMTP id i10-v6mr20621598pgn.415.1533046557166; Tue, 31 Jul 2018 07:15:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533046557; cv=none; d=google.com; s=arc-20160816; b=u0X8Ms3rhOac7n/KHXBFDBRy5eE+yzjY1s3/UtzZkqu9kmlCvqtHkkE4wqf8Ub2/qj hpcTCKyu9nigl0vheoBUsR21OZ5pT4uvDM5/sjub7i/aNtqTMxLTOwhHreau5M3Tpdy7 V44vAoPOwmQcI8BHQ2L9U3fbQiXAgnS73UpZ9ePp9iE8tVR3fhrLEDYOo1c5yylgHK7V 7/fSiTgx1sB0lYSPjlar02smfClRERAEQinELoZbew5hQo/91sSaDcBfubFqkb+qsa4d SNvdF1VLm2UvMf7l7oFNC6QCZk+rNNUpWKekY0SHxHOkNOI6mopQAVTdeUDQ7/hwE9T4 T3OQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject:arc-authentication-results; bh=g7l+li8qe/nna3TIYa/wIEF07RbY1c0hdM0qgbMR90A=; b=GtJICbRdPf14eOjwYX19D7Z1NyXzQR7MGVJ6YDqaRnPaCcivTCEusBbOTKBbM55kIV XbUFy0iID+Y3s4t4rGE4hmnu3m9zytn7EP2ASU+mBIGzpVOnLejgild79Sy5U120K9rw grlMcl2F2x+qEH4VBmLFKEb6u1Qy8s52UONV8vcadUMGyAEmbo4TltDpiFdUffiHsAal bX6Bi145dhAVcViX+HnaaQg3UW5QoYEDviF0/EZsSpmazSe+yPotE/lJ38oNrsvzyv4v xGX1TD5nwgDt8UUSV4Re3BAuDLfsYLUSglzLtN67hHUEk2R1OMpId+qS2Ue6Fx3rdu+s T8ig== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t80-v6si14479066pfk.228.2018.07.31.07.15.39; Tue, 31 Jul 2018 07:15:57 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732339AbeGaPyj (ORCPT + 99 others); Tue, 31 Jul 2018 11:54:39 -0400 Received: from www262.sakura.ne.jp ([202.181.97.72]:23945 "EHLO www262.sakura.ne.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730230AbeGaPyi (ORCPT ); Tue, 31 Jul 2018 11:54:38 -0400 Received: from fsav304.sakura.ne.jp (fsav304.sakura.ne.jp [153.120.85.135]) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTP id w6VEE5Nw096683; Tue, 31 Jul 2018 23:14:05 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Received: from www262.sakura.ne.jp (202.181.97.72) by fsav304.sakura.ne.jp (F-Secure/fsigk_smtp/530/fsav304.sakura.ne.jp); Tue, 31 Jul 2018 23:14:05 +0900 (JST) X-Virus-Status: clean(F-Secure/fsigk_smtp/530/fsav304.sakura.ne.jp) Received: from [192.168.1.8] (softbank126074194044.bbtec.net [126.74.194.44]) (authenticated bits=0) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTPSA id w6VEE4Ni096677 (version=TLSv1.2 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 31 Jul 2018 23:14:05 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Subject: Re: [PATCH v13 0/7] cgroup-aware OOM killer From: Tetsuo Handa To: Roman Gushchin , Andrew Morton Cc: David Rientjes , Michal Hocko , linux-mm@vger.kernel.org, Vladimir Davydov , Johannes Weiner , Tejun Heo , kernel-team@fb.com, cgroups@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <0d018c7e-a3de-a23a-3996-bed8b28b1e4a@i-love.sakura.ne.jp> <20180716220918.GA3898@castle.DHCP.thefacebook.com> <201807170055.w6H0tHn5075670@www262.sakura.ne.jp> Message-ID: Date: Tue, 31 Jul 2018 23:14:01 +0900 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <201807170055.w6H0tHn5075670@www262.sakura.ne.jp> Content-Type: text/plain; charset=iso-2022-jp Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018/07/17 9:55, Tetsuo Handa wrote: >> I don't get, why it's necessary to drop the cgroup oom killer to merge your fix? >> I'm happy to help with rebasing and everything else. > > Yes, I wish you rebase your series on top of OOM lockup (CVE-2016-10723) mitigation > patch ( https://marc.info/?l=linux-mm&m=153112243424285&w=4 ). It is a trivial change > and easy to cleanly backport (if applied before your series). > > Also, I expect you to check whether my cleanup patch which removes "abort" path > ( [PATCH 1/2] at https://marc.info/?l=linux-mm&m=153119509215026&w=4 ) helps > simplifying your series. I don't know detailed behavior of your series, but I > assume that your series do not kill threads which current thread should not wait > for MMF_OOM_SKIP. syzbot is hitting WARN(1) due to mem_cgroup_out_of_memory() == false. https://syzkaller.appspot.com/bug?id=ea8c7912757d253537375e981b61749b2da69258 I can't tell what change is triggering this race. Maybe removal of oom_lock from the oom reaper made more likely to hit. But anyway I suspect that static bool oom_kill_memcg_victim(struct oom_control *oc) { if (oc->chosen_memcg == NULL || oc->chosen_memcg == INFLIGHT_VICTIM) return oc->chosen_memcg; // <= This line is still broken because /* We have one or more terminating processes at this point. */ oc->chosen_task = INFLIGHT_VICTIM; is not called. Also, that patch is causing confusion by reviving schedule_timeout_killable(1) with oom_lock held. Can we temporarily drop cgroup-aware OOM killer from linux-next.git and apply my cleanup patch? Since the merge window is approaching, I really want to see how next -rc1 would look like...