Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp420234imm; Tue, 9 Oct 2018 21:12:56 -0700 (PDT) X-Google-Smtp-Source: ACcGV626kxXbJhPUB0xw8RjaUqNk224GB9c6hZ32su2v9uhZfPVYyfH+fDr2ClsUfsUUoRtwiPaK X-Received: by 2002:a63:1520:: with SMTP id v32-v6mr24186957pgl.150.1539144776177; Tue, 09 Oct 2018 21:12:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539144776; cv=none; d=google.com; s=arc-20160816; b=V/hvau6QQ6cfdOGrHYksuy469EqVaWILNtLmVqV15Thvbw9mhOhRJXvIwNxT6313VO Mt0Z/IDzJGlTYmXV4ccQZQ1y7SjWSARk827bUp8zvxHoIbVHMzHDvc+yFZF1zWTJ5JhR e3LdwjCn04lzKwbv6FIlFZPsBiCLjmqbKqFl4IwTokOGvA4UU2utHbFUZ3kYg7Lle/L5 9NzHEZSUs51HhcLWOGnZHyAZ9EPG2Bls60mCWao2qI+jwUobMbSDHNAqZesq9Vk7jIbW BH7QF+a58nqofIAkKglW0WIlVuUQqE6f7XGUCOOBz1t3GWk9p88sWDRmu9c5ih2ey4rc GzDA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date:dkim-signature; bh=7HZpE/svjvEHFVjkdbIdljSw3Nddw+tZ5mCc3DSajcA=; b=oaAVCX9M6ynjlFaOj60+aYWwbuZkQerWMLtjKV10RV5JhQdARjU/adBemELUZ9P/Y0 XhahF2vJ/UjC5cyzD4gAFVmeYVn987cL5T39bhS04hMf9pYxAh1KidMWd2w0fkrClO/Q gAAYwRy+Emp3zKteSNxi+VLfgFEaZ7nHNfZCB1eNbcjLCIu+I6ORtqddtDm1DuTSR58S wSd338eU3bdAB/p6j4T6KxV1oKnAZq2hahV+2te5l3awhuPSDQ4UCxdDsYmuXD3Wj7p+ 5P7YqlJ7yVj1uwdDQTlbaDY00i8a1XfueP5DARbKyApxDYrVAUGbApEayIS0RedprzE4 Coug== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b="H5ssc/dk"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w11-v6si21796490pgf.587.2018.10.09.21.12.41; Tue, 09 Oct 2018 21:12:56 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b="H5ssc/dk"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727593AbeJJLcL (ORCPT + 99 others); Wed, 10 Oct 2018 07:32:11 -0400 Received: from mail-pf1-f194.google.com ([209.85.210.194]:38579 "EHLO mail-pf1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727523AbeJJLcD (ORCPT ); Wed, 10 Oct 2018 07:32:03 -0400 Received: by mail-pf1-f194.google.com with SMTP id f29-v6so1941215pff.5 for ; Tue, 09 Oct 2018 21:11:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:from:to:cc:subject:in-reply-to:message-id:references :user-agent:mime-version; bh=7HZpE/svjvEHFVjkdbIdljSw3Nddw+tZ5mCc3DSajcA=; b=H5ssc/dkNs0Wa9f6wBeCfKiX5lJxqx0x0lKnVaRb+8AG5Ncr/SZPAmILDtiRa0T959 RFgRQr9tFfDvIYct6WMLXA0M51ijjZ5nOVWIe2LNiy0e6lXrnrNjyoNoa/mERViHkQj8 Dxg3SiLv3rt6YkH3Dho3qUtmWGCuGQuDH0xfJSBiPHixgPH1iXcYc8rRC8HgwlvNmfwB pI7fcw0WkOCbtaL0BuDsP5IXrKIrilocNfMUhouKtmeD2Zd3F4JPUz0QHiZ/5LxRtRQu DT/ky3cWDT6+rck3GmyS35QvxXYgn7+DybkaKmDHis+Vfcw81k/XkUX9fQgTXr1EW0B0 m6TA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:in-reply-to:message-id :references:user-agent:mime-version; bh=7HZpE/svjvEHFVjkdbIdljSw3Nddw+tZ5mCc3DSajcA=; b=fObIrVToWN39qf4QYUwONigvPMgqE17/o6CDR4Oxdxd9hN/cZnKOYapjiGBQfbQimQ qjh8JMpnFYTdsz/ZOhh9wwQp2s51+qROEzyqWeI2I7xGj4nUkeHgmO3WLBr0WDAtsaWZ iMC9RgTW3kbOguCGAtQYvn39FGNPDWJaxMsqh7/B8t7sG3ySLMng7GfhuCnK/Rv8yYwZ 8gBAs+nQ19s/43ze4YmT0+ffDEzu6GqfIMS7zzZ2hDGBChH6uLqqbg6ISl++DTvKSLpG 9IUvpd7o+Q3PGvn2WKVyPd18J7k9mp4RUOsD6ibYARqBTiXrF8czrF3RUoSUp0Sh+HyX XUbw== X-Gm-Message-State: ABuFfog+Fu10FPciJ7fiOmZ1814UcABpXTLgUD5dTic9azDv6kgqMTPM 9ekFWVgoeXsLoDZxYAh2MpuQQg== X-Received: by 2002:a63:f347:: with SMTP id t7-v6mr27712716pgj.255.1539144710589; Tue, 09 Oct 2018 21:11:50 -0700 (PDT) Received: from [2620:15c:17:3:3a5:23a7:5e32:4598] ([2620:15c:17:3:3a5:23a7:5e32:4598]) by smtp.gmail.com with ESMTPSA id x20-v6sm38432703pfe.131.2018.10.09.21.11.49 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 09 Oct 2018 21:11:49 -0700 (PDT) Date: Tue, 9 Oct 2018 21:11:48 -0700 (PDT) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: Tetsuo Handa cc: syzbot , hannes@cmpxchg.org, mhocko@kernel.org, akpm@linux-foundation.org, guro@fb.com, kirill.shutemov@linux.intel.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org, syzkaller-bugs@googlegroups.com, yang.s@alibaba-inc.com Subject: Re: INFO: rcu detected stall in shmem_fault In-Reply-To: <201810100012.w9A0Cjtn047782@www262.sakura.ne.jp> Message-ID: References: <000000000000dc48d40577d4a587@google.com> <201810100012.w9A0Cjtn047782@www262.sakura.ne.jp> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 10 Oct 2018, Tetsuo Handa wrote: > syzbot is hitting RCU stall due to memcg-OOM event. > https://syzkaller.appspot.com/bug?id=4ae3fff7fcf4c33a47c1192d2d62d2e03efffa64 > > What should we do if memcg-OOM found no killable task because the allocating task > was oom_score_adj == -1000 ? Flooding printk() until RCU stall watchdog fires > (which seems to be caused by commit 3100dab2aa09dc6e ("mm: memcontrol: print proper > OOM header when no eligible victim left") because syzbot was terminating the test > upon WARN(1) removed by that commit) is not a good behavior. > Not printing anything would be the obvious solution but the ideal solution would probably involve - adding feedback to the memcg oom killer that there are no killable processes, - adding complete coverage for memcg_oom_recover() in all uncharge paths where the oom memcg's page_counter is decremented, and - having all processes stall until memcg_oom_recover() is called so looping back into try_charge() has a reasonable expectation to succeed.