Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp540377imm; Wed, 18 Jul 2018 06:39:33 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfnjHNunkq2pDnz/E9wR6JMVUeeokMU/eq/ESZ901suIreMJ2eN4Tc3lEtMygAwfzRAZzth X-Received: by 2002:a63:ea4e:: with SMTP id l14-v6mr5633146pgk.94.1531921173413; Wed, 18 Jul 2018 06:39:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531921173; cv=none; d=google.com; s=arc-20160816; b=inqDEzIlVaJfCtwqbLjsKX05WkoHvVa+uqatnJ0rKSo2N7al/D6awMi5G4Sv+57jvd 8is4x4s2OauI7VWkhWFa7RD6I4VK/cYCBTauBSbeojfsqiO9yOk11LJoPm/vWvNd914r gZ8JdSjfqrJqQnTgFAavzwRp2rMdozv8ZHXbK653sBVZ+7yWNvWpra6AcWY/ZTvhp/Ce GkMr1jocVxfKBuncLYm5wpPuNEx837qznb9kBY8iRG5qTQpUX83jFe4KjLEIYHQm6P7S MahRZOn0IZczthM/ebA+LFCRvwxxrqAaPsBYup/54QsYRhyuUrIKNPP4zlIA6AeXOIaK 71ow== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=Ewy++WSOli85FuNtMU7CN8o0pVGjeE4iaIrVq4j/2F8=; b=l0Cmr04dnp4LSlWEY4o7Q2ZlOirG9IZH8qlwP49bnMqP6lA/92aXt1PaxaG16GnnoX qUdP4lEm8pgM0qIp+mKOkou0K3QcNFNIxCk5UPQyvw8xCm7bNjm9KMdpW/iZt5KIpuL1 Agzq4J4mAS3HCqFp8AzKf2Kb2IAfgwCn4hNln6embV2TlOMH3jrIcQnYnmZP54oh43Wr 5kvzGFH/gWJ8ocfHB/f3YF3y/nFNJ0jtOutSaxFKJmJT8BqiQZ+TlgnCTUnXEF+kMa8H kQPY1HMvRppis54qy6qrZTc2+Jb4cBqggbBYkLa7Qsck4BgQbcuqC0a7ETq+oJPxvkyJ 5X8w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w13-v6si2965128plp.51.2018.07.18.06.39.18; Wed, 18 Jul 2018 06:39:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731524AbeGROO1 (ORCPT + 99 others); Wed, 18 Jul 2018 10:14:27 -0400 Received: from www262.sakura.ne.jp ([202.181.97.72]:11322 "EHLO www262.sakura.ne.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730234AbeGROO1 (ORCPT ); Wed, 18 Jul 2018 10:14:27 -0400 Received: from fsav301.sakura.ne.jp (fsav301.sakura.ne.jp [153.120.85.132]) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTP id w6IDZdD1028800; Wed, 18 Jul 2018 22:35:39 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Received: from www262.sakura.ne.jp (202.181.97.72) by fsav301.sakura.ne.jp (F-Secure/fsigk_smtp/530/fsav301.sakura.ne.jp); Wed, 18 Jul 2018 22:35:39 +0900 (JST) X-Virus-Status: clean(F-Secure/fsigk_smtp/530/fsav301.sakura.ne.jp) Received: from [192.168.1.8] (softbank126074194044.bbtec.net [126.74.194.44]) (authenticated bits=0) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTPSA id w6IDZdP7028793 (version=TLSv1.2 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 18 Jul 2018 22:35:39 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Subject: Re: INFO: task hung in grab_super To: Dmitry Vyukov Cc: Eric Van Hensbergen , Ron Minnich , Latchesar Ionkov , v9fs-developer@lists.sourceforge.net, syzbot , linux-fsdevel , LKML , syzkaller-bugs , Al Viro References: <0000000000002f5541057143a85e@google.com> <0adc592b-d4a3-f6da-3c5c-22490f641eb9@i-love.sakura.ne.jp> From: Tetsuo Handa Message-ID: Date: Wed, 18 Jul 2018 22:35:35 +0900 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018/07/18 22:04, Dmitry Vyukov wrote: > On Wed, Jul 18, 2018 at 2:53 PM, Tetsuo Handa > wrote: >> On 2018/07/18 20:41, Dmitry Vyukov wrote: >>> This seems to be related to 9p. After rerunning the log I got: >>> >>> root@syzkaller:~# ps afxu | grep syz >>> root 18253 0.0 0.0 0 0 ttyS0 Zl 10:16 0:00 \_ >>> [syz-executor] >>> root@syzkaller:~# cat /proc/18253/task/*/stack >>> [<0>] p9_client_rpc+0x3a2/0x1400 >>> [<0>] p9_client_flush+0x134/0x2a0 >>> [<0>] p9_client_rpc+0x122c/0x1400 >>> [<0>] p9_client_create+0xc56/0x16af >>> [<0>] v9fs_session_init+0x21a/0x1a80 >>> [<0>] v9fs_mount+0x7c/0x900 >>> [<0>] mount_fs+0xae/0x328 >>> [<0>] vfs_kern_mount.part.34+0xdc/0x4e0 >>> [<0>] do_mount+0x581/0x30e0 >>> [<0>] ksys_mount+0x12d/0x140 >>> [<0>] __x64_sys_mount+0xbe/0x150 >>> [<0>] do_syscall_64+0x1b9/0x820 >>> [<0>] entry_SYSCALL_64_after_hwframe+0x49/0xbe >>> [<0>] 0xffffffffffffffff >>> >>> There is a bunch of hangs in 9p, so let's do: >>> >>> #syz dup: INFO: task hung in flush_work >>> >> Then, is dumping all threads when khungtaskd fires a candidate >> for CONFIG_DEBUG_AID_FOR_SYZBOT=y path? > > Perhaps would be useful. But maybe only tasks that are blocked for > more than timeout/2? and/or unkillable tasks? killable tasks are not a > problem. TASK_KILLABLE waiters are not reported by khungtaskd, are they? /* use "==" to skip the TASK_KILLABLE tasks waiting on NFS */ if (t->state == TASK_UNINTERRUPTIBLE) check_hung_task(t, timeout); And TASK_KILLABLE waiters can become a problem because > > Btw, I see that p9_client_rpc uses wait_event_killable, why wasn't it > killed along with the whole process? > wait_event_killable() would return -ERESTARTSYS if got SIGKILL. But if (c->status == Connected) && (type == P9_TFLUSH) is also true, it ignores SIGKILL by retrying the loop... again: err = wait_event_killable(*req->wq, req->status >= REQ_STATUS_RCVD); if ((err == -ERESTARTSYS) && (c->status == Connected) && (type == P9_TFLUSH)) { sigpending = 1; clear_thread_flag(TIF_SIGPENDING); goto again; } I wish they don't ignore SIGKILL (by e.g. offloading operations to a kernel thread).