Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp583377imm; Wed, 18 Jul 2018 07:20:24 -0700 (PDT) X-Google-Smtp-Source: AAOMgpeYCJy7LPvkv6Jq+B6gZm80nlGJhkn0ISF/mV92+PIatYKtZ/Inalyxa2iNRbP85UDSqCPB X-Received: by 2002:a63:b605:: with SMTP id j5-v6mr6074505pgf.437.1531923624856; Wed, 18 Jul 2018 07:20:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531923624; cv=none; d=google.com; s=arc-20160816; b=eQo//nb1IwB2NqbirBbS9dGznmo2oB6UxDOm48lAaIzxtBh4LgQidZvesUf47t4ug9 HnabUMMMWpV2r0FwRFO7+CDEaiBlhDK7PjCNjjRXVHAqtdDpE4WN29muNxlCB/uXZ1LE LrihMZFMaxaBhbOvH8NH52gK1q+/BNWti9pEUaJ23bUWmH0xpFAVCV451r8qjeW8dqjQ EFDIiKgbxcjSI/D3Fe1Sdp8mZUsoIDcBszcasLAkELt1q/qniRNStiGMwOKusrnZjRNm sDwtUlI5i2IlbWwCn8G0s2e1cyUw2MqJwwQjOsvQV1s1X/mc9tErXFd9FLNKri/shlKM NCjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=mQhmiC7prQavquoFQnwRmJge76plqxyysgfxYQpsKR0=; b=kPgRmqEdaziHKE3eMHQGXk4UBK1l9dX9BwBQ1WAdgODeWTmKj0l+/dnOaNXIzhppW8 u8UIu2yw6lyhPb2EbjLXiaW2865WQnU/6Al4+QH5saD5J2X/EUtIH985O67/cPXCsmWJ cmjQ3y43LKdpYOtlclgAU86HXB4mhQsRnsB10rtYHQveCu56npK0LGtb4SO+56EyvIrp KVe2Qo6q5pLK/Y3aQQ/aHrwGsBkafLOJH8zJNSkiVVoRbTmg3R1cZXYer5iP+0gX66d0 MY1GEhGe2wsCeV2wC7sK/VQsreX2Od5/OtEiq7ziJMBZNqZkouF7fsZZqJFLJ6hbP8uD Dn9A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l9-v6si3441627pfc.121.2018.07.18.07.20.09; Wed, 18 Jul 2018 07:20:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730845AbeGRO5A (ORCPT + 99 others); Wed, 18 Jul 2018 10:57:00 -0400 Received: from www262.sakura.ne.jp ([202.181.97.72]:29129 "EHLO www262.sakura.ne.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728861AbeGRO5A (ORCPT ); Wed, 18 Jul 2018 10:57:00 -0400 Received: from fsav405.sakura.ne.jp (fsav405.sakura.ne.jp [133.242.250.104]) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTP id w6IEHwWq060723; Wed, 18 Jul 2018 23:17:59 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Received: from www262.sakura.ne.jp (202.181.97.72) by fsav405.sakura.ne.jp (F-Secure/fsigk_smtp/530/fsav405.sakura.ne.jp); Wed, 18 Jul 2018 23:17:58 +0900 (JST) X-Virus-Status: clean(F-Secure/fsigk_smtp/530/fsav405.sakura.ne.jp) Received: from [192.168.1.8] (softbank126074194044.bbtec.net [126.74.194.44]) (authenticated bits=0) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTPSA id w6IEHwZn060697 (version=TLSv1.2 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 18 Jul 2018 23:17:58 +0900 (JST) (envelope-from penguin-kernel@i-love.sakura.ne.jp) Subject: Re: INFO: task hung in grab_super To: Dmitry Vyukov Cc: Eric Van Hensbergen , Ron Minnich , Latchesar Ionkov , v9fs-developer@lists.sourceforge.net, syzbot , linux-fsdevel , LKML , syzkaller-bugs , Al Viro References: <0000000000002f5541057143a85e@google.com> <0adc592b-d4a3-f6da-3c5c-22490f641eb9@i-love.sakura.ne.jp> From: Tetsuo Handa Message-ID: <727110bb-0154-e5df-4b2f-e965e3b98c62@i-love.sakura.ne.jp> Date: Wed, 18 Jul 2018 23:17:54 +0900 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018/07/18 23:11, Dmitry Vyukov wrote: > On Wed, Jul 18, 2018 at 3:35 PM, Tetsuo Handa > wrote: >>>>> This seems to be related to 9p. After rerunning the log I got: >>>>> >>>>> root@syzkaller:~# ps afxu | grep syz >>>>> root 18253 0.0 0.0 0 0 ttyS0 Zl 10:16 0:00 \_ >>>>> [syz-executor] >>>>> root@syzkaller:~# cat /proc/18253/task/*/stack >>>>> [<0>] p9_client_rpc+0x3a2/0x1400 >>>>> [<0>] p9_client_flush+0x134/0x2a0 >>>>> [<0>] p9_client_rpc+0x122c/0x1400 >>>>> [<0>] p9_client_create+0xc56/0x16af >>>>> [<0>] v9fs_session_init+0x21a/0x1a80 >>>>> [<0>] v9fs_mount+0x7c/0x900 >>>>> [<0>] mount_fs+0xae/0x328 >>>>> [<0>] vfs_kern_mount.part.34+0xdc/0x4e0 >>>>> [<0>] do_mount+0x581/0x30e0 >>>>> [<0>] ksys_mount+0x12d/0x140 >>>>> [<0>] __x64_sys_mount+0xbe/0x150 >>>>> [<0>] do_syscall_64+0x1b9/0x820 >>>>> [<0>] entry_SYSCALL_64_after_hwframe+0x49/0xbe >>>>> [<0>] 0xffffffffffffffff >>>>> >>>>> There is a bunch of hangs in 9p, so let's do: >>>>> >>>>> #syz dup: INFO: task hung in flush_work >>>>> >>>> Then, is dumping all threads when khungtaskd fires a candidate >>>> for CONFIG_DEBUG_AID_FOR_SYZBOT=y path? >>> >>> Perhaps would be useful. But maybe only tasks that are blocked for >>> more than timeout/2? and/or unkillable tasks? killable tasks are not a >>> problem. >> >> TASK_KILLABLE waiters are not reported by khungtaskd, are they? >> >> /* use "==" to skip the TASK_KILLABLE tasks waiting on NFS */ >> if (t->state == TASK_UNINTERRUPTIBLE) >> check_hung_task(t, timeout); >> >> And TASK_KILLABLE waiters can become a problem because >> >>> >>> Btw, I see that p9_client_rpc uses wait_event_killable, why wasn't it >>> killed along with the whole process? >>> >> >> wait_event_killable() would return -ERESTARTSYS if got SIGKILL. >> But if (c->status == Connected) && (type == P9_TFLUSH) is also true, >> it ignores SIGKILL by retrying the loop... >> >> again: >> err = wait_event_killable(*req->wq, req->status >= REQ_STATUS_RCVD); >> if ((err == -ERESTARTSYS) && (c->status == Connected) && (type == P9_TFLUSH)) { >> sigpending = 1; >> clear_thread_flag(TIF_SIGPENDING); >> goto again; >> } >> >> I wish they don't ignore SIGKILL (by e.g. offloading operations to a kernel thread). > > > I guess that's the problem, right? SIGKILL-ed task must not ignore > SIGKILL and hang in infinite loop. This would explain a bunch of hangs > in 9p. Did you check /proc/18253/task/*/stack after manually sending SIGKILL? I mean, who (i.e. you or syzkaller programs) is sending a signal (not limited to SIGKILL but any signal) that makes TASK_KILLABLE waiters to wake up?