Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp3262445imm; Sun, 13 May 2018 07:36:21 -0700 (PDT) X-Google-Smtp-Source: AB8JxZrRhZUPG77vk22wf4+6upbTvcjYpoxTrjPM7jK1JgUffrGXGUupAq1lx22IUeHI9oilQkxh X-Received: by 2002:a17:902:6505:: with SMTP id b5-v6mr6150385plk.147.1526222181408; Sun, 13 May 2018 07:36:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526222181; cv=none; d=google.com; s=arc-20160816; b=inn+Qcxe80hWjEWFFnPWiUk3S6ERSrj4W3rdXcJNSSbwQfQD+Hj0pAbkewvZWB0Nmx z4NYLAPvOK6COdJaDcIwWs7Guzt9UDcQMvBpf9VhlZr986tRHPWsS0+dfvUNPFsoZLHY 8EUXN5JVaMLPeEeDeySxOHNGHgfGXCSQsO7dc9UeG04qyfyEP5JpEuyyCENOXncC64sz QJc6iQIk7NKEnQmhg2QEuP8EZYjlztEars3Ye/bAgSPrjOqJV2OQVkydWk5/EG5AmW2+ UWlTO80ze0UIqbitBLUJuQZeZENO8bq1LhTr+JJ38VQr9/tdmb0lwN/gn6knglJn8wyn IwUg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=n7mLPtgNRQF66I1AI29uuMHceK7utB3x5XyPk0gmnCE=; b=tH/KY4bCC5Fo42GifrXgnzWgHqrArnKdi1SE2o8ExBT/znFvBgVk36tkLrcvJ4zBD0 vSUPSq3fBVcIOZoAWDxNwstD6AAQrNOZ3NV2dxoYSEFdqvPmjJ3oZ3335ur/5g6bC2Wb Q+WHDmoZkKodpZPO/irDZM4b/Fd7z7/0URYymeyicL+d+eAikARYzwnHXZtZu8uSUNLq s1C7ITgD2p7rVozys2dMArcZrp5bvLQSL/zjMYHsbhQOgNMXXKz96PXr0o4r0MxG90VB 3xHjjsVEJAtICWCqhr2ejfwFWqLxMPOwaZtvzixrd7YRNKzsLM48e+inreWRQ817SJOw sUyw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=qnUtMP7T; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o7-v6si6271037pgr.88.2018.05.13.07.36.07; Sun, 13 May 2018 07:36:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=qnUtMP7T; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751705AbeEMOfy (ORCPT + 99 others); Sun, 13 May 2018 10:35:54 -0400 Received: from mail-pg0-f41.google.com ([74.125.83.41]:37897 "EHLO mail-pg0-f41.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751270AbeEMOfw (ORCPT ); Sun, 13 May 2018 10:35:52 -0400 Received: by mail-pg0-f41.google.com with SMTP id n9-v6so4385462pgq.5 for ; Sun, 13 May 2018 07:35:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=n7mLPtgNRQF66I1AI29uuMHceK7utB3x5XyPk0gmnCE=; b=qnUtMP7THJtJMVnFX03n2V31Rqu096qP8w8la0VVFAhWe7T/JEH/GFd3xjxPn3JxeN H4FyQ0chxoXB6dV2iuriYlCyHpMRwC/IILRex5I/p4lYuyeezKOfXX1yAP/JqX7QwV1Z 9q2lKFbVhTOK5omxneacOvt8u9GmGy1yW/NrsTnIGb8mw+/7Z76nQ4bbf6CYemtlUxm6 bYu9xmIfgGmLrPm87oG+ON41/Hrq0LXWgGOBswnd6v5K93wBYsY242G0J9YpRkwyaN0E zGnlW+rWrYgddGn3ST0SYRKN2ipxKTiz5XXVTeuPI71fpgUoKaXGuh15MIqEStnPzsOA Y79g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=n7mLPtgNRQF66I1AI29uuMHceK7utB3x5XyPk0gmnCE=; b=bswvJmUdSUTdRW3GOxHXscZT7MOGRKfHzcmsFoIZlxk/8BepzxdDXsOlLwfz2J/4mJ 8nxMPE1fkx0vS1LdRb49PQrwOils1iLU5RLDrWR9cMcqSJm/BJbUtguzwTUJwaNbgZTs sEarmsOvFhHCgK2d6Kl/mYigA1hUfga4Rwpd12A5LBuTtXcTS7iZNMdGeSCahmtAKALN f6ATyl+4OI1yOevXaxDpBTBJOu1QAbpqdtETnGYgOuUMBHXKw5Esoo6pPB5Mlh+hwrLp b8pDxkg7b7YWlO6Gd+FlwaLLVDQGOwHCKSZ1Hlgb+zfDsEcq2WBbSK6vlJ1R4rh5eFYe uT/g== X-Gm-Message-State: ALKqPweVFlZrOGT+3NPmpZpeUK6RVIpYqYCwaZ3fa0YxY8MCmwLrNcnr DEopqzQyFzGs19DCxMdBX87lOiOyN6/4bOxcAS2ISw== X-Received: by 2002:a63:b406:: with SMTP id s6-v6mr5686631pgf.334.1526222152077; Sun, 13 May 2018 07:35:52 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a17:90a:9518:0:0:0:0 with HTTP; Sun, 13 May 2018 07:35:31 -0700 (PDT) In-Reply-To: <201805132329.CEB90134.OFFSMHOFtVJQLO@I-love.SAKURA.ne.jp> References: <94eb2c03c9bc75aff2055f70734c@google.com> <001a113f711a528a3f0560b08e76@google.com> <20180512215222.GC817@sol.localdomain> <201805131106.GFF73973.OOtMVQFSFOJFHL@I-love.SAKURA.ne.jp> <20180513033220.GA654@sol.localdomain> <201805132329.CEB90134.OFFSMHOFtVJQLO@I-love.SAKURA.ne.jp> From: Dmitry Vyukov Date: Sun, 13 May 2018 16:35:31 +0200 Message-ID: Subject: Re: BUG: workqueue lockup (2) To: Tetsuo Handa Cc: Eric Biggers , syzbot , Peter Hurley , Greg Kroah-Hartman , Kate Stewart , LKML , Linux-MM , Philippe Ombredanne , syzkaller-bugs , Thomas Gleixner Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, May 13, 2018 at 4:29 PM, Tetsuo Handa wrote: > Eric Biggers wrote: >> Generally it's best to close syzbot bug reports once the original cause is >> fixed, so that syzbot can continue to report other bugs with the same signature. > > That's difficult to judge. Closing as soon as the original cause is fixed allows > syzbot to try to report different reproducer for different bugs. But at the same time, > different/similar bugs which were reported in that report (or comments in the discussion > for that report) will become almost invisible from users (because users unlikely check > other reports in already fixed bugs). > > An example is > > general protection fault in kernfs_kill_sb (2) > https://syzkaller.appspot.com/bug?id=903af3e08fc7ec60e57d9c9b93b035f4fb038d9a > > where the cause of above report was already pointed out in the discussion for > the below report. > > general protection fault in kernfs_kill_sb > https://syzkaller.appspot.com/bug?id=d7db6ecf34f099248e4ff404cd381a19a4075653 > > Since the latter is marked as "fixed on May 08 18:30", I worry that quite few > users would check the relationship. > >> Note also that a "workqueue lockup" can be caused by almost anything in the >> kernel, I think. This one for example is probably in the sound subsystem: >> https://syzkaller.appspot.com/text?tag=CrashReport&x=1767232b800000 >> > > Right. Maybe we should not stop the test upon "workqueue lockup" message, for > it is likely that the cause of lockup is that somebody is busy looping which > should have been reported shortly as "rcu detected stall". > > Of course, there is possibility that "workqueue lockup" is reported because > cond_resched() was used when explicit schedule_timeout_*() is required, which > was the reason commit 82607adcf9cdf40f ("workqueue: implement lockup detector") > was added. > > If we stop the test upon "workqueue lockup" message, maybe longer timeout (e.g. > 300 seconds) is better so that rcu stall or hung task messages are reported > if rcu stall or hung task is occurring. Yes, we need order different stalls/lockups/hangs/etc according to what can trigger what. E.g. rcu stall can trigger task hung and workqueue lockup, but not the other way around. There is https://github.com/google/syzkaller/issues/516 to track this. But I did not yet have time to figure out all required changes. If you have additional details, please add them there.