Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp4185357ybi; Mon, 27 May 2019 12:54:52 -0700 (PDT) X-Google-Smtp-Source: APXvYqwEH5nqyVGlZuuFCRQtPhDTkXYlRZS/DKxvPAt7YWb0FXXomHjIf79spuqMwO6bhErnVoyU X-Received: by 2002:a17:90a:3569:: with SMTP id q96mr700240pjb.14.1558986892245; Mon, 27 May 2019 12:54:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1558986892; cv=none; d=google.com; s=arc-20160816; b=mdTf1xNCS95bAXh4wRIb7N6fBCbnmIEl19FZAFS4pB2XVs7sIunP0q+J3sb8wJ7nwI 9blv63GCgbKvAle5vTeUSPEoyMB5y8ZdDrZ79Ylgiku2KXw39PhOasTgQvkru6T2pJIh 0/1VpXEWWJaU0zda1ZU+/jR8b26mntoOQ+o4i/m/XH1koC140zs8IVLhbyXToxdIv4h2 nIcPB24gqxcgdd0hcg18aSMJhHMnUE3RBasqkUoR3O6J1K3+VsgOSR0E7d9dmQE8MlT6 HKCfdYdKGaUcXbAgWKpqZin24SlPRHDFC4jMT7NwiI81be0OdNDcs7DvQTQmmezmzCz7 z1Yw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:mail-followup-to :message-id:subject:to:from:date:dkim-signature; bh=AaAQ54esFK5LFGU1nOQFOnmsw5Vkl3PDTa/ucRPbo5E=; b=g9VxYSH3s6mC9H2qIawBc+r7SrHKldX+2RLI2bynAnmkAI73ZiSquRhFzX0maItwNv MShb1NpbV37NSKXJu4zLIla7RwVIyE/1/x5grt/IgZeUH7C5pJPcMohKu8g0kLMcMgdf w9YskK92qSWH1s0+0hKBPooO31H+4n2U6ESL8cNxZjv/MGhx/XWQ+ZmG5ryUVwmOOk3+ 3HwY6/AVLNTOF0VoBg18JXSr3ICMQDOuEyccxx8VWgU5YOIVuVLVXMSDyASI6cdqfQ5Q jz3sUcZhig9uoZlkYUWjZUSkbFtcrGmo3I2atFYZr7ETOvv0RVUqPXh02nV1TKwj8Lyp O0TA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@megous.com header.s=mail header.b=MiChMqP4; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=megous.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e9si15613969pgb.142.2019.05.27.12.54.36; Mon, 27 May 2019 12:54:52 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@megous.com header.s=mail header.b=MiChMqP4; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=megous.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726937AbfE0Txd (ORCPT + 99 others); Mon, 27 May 2019 15:53:33 -0400 Received: from vps.xff.cz ([195.181.215.36]:54828 "EHLO vps.xff.cz" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726346AbfE0Txd (ORCPT ); Mon, 27 May 2019 15:53:33 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=megous.com; s=mail; t=1558986810; bh=TQm1z7XV9OcXC3zOVs+6uPifYNf994Ug5KfUvl6jkw8=; h=Date:From:To:Subject:References:In-Reply-To:From; b=MiChMqP4U9HE88aPO9vEKcSp8Jkx39Bk5JYJZZyrLUPRveB4GmNseExAgdpO1fx5y LTYKRGqy2jEcgC8v6Z3aeFiISGWD+syP8+yqb0uHMXdKmXh8dw7gt/kq2CEFqTd1Eu 7RqduuUIm5Qw2K+82XUblNp+fMCH0WN6HnaYW3wI= Date: Mon, 27 May 2019 21:53:30 +0200 From: =?utf-8?Q?Ond=C5=99ej?= Jirman To: =?utf-8?B?Q2zDqW1lbnQgUMOpcm9u?= , Mauro Carvalho Chehab , Rob Herring , Mark Rutland , Maxime Ripard , Chen-Yu Tsai , devicetree , linux-kernel , linux-sunxi , linux-arm-kernel , linux-media@vger.kernel.org Subject: Re: [PATCH v2 00/10] Allwinner A64/H6 IR support Message-ID: <20190527195330.pugb7ypvnyv32fug@core.my.home> Mail-Followup-To: =?utf-8?B?Q2zDqW1lbnQgUMOpcm9u?= , Mauro Carvalho Chehab , Rob Herring , Mark Rutland , Maxime Ripard , Chen-Yu Tsai , devicetree , linux-kernel , linux-sunxi , linux-arm-kernel , linux-media@vger.kernel.org References: <20190526222536.10917-1-peron.clem@gmail.com> <20190527134805.j7t4ffstrnhdml47@core.my.home> <20190527163117.hpealt6cttqzqdxz@core.my.home> <20190527172337.5qxh5qeqnul55gsb@core.my.home> <20190527193016.yxngu5grsqnctx3z@core.my.home> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190527193016.yxngu5grsqnctx3z@core.my.home> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Cl?ment, On Mon, May 27, 2019 at 09:30:16PM +0200, verejna wrote: > Hi Cl?ment, > > On Mon, May 27, 2019 at 08:49:59PM +0200, Cl?ment P?ron wrote: > > Hi Ondrej, > > > > > > > > I'm testing on Orange Pi 3. > > > > > > With your patches, I get kernel lockup after ~1 minute of use (ssh stops > > > responding/serial console stops responding). I don't have RC controller to test > > > the CIR. But just enabling the CIR causes kernel to hang shortly after boot. > > > > > > I tried booting multiple times. Other results: > > > > > > boot 2: > > > > > > - ssh hangs even before connecting (ethernet crashes/is reset) > > > > > > INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (1 GPs behind) idle=64a/0/0x3 softirq=4091/4091 fqs=2437 > > > dwmac-sun8i 5020000.ethernet eth0: Reset adapter. > > > rcu: INFO: rcu_sched detected expedited stalls on CPUs/tasks: { 0-... } 5696 jiffies s: 81 root: 0x1/. > > > rcu: blocking rcu_node structures: > > > rcu: INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (1 GPs behind) idle=64a/0/0x3 softirq=4091/4091 fqs=9714 > > > rcu: INFO: rcu_sched detected expedited stalls on CPUs/tasks: { 0-... } 21568 jiffies s: 81 root: 0x1/. > > > rcu: blocking rcu_node structures: > > > rcu: INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (1 GPs behind) idle=64a/0/0x3 softirq=4091/4091 fqs=17203 > > > > > > above messages appear regularly. > > > > > > boot 3: > > > > > > rcu: INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (9 GPs behind) idle=992/0/0x3 softirq=6123/6123 fqs=2600 > > > > > > > > > Sometimes serial console keeps working. Sometimes it locks up too (but not > > > frequently). Storage locks up always (any program that was not run before > > > the crash can't be started and lock up the kernel hard, programs that > > > were executed prior, can be run again). > > > > > > > > > Exactly the same kernel build on H5 seems to work (or at least I was not able to > > > trigger the crash). So this seems to be limited to H6 for now. > > > > > > I suspect that the crash occurs sooner if I vary the light (turn on/off the table > > > lamp light). > > > > > > Without your patches, everything works fine on H6, and I never see > > > crashes/lockups. > > > > > > I tired physically covering the IR receiver, and that helps preventing the > > > crash. As soon as I uncover it, the crash happens again in 1s or so: > > > > > > rcu: INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (1 GPs behind) idle=4ea/0/0x3 softirq=4483/4484 fqs=2444 > > > rcu: INFO: rcu_sched detected stalls on CPUs/tasks: > > > rcu: 0-....: (1 GPs behind) idle=4ea/0/0x3 softirq=4483/4484 fqs=9777 > > > > > > This time I got the hung task and reboot: (probably not directly related) > > > > > > INFO: task find:560 blocked for more than 120 seconds. > > > Not tainted 5.2.0-rc2+ #7 > > > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > > > find D 0 560 551 0x00000000 > > > Call trace: > > > __switch_to+0x6c/0x90 > > > __schedule+0x1f4/0x578 > > > schedule+0x28/0xa8 > > > io_schedule+0x18/0x38 > > > __lock_page+0x12c/0x208 > > > pagecache_get_page+0x238/0x2e8 > > > __get_node_page+0x6c/0x310 > > > f2fs_get_node_page+0x14/0x20 > > > f2fs_iget+0x70/0xc60 > > > f2fs_lookup+0xcc/0x218 > > > __lookup_slow+0x78/0x160 > > > lookup_slow+0x3c/0x60 > > > walk_component+0x1e4/0x2e0 > > > path_lookupat.isra.13+0x5c/0x1e0 > > > filename_lookup.part.23+0x6c/0xe8 > > > user_path_at_empty+0x4c/0x60 > > > vfs_statx+0x78/0xd8 > > > __se_sys_newfstatat+0x24/0x48 > > > __arm64_sys_newfstatat+0x18/0x20 > > > el0_svc_handler+0x9c/0x170 > > > el0_svc+0x8/0xc > > > Kernel panic - not syncing: hung_task: blocked tasks > > > CPU: 1 PID: 34 Comm: khungtaskd Not tainted 5.2.0-rc2+ #7 > > > Hardware name: OrangePi 3 (DT) > > > Call trace: > > > dump_backtrace+0x0/0xf8 > > > show_stack+0x14/0x20 > > > dump_stack+0xa8/0xcc > > > panic+0x124/0x2dc > > > proc_dohung_task_timeout_secs+0x0/0x40 > > > kthread+0x120/0x128 > > > ret_from_fork+0x10/0x18 > > > SMP: stopping secondary CPUs > > > Kernel Offset: disabled > > > CPU features: 0x0002,20002000 > > > Memory Limit: none > > > Rebooting in 3 seconds.. > > > > > > > > > Meanwhile H5 based board now runs for 15 minutes without issues. > > > > > > So to sum up: > > > > > > - these crashes are definitely H6 IR related > > > - the same kernel, on H5 works > > > - covering the sensor prevents the crashes on H6 > > > > > > So we should probably hold on with the series, until this is figured out. > > > > Thanks for testing, but I think it's more hardware related. > > It seems that your IR is flooded or misconfigured for your board. > > Could you add a simple print in the "sunxi_ir_irq" > > Yes, I get flood of IRQs with status = 0x30. (after I turn on the lamp, > but it persists even after I turn it off and cover the IR sensor). Interestingly, status also contains RAC, and it's 0 in this case. So the interrupt if firing with "No available data in RX FIFO" repeatedly. Regardless of input. So there's something else up. regards, o. > That's weird, because on H6 in CIR_RXSTA, bit 5 is undefined but corresponding > bit in CIR_RXINT is DRQ_EN (RX FIFO DMA Enable) > > So I'm not sure what it could be flooded with and why IRQs keep being > fired, even with no sensor input after the FIFO is read. > > regards, > o. > > > If it's confirmed, maybe tweak the threshold configuration or > > implement the new active_threshold will help. > > > > With my hardware Beelink GS1 and on Jernej's board (A64) there is no issue. > > > > I will disable all the other H6 boards until someone test it. > > > > Regards, > > Cl?ment > > _______________________________________________ > linux-arm-kernel mailing list > linux-arm-kernel@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/linux-arm-kernel