Received: by 2002:a05:6a10:7420:0:0:0:0 with SMTP id hk32csp995770pxb; Wed, 16 Feb 2022 08:41:33 -0800 (PST) X-Google-Smtp-Source: ABdhPJy8ikq2Z6grO0sgqw9h8TNejGAT2xps9Pbkukt2KbOw/a50/mrLQtaOXsf8KZagy0HQMFjB X-Received: by 2002:a17:906:16cc:b0:6ce:e607:ff02 with SMTP id t12-20020a17090616cc00b006cee607ff02mr2865141ejd.418.1645029693037; Wed, 16 Feb 2022 08:41:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645029693; cv=none; d=google.com; s=arc-20160816; b=sOiEXS8SyFU7r8R5+ONZFD9h8cDC7Ws6bzRMjKOS1zy4vuJ9VidAlJgiuLFKcqcCVf 31PVpC9Y8hGrElvPuZHK5Z4Etf+LYxYDGYmP+MwU/rJLw+gQ69dwwesP2xJo8e/O8uhh B/yeW4V+LNc38w90lR7tiJnzJo18sUw7BC+BIlhPquzx2UjEfYIivqaN9YK1y4UlFggm gHS+L8X4BUJ9F6MSHAvFgYtRPLxyueTq9vavRfrWY43WWqXkLjvanKT9MEbim2ZEdj7I be3xxFUVj9pKJeAzdqP6E8ZDLM3ICmK/tQ4dA0ebg78ydbDWVGXR+UyE0sKfI9+PXxZy zVpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Q/am0HDxpBYA40FHZaIrVn8nXi2cQpq4EMCF+My6Irk=; b=M6fMzAu2h8qHiZ+Z5l+BsVZIptmJL1c/2b2qDeZwjqTN8X23GlmUH4qjnP94oZ7Xlf hIqKt2qw8LoynWgzOLmtKVqZzKL8qxZwVZy8TIGn5XuiFbD9dKez+ToaDav+KTdhzU1B +kDRc1KGDJiwIinF2yuHjbBiATPApOkAGoHYeiPts8+sKX/woD0prurQJ73DjLHkU9Rh vPRINd1RheFzmgGOuevr2ouByq5nXQIEBWyUJ67aTF/e93Suh9Vs/M2yQ+ZBUpIyLIci TTHDJ7UIdz8tKB5R+QFIyHAnwRCjQxDDxB7WSPjzGDZkFWYO+9Pg5uAc9dyQDGiC6QGR QjYA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=NtvZYC1A; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j23si137514eje.624.2022.02.16.08.41.09; Wed, 16 Feb 2022 08:41:33 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=NtvZYC1A; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236242AbiBPQRR (ORCPT + 99 others); Wed, 16 Feb 2022 11:17:17 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:41680 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236138AbiBPQRQ (ORCPT ); Wed, 16 Feb 2022 11:17:16 -0500 Received: from mail-il1-x12c.google.com (mail-il1-x12c.google.com [IPv6:2607:f8b0:4864:20::12c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7F77D1CC287 for ; Wed, 16 Feb 2022 08:17:02 -0800 (PST) Received: by mail-il1-x12c.google.com with SMTP id m8so145163ilg.7 for ; Wed, 16 Feb 2022 08:17:02 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Q/am0HDxpBYA40FHZaIrVn8nXi2cQpq4EMCF+My6Irk=; b=NtvZYC1AXgVz959qRsuGJurUXhPS4HOehNbNlRawXOrzvQ6FV9JAel/aIqZu4j5tOe ARRoSb3laykvZtmYJ6G7zYl0hT66XXYCL7t563wkKS8/eQawUAI0cHV/kAFXIcWzmFE1 65Wmi/+YKQDZjLXHTp5GG7JwCR3HjiRGv3sX+0EmjGHl1deohObtRCBJlSodzDjZwKXC Qq1NJJtyttLjiMaYdze63OowaswwpdPPyNLxnvcF+uxw+otohltXnHTtuEmrzOraOegu 46EZHH02T+0+JN/0rnQlzoB2slZbFlAJS7WuOyGNU9Yv9SVWk4NmWWYDV4BGYveQHaXF 0Rwg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Q/am0HDxpBYA40FHZaIrVn8nXi2cQpq4EMCF+My6Irk=; b=fkpb5RA1cAUVlHvBWc5QOal0CUD3rOsDbQxfvKG6dvyDPZMLmbeqyleM2VpBLYYtJE vopFpWpXOp2uYMqpgF278YxoOMHomnPo7zVHwZ4Wvf6Xm/yQiF7uExDwe9Wt/FKlsP9m D+L9o6bL6CWd0/LfoaMhQgL2hoF8vxExaGralky7bgb10oD4hKCQ1RxQ8lTVuOyN5eY8 N8kHDCXuK48u3fMVDk3HD65SWbQKsDhj8Obe46f/oA/mUrctdrU14aXO8Ru4+SZNFNB2 X+35ndwmtieiNOG4Ucgq/E2oVamy88o8pARtoGw4YRNK+F4hRVNao7ORy8N/dHF4DdK6 ZfVQ== X-Gm-Message-State: AOAM531L1e/8vTREZiXnqkPtEIBiCniF5e1yHJSo3USigbuY94l6jNqt NbMPK0OctdCFZ9sxZf+GI1Vpwj5B0bwy1fNWvG34yiUCmGXjp/pC X-Received: by 2002:a92:c241:0:b0:2b9:80f9:e2a with SMTP id k1-20020a92c241000000b002b980f90e2amr2303305ilo.208.1645028221681; Wed, 16 Feb 2022 08:17:01 -0800 (PST) MIME-Version: 1.0 References: <00000000000038779505d5d8b372@google.com> In-Reply-To: From: Aleksandr Nogikh Date: Wed, 16 Feb 2022 17:16:50 +0100 Message-ID: Subject: Re: [syzbot] riscv/fixes boot error: can't ssh into the instance To: Dmitry Vyukov Cc: Alexandre Ghiti , Alexandre Ghiti , linux-riscv@lists.infradead.org, kasan-dev , Palmer Dabbelt , syzbot , LKML , syzkaller-bugs@googlegroups.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org If I use just defconfig + DEBUG_VIRTUAL, without any KASAN, it begins to boot, but overwhelms me with tons of `virt_to_phys used for non-linear address:` errors. Like that [ 2.701271] virt_to_phys used for non-linear address: 00000000b59e31b6 (0xffffffff806c2000) [ 2.701727] WARNING: CPU: 0 PID: 1 at arch/riscv/mm/physaddr.c:16 __virt_to_phys+0x7e/0x86 [ 2.702207] Modules linked in: [ 2.702393] CPU: 0 PID: 1 Comm: swapper/0 Tainted: G W 5.17.0-rc1 #1 [ 2.702806] Hardware name: riscv-virtio,qemu (DT) [ 2.703051] epc : __virt_to_phys+0x7e/0x86 [ 2.703298] ra : __virt_to_phys+0x7e/0x86 [ 2.703547] epc : ffffffff80008448 ra : ffffffff80008448 sp : ffff8f800021bde0 [ 2.703977] gp : ffffffff80ed9b30 tp : ffffaf8001230000 t0 : ffffffff80eea56f [ 2.704704] t1 : ffffffff80eea560 t2 : 0000000000000000 s0 : ffff8f800021be00 [ 2.705153] s1 : ffffffff806c2000 a0 : 000000000000004f a1 : ffffffff80e723d8 [ 2.705555] a2 : 0000000000000010 a3 : fffffffffffffffe a4 : 0000000000000000 [ 2.706027] a5 : 0000000000000000 a6 : 0000000000000005 a7 : ffffffffffffffff [ 2.706474] s2 : ffffffff80b80b08 s3 : 00000000000000c2 s4 : ffffffff806c2000 [ 2.706891] s5 : ffffffff80edba10 s6 : ffffffff80edb960 s7 : 0000000000000001 [ 2.707290] s8 : 00000000000000ff s9 : ffffffff80b80b40 s10: 00000000000000cc [ 2.707689] s11: ffffaf807e1fcf00 t3 : 0000000000000076 t4 : ffffffffffffffff [ 2.708092] t5 : 00000000000001f2 t6 : ffff8f800021bb48 [ 2.708433] status: 0000000000000120 badaddr: 0000000000000000 cause: 0000000000000003 [ 2.708919] [] free_reserved_area+0x72/0x19a [ 2.709296] [] free_initmem+0x6c/0x7c [ 2.709648] [] kernel_init+0x3a/0x10a [ 2.709993] [] ret_from_exception+0x0/0xc [ 2.710310] ---[ end trace 0000000000000000 ]--- On Wed, Feb 16, 2022 at 5:09 PM Aleksandr Nogikh wrote: > > On Wed, Feb 16, 2022 at 12:56 PM Dmitry Vyukov wrote: > > > > On Wed, 16 Feb 2022 at 12:47, Aleksandr Nogikh wrote: > > > > > > On Wed, Feb 16, 2022 at 11:37 AM Aleksandr Nogikh wrote: > > > > > > > > Hi Alex, > > > > > > > > On Wed, Feb 16, 2022 at 5:14 AM Alexandre Ghiti wrote: > > > > > > > > > > Hi Dmitry, > > > > > > > > > > On 2/15/22 18:12, Dmitry Vyukov wrote: > > > > > > On Wed, 2 Feb 2022 at 14:18, Alexandre Ghiti > > > > > > wrote: > > > > > >> Hi Aleksandr, > > > > > >> > > > > > >> On Wed, Feb 2, 2022 at 12:08 PM Aleksandr Nogikh wrote: > > > > > >>> Hello, > > > > > >>> > > > > > >>> syzbot has already not been able to fuzz its RISC-V instance for 97 > > > > > >> That's a longtime, I'll take a look more regularly. > > > > > >> > > > > > >>> days now because the compiled kernel cannot boot. I bisected the issue > > > > > >>> to the following commit: > > > > > >>> > > > > > >>> commit 54c5639d8f507ebefa814f574cb6f763033a72a5 > > > > > >>> Author: Alexandre Ghiti > > > > > >>> Date: Fri Oct 29 06:59:27 2021 +0200 > > > > > >>> > > > > > >>> riscv: Fix asan-stack clang build > > > > > >>> > > > > > >>> Apparently, the problem appears on GCC-built RISC-V kernels with KASAN > > > > > >>> enabled. In the previous message syzbot mentions > > > > > >>> "riscv64-linux-gnu-gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU > > > > > >>> Binutils for Debian) 2.35.2", but the issue also reproduces finely on > > > > > >>> a newer GCC compiler: "riscv64-linux-gnu-gcc (Debian 11.2.0-10) > > > > > >>> 11.2.0, GNU ld (GNU Binutils for Debian) 2.37". > > > > > >>> For convenience, I also duplicate the .config file from the bot's > > > > > >>> message: https://syzkaller.appspot.com/x/.config?x=522544a2e0ef2a7d > > > > > >>> > > > > > >>> Can someone with KASAN and RISC-V expertise please take a look? > > > > > >> I'll take a look at that today. > > > > > >> > > > > > >> Thanks for reporting the issue, > > > > > > > > > > > > > > > > I took a quick look, not enough to fix it but I know the issue comes > > > > > from the inline instrumentation, I have no problem with the outline > > > > > instrumentation. I need to find some cycles to work on this, my goal is > > > > > to fix this for 5.17. > > > > > > > > Thanks for the update! > > > > > > > > Can you please share the .config with which you tested the outline > > > > instrumentation? > > > > I updated the syzbot config to use KASAN_OUTLINE instead of KASAN_INLINE, > > > > but it still does not boot :( > > > > > > > > Here's what I used: > > > > https://gist.github.com/a-nogikh/279c85c2d24f47efcc3e865c08844138 > > > > > > Update: it doesn't boot with that big config, but boots if I generate > > > a simple one with KASAN_OUTLINE: > > > > > > make defconfig ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- > > > ./scripts/config -e KASAN -e KASAN_OUTLINE > > > make olddefconfig ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- > > > > > > And it indeed doesn't work if I use KASAN_INLINE. > > > > It may be an issue with code size. Full syzbot config + KASAN + KCOV > > produce hugely massive .text. It may be hitting some limitation in the > > bootloader/kernel bootstrap code. > > I bisected the difference between the config we use on syzbot and the > simple one that was generated like I described above. > Turns out that it's the DEBUG_VIRTUAL config that makes the difference. > > make defconfig ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- > ./scripts/config -e KASAN -e KASAN_OUTLINE -e DEBUG_VIRTUAL > make olddefconfig ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- > > And the resulting kernel does not boot. > My env: the `riscv/fixes` branch, commit > 6df2a016c0c8a3d0933ef33dd192ea6606b115e3, qemu 6.2.0.