Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp8075969rwl; Tue, 10 Jan 2023 08:46:27 -0800 (PST) X-Google-Smtp-Source: AMrXdXuNw8jQ6lku7/jopZ+tPc8V0MKSF7Jtj40uwDzFxvB+kgezXmY3ef9EhBAZuU7gBYB822+3 X-Received: by 2002:a05:6a20:b91c:b0:b2:4327:fa6c with SMTP id fe28-20020a056a20b91c00b000b24327fa6cmr69026762pzb.18.1673369187270; Tue, 10 Jan 2023 08:46:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1673369187; cv=none; d=google.com; s=arc-20160816; b=lhzx3863hwhkAqj6UkLA0899KUQF6DM1UHFCT44iFlO07h9VY6m8w5QAsTSSfyauQx MvhezCrPW/WfdcqHdvo/8QCs//BUIJraJoUgefzB9g/Qoarr964Bbvdk+rxweuUx5txx dwJgQrzMIkFTaC9NpAkvl57pt51ZvUCNgWQccgZ77+ChwvMpBWDwqdURguPEsKzMAoya HV6KDDRfTGXAnH7u2id80ZftEwF5dg/YXvVy2mid8DBc/RnTtMuyEz/sgrKpVJ8fYga1 zWQj+Y2PD41HQ2ulmEjFPhMHjG7LBwl4gPAeTyIB1dXl/lKOvom28qFErbGZXp4FYIxi yZpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:subject:cc:to:from:date:references:in-reply-to :message-id:mime-version:user-agent:feedback-id:dkim-signature :dkim-signature; bh=BQbRuk/s7xiQujbEKJZAedL9CmNeLHCNVqnm1mI9eH8=; b=w4d5ml68T9D5HeFaksHfG2EV2seSdNX5zsc0DtNpqHJ+u/Gsr3AnVcfAzHd7tcC+QS n5pAHhBado5CBARWJCQPF858znPojYNGWh6GaPUBBY1BrQ6YJ8jIPtPDDppFvdv1BvAR Bhi8WZobnfzp6bUoSdjHVDrgZqyIE9Zv3xnwCZpdoDyre0e7K2F/HGtkJGUfsO8gyMGP +7K6QzEoMm21XeX2R8up/qe60RStFp0+l2Kge2OisNXRGAsBujW/HQj21wVlTPIVtRaH +KrH6SnQu/URtXiBf5ng0Rz+jsRhvVL0O++ll68KB0BEwW6CItTMXgqbAC3DM5xgQLp7 ucXA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@arndb.de header.s=fm2 header.b=MG29sti5; dkim=pass header.i=@messagingengine.com header.s=fm3 header.b=M3IrA3iS; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id r17-20020a632051000000b00477ae8def63si12028193pgm.495.2023.01.10.08.46.20; Tue, 10 Jan 2023 08:46:27 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@arndb.de header.s=fm2 header.b=MG29sti5; dkim=pass header.i=@messagingengine.com header.s=fm3 header.b=M3IrA3iS; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238928AbjAJQls (ORCPT + 53 others); Tue, 10 Jan 2023 11:41:48 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55636 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238871AbjAJQlf (ORCPT ); Tue, 10 Jan 2023 11:41:35 -0500 Received: from out3-smtp.messagingengine.com (out3-smtp.messagingengine.com [66.111.4.27]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8789D78E85; Tue, 10 Jan 2023 08:41:32 -0800 (PST) Received: from compute6.internal (compute6.nyi.internal [10.202.2.47]) by mailout.nyi.internal (Postfix) with ESMTP id 246775C00C6; Tue, 10 Jan 2023 11:41:30 -0500 (EST) Received: from imap51 ([10.202.2.101]) by compute6.internal (MEProxy); Tue, 10 Jan 2023 11:41:30 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=arndb.de; h=cc :cc:content-type:date:date:from:from:in-reply-to:in-reply-to :message-id:mime-version:references:reply-to:sender:subject :subject:to:to; s=fm2; t=1673368890; x=1673455290; bh=BQbRuk/s7x iQujbEKJZAedL9CmNeLHCNVqnm1mI9eH8=; b=MG29sti5l3eVz+M3xptJxz0NC1 IE/qvGYdM5ehaCmzthzjydWfuT8MzV4zLv1gLa2hfxy1VXTntIRezdknmalF797p 0uBvxcXVdpEyv3TroYxwgy1kion+HyYNZHRxyC/f0o1QVYqMg86ueUKXDJQbdysY IalayF8dxEy1AW/PYZUGHOK05+goJmMJJsNFS2MDxrx86Z1y71ZnAP5lMZp89ia9 E2wEpFbszvlM/gqJ5L3Av3RvMq4VPHNI0yC/tPokJ7y7/X68B+ffporoGHBHQIej +2N4aI8P+PGfvB90Oc9vhU7uFr7jo+OsiulI03gT4a1S1bwcBIGWMN6CXOkA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:cc:content-type:date:date:feedback-id :feedback-id:from:from:in-reply-to:in-reply-to:message-id :mime-version:references:reply-to:sender:subject:subject:to:to :x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s= fm3; t=1673368890; x=1673455290; bh=BQbRuk/s7xiQujbEKJZAedL9CmNe LHCNVqnm1mI9eH8=; b=M3IrA3iSERaPm0UxD3uqwyQpI/Jph0h9b4Y52R/Ejhft OiO5SfeTJOEe68m4R+PZLzR2MZ5g3xCl8xfuawkMxu2cgOh5t3R3dWV5JE0Z+3mQ SlUzJ1jdTvGTDKOXEKEgjWaeIfFW8gmrrlnYka+O+GSgG7Uw1LP0xcbDJbna6/Rd VwtM9RDQISi1iaFutZvuLLCgdnp5pSxnSAiYxtfxo5fQT6PPHAT/98ogsOZdNhgu tjvEMnTaokq4bIaXxMKbMXqv9dx98Y8Ir2IYKgFl4Hf91GjhrPQfN6RdreZdINeg mUOUWyxpya/lwNU2brNxj8NwlOniMknjrlVQFiaJzQ== X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvhedrledvgddvjecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivghnthhsucdlqddutddtmdenuc fjughrpefofgggkfgjfhffhffvvefutgesthdtredtreertdenucfhrhhomhepfdetrhhn ugcuuegvrhhgmhgrnhhnfdcuoegrrhhnugesrghrnhgusgdruggvqeenucggtffrrghtth gvrhhnpeffheeugeetiefhgeethfejgfdtuefggeejleehjeeutefhfeeggefhkedtkeet ffenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpegrrh hnugesrghrnhgusgdruggv X-ME-Proxy: Feedback-ID: i56a14606:Fastmail Received: by mailuser.nyi.internal (Postfix, from userid 501) id CFBF0B60086; Tue, 10 Jan 2023 11:41:29 -0500 (EST) X-Mailer: MessagingEngine.com Webmail Interface User-Agent: Cyrus-JMAP/3.7.0-alpha0-1185-g841157300a-fm-20221208.002-g84115730 Mime-Version: 1.0 Message-Id: <77342621-d67f-4d47-a33b-6f721576bf9d@app.fastmail.com> In-Reply-To: References: Date: Tue, 10 Jan 2023 17:41:08 +0100 From: "Arnd Bergmann" To: "Naresh Kamboju" , linux-next , "open list" , dri-devel@lists.freedesktop.org, "Linux ARM" , lkft-triage@lists.linaro.org, regressions@lists.linux.dev Cc: "Mark Brown" , "Will Deacon" , "Catalin Marinas" , "Mark Rutland" , "Aishwarya TCV" , "Anders Roxell" , "Ard Biesheuvel" , "Neil Armstrong" , linux-amlogic@lists.infradead.org Subject: Re: next-20230110: arm64: defconfig+kselftest config boot failed - Unable to handle kernel paging request at virtual address fffffffffffffff8 Content-Type: text/plain X-Spam-Status: No, score=-2.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jan 10, 2023, at 17:14, Naresh Kamboju wrote: > [ please ignore this email if this regression already reported ] > > Today's Linux next tag next-20230110 boot passes with defconfig but > boot fails with > defconfig + kselftest merge config on arm64 devices and qemu-arm64. > > Reported-by: Linux Kernel Functional Testing > > We are bisecting this problem and get back to you shortly. > > GOOD: next-20230109 (defconfig + kselftests configs) > BAD: next-20230110 (defconfig + kselftests configs) > > kernel crash log [1]: > > [ 15.302140] Unable to handle kernel paging request at virtual > address fffffffffffffff8 > [ 15.309906] Mem abort info: > [ 15.312659] ESR = 0x0000000096000004 > [ 15.316365] EC = 0x25: DABT (current EL), IL = 32 bits > [ 15.321626] SET = 0, FnV = 0 > [ 15.324644] EA = 0, S1PTW = 0 > [ 15.327744] FSC = 0x04: level 0 translation fault > [ 15.332619] Data abort info: > [ 15.335422] ISV = 0, ISS = 0x00000004 > [ 15.339226] CM = 0, WnR = 0 > [ 15.342154] swapper pgtable: 4k pages, 48-bit VAs, pgdp=000000001496c000 > [ 15.348795] [fffffffffffffff8] pgd=0000000000000000, p4d=0000000000000000 > [ 15.355524] Internal error: Oops: 0000000096000004 [#1] PREEMPT SMP > [ 15.361729] Modules linked in: meson_gxl dwmac_generic > snd_soc_meson_gx_sound_card snd_soc_meson_card_utils lima gpu_sched > drm_shmem_helper meson_drm drm_dma_helper crct10dif_ce meson_ir > rc_core meson_dw_hdmi dw_hdmi meson_canvas dwmac_meson8b > stmmac_platform meson_rng stmmac rng_core cec meson_gxbb_wdt > drm_display_helper snd_soc_meson_aiu snd_soc_meson_codec_glue pcs_xpcs > snd_soc_meson_t9015 amlogic_gxl_crypto crypto_engine display_connector > snd_soc_simple_amplifier drm_kms_helper drm nvmem_meson_efuse > [ 15.405976] CPU: 1 PID: 9 Comm: kworker/u8:0 Not tainted > 6.2.0-rc3-next-20230110 #1 > [ 15.413563] Hardware name: Libre Computer AML-S905X-CC (DT) > [ 15.419086] Workqueue: events_unbound deferred_probe_work_func > [ 15.424863] pstate: 00000005 (nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) > [ 15.431762] pc : of_drm_find_bridge+0x38/0x70 [drm] > [ 15.436594] lr : of_drm_find_bridge+0x20/0x70 [drm] The line is drivers/gpu/drm/drm_bridge.c:1310: if (bridge->of_node == np) { The list_head here is a NULL pointer, so ->of_node points to address negative 8, i.e. fffffffffffffff8 This is linked list corruption, which typically happens as part of a use-after-free, and could be the result of a failed registration causing an object to be freed after it is added to the list. Unfortunately, there are no patches to this file between next-20230109 and next-20230110, so the bug probably is not actually in this file. > [ 15.515426] Call trace: > [ 15.517863] Insufficient stack space to handle exception! > [ 15.517867] ESR: 0x0000000096000047 -- DABT (current EL) > [ 15.517871] FAR: 0xffff80000a047ff0 > [ 15.517873] Task stack: [0xffff80000a048000..0xffff80000a04c000] > [ 15.517877] IRQ stack: [0xffff800008008000..0xffff80000800c000] > [ 15.517880] Overflow stack: [0xffff00007d9c1320..0xffff00007d9c2320] > [ 15.517884] CPU: 1 PID: 9 Comm: kworker/u8:0 Not tainted > 6.2.0-rc3-next-20230110 #1 > [ 15.517890] Hardware name: Libre Computer AML-S905X-CC (DT) > [ 15.517895] Workqueue: events_unbound deferred_probe_work_func > [ 15.517915] pstate: 800003c5 (Nzcv DAIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--) > [ 15.517923] pc : el1_abort+0x4/0x5c > [ 15.517932] lr : el1h_64_sync_handler+0x60/0xac > [ 15.517939] sp : ffff80000a048020 Not sure about the missing stack trace: I can see that the stack pointer is on a task stack, which is reported as having overflown, but I don't see why it's unable to print the stack while running from the overflow stack. A stack overflow is often caused by unbounded recursion, which can happen when a device driver binds itself to a device that it has just created. The log does look a bit suspicious here, with multiple registrations for c883a000.hdmi-tx: 986 08:02:56.487871 [ 15.141218] meson-drm d0100000.vpu: Queued 2 outputs on vpu 987 08:02:56.493572 [ 15.141615] meson8b-dwmac c9410000.ethernet: Ring mode enabled 988 08:02:56.504769 [ 15.150744] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) 989 08:02:56.515743 [ 15.154970] meson8b-dwmac c9410000.ethernet: Enable RX Mitigation via HW Watchdog Timer 990 08:02:56.521531 [ 15.159175] lima d00c0000.gpu: pp0 - mali450 version major 0 minor 0 991 08:02:56.526718 [ 15.161436] meson-drm d0100000.vpu: Failed to find HDMI transceiver bridge 992 08:02:56.532417 [ 15.168933] lima d00c0000.gpu: pp1 - mali450 version major 0 minor 0 993 08:02:56.537747 [ 15.206102] meson-drm d0100000.vpu: Queued 2 outputs on vpu 994 08:02:56.543435 [ 15.209608] lima d00c0000.gpu: pp2 - mali450 version major 0 minor 0 995 08:02:56.554307 [ 15.217027] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) 996 08:02:56.559929 [ 15.221169] lima d00c0000.gpu: l2 cache 8K, 4-way, 64byte cache line, 128bit external bus 997 08:02:56.565229 [ 15.231561] meson-drm d0100000.vpu: Failed to find HDMI transceiver bridge 998 08:02:56.576490 [ 15.238133] lima d00c0000.gpu: l2 cache 64K, 4-way, 64byte cache line, 128bit external bus 999 08:02:56.581759 [ 15.253879] lima d00c0000.gpu: bus rate = 166666667 1000 08:02:56.587248 [ 15.257128] lima d00c0000.gpu: mod rate = 24000000 1001 08:02:56.593344 [ 15.261862] lima d00c0000.gpu: error -ENODEV: _opp_set_regulators: no regulator (mali) found 1002 08:02:56.612809 [ 15.286143] [drm] Initialized lima 1.1.0 20191231 for d00c0000.gpu on minor 1 1003 08:02:56.618428 [ 15.292259] meson-drm d0100000.vpu: Queued 2 outputs on vpu 1004 08:02:56.623866 [ 15.293715] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) 1005 08:02:56.634771 [ 15.302140] Unable to handle kernel paging request at virtual address fffffffffffffff8 Again, I don't see any recent changes in that driver or the corresponding dts file. Arnd