Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp9219053rwl; Wed, 11 Jan 2023 03:05:48 -0800 (PST) X-Google-Smtp-Source: AMrXdXuEGiRqdYPC7m3RQ4GBmQ7HKhfkdkYl5xjthZiZLn8Z5lZokjS52HXS+6Wr5HO3VNb5yt6h X-Received: by 2002:a05:6a20:94a1:b0:ad:c694:3fbb with SMTP id hs33-20020a056a2094a100b000adc6943fbbmr72186758pzb.25.1673435147880; Wed, 11 Jan 2023 03:05:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1673435147; cv=none; d=google.com; s=arc-20160816; b=z8FZwYJno0Fu2weR+a8xITWWAWq5YtGIRaGMew7+K5tGtqYpfBoI9cdkLuEbsJ4uuR MYpyRnZ0VGV4LAX+Ymy0IXadPAcOAjXidDbWNtjdvPzEovmmqUyjHjsYQOx7FB8rhON3 S1j/msR6+ADJ4a9ItvDbfJ7WzfiQsLnaoBwZ4SolYmRYkxSZ0gpLAFsJseBCrSVlqKEq snVY0Gh79OweknBqb5QYSeYL0MCVCd9kznHertZlepN9y4+IOvabePPoVM1j+r4QbFjv tRO+Zuhjb76E26zanoKK2O8gMV9mE13KyruAVRsPAt3IPhrzxYYE/bJrVXmKZsEAZCuC Hvjg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :organization:references:cc:to:content-language:subject:reply-to :from:user-agent:mime-version:date:message-id:dkim-signature; bh=s3KqGObhsHrRbnSdUxruE0y4nmn0w6bjS57+0xuPZcw=; b=K3YVU77hB2N5kr58mir6o/23J0O4FtzDayTbnl1F7OaVxAgeZ7Frnau2vYUsXoshxh 8PNYZbjQ47R3779bXRu0l4N/h26TcKDtSyoSDrB4IKRJGURtzC2Z5h27oxR+sLoexCyj Ay+Nl5oQ5r+Evwz8jAAz37DnU7ad0tBn1QBgAVn6SMj/2dMWTSQJcYIgWATy9ySh5YO9 zG5ENDqXLhDK8SUdsrr/l/o25jdTwIFl7ka/3jKbw962kliHY0mbTSerLQXdhlX5BSAD /6cTkhLZEghi8140Y7tVIyo2H6JYGvAvIZTIjaopetZwwc0DufnDwJQBgrkJyS3rL7bk KSNA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=nGcDPPgC; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a2-20020a63e402000000b004a2f26de947si14469576pgi.224.2023.01.11.03.05.40; Wed, 11 Jan 2023 03:05:47 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=nGcDPPgC; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238400AbjAKKfF (ORCPT + 53 others); Wed, 11 Jan 2023 05:35:05 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54372 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233520AbjAKKet (ORCPT ); Wed, 11 Jan 2023 05:34:49 -0500 Received: from mail-wm1-x32f.google.com (mail-wm1-x32f.google.com [IPv6:2a00:1450:4864:20::32f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id EDFE410059 for ; Wed, 11 Jan 2023 02:34:43 -0800 (PST) Received: by mail-wm1-x32f.google.com with SMTP id m26-20020a05600c3b1a00b003d9811fcaafso12264812wms.5 for ; Wed, 11 Jan 2023 02:34:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=content-transfer-encoding:in-reply-to:organization:references:cc:to :content-language:subject:reply-to:from:user-agent:mime-version:date :message-id:from:to:cc:subject:date:message-id:reply-to; bh=s3KqGObhsHrRbnSdUxruE0y4nmn0w6bjS57+0xuPZcw=; b=nGcDPPgC2WNOGZZX/rmF0U0iF7x/ue7sd1UMOIHUj1uob1tYj7ae2czwU/aivZKVgX IPIeb1sYJVOH+GBZc53rnGhZ98KHkGnJCjQZLswPLSyZR1q66lGzwNETb3tII+ZGZFQ4 /V4ZPz+uwruLwi2sll7x7vFuPNuIec0MYiGCriDXfiQHgUdOJWVSUzHxvGYQ4DLv63pC 1Vwlbgp0Z3v52gJLbVaP5fbwtGMjOjt62iOUEj323zu6uX45OGXisPUgNxsJSQdJTzEk hqUzwMxsuVh0xJz+aseAYFCe6L2pdUefKpb+oAwgL4D3VgaPro/O25YRfIKWirwtz/o9 ltLg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:in-reply-to:organization:references:cc:to :content-language:subject:reply-to:from:user-agent:mime-version:date :message-id:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=s3KqGObhsHrRbnSdUxruE0y4nmn0w6bjS57+0xuPZcw=; b=12nR87wkv9oKlp5BrF88AGyuqXeI6HDRryRYTgAJaM/Pyp6hdbiYrM/wMNKFTxTWrO ask9F2gmmWSpL/tL2Z8QCm7MrCRhAMlBVfQGJ0hb2ocCLd6W5X1qxxNC3OEpm9CMwpZR 1wLEy15p9tF6l1ONTRP8KCWSZ03MS3vqkcKpga/nYcMKkDKPdRDQr3K/T2NkYWi7YSSA 7cyUR/uuhYLqWdSfB5D8J9qCcMdEQ7DSFCM8ETHv6uZHwYqc9N+bvsJD7pHe69Z78wtP dyODn0uO0YPgfp5XQNQ0HsfsRmY1JT8gSPcGIUT4diq8Omhvfck0vtd+om42ujQlDfv9 vJAw== X-Gm-Message-State: AFqh2koFn2tLUMHJzd8YjhzvwWU6yAf9mRjT0RG3Rv6GVKnwf/Cfp1HP Rq2CEpbKVzHC+YqHEUgZW/9+hg== X-Received: by 2002:a05:600c:3506:b0:3cf:803b:d7cc with SMTP id h6-20020a05600c350600b003cf803bd7ccmr55084313wmq.33.1673433282477; Wed, 11 Jan 2023 02:34:42 -0800 (PST) Received: from ?IPV6:2a01:e0a:982:cbb0:d95d:43b7:d6a9:39a5? ([2a01:e0a:982:cbb0:d95d:43b7:d6a9:39a5]) by smtp.gmail.com with ESMTPSA id n14-20020a05600c3b8e00b003b49bd61b19sm25394292wms.15.2023.01.11.02.34.41 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 11 Jan 2023 02:34:42 -0800 (PST) Message-ID: <19423616-79d4-9478-c6df-4f226f87a27f@linaro.org> Date: Wed, 11 Jan 2023 11:34:41 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.6.0 From: Neil Armstrong Reply-To: neil.armstrong@linaro.org Subject: Re: next-20230110: arm64: defconfig+kselftest config boot failed - Unable to handle kernel paging request at virtual address fffffffffffffff8 Content-Language: en-US To: Arnd Bergmann , Naresh Kamboju , linux-next , open list , dri-devel@lists.freedesktop.org, Linux ARM , lkft-triage@lists.linaro.org, regressions@lists.linux.dev Cc: Mark Brown , Will Deacon , Catalin Marinas , Mark Rutland , Aishwarya TCV , Anders Roxell , Ard Biesheuvel , linux-amlogic@lists.infradead.org References: <77342621-d67f-4d47-a33b-6f721576bf9d@app.fastmail.com> Organization: Linaro Developer Services In-Reply-To: <77342621-d67f-4d47-a33b-6f721576bf9d@app.fastmail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 10/01/2023 17:41, Arnd Bergmann wrote: > On Tue, Jan 10, 2023, at 17:14, Naresh Kamboju wrote: >> [ please ignore this email if this regression already reported ] >> >> Today's Linux next tag next-20230110 boot passes with defconfig but >> boot fails with >> defconfig + kselftest merge config on arm64 devices and qemu-arm64. >> >> Reported-by: Linux Kernel Functional Testing >> >> We are bisecting this problem and get back to you shortly. >> >> GOOD: next-20230109 (defconfig + kselftests configs) >> BAD: next-20230110 (defconfig + kselftests configs) >> >> kernel crash log [1]: >> >> [ 15.302140] Unable to handle kernel paging request at virtual >> address fffffffffffffff8 >> [ 15.309906] Mem abort info: >> [ 15.312659] ESR = 0x0000000096000004 >> [ 15.316365] EC = 0x25: DABT (current EL), IL = 32 bits >> [ 15.321626] SET = 0, FnV = 0 >> [ 15.324644] EA = 0, S1PTW = 0 >> [ 15.327744] FSC = 0x04: level 0 translation fault >> [ 15.332619] Data abort info: >> [ 15.335422] ISV = 0, ISS = 0x00000004 >> [ 15.339226] CM = 0, WnR = 0 >> [ 15.342154] swapper pgtable: 4k pages, 48-bit VAs, pgdp=000000001496c000 >> [ 15.348795] [fffffffffffffff8] pgd=0000000000000000, p4d=0000000000000000 >> [ 15.355524] Internal error: Oops: 0000000096000004 [#1] PREEMPT SMP >> [ 15.361729] Modules linked in: meson_gxl dwmac_generic >> snd_soc_meson_gx_sound_card snd_soc_meson_card_utils lima gpu_sched >> drm_shmem_helper meson_drm drm_dma_helper crct10dif_ce meson_ir >> rc_core meson_dw_hdmi dw_hdmi meson_canvas dwmac_meson8b >> stmmac_platform meson_rng stmmac rng_core cec meson_gxbb_wdt >> drm_display_helper snd_soc_meson_aiu snd_soc_meson_codec_glue pcs_xpcs >> snd_soc_meson_t9015 amlogic_gxl_crypto crypto_engine display_connector >> snd_soc_simple_amplifier drm_kms_helper drm nvmem_meson_efuse >> [ 15.405976] CPU: 1 PID: 9 Comm: kworker/u8:0 Not tainted >> 6.2.0-rc3-next-20230110 #1 >> [ 15.413563] Hardware name: Libre Computer AML-S905X-CC (DT) >> [ 15.419086] Workqueue: events_unbound deferred_probe_work_func >> [ 15.424863] pstate: 00000005 (nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) >> [ 15.431762] pc : of_drm_find_bridge+0x38/0x70 [drm] >> [ 15.436594] lr : of_drm_find_bridge+0x20/0x70 [drm] > > The line is > > drivers/gpu/drm/drm_bridge.c:1310: if (bridge->of_node == np) { > > The list_head here is a NULL pointer, so ->of_node points > to address negative 8, i.e. fffffffffffffff8 > > This is linked list corruption, which typically happens as > part of a use-after-free, and could be the result of a > failed registration causing an object to be freed after > it is added to the list. > > Unfortunately, there are no patches to this file between > next-20230109 and next-20230110, so the bug probably is > not actually in this file. > >> [ 15.515426] Call trace: >> [ 15.517863] Insufficient stack space to handle exception! >> [ 15.517867] ESR: 0x0000000096000047 -- DABT (current EL) >> [ 15.517871] FAR: 0xffff80000a047ff0 >> [ 15.517873] Task stack: [0xffff80000a048000..0xffff80000a04c000] >> [ 15.517877] IRQ stack: [0xffff800008008000..0xffff80000800c000] >> [ 15.517880] Overflow stack: [0xffff00007d9c1320..0xffff00007d9c2320] >> [ 15.517884] CPU: 1 PID: 9 Comm: kworker/u8:0 Not tainted >> 6.2.0-rc3-next-20230110 #1 >> [ 15.517890] Hardware name: Libre Computer AML-S905X-CC (DT) >> [ 15.517895] Workqueue: events_unbound deferred_probe_work_func >> [ 15.517915] pstate: 800003c5 (Nzcv DAIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--) >> [ 15.517923] pc : el1_abort+0x4/0x5c >> [ 15.517932] lr : el1h_64_sync_handler+0x60/0xac >> [ 15.517939] sp : ffff80000a048020 > > Not sure about the missing stack trace: I can see that the stack > pointer is on a task stack, which is reported as having overflown, > but I don't see why it's unable to print the stack while running > from the overflow stack. > > A stack overflow is often caused by unbounded recursion, which > can happen when a device driver binds itself to a device that it > has just created. The log does look a bit suspicious here, > with multiple registrations for c883a000.hdmi-tx: > > 986 08:02:56.487871 [ 15.141218] meson-drm d0100000.vpu: Queued 2 outputs on vpu > 987 08:02:56.493572 [ 15.141615] meson8b-dwmac c9410000.ethernet: Ring mode enabled > 988 08:02:56.504769 [ 15.150744] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) > 989 08:02:56.515743 [ 15.154970] meson8b-dwmac c9410000.ethernet: Enable RX Mitigation via HW Watchdog Timer > 990 08:02:56.521531 [ 15.159175] lima d00c0000.gpu: pp0 - mali450 version major 0 minor 0 > 991 08:02:56.526718 [ 15.161436] meson-drm d0100000.vpu: Failed to find HDMI transceiver bridge > 992 08:02:56.532417 [ 15.168933] lima d00c0000.gpu: pp1 - mali450 version major 0 minor 0 > 993 08:02:56.537747 [ 15.206102] meson-drm d0100000.vpu: Queued 2 outputs on vpu > 994 08:02:56.543435 [ 15.209608] lima d00c0000.gpu: pp2 - mali450 version major 0 minor 0 > 995 08:02:56.554307 [ 15.217027] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) > 996 08:02:56.559929 [ 15.221169] lima d00c0000.gpu: l2 cache 8K, 4-way, 64byte cache line, 128bit external bus > 997 08:02:56.565229 [ 15.231561] meson-drm d0100000.vpu: Failed to find HDMI transceiver bridge > 998 08:02:56.576490 [ 15.238133] lima d00c0000.gpu: l2 cache 64K, 4-way, 64byte cache line, 128bit external bus > 999 08:02:56.581759 [ 15.253879] lima d00c0000.gpu: bus rate = 166666667 > 1000 08:02:56.587248 [ 15.257128] lima d00c0000.gpu: mod rate = 24000000 > 1001 08:02:56.593344 [ 15.261862] lima d00c0000.gpu: error -ENODEV: _opp_set_regulators: no regulator (mali) found > 1002 08:02:56.612809 [ 15.286143] [drm] Initialized lima 1.1.0 20191231 for d00c0000.gpu on minor 1 > 1003 08:02:56.618428 [ 15.292259] meson-drm d0100000.vpu: Queued 2 outputs on vpu > 1004 08:02:56.623866 [ 15.293715] meson-drm d0100000.vpu: bound c883a000.hdmi-tx (ops meson_dw_hdmi_ops [meson_dw_hdmi]) > 1005 08:02:56.634771 [ 15.302140] Unable to handle kernel paging request at virtual address fffffffffffffff8 > > Again, I don't see any recent changes in that driver or the > corresponding dts file. I merged a fix that could be related: https://lore.kernel.org/all/20230109220033.31202-1-m.szyprowski@samsung.com/ This could make the driver to return from probe while not totally probed, and explain such error. Neil > > Arnd