Received: by 2002:a05:7412:251c:b0:e2:908c:2ebd with SMTP id w28csp2806914rda; Wed, 25 Oct 2023 12:54:24 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEabup6K172I5X8BOCNlXb4eeUH/zvW0FCB+ATaPwLlXm8uL4hHeP3lhacYo9MNgt6+C+G4 X-Received: by 2002:a25:b85:0:b0:d9a:ba4b:44ab with SMTP id 127-20020a250b85000000b00d9aba4b44abmr14976227ybl.61.1698263663702; Wed, 25 Oct 2023 12:54:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698263663; cv=none; d=google.com; s=arc-20160816; b=pjr4/+yFHB2vYzqaVr2B0KIjjgk7B8B3Zxuv0rEzaQPOc8I29g3TOrdzkB3cVOrVhU kEuv80L00z4dt79oJRL+dEKsZIbWTBKRXinbQopsmzXtHakVFF9RVokx3bexAEbG+FAe xoBHm/vg/9HbXChCQx5Azqg/feaj6xf5ZOQvvSEJdh4sqnXuaLUjQ6M7Yb1yKFNHEeCB LcwzVKi2CuD/MqWggcpkMDqh8/ccvtARFtorl1L8795GHDIf6F1IwX4qCkzzwNGxP8mr gBmvSToKkFS969W/cjjKhCLXphuIU2BskDlwu6Ddh+pdLSnHFl9x/1Bp+Eb7ZSaKSWWf c77Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version; bh=joXCcwcOg2a/mAZq6O1MXXLpBFDXwWdDmgfsLWOEBlI=; fh=cpSecrw2yxdN4H5g/ibL4pPsoTX+KDiLaqdCkkDro/o=; b=YPCZFn1MJms2+DP/vdeZv8qXciTE88bZs4i9xQaZVakbshugnF5j67xNJTHZBuPhSq 8mK53TcxtxGPB9fGYeNJXdJVbhpDobZcLPlNboUxSNJSV5sr/U7d27oh6tc5U8UEW57W tLWxo++9hT+hKkt+3lDXqs7N+to4G4/uE9MhP7Jli/cUY1kCF000gF8gjwyE7JcQ1+Sv m/KczyJXA+/5XFCkR+NgCcxvRmFcJFhG5mzFzO27xjDUngtAZ5gjVtsTYb7mo2wdCVJ3 O/vddMxOwWgzu7Q7PFOO0jcoYMA2joXdYw+AYm1soxQaIOTMCXCm9jL0kUQt5TYJAHph yWLw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [2620:137:e000::3:3]) by mx.google.com with ESMTPS id t9-20020a25f609000000b00d81456a151asi12054136ybd.626.2023.10.25.12.54.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 25 Oct 2023 12:54:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) client-ip=2620:137:e000::3:3; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id 56661817C1F3; Wed, 25 Oct 2023 12:54:20 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234772AbjJYTx4 convert rfc822-to-8bit (ORCPT + 99 others); Wed, 25 Oct 2023 15:53:56 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57858 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234337AbjJYTxy (ORCPT ); Wed, 25 Oct 2023 15:53:54 -0400 Received: from mail-yw1-f181.google.com (mail-yw1-f181.google.com [209.85.128.181]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D5A53C4; Wed, 25 Oct 2023 12:53:51 -0700 (PDT) Received: by mail-yw1-f181.google.com with SMTP id 00721157ae682-5a7c011e113so928687b3.1; Wed, 25 Oct 2023 12:53:51 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1698263631; x=1698868431; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=Xa8+C/PTH21PNEpO4wV0awBQL0qs+5FqYNr/HCsqJnA=; b=L+OuTd/D6cFOtexBBCj0LrhEum7/0nBnRvPGxr4V1IHO+2iZJe5lA8Ggl/aKoay9Oj 0Y97E+m7oaWkhWhQUOvCipIb9ZLogvVgRp5+dDBD5k6CZ8kiSY56iXZeVp8r6yPH683/ olC+GeByX1iC8hRLbNM/7u6psh+NL3pYSD73Lu6smge3U+00IaiFHwpcoAkmUgqbpJSo cYCuq9Q17lZiUrZAUfU4Hpnb+jq9ESxv6RA/1fJyeaL7xLAk+Mbs7JBcODhLxGgNspnq Cz547iRqbYjf5QARC20MtCJ/OQGW7jJCboY6KVRjcbaFagzsRFc8hgeJke223FcT7U5d PpyA== X-Gm-Message-State: AOJu0YwwsOXnVvHANuGxul3k0rgptPVHrM5yV+bIMjgiZPd0E2sqopcp lubGo8Nx1+nwVdanKTbzqs49jqfr9Qjb0w== X-Received: by 2002:a25:d38d:0:b0:d9c:2a9c:3f4f with SMTP id e135-20020a25d38d000000b00d9c2a9c3f4fmr17010723ybf.62.1698263630704; Wed, 25 Oct 2023 12:53:50 -0700 (PDT) Received: from mail-yb1-f181.google.com (mail-yb1-f181.google.com. [209.85.219.181]) by smtp.gmail.com with ESMTPSA id f132-20020a25518a000000b00d9a36ded1besm4707536ybb.6.2023.10.25.12.53.49 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 25 Oct 2023 12:53:49 -0700 (PDT) Received: by mail-yb1-f181.google.com with SMTP id 3f1490d57ef6-d857c8a1d50so74214276.3; Wed, 25 Oct 2023 12:53:49 -0700 (PDT) X-Received: by 2002:a5b:c42:0:b0:da0:48df:cafa with SMTP id d2-20020a5b0c42000000b00da048dfcafamr5279836ybr.16.1698263629623; Wed, 25 Oct 2023 12:53:49 -0700 (PDT) MIME-Version: 1.0 References: <20231009130126.697995596@linuxfoundation.org> <2023101057-runny-pellet-8952@gregkh> <7d7a5a15-3349-adce-02cd-82b6cb4bebde@roeck-us.net> In-Reply-To: <7d7a5a15-3349-adce-02cd-82b6cb4bebde@roeck-us.net> From: Geert Uytterhoeven Date: Wed, 25 Oct 2023 21:53:37 +0200 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: renesas_sdhi problems in 5.10-stable was Re: [PATCH 5.10 000/226] 5.10.198-rc1 review To: Guenter Roeck Cc: Pavel Machek , Wolfram Sang , Ulf Hansson , Greg Kroah-Hartman , niklas.soderlund+renesas@ragnatech.se, yoshihiro.shimoda.uh@renesas.com, biju.das.jz@bp.renesas.com, Chris.Paterson2@renesas.com, stable@vger.kernel.org, patches@lists.linux.dev, linux-kernel@vger.kernel.org, torvalds@linux-foundation.org, akpm@linux-foundation.org, shuah@kernel.org, patches@kernelci.org, lkft-triage@lists.linaro.org, jonathanh@nvidia.com, f.fainelli@gmail.com, sudipm.mukherjee@gmail.com, srw@sladewatkins.net, rwarsow@gmx.de, conor@kernel.org, linux-reneas-soc@vger.kernel.org, Linux MMC List Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Wed, 25 Oct 2023 12:54:20 -0700 (PDT) Hi Günter, On Wed, Oct 25, 2023 at 8:39 PM Guenter Roeck wrote: > On 10/25/23 10:05, Geert Uytterhoeven wrote: > > On Wed, Oct 25, 2023 at 2:35 PM Geert Uytterhoeven wrote: > >> On Wed, Oct 25, 2023 at 12:53 PM Geert Uytterhoeven > >> wrote: > >>> On Wed, Oct 25, 2023 at 12:47 PM Geert Uytterhoeven > >>> wrote: > >>>> On Tue, Oct 24, 2023 at 9:22 PM Pavel Machek wrote: > >>>>> But we still have failures on Renesas with 5.10.199-rc2: > >>>>> > >>>>> https://gitlab.com/cip-project/cip-testing/linux-stable-rc-ci/-/pipelines/1047368849 > >>>>> > >>>>> And they still happed during MMC init: > >>>>> > >>>>> 2.638013] renesas_sdhi_internal_dmac ee100000.mmc: Got CD GPIO > >>>>> [ 2.638846] INFO: trying to register non-static key. > >>>>> [ 2.644192] ledtrig-cpu: registered to indicate activity on CPUs > >>>>> [ 2.649066] The code is fine but needs lockdep annotation, or maybe > >>>>> [ 2.649069] you didn't initialize this object before use? > >>>>> [ 2.649071] turning off the locking correctness validator. > >>>>> [ 2.649080] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.10.199-rc2-arm64-renesas-ge31b6513c43d #1 > >>>>> [ 2.649082] Hardware name: HopeRun HiHope RZ/G2M with sub board (DT) > >>>>> [ 2.649086] Call trace: > >>>>> [ 2.655106] SMCCC: SOC_ID: ARCH_SOC_ID not implemented, skipping .... > >>>>> [ 2.661354] dump_backtrace+0x0/0x194 > >>>>> [ 2.661361] show_stack+0x14/0x20 > >>>>> [ 2.667430] usbcore: registered new interface driver usbhid > >>>>> [ 2.672230] dump_stack+0xe8/0x130 > >>>>> [ 2.672238] register_lock_class+0x480/0x514 > >>>>> [ 2.672244] __lock_acquire+0x74/0x20ec > >>>>> [ 2.681113] usbhid: USB HID core driver > >>>>> [ 2.687450] lock_acquire+0x218/0x350 > >>>>> [ 2.687456] _raw_spin_lock+0x58/0x80 > >>>>> [ 2.687464] tmio_mmc_irq+0x410/0x9ac > >>>>> [ 2.688556] renesas_sdhi_internal_dmac ee160000.mmc: mmc0 base at 0x00000000ee160000, max clock rate 200 MHz > >>>>> [ 2.744936] __handle_irq_event_percpu+0xbc/0x340 > >>>>> [ 2.749635] handle_irq_event+0x60/0x100 > >>>>> [ 2.753553] handle_fasteoi_irq+0xa0/0x1ec > >>>>> [ 2.757644] __handle_domain_irq+0x7c/0xdc > >>>>> [ 2.761736] efi_header_end+0x4c/0xd0 > >>>>> [ 2.765393] el1_irq+0xcc/0x180 > >>>>> [ 2.768530] arch_cpu_idle+0x14/0x2c > >>>>> [ 2.772100] default_idle_call+0x58/0xe4 > >>>>> [ 2.776019] do_idle+0x244/0x2c0 > >>>>> [ 2.779242] cpu_startup_entry+0x20/0x6c > >>>>> [ 2.783160] rest_init+0x164/0x28c > >>>>> [ 2.786561] arch_call_rest_init+0xc/0x14 > >>>>> [ 2.790565] start_kernel+0x4c4/0x4f8 > >>>>> [ 2.794233] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000014 > >>>>> [ 2.803011] Mem abort info: > >>>>> > >>>>> from https://lava.ciplatform.org/scheduler/job/1025535 > >>>>> from > >>>>> https://gitlab.com/cip-project/cip-testing/linux-stable-rc-ci/-/jobs/5360973735 . > >>>>> > >>>>> Is there something else missing? > >>>> > >>>> I don't have a HopeRun HiHope RZ/G2M, but both v5.10.198 and v5.10.199 > >>>> seem to work fine on Salvator-XS with R-Car H3 ES2.0 and Salvator-X > >>>> with R-Car M3-W ES1.0, using a config based on latest renesas_defconfig. > >>> > >>> Sorry, I looked at the wrong log on R-Car M3-W. > >>> I do see the issue with v5.10.198, but not with v5.10.199. > >> > >> It seems to be an intermittent issue. Investigating... > > > > After spending too much time on bisecting, the bad guy turns out to > > be commit 6d3745bbc3341d3b ("mmc: renesas_sdhi: register irqs before > > registering controller") in v5.10.198. > > > > Adding debug information shows the lock is mmc_host.lock. > > > > It is definitely initialized: > > > > renesas_sdhi_probe() > > { > > ... > > tmio_mmc_host_alloc() > > mmc_alloc_host > > spin_lock_init(&host->lock); > > ... > > devm_request_irq() > > -> tmio_mmc_irq > > tmio_mmc_cmd_irq() > > spin_lock(&host->lock); > > ... > > } > > > > That leaves us with a missing lockdep annotation? > > Is it possible that the lock initialization is overwritten ? > I seem to recall a recent case where this happens. > > Also, there is > spin_lock_init(&_host->lock); > in tmio_mmc_host_probe(), and tmio_mmc_host_probe() is called after > devm_request_irq(). Unless I am missing something, that is initializing tmio_mmc_host.lock, which is a different lock than mmc_host.lock? > Also, how would lockdep annotation help with "Unable to handle > kernel NULL pointer dereference at virtual address 0000000000000014" > in the log above ? For the log from v5.10.198-rc1-g18c65c1b4996, that happened because it lacked commit 1e3d016a95067ab3 ("mmc: renesas_sdhi: only reset SCC when its pointer is populated"), according to earlier messages in this thread. For the NULL pointer dereference in 5.10.199-rc2, I'm not sure. I didn't see that on R-Car M3-W... According to my logs, I never saw this lockdep issue in MMC on mainline before, so it's a bit hard to guess what's missing... Gr{oetje,eeting}s, Geert -- Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@linux-m68k.org In personal conversations with technical people, I call myself a hacker. But when I'm talking to journalists I just say "programmer" or something like that. -- Linus Torvalds