Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp3100911pxb; Tue, 19 Jan 2021 13:55:49 -0800 (PST) X-Google-Smtp-Source: ABdhPJyWD9CuTXrYBUgKMK9osKOd/C9dUgg1fjgt96WBLDqmCABX+nwlLVhGEegSGrLLNcO5MB8/ X-Received: by 2002:a17:906:d98:: with SMTP id m24mr4106460eji.428.1611093349702; Tue, 19 Jan 2021 13:55:49 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611093349; cv=none; d=google.com; s=arc-20160816; b=VBrHljHx3bVj+HgeLsAUV3eY96OIg0k0imYsCots6S/5R/zkuMu7gGiBbP15NcR3G9 alZnZgDCD15zRqxd/waE8kNr44SHM8ZbciHW8InQn+aqndUsbf07j9lOJ1B1PKvbYcTE Xn3mm6XC4ypTfUreg+xL1wRyN7DOubYI761DJEDnQ7b9aGLTeUK1YcrSFEkKo7gsuq3s KC1QaR1V1XLFSZjoRjeqU4Ms6Jft406Em8g0fsh4/nSnP2RamyHzu8uyU5g/ryGaCFUp vfI5H0cr65WPWfjLw3m3q1dm/IamDXrhaa6PYbATNk1YSWiNNMfV67HeBfKmwiM7axOB T3xQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=ojYsHh4FDsAiTDZp/2Xa/VJBmDMV6ozJodacRBfXCrw=; b=BCT5jgULvtVKpOZCmOmYBU9/J5/YXSoj+pRybj8q7iQrmPJnrRDoKOc+p82hPQqqlR JpO49hCmoc5llKXAgrzUIq1eZ5zP4OCayWIIAac/GkbVa2nOKitSSc7EUHOKEq8maASg qUJkPr+lMIeGHPuHNtAEf6zlC4QwsVm4DGN8SljpAvh2MP6Roe8bAd1tXjeXGZlQaBH2 EmKqWHMiaso34ZS7T1hzEekBnfTbu7l0UZqPB7o/6aTX/Oc7N6z0zUSiZvnQWtTF5AnD fKteZ15V7oRxyd0NXoa9mR5+TYyPR+h2HF2Ch0lOGw91GPlFARmwivU7LRPGhlSV2VVG F3Ng== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Daof5gP5; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a16si22029ejb.377.2021.01.19.13.55.13; Tue, 19 Jan 2021 13:55:49 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Daof5gP5; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728683AbhASVwS (ORCPT + 99 others); Tue, 19 Jan 2021 16:52:18 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35378 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728645AbhASVvn (ORCPT ); Tue, 19 Jan 2021 16:51:43 -0500 Received: from mail-yb1-xb2e.google.com (mail-yb1-xb2e.google.com [IPv6:2607:f8b0:4864:20::b2e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5591EC061575 for ; Tue, 19 Jan 2021 13:51:03 -0800 (PST) Received: by mail-yb1-xb2e.google.com with SMTP id x6so17495146ybr.1 for ; Tue, 19 Jan 2021 13:51:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=ojYsHh4FDsAiTDZp/2Xa/VJBmDMV6ozJodacRBfXCrw=; b=Daof5gP5rv/XomU+kRddDWUQvFwdhqWwwXZIKY+Q9B0rQrpOGOvpc2J+l6gzY0kYO4 VBj19SvhpQzW/X+ey3/WG/NCWWwNkGx904iobVqgZiMVt1YgKrz7Da/T5YghWTWnO+ho o25i37njl2HuB8YW35CCRaF4aoTmrc6yMbBofr8UwoOGKJ//d6QbmhMTiR5ATNTVGEv9 vkslfHT7RgYnXUSnAWqirIglIP89mripqDCy+5jD376hgcW33qAPUADyKAv5U7jwy0e+ HJorcJjIgpmsVp449Fc2CJfvIXaPvteIsMUCz280stWI4NXDsOiWD23JC0sdCOvwDA8J Zh2g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=ojYsHh4FDsAiTDZp/2Xa/VJBmDMV6ozJodacRBfXCrw=; b=KozkXGDwzvhb5WQcSA1kNmuE0mMTEAZ82EfR34aLER+6k1vDHNV37d/DkT2eBClFND N8wN08beFPskjzoZbYupb+2flrTpHxD+sWewXF1lDMi2LNPmTlLi66ZQmH9IBZBuFgaC nmXVP+LOtXAHE0orPobMopvrH9ir5/2a2W65VoppQ9/PnwvZOPNfRMY+OXZAXKwdb6bn NYRlVlB/EAEX0KRm/ufwfARz21LydrMNGGcwCk1nqoZ3rnxYc0tXyVr1zp0lNg7D6TGT HhxZOFvjZV6FFl3WZtfstHHuUoKNsz3XjuInBLP/41CzhVck53dd5MdHNB889gPko2FC eK9Q== X-Gm-Message-State: AOAM531R4rwsUc60cUF2FwvW+j9j7uhQfIWziCYiA6TXRNIHutOhPUpy 7UmTtSYNDtv9cnJInrN4i04h8D79/dNBvzNTAAURtQ== X-Received: by 2002:a25:c343:: with SMTP id t64mr8767516ybf.228.1611093062241; Tue, 19 Jan 2021 13:51:02 -0800 (PST) MIME-Version: 1.0 References: <20201218031703.3053753-1-saravanak@google.com> <20201218031703.3053753-6-saravanak@google.com> <86db7747ea6d48eebbf40a5855240d14@kernel.org> In-Reply-To: From: Saravana Kannan Date: Tue, 19 Jan 2021 13:50:26 -0800 Message-ID: Subject: Re: [PATCH v1 5/5] driver core: Set fw_devlink=on by default To: Geert Uytterhoeven Cc: Marc Zyngier , Greg Kroah-Hartman , "Rafael J. Wysocki" , Android Kernel Team , Linux Kernel Mailing List , Jisheng Zhang , Kevin Hilman , John Stultz , Nicolas Saenz Julienne , Yoshihiro Shimoda , Linux-Renesas Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jan 19, 2021 at 10:08 AM Saravana Kannan wrote: > > On Tue, Jan 19, 2021 at 1:05 AM Geert Uytterhoeven wrote: > > > > Hi Saravana, > > > > On Mon, Jan 18, 2021 at 10:19 PM Saravana Kannan wrote: > > > On Mon, Jan 18, 2021 at 11:16 AM Geert Uytterhoeven > > > wrote: > > > > On Mon, Jan 18, 2021 at 6:59 PM Marc Zyngier wrote: > > > > > On 2021-01-18 17:39, Geert Uytterhoeven wrote: > > > > > > On Fri, Dec 18, 2020 at 4:34 AM Saravana Kannan > > > > > > wrote: > > > > > >> Cyclic dependencies in some firmware was one of the last remaining > > > > > >> reasons fw_devlink=on couldn't be set by default. Now that cyclic > > > > > >> dependencies don't block probing, set fw_devlink=on by default. > > > > > >> > > > > > >> Setting fw_devlink=on by default brings a bunch of benefits > > > > > >> (currently, > > > > > >> only for systems with device tree firmware): > > > > > >> * Significantly cuts down deferred probes. > > > > > >> * Device probe is effectively attempted in graph order. > > > > > >> * Makes it much easier to load drivers as modules without having to > > > > > >> worry about functional dependencies between modules (depmod is still > > > > > >> needed for symbol dependencies). > > > > > >> > > > > > >> If this patch prevents some devices from probing, it's very likely due > > > > > >> to the system having one or more device drivers that "probe"/set up a > > > > > >> device (DT node with compatible property) without creating a struct > > > > > >> device for it. If we hit such cases, the device drivers need to be > > > > > >> fixed so that they populate struct devices and probe them like normal > > > > > >> device drivers so that the driver core is aware of the devices and > > > > > >> their > > > > > >> status. See [1] for an example of such a case. > > > > > >> > > > > > >> [1] - > > > > > >> https://lore.kernel.org/lkml/CAGETcx9PiX==mLxB9PO8Myyk6u2vhPVwTMsA5NkD-ywH5xhusw@mail.gmail.com/ > > > > > >> Signed-off-by: Saravana Kannan > > > > > > > > > > > > Shimoda-san reported that next-20210111 and later fail to boot > > > > > > on Renesas R-Car Gen3 platforms. No output is seen, unless earlycon > > > > > > is enabled. > > > > > > > > > > > > I have bisected this to commit e590474768f1cc04 ("driver core: Set > > > > > > fw_devlink=on by default"). > > > > > > > > > > There is a tentative patch from Saravana here[1], which works around > > > > > some issues on my RK3399 platform, and it'd be interesting to find > > > > > out whether that helps on your system. > > > > > > > > > > Thanks, > > > > > > > > > > M. > > > > > > > > > > [1] > > > > > https://lore.kernel.org/r/20210116011412.3211292-1-saravanak@google.com > > > > > > > > Thanks for the suggestion, but given no devices probe (incl. GPIO > > > > providers), I'm afraid it won't help. [testing] Indeed. > > > > > > > > With the debug prints in device_links_check_suppliers enabled, and > > > > some postprocessing, I get: > > > > > > > > 255 supplier e6180000.system-controller not ready > > > > 9 supplier fe990000.iommu not ready > > > > 9 supplier fe980000.iommu not ready > > > > 6 supplier febd0000.iommu not ready > > > > 6 supplier ec670000.iommu not ready > > > > 3 supplier febe0000.iommu not ready > > > > 3 supplier e7740000.iommu not ready > > > > 3 supplier e6740000.iommu not ready > > > > 3 supplier e65ee000.usb-phy not ready > > > > 3 supplier e6570000.iommu not ready > > > > 3 supplier e6054000.gpio not ready > > > > 3 supplier e6053000.gpio not ready > > > > > > > > As everything is part of a PM Domain, the (lack of the) system controller > > > > must be the culprit. What's wrong with it? It is registered very early in > > > > the boot: > > > > > > > > [ 0.142096] rcar_sysc_pd_init:442: of_genpd_add_provider_onecell() returned 0 > > > > > Looks like you found the important logs. Can you please enable all > > > these logs and send the early con logs as an attachment (so I don't > > > need to deal with lines getting wrapped)? > > > 1. The ones in device_links_check_suppliers() > > > 2. The ones in device_link_add() > > > 3. initcall_debug=1 > > > > I have attached[*] the requested log. > > > > > That should help us figure out what's going on. Also, what's the DT > > > that corresponds to one of the boards that see this issue? > > > > arch/arm64/boot/dts/renesas/r8a77951-salvator-xs.dts > > > > > Lastly, can you please pick up these 3 patches (some need clean up > > > before they merge) to make sure it's not an issue being worked on from > > > other bug reports? > > > https://lore.kernel.org/lkml/20210116011412.3211292-1-saravanak@google.com/ > > > https://lore.kernel.org/lkml/20210115210159.3090203-1-saravanak@google.com/ > > > https://lore.kernel.org/lkml/20201218210750.3455872-1-saravanak@google.com/ > > > > > > I have a strong hunch the 2nd one will fix your issues. fw_devlink can > > > handle cyclic dependencies now (it basically reverts to > > > fw_devlink=permissive mode for devices in the cycle), but it needs to > > > "see" all the dependencies to know there's a cycle. So want to make > > > sure it "sees" the "gpios" binding used all over some of the Renesas > > > DT files. > > > > These patches don't help. > > The 2nd one actually introduces a new failure: > > > > OF: /soc/i2c@e66d8000/gpio@20/pcie-sata-switch-hog: could not get > > #gpio-cells for /cpus/cpu@102 > > > > Note that my issues don't seem to be GPIO-related at all. > > > > BTW, you are aware IOMMUs and DMA controllers are optional? > > I.e. device drivers with iommus and/or dmas DT properties where the > > targets of these properties do not have a driver should still be probed, > > eventually. But if the IOMMU or DMA drivers are present, they should be > > probed first, so the device drivers can make use of them. > > Thanks for the logs and details. > > Yeah, this is going to be a problem then. How is this handled in > static kernels today? Do we just try to make sure the iommus driver > probes the iommu device before the consumers? And then the consumers > simply don't defer probe on failure to get iommu? > > I can make this work if modules are not enabled (needs some code > changes), but it's not going to work when there are modules. There's > no way to tell if an iommu module won't be loaded soon. Also, device > links doing this behavior only for iommu/dma is probably not a good > idea. So, whatever we do will have to be common behavior. :( > > Another intermediate option I was thinking was having a > CONFIG_FW_DEVLINK_OFF/PERMISSIVE/ON and defaulting it to ON for ARM64 > and turning it off in the defconfig for boards for which this doesn't > work. That way, we can incrementally enable fw_devlink. > > This week is a very hectic week for me. So, please bear with slow > responses from me for rest of this week. Let me think about this a bit > to see if I can come up with a better solution than what I have in > mind. > > Also, can you try deleting "iommu" and "dma" parsing in > of_supplier_bindings[] in driver/of/property.c and see if it helps? > Then we'd know this is the reason for things not working in your case. Hi Geert, I took a look at your logs. It looks like your guess is right. It's at least one of the issues. You'll need to convert drivers/soc/renesas/rcar-sysc.c into a platform driver. You already have a platform device created for it. So just go ahead and probe it with a platform driver. See what Marek did here [1]. You probably had to implement it as an "initcall based driver" because you had to play initcall chicken to make sure the PD hardware was initialized before the consumers. With fw_devlink=on you won't have to worry about that. As an added benefit of implementing a proper platform driver, you can actually implement runtime PM now, your suspend/resume would be more robust, etc. [1] - https://lore.kernel.org/lkml/20210113110320.13149-1-m.szyprowski@samsung.com/ -Saravana > > > Thanks! > > > > [*] Although attaching means people like myself cannot read and comment > > on the log easily, without saving the attachment first. > > That's also the reason why patches should be submitted inline... > > Yeah, I see your concern. If you want to add comments to logs when > sending them, yeah, please go ahead and put it inline. Or if someone > wants to add comments to what you attached, they could copy paste the > relevant sections and add comments. > > Thanks, > Saravana