Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp978616ybi; Fri, 24 May 2019 14:53:16 -0700 (PDT) X-Google-Smtp-Source: APXvYqzbWa1Ozb+9GCn6itX02k/RgrT87c4dfXfJq1631QLn2ZMnct3oKpQwqW7D18I6dpUqxNPU X-Received: by 2002:a63:fa4a:: with SMTP id g10mr108188056pgk.147.1558734796758; Fri, 24 May 2019 14:53:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1558734796; cv=none; d=google.com; s=arc-20160816; b=H/oYcwO1aJSJtK8IkvmOimi7cmRKhdQW/7q75rWSHBbZV08Zf2BcKJc5KM60h0Vwzk tSMn8Q4gSZi4Bxd03gz0HjIUUPEzLo1LAy+U404Ri69aXz3A0KJ65XnEXLu5uhG+Rdso I4DPrwm0U6mG44wx7ISuYc0Y8psnfDrIsTSpazi0cjXxr8FIC2Ba6oSTt1I/u9YB9Th+ sn2pTYp4rI2lMphu8EBBzvNQYKez9ldlW7h8oKc/Wk06Gs3hMTY9FxOqa8XYhaClEaKt LaPLHYI4NjPEHrBXdOS13xCbSNTJREeMULL9Kx5dmKFQQS+XRvN1f5tVTsoSe7evRSRx Y+ig== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=/3kgNn6NL4JYVxHLJQOHqTYeAocP9fpuKYr/bgz9xX4=; b=ulbrMHA6VXjbd4Rdzx7coGdPF8K+E051o+Lx2XrOPs26iPq3R4kFQ/BHqe/1ydTbLz ckGsLCqD3hclnEJAFPxYE077KqINLRxQ2UvbK4eQ//mcnYXWcctC4tosKG2bsAtmCp4j RUtPjjTpcns2sNHXJ5voKmbMjeZ0YJNi9KrfL9yJi0Cz/5DGZawpS8mtNZQDJYskVbsi 6VdCb69zADmXlNCJdmOtqfkMNkhv/Rw6hgp0TLB/MaSUyin3w3nagX42BymouiOPToQP THWl0hxhkJT7vdTNBscrvJu40+ZWSxfJYvD+b9EjUQk3S5SJ3A9PrjnDKCGajDxHOAtZ jIRA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=YJg0ERBg; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 76si6176896pfv.233.2019.05.24.14.53.01; Fri, 24 May 2019 14:53:16 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=YJg0ERBg; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404330AbfEXVvw (ORCPT + 99 others); Fri, 24 May 2019 17:51:52 -0400 Received: from mail-oi1-f193.google.com ([209.85.167.193]:42658 "EHLO mail-oi1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404176AbfEXVvw (ORCPT ); Fri, 24 May 2019 17:51:52 -0400 Received: by mail-oi1-f193.google.com with SMTP id w9so8088310oic.9 for ; Fri, 24 May 2019 14:51:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=/3kgNn6NL4JYVxHLJQOHqTYeAocP9fpuKYr/bgz9xX4=; b=YJg0ERBgRmU8/PhbbxteKlV3jmW1XQ/qYJ+UMfFuYI07medwvIBJTkYAoT/cj0AqQc rtCAp7LU6Op9rb/cgu58oSr7RO5Tw0/hEf5swzeTpaHuqwu21WXzG4+4oUlBsdZzrqD6 sd6KD+WCyYPNt3M7BXKCvSTXvLzEf4iRXBqHORZOPGwq+jxtovbyZ8l2aouj6NbbIivr Xes+QPwYHBvG781JN9HvxjYPL24HEOezBclZXNWTAyfF/3VKhNOTHe/7p8FxpN4D02mk B+sDG1AdTOLfXYh/DBCyx9apCh43/Cvap3gknlPVNTpS1ufvCx9q0GPYFxR+FP315aFQ JHHg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=/3kgNn6NL4JYVxHLJQOHqTYeAocP9fpuKYr/bgz9xX4=; b=E18viYuTIGTs3enHGXsM6QNR6zW1uxoBN/M/LS8mO0euatXhm5YlwOjw450htavhev +x2+oJZJNTGDIcEWiF2CyB5aLN2oxuszgQSpac87Y7pWIoWv7IboDty/JiL9mG84X1CG 3XAKJy/dbgqyICpCdKulBPc6Bxfvoh/Z4x4oE5eldUCbP1mAK3cB0mKUgLxBQCIjHfIE qguFnQOJMIXbv45ZDUTg7wwAYFJWhOtG8x2qn+28IOA+wz4YwFG7oj+Kjooo2QGBRu40 ZR0g3cNXZFef64FZhtSa0lqRJwAPoywa7maU1Q/6Qg3PwApEZetNhrlfI7wcUbgTSl7g fpQg== X-Gm-Message-State: APjAAAXt3vTf2eh4J5UvJXqoXDC/vkC1ubIGNRXxZhvJqFq0RECmKxfx 5wPHycdLpcyLAUA026T2hz9YBzEXjodEf6FqdtJziA== X-Received: by 2002:aca:ec0f:: with SMTP id k15mr7306013oih.43.1558734710979; Fri, 24 May 2019 14:51:50 -0700 (PDT) MIME-Version: 1.0 References: <20190524010117.225219-1-saravanak@google.com> In-Reply-To: From: Saravana Kannan Date: Fri, 24 May 2019 14:51:15 -0700 Message-ID: Subject: Re: [PATCH v1 0/5] Solve postboot supplier cleanup and optimize probe ordering To: Rob Herring Cc: Mark Rutland , Greg Kroah-Hartman , "Rafael J. Wysocki" , Frank Rowand , devicetree@vger.kernel.org, "linux-kernel@vger.kernel.org" , Android Kernel Team Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Before I address all the comments, a friendly reminder: Whatever solution we come up with needs to work on a system with loadable modules and shouldn't depend on userspace for correctness. On Fri, May 24, 2019 at 6:04 AM Rob Herring wrote: > > On Thu, May 23, 2019 at 8:01 PM Saravana Kannan wrote: > > > > Add a generic "depends-on" property that allows specifying mandatory > > functional dependencies between devices. Add device-links after the > > devices are created (but before they are probed) by looking at this > > "depends-on" property. > > The DT already has dependency information. A node with 'clocks' > property has its dependency right there. We should use that. We don't > need to duplicate the information. So, are you saying all clock bindings are mandatory for a device for all possible users of DT + Linux? Do you think if we have a patch that makes all clock bindings mandatory dependencies, no one would object to that? > > This property is used instead of existing DT properties that specify > > phandles of other devices (Eg: clocks, pinctrl, regulators, etc). This > > is because not all resources referred to by existing DT properties are > > mandatory functional dependencies. Some devices/drivers might be able > > to operate with reduced functionality when some of the resources > > aren't available. For example, a device could operate in polling mode > > if no IRQ is available, a device could skip doing power management if > > clock or voltage control isn't available and they are left on, etc. > > Yeah, but none of these examples are typically what you'd want to > happen. These cases are a property of the OS, not the DT. For example, > until recently, If you added pinctrl bindings to your DT, the kernel > would no longer boot because it would be looking for pinctrl driver. > That's wrong because the DT should not be coupled to the OS like that. > Adding this property will cause the same problem. Isn't the a perfect example of the pinctrl being an optional dependency in that specific case? The kernel still booted if pinctrl wasn't available? I don't agree that the dependency is purely a property of the OS. If there's no clock to clock the hardware core, then the hardware just can't work. There's no question about that. However, there can be clock bindings that aren't mandatory for functionality but are needed just for performance/power control. Another perfect example are clock providers. Clock providers often get input clocks from multiple other clock providers and even have cyclic clock bindings. But only some of them are mandatory for the clock provider to work. For example, clock provider A has input clocks from clock providers B and C, but it only needs B to function (provides root clock to all clocks). Not having C would only affect 4 (out of 100s of clocks) from clock provider A and those 4 are clocks depend on an input clock from C (basically clock from C going to A to have some clock gates and dividers added and sent back to C). This isn't even a made up scenario -- there are SoCs that actually have this. The OS could still choose to not probe the device unless full functionality is available or it could assume all clocks are left on by the bootloader and provide basic functionality. THAT would be the property of the OS. But that doesn't remove the fact that some of the resources are absolutely mandatory for the hardware to function. I'm proposing the depends-on to capture the true hardware dependency -- not what the SW chooses to do with it. > > So, adding mandatory functional dependency links between devices by > > looking at referred phandles in DT properties won't work as it would > > prevent probing devices that could be probed. By having an explicit > > depends-on property, we can handle these cases correctly. > > > > Having functional dependencies explicitly called out in DT and > > automatically added before the devices are probed, provides the > > following benefits: > > > > - Optimizes device probe order and avoids the useless work of > > attempting probes of devices that will not probe successfully > > (because their suppliers aren't present or haven't probed yet). > > > > For example, in a commonly available mobile SoC, registering just > > one consumer device's driver at an initcall level earlier than the > > supplier device's driver causes 11 failed probe attempts before the > > consumer device probes successfully. This was with a kernel with all > > the drivers statically compiled in. This problem gets a lot worse if > > all the drivers are loaded as modules without direct symbol > > dependencies. > > Do you have data on how much time is spent. Past 'smarter probing' > attempts have not shown a significant difference. "avoids the useless work attempting probes of devices that will not probe successfully" -- I never claimed to save boot up time. Your argument about having to save wall clock time is a moot point as a ton of kernel features that optimize code won't save wall clock time (the CPU would just run faster to make up for the inefficiency). Those features just make the kernel less resource hungry and more efficient. I'd understand your argument if this patch series is insanely complex -- but that's not the case here. > > - Supplier devices like clock providers, regulators providers, etc > > need to keep the resources they provide active and at a particular > > state(s) during boot up even if their current set of consumers don't > > request the resource to be active. This is because the rest of the > > consumers might not have probed yet and turning off the resource > > before all the consumers have probed could lead to a hang or > > undesired user experience. > > We already know generally what devices are dependencies because you > just listed them. Why don't we make the kernel smarter by > instantiating these core devices/drivers first instead of relying on > initcall and link order. That's what this patch series is -- it makes the kernel smarter by just using the data from DT instead of relying on manual tweaking of initcall and link order. > > Some frameworks (Eg: regulator) handle this today by turning off > > "unused" resources at late_initcall_sync and hoping all the devices > > have probed by then. This is not a valid assumption for systems with > > loadable modules. Other frameworks (Eg: clock) just don't handle > > this due to the lack of a clear signal for when they can turn off > > resources. This leads to downstream hacks to handle cases like this > > that can easily be solved in the upstream kernel. > > IMO, we should get rid of this auto disabling. Well, you need to back that opinion with reasoning. IMO we should disable unused resources so that we don't waste power -- especially on devices operating on batteries. Also, I explicitly said "need to keep the resources they provide active and at a particular state(s) during boot up". So it's not even about auto disabling. For example, in the case of a voltage regulator supplying multiple devices, if the first device probes and says it only need the lowest voltage level, you can't just drop the voltage. Because the other devices in the same voltage rail haven't probed yet and you can crash the system if you just drop the voltage. You need to wait for all the devices to be probed and then you can let the voltage regulator operate normally. And you can't depend on late_initcall because it falls apart on systems with modules. -Saravana