Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp1017601ybb; Wed, 1 Apr 2020 14:04:01 -0700 (PDT) X-Google-Smtp-Source: APiQypLwTxynjtZewt0r7UHYEJ5tQdGdsIRthN1AplIjtb55YsO5rR1PZvA1+ukfg7G8DvQSc2XZ X-Received: by 2002:aca:c488:: with SMTP id u130mr239896oif.154.1585775041622; Wed, 01 Apr 2020 14:04:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585775041; cv=none; d=google.com; s=arc-20160816; b=Z3QFQ72amqEyRdl9QZfnQHBkRYuTJd0uoMkNBYA4O5KFUVJYSLCkY3iPqp9QrF7KI/ FwMi8IMQpoDVCoUTXXYBTbWg56nVzQVfMbHmqIClWSvO4KZCGfU22e1Rbrq9a0w0wqZE H2HzoESWDv1xLA9vABKanzBTyXfVPT+/ptnCjgim1+4vC74NNOjKrFrCmIbkgwijxlfK HPEcECFyqysslUFkbq4JaAd43jHl62Zks2LM5G/CuztRYKk9vIsuKRGDqlFlXl1Ay7O6 s7l1oMs9MDcEa7PLgRRVgTA+hNn6BsZ9bSgNBb2uqPGbVEMjOKJcrETDnyj3MastAr0s 4ecg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=kZZLw38OvfX0LdWZiTiDgV/9bUny0StnlwiaNs/2Ufc=; b=sn9PzyTXYqaJ545/RnhZKjgHSmM+RJLz6bKp6ZIY84Hs7Q3WegZuL/C6+BEThGEUoI w/JUdsOzMHrq2Qx1mRaNespm7pRLyDoOO0UF9V2hCm9TLIAZGl75JpItwBK8rtuBGfvK MGSIH++1Z1NO2CCN/C9Wq/+c+IHM2tHkYrEKyrK0WYHKpZHRUkq4I0f97x9n8aounRGZ TpYB86q+5cR0mRW8+5NmMMQzdHYBTUG5fWk5rtdxXe+9yaBbkVjXhZn2Qgx8vCVPg3JY Yy/CQPWEUlwdd6alZAb5U2EUmPC8JKrx02gzgoKS59jsrhMAXBntl79xEWmHW4Am2Qfc uJgQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@sifive.com header.s=google header.b=aly7ulvt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u64si1376384oig.238.2020.04.01.14.03.47; Wed, 01 Apr 2020 14:04:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@sifive.com header.s=google header.b=aly7ulvt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732742AbgDAVBx (ORCPT + 99 others); Wed, 1 Apr 2020 17:01:53 -0400 Received: from mail-lj1-f193.google.com ([209.85.208.193]:42123 "EHLO mail-lj1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732441AbgDAVBx (ORCPT ); Wed, 1 Apr 2020 17:01:53 -0400 Received: by mail-lj1-f193.google.com with SMTP id q19so944768ljp.9 for ; Wed, 01 Apr 2020 14:01:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sifive.com; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=kZZLw38OvfX0LdWZiTiDgV/9bUny0StnlwiaNs/2Ufc=; b=aly7ulvt+vfPUTikm4mltTkCbIZstebjkVao345asWRzgWwNr8HXzV9jvOV7qGfvlv UShoBmXhYLIDKE25KPNXRT3UYhD9ms32o+X2UsiGs7IVWDcUlmAveM5vTn3hox+wRzrj +6g3z+A2EulG+uY85yfZmgAuVmP8aMoXv7Av3iExS/HEjdAxDOu8N03VSkGIV2DsrPPN /aPw2Y6KLq1biHJFmaxinaPrVzFEBYkVcvRsTSg6B1skCt92JIZrn8HBKG3aj40G1f+g XMRw/JUp5To7tjRdK/WcREvMxKWN5WVPrYT53f+O6xhXut+ZBK8KmQCrycPFZ4Da0X+Q FAzQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=kZZLw38OvfX0LdWZiTiDgV/9bUny0StnlwiaNs/2Ufc=; b=P/Fg+feY+6XIrddpLCF4tGmRe9tJz6+TibS0HNNBiOLWyUu+GoN67MKpf1TSCPD5LB W7YojX6AaYoYcmzbzLxYgH4wG6GIm2kRHwuu9zCxr9FdAP6owR0t/GGTk96a9KsWLJ9a Yfo1Nft/d70vQV+VLoF7pyYUyqvSwmw/726eNsFCRQZEgSOS8E8qpr3NYYFGkVFAVThB A8pdVsGBfGhLpnqDSFP12Xysf4NL/025cvGralOj5pBslmDk0rvA/svfgxkjhknQl3z6 FZ1mHz0REs+s0z3aKQLSNEczSOJzXkJh3grV3km35uzR4DURersPfbVnqwEvWpmCaHsG Y/SA== X-Gm-Message-State: AGi0PuboQ94sl7dtBeR8w0rwjldVTnKH/U40i5Z+Pid5E8MLD+k914OP VCoOHa0NS8KeSGz1pK4kW0ZbYBUVScUUZgpAHV+6RA== X-Received: by 2002:a2e:8410:: with SMTP id z16mr75114ljg.197.1585774907722; Wed, 01 Apr 2020 14:01:47 -0700 (PDT) MIME-Version: 1.0 References: <20200401202937.GA130497@google.com> In-Reply-To: <20200401202937.GA130497@google.com> From: Alan Mikhak Date: Wed, 1 Apr 2020 14:01:36 -0700 Message-ID: Subject: Re: [PATCH] PCI: Warn about MEM resource size being too big To: Bjorn Helgaas Cc: linux-kernel@vger.kernel.org, linux-pci , Jingoo Han , Gustavo Pimentel , lorenzo.pieralisi@arm.com, amurray@thegoodpenguin.co.uk, Kishon Vijay Abraham I , Paul Walmsley Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Apr 1, 2020 at 1:29 PM Bjorn Helgaas wrote: > > On Tue, Mar 31, 2020 at 01:36:04PM -0700, Alan Mikhak wrote: > > On Tue, Mar 31, 2020 at 1:12 PM Bjorn Helgaas wrote: > > > On Mon, Mar 30, 2020 at 05:19:47PM -0700, Alan Mikhak wrote: > > > > Output a warning for MEM resource size with > > > > non-zero upper 32-bits. > > > > > > > > ATU programming functions limit the size of > > > > the translated region to 4GB by using a u32 size > > > > parameter. Function dw_pcie_prog_outbound_atu() > > > > does not program the upper 32-bit ATU limit > > > > register. This may result in undefined behavior > > > > for resource sizes with non-zero upper 32-bits. > > > > > > > > For example, a 128GB address space starting at > > > > physical CPU address of 0x2000000000 with size of > > > > 0x2000000000 needs the following values programmed > > > > into the lower and upper 32-bit limit registers: > > > > 0x3fffffff in the upper 32-bit limit register > > > > 0xffffffff in the lower 32-bit limit register > > > > > > > > Currently, only the lower 32-bit limit register is > > > > programmed with a value of 0xffffffff but the upper > > > > 32-bit limit register is not being programmed. > > > > As a result, the upper 32-bit limit register remains > > > > at its default value after reset of 0x0. This would > > > > be a problem for a 128GB PCIe space because in > > > > effect its size gets reduced to 4GB. > > > > > > > > ATU programming functions can be changed to > > > > specify a u64 size parameter for the translated > > > > region. Along with this change, the internal > > > > calculation of the limit address, the address of > > > > the last byte in the translated region, needs to > > > > change such that both the lower 32-bit and upper > > > > 32-bit limit registers can be programmed correctly. > > > > > > > > Changing the ATU programming functions is high > > > > impact. Without change, this issue can go > > > > unnoticed. A warning may prompt the user to > > > > look into possible issues. > > > > > > So this is basically a warning, and we could actually *fix* the > > > problem with more effort? I vote for the fix. > > > > The fix would impact all PCIe drivers that depend on dwc. > > Is that another way of saying "the fix would *fix* all the drivers > that depend on dwc"? Thanks Bjorn for your comments. Not at all. I'm not suggesting that. I'm just stating the dilemma. One option is, as you may be alluding, the *fix* would include modification to all drivers that depend on dwc to at least not break the build.. Whoever embarks on such a *fix* would have to take that on before submitting the patch. Another option is to produce an alternate ATU programming API for the Linux PCI sub-system to support u64 size. That way individual driver owners can choose if and when to migrate their drivers to the new API on their own timeline. Such an alternative API can also be generic to support not only Designware PCIe controllers but others. > > > I would have no way of validating such a fix without > > breaking it for everyone let alone the bandwidth it needs. > > All drivers that depend on dwc seem to be currently happy > > with the u32 size limit. I suggest we add the warning but > > keep this issue in mind for a solution that allows existing > > PCe drivers to phase into the fix on their own individual > > schedules, if they need to. > > Obviously it would *nice* to test all the drivers that depend on dwc, > but if you're fixing a problem, you verify the fix on your system, and > the relevant people review it, I don't think exhaustive testing is a > hard requirement, and I certainly wouldn't expect you to do it. That is a relief for whoever commits to take this on. > > If we want to live with a 32-bit limit, I think we should change the > relevant interfaces to use u32 so there's not a way to venture into > this region of undefined behavior. I don't think "warning + undefined > behavior" is a very maintainable situation. I cannot live with the 32-bit limit. I need a 64-bit solution. I had to implement a solution that suits my needs. I have worked out some of the issue. It is generic enough for my use with PCIe controllers from more than one vendor. But, it requires pulling a lot of code from Designware layer into a separate framework which I believe can become common for Linux PCI subsystem. If it gets in, others who need 64-bit can migrate over to it without being migrated involuntarily. > > > > > This limitation also means that multiple ATUs > > > > would need to be used to map larger regions. > > > > > > > > Signed-off-by: Alan Mikhak > > > > --- > > > > drivers/pci/controller/dwc/pcie-designware-host.c | 6 +++++- > > > > 1 file changed, 5 insertions(+), 1 deletion(-) > > > > > > > > diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c > > > > index 395feb8ca051..37a8c71ef89a 100644 > > > > --- a/drivers/pci/controller/dwc/pcie-designware-host.c > > > > +++ b/drivers/pci/controller/dwc/pcie-designware-host.c > > > > @@ -325,6 +325,7 @@ int dw_pcie_host_init(struct pcie_port *pp) > > > > struct pci_bus *child; > > > > struct pci_host_bridge *bridge; > > > > struct resource *cfg_res; > > > > + resource_size_t mem_size; > > > > u32 hdr_type; > > > > int ret; > > > > > > > > @@ -362,7 +363,10 @@ int dw_pcie_host_init(struct pcie_port *pp) > > > > case IORESOURCE_MEM: > > > > pp->mem = win->res; > > > > pp->mem->name = "MEM"; > > > > - pp->mem_size = resource_size(pp->mem); > > > > + mem_size = resource_size(pp->mem); > > > > + if (upper_32_bits(mem_size)) > > > > + dev_warn(dev, "MEM resource size too big\n"); > > > > + pp->mem_size = mem_size; > > > > pp->mem_bus_addr = pp->mem->start - win->offset; > > > > break; > > > > case 0: > > > > -- > > > > 2.7.4 > > > >