Received: by 2002:a17:90a:202a:0:0:0:0 with SMTP id n39csp3553640pjc; Sun, 2 Jun 2019 13:01:51 -0700 (PDT) X-Google-Smtp-Source: APXvYqwsiMk+Eo0OIvpMeVQ8YitFm8uLJxuHhnzDdXs8QKFJuMUTdukS9/6Tgu5jnK+wdd5FY4Pb X-Received: by 2002:a63:1f04:: with SMTP id f4mr24542389pgf.423.1559505710895; Sun, 02 Jun 2019 13:01:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559505710; cv=none; d=google.com; s=arc-20160816; b=rjATzi8OAZDuJcnNECycKg2Q6RUMz6bd4HbzWQ1gtOhAvHh7nKb5uBkVu+p+++40Fk 0TYiy/7O72lXofiYi31HqMKigqYO7GLhN7Nh5gUxrDxG4wv5vRl06KRga3m5x11j0RZU 7ccdQm1ajJ4d6N+okGXcR94few+Nh2EOReFYGpD0VaUgrhSdhgLGQ8fMr9K0eJsPdUlr dmUju92EtTEYcX0N8LjYKQ/fezoyR6E1cco2Wyr5Di2Moz63Jf4O244LqYSMUQFkacpq 8EK8vikyefqVe7JVbI4BFe+cxeuDQgWVaL5SF0+D2eufD9UNpW8Zh07/qTFzKSZ2rqUB B+Rw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=aaBnd3Mz67/mbpVkdYT/2BN5oFrupn5X4M/t44T3/Vg=; b=tft82Zutw2Lz6zTQK7iSX9XhObZsD25M3hLmPYVhO/bD1Twi/FDeFG/uqctIG0Vrsm H6nnYXPREjNrn2YVcWxgLu7kyxq4G9bX0VZaosuq87obm+lgyAARWLADUvFV7nKACzht JAagUW39n3SBJW+f6QI2XcO10Af/rnw71W88STNYQ4O4EdbqyVTUp2jLC8BClF/qVTx9 sDCTp7I4eUe/rKGMgU578I3JDSWrJarG5VNZ/aaA0iTE/dgun7WoXQbZzlDewse5qMZ8 dg2REyINp5PyEdz+iR0EUkDs9mIMU7na0fpnawCoTi0maIn5mKH4vcYwipENYcR8afBe 4f3A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lmYArfef; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j14si14879937pll.251.2019.06.02.12.56.14; Sun, 02 Jun 2019 13:01:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lmYArfef; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726779AbfFBTIw (ORCPT + 99 others); Sun, 2 Jun 2019 15:08:52 -0400 Received: from mail-ed1-f67.google.com ([209.85.208.67]:44595 "EHLO mail-ed1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726270AbfFBTIv (ORCPT ); Sun, 2 Jun 2019 15:08:51 -0400 Received: by mail-ed1-f67.google.com with SMTP id b8so23552493edm.11; Sun, 02 Jun 2019 12:08:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=aaBnd3Mz67/mbpVkdYT/2BN5oFrupn5X4M/t44T3/Vg=; b=lmYArfefuVc8Zcozg9TjAwVGWcJeWBwFEZt1Fu7iVxDfd9QlQfGvf15pyROvXVtkbr vtxL8nTDUZcILwufR1Qw3fo+0aTowXMqdZfhd+B3abRCeajFApVy4xbvk2hfVrghQN48 4wy3JdFRXmOPBtV3UpZU5ZXNsfo2zom26TRBTDg6sy94Cfml/InFLs+K+d4h1rXh6kf0 vQi1gkIbm40CGUhO/yEx3gCYDEQUctT2tJcVlErKU+PHqnthauTeWbxsuL4ns7+y1KaL 1uMjCn7d1c5il2JAu8NTBVgdgWvS5Rorn/p6TUEPPSWg3stj+lKrKpzfqaKXA7wdobEV fjsQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=aaBnd3Mz67/mbpVkdYT/2BN5oFrupn5X4M/t44T3/Vg=; b=iA0lx+tYJR0dy7O7XH103xaznyBaGa2jP6zSO3e2AZnvX4FoQSQshSO34tCSi+LMu8 hNcQ5al2Y8VN8ARSJMYriBkP2wg+L7kIc0dIoCcJV4i1qTsrikaAEOFVAtdChMrL5CQV V51VObsxle30UmUIqAAQNxQ8sphZThOXnXZL3nkMT4DUPOvIVHN4Iwehoc9ZJqmOfRMH tEcseBfmsKQ/hu5urtk2uFoYzdB0rFg0KAvAqPzli7NhzMcdTuIqxHB+mW/OgUDN8sdi nCa0Ah8rppiN01sTfy8MaJDM4EZRltjM1DpTQtE2LW32lJ4QLIANw4VssUqZgf+Wuyh1 pcOA== X-Gm-Message-State: APjAAAWfZgHFBnWW7i1XRSm9zz0ZoJXsJrLg0r9EIEFkzCcicd2P4nD8 zk1NvYFQePIiTdZFybOoWM7h5883v/sJ2pKh82Ww5gMT X-Received: by 2002:aa7:cdd7:: with SMTP id h23mr8124859edw.264.1559502528659; Sun, 02 Jun 2019 12:08:48 -0700 (PDT) MIME-Version: 1.0 References: <20181201165348.24140-1-robdclark@gmail.com> In-Reply-To: From: Rob Clark Date: Sun, 2 Jun 2019 12:08:33 -0700 Message-ID: Subject: Re: [PATCH] of/device: add blacklist for iommu dma_ops To: Rob Herring Cc: Linux IOMMU , dri-devel , linux-arm-msm , Vivek Gautam , Tomasz Figa , Christoph Hellwig , Robin Murphy , Will Deacon , David Airlie , freedreno , Archit Taneja , Sean Paul , Doug Anderson , Daniel Vetter , Frank Rowand , devicetree@vger.kernel.org, "linux-kernel@vger.kernel.org" Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 10, 2019 at 7:35 AM Rob Clark wrote: > > On Tue, Dec 4, 2018 at 2:29 PM Rob Herring wrote: > > > > On Sat, Dec 1, 2018 at 10:54 AM Rob Clark wrote: > > > > > > This solves a problem we see with drm/msm, caused by getting > > > iommu_dma_ops while we attach our own domain and manage it directly at > > > the iommu API level: > > > > > > [0000000000000038] user address but active_mm is swapper > > > Internal error: Oops: 96000005 [#1] PREEMPT SMP > > > Modules linked in: > > > CPU: 7 PID: 70 Comm: kworker/7:1 Tainted: G W 4.19.3 #90 > > > Hardware name: xxx (DT) > > > Workqueue: events deferred_probe_work_func > > > pstate: 80c00009 (Nzcv daif +PAN +UAO) > > > pc : iommu_dma_map_sg+0x7c/0x2c8 > > > lr : iommu_dma_map_sg+0x40/0x2c8 > > > sp : ffffff80095eb4f0 > > > x29: ffffff80095eb4f0 x28: 0000000000000000 > > > x27: ffffffc0f9431578 x26: 0000000000000000 > > > x25: 00000000ffffffff x24: 0000000000000003 > > > x23: 0000000000000001 x22: ffffffc0fa9ac010 > > > x21: 0000000000000000 x20: ffffffc0fab40980 > > > x19: ffffffc0fab40980 x18: 0000000000000003 > > > x17: 00000000000001c4 x16: 0000000000000007 > > > x15: 000000000000000e x14: ffffffffffffffff > > > x13: ffff000000000000 x12: 0000000000000028 > > > x11: 0101010101010101 x10: 7f7f7f7f7f7f7f7f > > > x9 : 0000000000000000 x8 : ffffffc0fab409a0 > > > x7 : 0000000000000000 x6 : 0000000000000002 > > > x5 : 0000000100000000 x4 : 0000000000000000 > > > x3 : 0000000000000001 x2 : 0000000000000002 > > > x1 : ffffffc0f9431578 x0 : 0000000000000000 > > > Process kworker/7:1 (pid: 70, stack limit = 0x0000000017d08ffb) > > > Call trace: > > > iommu_dma_map_sg+0x7c/0x2c8 > > > __iommu_map_sg_attrs+0x70/0x84 > > > get_pages+0x170/0x1e8 > > > msm_gem_get_iova+0x8c/0x128 > > > _msm_gem_kernel_new+0x6c/0xc8 > > > msm_gem_kernel_new+0x4c/0x58 > > > dsi_tx_buf_alloc_6g+0x4c/0x8c > > > msm_dsi_host_modeset_init+0xc8/0x108 > > > msm_dsi_modeset_init+0x54/0x18c > > > _dpu_kms_drm_obj_init+0x430/0x474 > > > dpu_kms_hw_init+0x5f8/0x6b4 > > > msm_drm_bind+0x360/0x6c8 > > > try_to_bring_up_master.part.7+0x28/0x70 > > > component_master_add_with_match+0xe8/0x124 > > > msm_pdev_probe+0x294/0x2b4 > > > platform_drv_probe+0x58/0xa4 > > > really_probe+0x150/0x294 > > > driver_probe_device+0xac/0xe8 > > > __device_attach_driver+0xa4/0xb4 > > > bus_for_each_drv+0x98/0xc8 > > > __device_attach+0xac/0x12c > > > device_initial_probe+0x24/0x30 > > > bus_probe_device+0x38/0x98 > > > deferred_probe_work_func+0x78/0xa4 > > > process_one_work+0x24c/0x3dc > > > worker_thread+0x280/0x360 > > > kthread+0x134/0x13c > > > ret_from_fork+0x10/0x18 > > > Code: d2800004 91000725 6b17039f 5400048a (f9401f40) > > > ---[ end trace f22dda57f3648e2c ]--- > > > Kernel panic - not syncing: Fatal exception > > > SMP: stopping secondary CPUs > > > Kernel Offset: disabled > > > CPU features: 0x0,22802a18 > > > Memory Limit: none > > > > > > The problem is that when drm/msm does it's own iommu_attach_device(), > > > now the domain returned by iommu_get_domain_for_dev() is drm/msm's > > > domain, and it doesn't have domain->iova_cookie. > > > > > > We kind of avoided this problem prior to sdm845/dpu because the iommu > > > was attached to the mdp node in dt, which is a child of the toplevel > > > mdss node (which corresponds to the dev passed in dma_map_sg()). But > > > with sdm845, now the iommu is attached at the mdss level so we hit the > > > iommu_dma_ops in dma_map_sg(). > > > > > > But auto allocating/attaching a domain before the driver is probed was > > > already a blocking problem for enabling per-context pagetables for the > > > GPU. This problem is also now solved with this patch. > > > > > > Fixes: 97890ba9289c dma-mapping: detect and configure IOMMU in of_dma_configure > > > Tested-by: Douglas Anderson > > > Signed-off-by: Rob Clark > > > --- > > > This is an alternative/replacement for [1]. What it lacks in elegance > > > it makes up for in practicality ;-) > > > > > > [1] https://patchwork.freedesktop.org/patch/264930/ > > > > > > drivers/of/device.c | 22 ++++++++++++++++++++++ > > > 1 file changed, 22 insertions(+) > > > > > > diff --git a/drivers/of/device.c b/drivers/of/device.c > > > index 5957cd4fa262..15ffee00fb22 100644 > > > --- a/drivers/of/device.c > > > +++ b/drivers/of/device.c > > > @@ -72,6 +72,14 @@ int of_device_add(struct platform_device *ofdev) > > > return device_add(&ofdev->dev); > > > } > > > > > > +static const struct of_device_id iommu_blacklist[] = { > > > + { .compatible = "qcom,mdp4" }, > > > + { .compatible = "qcom,mdss" }, > > > + { .compatible = "qcom,sdm845-mdss" }, > > > + { .compatible = "qcom,adreno" }, > > > + {} > > > +}; > > > > Not completely clear to whether this is still needed or not, but this > > really won't scale. Why can't the driver for these devices override > > whatever has been setup by default? > > > > fwiw, at the moment it is not needed, but it will become needed again > to implement per-context pagetables (although I suppose for this we > only need to blacklist qcom,adreno and not also the display nodes). So, another case I've come across, on the display side.. I'm working on handling the case where bootloader enables display (and takes iommu out of reset).. as soon as DMA domain gets attached we get iommu faults, because bootloader has already configured display for scanout. Unfortunately this all happens before actual driver is probed and has a chance to intervene. It's rather unfortunate that we tried to be clever rather than just making drivers call some function to opt-in to the hookup of dma iommu ops :-( BR, -R > > The reason is that in the current state the core code creates the > first domain before the driver has a chance to intervene and tell it > not to. And this results that driver ends up using a different > context bank on the iommu than what the firmware expects. > > I guess the alternative is to put some property in DT.. but that > doesn't really feel right. I guess there aren't really many (or any?) > other drivers that have this specific problem, so I don't really > expect it to be a scaling problem. > > Yeah, it's a bit ugly, but I'll take a small ugly working hack, over > elegant but non-working any day ;-)... but if someone has a better > idea then I'm all ears. > > BR, > -R