Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp2899939pxt; Mon, 9 Aug 2021 11:27:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwv9KoQ9lgOMk/MUa58vGQHEC9TcVhlOEVa4XUaX+1l0oVl4igH4FDV1ZKWdIgH/MxMDhiE X-Received: by 2002:a92:c68a:: with SMTP id o10mr14068ilg.163.1628533675712; Mon, 09 Aug 2021 11:27:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628533675; cv=none; d=google.com; s=arc-20160816; b=TafzkDGfCi3ThTQmToB+nTwuJSM5kVLki5YtaTk082b1HmimIY1g7ugJdrYzzGYwtF ewHBUg9Yd/q0mKH8yVyT/aU/fmx0eUgxYs7ACN8ILQ0lAxihpBUkqqyEIdN6LNuK08a3 gOa1KHJ+s2S8SnEuAjKBw2e3XkmlP+qjLdWEoUXlsOsobUUloWuNhBoxoTNDdPFrUHWT +GEFDTABBRurwerHd+8rghUdAD4jhhXIxt5PpTmzlnjBBD9j5kpCzLkiw8cZuS6bwKkM xI6jN69tquroTyqoxMEgVbE+L/ndHG5ixqsPxTUn40EuNP/mui9G1mvtEngIGjLTKYv1 PoXA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=I6cFLsVYbYRFP0hZKX95CD3+fYy/muCKans4k1cHBIU=; b=EWJcwtVU2HMALdaS7cyo20fVgbVQM9JUfQeHXjdQXrDrufK6SZ1Pl6dYg3POnAaFTi uSvSgzKWoFp6nGsjdU4X4FyIt6B2PgL9Unrz0h6v//YjqsYBhBTXHcLUCLRRWplBcup0 1kMvYrkS+M3XbHG3LRu2rEqE1ylngZvozv8Q0/xF0NFzeqdEsYi5ZnUfL8VPTiOW9quQ MMITA7t+3scRYMlc0gEbKGwujqQtrtrnQSGjs4elPibQWDmmk4lWVTKzKkEg2wwfZ4NX fpdBLMiZ7NwL+WHkesWUffYSUr0lVtC7WZRHTAGDhYYNKX12oXo/xhswfnRULVp/78nX DNBQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=OgtCpEGx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e17si12547129iot.103.2021.08.09.11.27.43; Mon, 09 Aug 2021 11:27:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=OgtCpEGx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234350AbhHIS0x (ORCPT + 99 others); Mon, 9 Aug 2021 14:26:53 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43530 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230039AbhHIS0w (ORCPT ); Mon, 9 Aug 2021 14:26:52 -0400 Received: from mail-wm1-x332.google.com (mail-wm1-x332.google.com [IPv6:2a00:1450:4864:20::332]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8965FC0613D3; Mon, 9 Aug 2021 11:26:31 -0700 (PDT) Received: by mail-wm1-x332.google.com with SMTP id o7-20020a05600c5107b0290257f956e02dso70063wms.1; Mon, 09 Aug 2021 11:26:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=I6cFLsVYbYRFP0hZKX95CD3+fYy/muCKans4k1cHBIU=; b=OgtCpEGxPyPutM9eJ0tbgC7trfe/M5qWa+iBNNwGaevKXJQ8jK5XUWAdZwcOrqei+u 0optc10CksMLo2bI2MEgkwVJinsfYZuGi4iWHYqy3MV7pOKRxznVwY5wvIDCnK24g7S8 gWIpI1ANvks9UW73/GMMoy7HtDbdlDz2LXH7cJNum+sfgHtCPavmAIDzUwPPjgh9YCmy 0Y7S38dALrm4olvcdNC+41LPV3Fc881BbPdVGcAJAyzVg94z1qUPvdVfmyhxxIdw7ZYj 7JLURiKwX9lsJDoLK5k4PdgIz1ZsSA/7BnrQELq+3C9SCEjxzpNorGgYYDA2ECDBkzH5 IIAg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=I6cFLsVYbYRFP0hZKX95CD3+fYy/muCKans4k1cHBIU=; b=PskFmr01CxDxF4/B7jOOpuxZUVn68Xy0JrUpvjd1ao2EOTazVvXG3Ids+RrZAXq/9/ bGOTmQ/O9ySNze0m3OTaCpjpfFtEPst/NhFFVroPkj5+mhnayV4fW4zNbUqokVHMJhSj r7MmrAM9De3TRv+JUZOAsA0KopD7S0t3e9ryN5cyxckAkQnw/HqSgX0fMdC70WEsklhV KUG5klkO8i47LWp2exfpgrf1jSVCG7jaeG4GTHBWejqkj95cJeA8mDUpSCex73kBKp5W za1BZo3JLjwNnxgvre0L+tt86k22FNbAmHUIqgfNl+k5lgDHPVhRr+294g80WyPpPzLC KfIw== X-Gm-Message-State: AOAM531E1pRkIfgx32BLbcyyG3B1iOXC3oWulmNWh119AXBWk4PNtTIW ixCoU6E0qmKWBJKSWkpDOjOcRgQ33pSivMoa04M= X-Received: by 2002:a1c:f414:: with SMTP id z20mr504283wma.94.1628533590124; Mon, 09 Aug 2021 11:26:30 -0700 (PDT) MIME-Version: 1.0 References: <20210728140052.GB22887@mms-0441> <8b2742c8891abe4fec3664730717a089@codeaurora.org> <20210802105544.GA27657@willie-the-truck> <20210802151409.GE28735@willie-the-truck> <20210809145651.GC1458@willie-the-truck> <20210809170508.GB1589@willie-the-truck> <20210809174022.GA1840@willie-the-truck> <76bfd0b4248148dfbf9d174ddcb4c2a2@codeaurora.org> <8e5edd6886a0c3a5f6c8cb4dff517224@codeaurora.org> In-Reply-To: <8e5edd6886a0c3a5f6c8cb4dff517224@codeaurora.org> From: Rob Clark Date: Mon, 9 Aug 2021 11:30:45 -0700 Message-ID: Subject: Re: [Freedreno] [PATCH 0/3] iommu/drm/msm: Allow non-coherent masters to use system cache To: Sai Prakash Ranjan Cc: Will Deacon , Georgi Djakov , "Isaac J. Manjarres" , David Airlie , Akhil P Oommen , "list@263.net:IOMMU DRIVERS , Joerg Roedel ," , Linux Kernel Mailing List , Sean Paul , Jordan Crouse , Kristian H Kristensen , dri-devel , Daniel Vetter , linux-arm-msm , freedreno , Robin Murphy , "moderated list:ARM/FREESCALE IMX / MXC ARM ARCHITECTURE" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 9, 2021 at 11:11 AM Sai Prakash Ranjan wrote: > > On 2021-08-09 23:37, Rob Clark wrote: > > On Mon, Aug 9, 2021 at 10:47 AM Sai Prakash Ranjan > > wrote: > >> > >> On 2021-08-09 23:10, Will Deacon wrote: > >> > On Mon, Aug 09, 2021 at 10:18:21AM -0700, Rob Clark wrote: > >> >> On Mon, Aug 9, 2021 at 10:05 AM Will Deacon wrote: > >> >> > > >> >> > On Mon, Aug 09, 2021 at 09:57:08AM -0700, Rob Clark wrote: > >> >> > > On Mon, Aug 9, 2021 at 7:56 AM Will Deacon wrote: > >> >> > > > On Mon, Aug 02, 2021 at 06:36:04PM -0700, Rob Clark wrote: > >> >> > > > > On Mon, Aug 2, 2021 at 8:14 AM Will Deacon wrote: > >> >> > > > > > On Mon, Aug 02, 2021 at 08:08:07AM -0700, Rob Clark wrote: > >> >> > > > > > > On Mon, Aug 2, 2021 at 3:55 AM Will Deacon wrote: > >> >> > > > > > > > On Thu, Jul 29, 2021 at 10:08:22AM +0530, Sai Prakash Ranjan wrote: > >> >> > > > > > > > > On 2021-07-28 19:30, Georgi Djakov wrote: > >> >> > > > > > > > > > On Mon, Jan 11, 2021 at 07:45:02PM +0530, Sai Prakash Ranjan wrote: > >> >> > > > > > > > > > > commit ecd7274fb4cd ("iommu: Remove unused IOMMU_SYS_CACHE_ONLY flag") > >> >> > > > > > > > > > > removed unused IOMMU_SYS_CACHE_ONLY prot flag and along with it went > >> >> > > > > > > > > > > the memory type setting required for the non-coherent masters to use > >> >> > > > > > > > > > > system cache. Now that system cache support for GPU is added, we will > >> >> > > > > > > > > > > need to set the right PTE attribute for GPU buffers to be sys cached. > >> >> > > > > > > > > > > Without this, the system cache lines are not allocated for GPU. > >> >> > > > > > > > > > > > >> >> > > > > > > > > > > So the patches in this series introduces a new prot flag IOMMU_LLC, > >> >> > > > > > > > > > > renames IO_PGTABLE_QUIRK_ARM_OUTER_WBWA to IO_PGTABLE_QUIRK_PTW_LLC > >> >> > > > > > > > > > > and makes GPU the user of this protection flag. > >> >> > > > > > > > > > > >> >> > > > > > > > > > Thank you for the patchset! Are you planning to refresh it, as it does > >> >> > > > > > > > > > not apply anymore? > >> >> > > > > > > > > > > >> >> > > > > > > > > > >> >> > > > > > > > > I was waiting on Will's reply [1]. If there are no changes needed, then > >> >> > > > > > > > > I can repost the patch. > >> >> > > > > > > > > >> >> > > > > > > > I still think you need to handle the mismatched alias, no? You're adding > >> >> > > > > > > > a new memory type to the SMMU which doesn't exist on the CPU side. That > >> >> > > > > > > > can't be right. > >> >> > > > > > > > > >> >> > > > > > > > >> >> > > > > > > Just curious, and maybe this is a dumb question, but what is your > >> >> > > > > > > concern about mismatched aliases? I mean the cache hierarchy on the > >> >> > > > > > > GPU device side (anything beyond the LLC) is pretty different and > >> >> > > > > > > doesn't really care about the smmu pgtable attributes.. > >> >> > > > > > > >> >> > > > > > If the CPU accesses a shared buffer with different attributes to those which > >> >> > > > > > the device is using then you fall into the "mismatched memory attributes" > >> >> > > > > > part of the Arm architecture. It's reasonably unforgiving (you should go and > >> >> > > > > > read it) and in some cases can apply to speculative accesses as well, but > >> >> > > > > > the end result is typically loss of coherency. > >> >> > > > > > >> >> > > > > Ok, I might have a few other sections to read first to decipher the > >> >> > > > > terminology.. > >> >> > > > > > >> >> > > > > But my understanding of LLC is that it looks just like system memory > >> >> > > > > to the CPU and GPU (I think that would make it "the point of > >> >> > > > > coherence" between the GPU and CPU?) If that is true, shouldn't it be > >> >> > > > > invisible from the point of view of different CPU mapping options? > >> >> > > > > >> >> > > > You could certainly build a system where mismatched attributes don't cause > >> >> > > > loss of coherence, but as it's not guaranteed by the architecture and the > >> >> > > > changes proposed here affect APIs which are exposed across SoCs, then I > >> >> > > > don't think it helps much. > >> >> > > > > >> >> > > > >> >> > > Hmm, the description of the new mapping flag is that it applies only > >> >> > > to transparent outer level cache: > >> >> > > > >> >> > > +/* > >> >> > > + * Non-coherent masters can use this page protection flag to set cacheable > >> >> > > + * memory attributes for only a transparent outer level of cache, also known as > >> >> > > + * the last-level or system cache. > >> >> > > + */ > >> >> > > +#define IOMMU_LLC (1 << 6) > >> >> > > > >> >> > > But I suppose we could call it instead IOMMU_QCOM_LLC or something > >> >> > > like that to make it more clear that it is not necessarily something > >> >> > > that would work with a different outer level cache implementation? > >> >> > > >> >> > ... or we could just deal with the problem so that other people can reuse > >> >> > the code. I haven't really understood the reluctance to solve this properly. > >> >> > > >> >> > Am I missing some reason this isn't solvable? > >> >> > >> >> Oh, was there another way to solve it (other than foregoing setting > >> >> INC_OCACHE in the pgtables)? Maybe I misunderstood, is there a > >> >> corresponding setting on the MMU pgtables side of things? > >> > > >> > Right -- we just need to program the CPU's MMU with the matching memory > >> > attributes! It's a bit more fiddly if you're just using ioremap_wc() > >> > though, as it's usually the DMA API which handles the attributes under > >> > the > >> > hood. > >> > > >> > Anyway, sorry, I should've said that explicitly earlier on. We've done > >> > this > >> > sort of thing in the Android tree so I assumed Sai knew what needed to > >> > be > >> > done and then I didn't think to explain to you :( > >> > > >> > >> Right I was aware of that but even in the android tree there is no > >> user > >> :) > >> I think we can't have a new memory type without any user right in > >> upstream > >> like android tree? > >> > >> @Rob, I think you already tried adding a new MT and used > >> pgprot_syscached() > >> in GPU driver but it was crashing? > > > > Correct, but IIRC there were some differences in the code for memory > > types compared to the android tree.. I couldn't figure out the > > necessary patches to cherry-pick to get the android patch to apply > > cleanly, so I tried re-implementing it without having much of a clue > > about how that code works (which was probably the issue) ;-) > > > > Hehe no, even I get the same crash after porting/modifying the required > patches from android ;) and I think crashes would be seen in android as > well, its just that they don't have any user exercising that code. > > Thing is I can't make head and tail of the GPU crash logs, maybe you > know > how to decode those errors, if not I can start a thread with QC GPU team > and ask them to decode? > If you have a gpu devcore dump, I can take a look at it with crashdec.. otherwise I can try to find the branch where I had that patch backported. I'm more familiar with using crashdec to figure out mesa bugs, but maybe I could spot something where what the GPU is seeing disagrees with what the CPU expects it to be seeing. BR, -R