Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp2869501pxt; Mon, 9 Aug 2021 10:43:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyGYv0lco05nmPDTyaNX47lNo9yijvfa0Cy8p+4KcjMoJIguz19LVxiPy/2vICyOMTnG2cT X-Received: by 2002:a05:6402:b99:: with SMTP id cf25mr31237252edb.130.1628531029074; Mon, 09 Aug 2021 10:43:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628531029; cv=none; d=google.com; s=arc-20160816; b=XJtYcL4uFjjylCWKM9eVY30NY8EUg3Jipro+QCEIuwAvBBSlwRO2cP/kYRSZcydSXP Na3qgac2a7bgQYyI3vpS1o4tX0voIauGmJ+cUWhRO5ldgPEz6RMQEjUl/v8X/3zDMzDm SkrL2MbtONuiLBWGlz/iXOsKqLbjp85Q2TS63DKSSMXC1FcAyTehFhPqwAT0XfsnkJfK 1oOOgADNEVh+RwUVrXaCMm4zcotnao+il5LnKqVCDyqlWQeaPsALFKjlJvk3Z4FLYUU0 UL79ls65xN4jmQmIgkN+fzR89HgnWt2TfQ9sZEaCX0VHtyCHzLx6bDQXofSt8g7aB3Wu tMqg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=tsa8pzXrHj5AGYUrj29mmLTcfAzxvuczycE3wcOo1pY=; b=qItZGakptYXwlVnwSEpv581Xah5x6Wb5ESNepkYgiv9s05XasA3y9psGu/M/6dFPFn i804kxwe1Rcd8hxoCWiUHbw4kHKsRNlvTWt9uJC4rTuOUW8z39eJlh/kzu2ogbbQv8NE 7l6hjOLs5NWagTanOwcSvGl5NZLJef8ZEERogZOE4U4FRYFotQNlmMa+GQQNIHyGc3Mx nSuaB7q31GCQHZyVx3AEKe9O/Zqc+9hnynQ1vcH8UiuY3FDl1KxJMBcI6mdpWQkReHt2 NO7LzSTN6AvWDTKzuh/JNcuAKV3y9QVK5MU4bCm2OneCLcO8NcSo3qX4okwEzfNn2+lL k7WQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=YxJy11mi; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id gf2si17906041ejb.500.2021.08.09.10.43.23; Mon, 09 Aug 2021 10:43:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=YxJy11mi; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234547AbhHIRku (ORCPT + 99 others); Mon, 9 Aug 2021 13:40:50 -0400 Received: from mail.kernel.org ([198.145.29.99]:41590 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234473AbhHIRku (ORCPT ); Mon, 9 Aug 2021 13:40:50 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 6D5C56056B; Mon, 9 Aug 2021 17:40:26 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1628530829; bh=BIKbIviH9SpdRUOz4zp7Qr9LpfMy8guxskktS3QxA2c=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=YxJy11mivVEo2UaBSB2xiRgRmjERNhE4SAb9XXON7MqwSwaegBla8YhtlQ0M97H/v ddzB4KQPMUy8eR4Nb+Ylch/pCyZbDfeE3tGaPChCILwJK/2q0RRkhNYQkbcGH70re0 9Jwq1wnQL+fAiRQ1GbJvQoxnvA+jTbbqC/Nd4gK2Qhesw5nRwk5NMHprVCI7KuITvJ iDnF0DF25umEYeh0IQv92pi7+OWU3t4wP1GICbH63L7+b7XS1hwOThYj7eGVchmdYo uHA2MZxJs2VNT6mQ+E1jMEQXw4pTXfYdIvhUYkqHYqBPjE2DyaXcU/E23ebgOtBlwh 7X7g2G6tevPug== Date: Mon, 9 Aug 2021 18:40:23 +0100 From: Will Deacon To: Rob Clark Cc: Sai Prakash Ranjan , Georgi Djakov , "Isaac J. Manjarres" , David Airlie , Akhil P Oommen , "list@263.net:IOMMU DRIVERS , Joerg Roedel ," , Linux Kernel Mailing List , Sean Paul , Jordan Crouse , Kristian H Kristensen , dri-devel , Daniel Vetter , linux-arm-msm , freedreno , Robin Murphy , "moderated list:ARM/FREESCALE IMX / MXC ARM ARCHITECTURE" Subject: Re: [Freedreno] [PATCH 0/3] iommu/drm/msm: Allow non-coherent masters to use system cache Message-ID: <20210809174022.GA1840@willie-the-truck> References: <20210728140052.GB22887@mms-0441> <8b2742c8891abe4fec3664730717a089@codeaurora.org> <20210802105544.GA27657@willie-the-truck> <20210802151409.GE28735@willie-the-truck> <20210809145651.GC1458@willie-the-truck> <20210809170508.GB1589@willie-the-truck> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 09, 2021 at 10:18:21AM -0700, Rob Clark wrote: > On Mon, Aug 9, 2021 at 10:05 AM Will Deacon wrote: > > > > On Mon, Aug 09, 2021 at 09:57:08AM -0700, Rob Clark wrote: > > > On Mon, Aug 9, 2021 at 7:56 AM Will Deacon wrote: > > > > On Mon, Aug 02, 2021 at 06:36:04PM -0700, Rob Clark wrote: > > > > > On Mon, Aug 2, 2021 at 8:14 AM Will Deacon wrote: > > > > > > On Mon, Aug 02, 2021 at 08:08:07AM -0700, Rob Clark wrote: > > > > > > > On Mon, Aug 2, 2021 at 3:55 AM Will Deacon wrote: > > > > > > > > On Thu, Jul 29, 2021 at 10:08:22AM +0530, Sai Prakash Ranjan wrote: > > > > > > > > > On 2021-07-28 19:30, Georgi Djakov wrote: > > > > > > > > > > On Mon, Jan 11, 2021 at 07:45:02PM +0530, Sai Prakash Ranjan wrote: > > > > > > > > > > > commit ecd7274fb4cd ("iommu: Remove unused IOMMU_SYS_CACHE_ONLY flag") > > > > > > > > > > > removed unused IOMMU_SYS_CACHE_ONLY prot flag and along with it went > > > > > > > > > > > the memory type setting required for the non-coherent masters to use > > > > > > > > > > > system cache. Now that system cache support for GPU is added, we will > > > > > > > > > > > need to set the right PTE attribute for GPU buffers to be sys cached. > > > > > > > > > > > Without this, the system cache lines are not allocated for GPU. > > > > > > > > > > > > > > > > > > > > > > So the patches in this series introduces a new prot flag IOMMU_LLC, > > > > > > > > > > > renames IO_PGTABLE_QUIRK_ARM_OUTER_WBWA to IO_PGTABLE_QUIRK_PTW_LLC > > > > > > > > > > > and makes GPU the user of this protection flag. > > > > > > > > > > > > > > > > > > > > Thank you for the patchset! Are you planning to refresh it, as it does > > > > > > > > > > not apply anymore? > > > > > > > > > > > > > > > > > > > > > > > > > > > > I was waiting on Will's reply [1]. If there are no changes needed, then > > > > > > > > > I can repost the patch. > > > > > > > > > > > > > > > > I still think you need to handle the mismatched alias, no? You're adding > > > > > > > > a new memory type to the SMMU which doesn't exist on the CPU side. That > > > > > > > > can't be right. > > > > > > > > > > > > > > > > > > > > > > Just curious, and maybe this is a dumb question, but what is your > > > > > > > concern about mismatched aliases? I mean the cache hierarchy on the > > > > > > > GPU device side (anything beyond the LLC) is pretty different and > > > > > > > doesn't really care about the smmu pgtable attributes.. > > > > > > > > > > > > If the CPU accesses a shared buffer with different attributes to those which > > > > > > the device is using then you fall into the "mismatched memory attributes" > > > > > > part of the Arm architecture. It's reasonably unforgiving (you should go and > > > > > > read it) and in some cases can apply to speculative accesses as well, but > > > > > > the end result is typically loss of coherency. > > > > > > > > > > Ok, I might have a few other sections to read first to decipher the > > > > > terminology.. > > > > > > > > > > But my understanding of LLC is that it looks just like system memory > > > > > to the CPU and GPU (I think that would make it "the point of > > > > > coherence" between the GPU and CPU?) If that is true, shouldn't it be > > > > > invisible from the point of view of different CPU mapping options? > > > > > > > > You could certainly build a system where mismatched attributes don't cause > > > > loss of coherence, but as it's not guaranteed by the architecture and the > > > > changes proposed here affect APIs which are exposed across SoCs, then I > > > > don't think it helps much. > > > > > > > > > > Hmm, the description of the new mapping flag is that it applies only > > > to transparent outer level cache: > > > > > > +/* > > > + * Non-coherent masters can use this page protection flag to set cacheable > > > + * memory attributes for only a transparent outer level of cache, also known as > > > + * the last-level or system cache. > > > + */ > > > +#define IOMMU_LLC (1 << 6) > > > > > > But I suppose we could call it instead IOMMU_QCOM_LLC or something > > > like that to make it more clear that it is not necessarily something > > > that would work with a different outer level cache implementation? > > > > ... or we could just deal with the problem so that other people can reuse > > the code. I haven't really understood the reluctance to solve this properly. > > > > Am I missing some reason this isn't solvable? > > Oh, was there another way to solve it (other than foregoing setting > INC_OCACHE in the pgtables)? Maybe I misunderstood, is there a > corresponding setting on the MMU pgtables side of things? Right -- we just need to program the CPU's MMU with the matching memory attributes! It's a bit more fiddly if you're just using ioremap_wc() though, as it's usually the DMA API which handles the attributes under the hood. Anyway, sorry, I should've said that explicitly earlier on. We've done this sort of thing in the Android tree so I assumed Sai knew what needed to be done and then I didn't think to explain to you :( Will