Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp1267112rwd; Wed, 31 May 2023 11:31:27 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ4MmxQSO9IuOCZSA3ZxXWcksotBtWzEF8MAbiwGrwtEXNAKjW4KqBn8JlPCQoKEhB5jUmEp X-Received: by 2002:a17:902:c642:b0:1ae:89a:a4 with SMTP id s2-20020a170902c64200b001ae089a00a4mr5720479pls.8.1685557887112; Wed, 31 May 2023 11:31:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685557887; cv=none; d=google.com; s=arc-20160816; b=uma1YxOHW2N7ShYGiGc1WoZmFlE4+8gXuOyEKK1RgvsNzaA0M3FK4apkVyaRlJslK7 pJUi/+4rvwd14WKEy5rhMDNhogJVF1orYJTj0Z9Ni6VXzgYqixykne/n//oPtib1fcUU srWQodFwW9GNwxKfJM0/S3H4arxEqTFOmJCJrBQO6oS1UNp5qDyAf3f/GRc9NusqZwKG RbjzhO0/1dnYdpyjV+GeZS4myUc8y4oD8YHgO1DfgO2e2LvV18LkQJw3uI+/c6d+8EkX F4eJVV2HbOsKn+nKKQwhEWS1n5gpPSyHyn8sSFhofMjzvUUpj/td+8fWCFosdFJQMU60 Q43Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent :content-transfer-encoding:references:in-reply-to:date:cc:to:from :subject:message-id; bh=xDOgsAT/aFSaaslf4tz0bVVsnHojBWshS7RNH50NUZQ=; b=o1sxtBg6YAmv88WRKs4XaURBwiNrVNh/OijwAsU++D7uH+/5CShR3jDaMbo2C7JPCR f2TLm6iDKATTyTQ5BCI1v792w+AWMCfIn7kOta0MXoJnLLTP6TbdAO9twINuDkpyfrCe /qCNjFGUM8k2/3qcI/GYDLCr/8F7SyWHq8YqYuqPTInonM3RIulqIqmQSB0HLvx55l3D 2pTEXhqx+7F4SPPnuSHXngNiLY+xRi8GixhP54LXfndbtrpKD8Hn5apBWSJSLFP3VpcA L6RNaqcA1tXyppj/TdCh+bLMbihc6CsDZO0+MLlJgYMEWB+DgEx5p3z3RVkaMZcqll5R xwrA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id v11-20020a1709029a0b00b001afac49e04dsi1225753plp.642.2023.05.31.11.30.54; Wed, 31 May 2023 11:31:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230267AbjEaRxb convert rfc822-to-8bit (ORCPT + 99 others); Wed, 31 May 2023 13:53:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43984 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230040AbjEaRx3 (ORCPT ); Wed, 31 May 2023 13:53:29 -0400 Received: from metis.ext.pengutronix.de (metis.ext.pengutronix.de [IPv6:2001:67c:670:201:290:27ff:fe1d:cc33]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9E44E125 for ; Wed, 31 May 2023 10:53:27 -0700 (PDT) Received: from ptz.office.stw.pengutronix.de ([2a0a:edc0:0:900:1d::77] helo=[IPv6:::1]) by metis.ext.pengutronix.de with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1q4Q0g-0000LC-TQ; Wed, 31 May 2023 19:53:22 +0200 Message-ID: Subject: Re: [PATCH v6 6/6] drm/etnaviv: allow usperspace create cached coherent bo From: Lucas Stach To: Sui Jingfeng , Russell King , Christian Gmeiner , David Airlie , Daniel Vetter Cc: linux-kernel@vger.kernel.org, etnaviv@lists.freedesktop.org, dri-devel@lists.freedesktop.org, loongson-kernel@lists.loongnix.cn Date: Wed, 31 May 2023 19:53:19 +0200 In-Reply-To: <5c2faf7e-002c-dad0-c4fe-63aab04f7e87@loongson.cn> References: <20230530160643.2344551-1-suijingfeng@loongson.cn> <20230530160643.2344551-7-suijingfeng@loongson.cn> <35c15c0912b4a9372b9c2194a46b518ce515ce3d.camel@pengutronix.de> <5c2faf7e-002c-dad0-c4fe-63aab04f7e87@loongson.cn> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT User-Agent: Evolution 3.46.4 (3.46.4-1.fc37) MIME-Version: 1.0 X-SA-Exim-Connect-IP: 2a0a:edc0:0:900:1d::77 X-SA-Exim-Mail-From: l.stach@pengutronix.de X-SA-Exim-Scanned: No (on metis.ext.pengutronix.de); SAEximRunCond expanded to false X-PTX-Original-Recipient: linux-kernel@vger.kernel.org X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Am Donnerstag, dem 01.06.2023 um 01:29 +0800 schrieb Sui Jingfeng: > Hi, > > On 2023/6/1 00:33, Lucas Stach wrote: > > Hi Sui Jingfeng, > > > > Am Mittwoch, dem 31.05.2023 um 00:06 +0800 schrieb Sui Jingfeng: > > > cached system RAM is coherent on loongson CPUs, and the GPU and DC allways > > > snoop the CPU's cache. write-combine caching property is not suitiable for > > > us. > > > > > As previously mentioned in the Mesa MR, I don't think this is the right > > approach. > > > > ETNA_BO_CACHED already looks coherent to userspace, as all accesses are > > bracketed via the ETNAVIV_GEM_CPU_PREP and ETNAVIV_GEM_CPU_FINI ioctls, > > which will do the necessary cache maintenance on platforms where device > > coherence isn't enforced by the hardware, so there is no need for a > > separate ETNA_BO_CACHED_COHERENT. > > As far as I can see,  ETNA_BO_CACHED_COHERENT could probably help to > bypass the overhead of > > dma_sync_sgtable_for_cpu() and dma_sync_sgtable_for_device() brings to us. > > > I have tested long time ago, there no need call this function on our > platform. > > The glmark2 works as before if I comment out thoes two function. > > Are you serious, sir? > The dma_sync* functions are more or less no-ops when the device is marked as being coherent. As the mapping is done via the DRM device instance, you might need to propagate the coherent property from the GPU core device to the virtual DRM device, along the lines of how we propagate other DMA properties from the GPU device to the DRM device in etnaviv_pdev_probe. Other than that things should just work with minimal overhead. > > Instead we just need a new ETNAVIV_PARAM to inform userspace about > > hardware cache coherence being available for a specific GPU core, > > Ok, let me think about for a while how to implement this. > Simple: add new ETNAVIV_PARAM_GPU_COHERENT to include/uapi/drm/etnaviv_drm.h, return the result from dev_is_dma_coherent in etnaviv_gpu_get_param(). > But How about we merge this first, I create another patch to improve it > > with a roughly working base first? I'm just asking if the answer is No :-) > The answer is a firm no. This impacts UAPI, so there is no chance to ever get rid of any wrong decisions here, as any added UAPI needs to be supported indefinitely. I'm not signing up for maintaining something I believe is implemented upside down. Please don't take this the wrong way: I'm pretty excited to see etnaviv used on more architectures and outside of the proven platform device paths, so I'm happy to assist in working out the design and help you get things merged in both kernel and Mesa. But I think we are still quite a few steps away from having things worked out enough to even think about merging those patchsets. Also please allow me to comment on the other patches of the series, so I can get a better understanding of your platform/integration, before sending another revision of those patches. Regards, Lucas > > in > > which case the userspace driver should switch to preferring > > ETNA_BO_CACHED over ETNA_BO_WC. > > Yeah,  ETNA_BO_CACHED is enough. > > ETNA_BO_CACHED_COHERENT is actually a special case of ETNA_BO_CACHED. > > > Regards, > > Lucas > > > > > Signed-off-by: Sui Jingfeng > > > --- > > > drivers/gpu/drm/etnaviv/etnaviv_drv.c | 2 +- > > > drivers/gpu/drm/etnaviv/etnaviv_gem.c | 22 +++++++++++++++++++-- > > > drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c | 9 ++++++++- > > > include/uapi/drm/etnaviv_drm.h | 11 ++++++----- > > > 4 files changed, 35 insertions(+), 9 deletions(-) > > > > > > diff --git a/drivers/gpu/drm/etnaviv/etnaviv_drv.c b/drivers/gpu/drm/etnaviv/etnaviv_drv.c > > > index 052f745cecc0..2816c654c023 100644 > > > --- a/drivers/gpu/drm/etnaviv/etnaviv_drv.c > > > +++ b/drivers/gpu/drm/etnaviv/etnaviv_drv.c > > > @@ -274,7 +274,7 @@ static int etnaviv_ioctl_gem_new(struct drm_device *dev, void *data, > > > struct drm_etnaviv_gem_new *args = data; > > > > > > if (args->flags & ~(ETNA_BO_CACHED | ETNA_BO_WC | ETNA_BO_UNCACHED | > > > - ETNA_BO_FORCE_MMU)) > > > + ETNA_BO_CACHED_COHERENT | ETNA_BO_FORCE_MMU)) > > > return -EINVAL; > > > > > > return etnaviv_gem_new_handle(dev, file, args->size, > > > diff --git a/drivers/gpu/drm/etnaviv/etnaviv_gem.c b/drivers/gpu/drm/etnaviv/etnaviv_gem.c > > > index b5f73502e3dd..d8b559bd33d3 100644 > > > --- a/drivers/gpu/drm/etnaviv/etnaviv_gem.c > > > +++ b/drivers/gpu/drm/etnaviv/etnaviv_gem.c > > > @@ -343,6 +343,7 @@ void *etnaviv_gem_vmap(struct drm_gem_object *obj) > > > static void *etnaviv_gem_vmap_impl(struct etnaviv_gem_object *obj) > > > { > > > struct page **pages; > > > + pgprot_t prot; > > > > > > lockdep_assert_held(&obj->lock); > > > > > > @@ -350,8 +351,20 @@ static void *etnaviv_gem_vmap_impl(struct etnaviv_gem_object *obj) > > > if (IS_ERR(pages)) > > > return NULL; > > > > > > - return vmap(pages, obj->base.size >> PAGE_SHIFT, > > > - VM_MAP, pgprot_writecombine(PAGE_KERNEL)); > > > + switch (obj->flags) { > > > + case ETNA_BO_CACHED_COHERENT: > > > + case ETNA_BO_CACHED: > > > + prot = PAGE_KERNEL; > > > + break; > > > + case ETNA_BO_UNCACHED: > > > + prot = pgprot_noncached(PAGE_KERNEL); > > > + break; > > > + case ETNA_BO_WC: > > > + default: > > > + prot = pgprot_writecombine(PAGE_KERNEL); > > > + } > > > + > > > + return vmap(pages, obj->base.size >> PAGE_SHIFT, VM_MAP, prot); > > > } > > > > > > static inline enum dma_data_direction etnaviv_op_to_dma_dir(u32 op) > > > @@ -545,6 +558,7 @@ static const struct drm_gem_object_funcs etnaviv_gem_object_funcs = { > > > static int etnaviv_gem_new_impl(struct drm_device *dev, u32 size, u32 flags, > > > const struct etnaviv_gem_ops *ops, struct drm_gem_object **obj) > > > { > > > + struct etnaviv_drm_private *priv = dev->dev_private; > > > struct etnaviv_gem_object *etnaviv_obj; > > > unsigned sz = sizeof(*etnaviv_obj); > > > bool valid = true; > > > @@ -555,6 +569,10 @@ static int etnaviv_gem_new_impl(struct drm_device *dev, u32 size, u32 flags, > > > case ETNA_BO_CACHED: > > > case ETNA_BO_WC: > > > break; > > > + case ETNA_BO_CACHED_COHERENT: > > > + if (priv->has_cached_coherent) > > > + break; > > > + fallthrough; > > > default: > > > valid = false; > > > } > > > diff --git a/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c b/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c > > > index 3524b5811682..671d91d8f1c6 100644 > > > --- a/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c > > > +++ b/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c > > > @@ -112,11 +112,18 @@ static const struct etnaviv_gem_ops etnaviv_gem_prime_ops = { > > > struct drm_gem_object *etnaviv_gem_prime_import_sg_table(struct drm_device *dev, > > > struct dma_buf_attachment *attach, struct sg_table *sgt) > > > { > > > + struct etnaviv_drm_private *priv = dev->dev_private; > > > struct etnaviv_gem_object *etnaviv_obj; > > > size_t size = PAGE_ALIGN(attach->dmabuf->size); > > > + u32 cache_flags; > > > int ret, npages; > > > > > > - ret = etnaviv_gem_new_private(dev, size, ETNA_BO_WC, > > > + if (priv->has_cached_coherent) > > > + cache_flags = ETNA_BO_CACHED_COHERENT; > > > + else > > > + cache_flags = ETNA_BO_WC; > > > + > > > + ret = etnaviv_gem_new_private(dev, size, cache_flags, > > > &etnaviv_gem_prime_ops, &etnaviv_obj); > > > if (ret < 0) > > > return ERR_PTR(ret); > > > diff --git a/include/uapi/drm/etnaviv_drm.h b/include/uapi/drm/etnaviv_drm.h > > > index af024d90453d..474b0db286de 100644 > > > --- a/include/uapi/drm/etnaviv_drm.h > > > +++ b/include/uapi/drm/etnaviv_drm.h > > > @@ -90,13 +90,14 @@ struct drm_etnaviv_param { > > > * GEM buffers: > > > */ > > > > > > -#define ETNA_BO_CACHE_MASK 0x000f0000 > > > +#define ETNA_BO_CACHE_MASK 0x000f0000 > > > /* cache modes */ > > > -#define ETNA_BO_CACHED 0x00010000 > > > -#define ETNA_BO_WC 0x00020000 > > > -#define ETNA_BO_UNCACHED 0x00040000 > > > +#define ETNA_BO_CACHED 0x00010000 > > > +#define ETNA_BO_WC 0x00020000 > > > +#define ETNA_BO_UNCACHED 0x00040000 > > > +#define ETNA_BO_CACHED_COHERENT 0x00080000 > > > /* map flags */ > > > -#define ETNA_BO_FORCE_MMU 0x00100000 > > > +#define ETNA_BO_FORCE_MMU 0x00100000 > > > > > > struct drm_etnaviv_gem_new { > > > __u64 size; /* in */ >