Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp519907rwd; Thu, 1 Jun 2023 03:28:54 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5gkwEpqKrvJe9y7ML+QigjkiqPEWDz6WPV6Mo8OKrz/nYZQpRkSDPNbkuCsAxlgmHKVZ93 X-Received: by 2002:a05:6a00:b50:b0:64c:a554:f577 with SMTP id p16-20020a056a000b5000b0064ca554f577mr11237073pfo.11.1685615333902; Thu, 01 Jun 2023 03:28:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685615333; cv=none; d=google.com; s=arc-20160816; b=LzMpenrE1H5Rc8ptjeHXdIXv2S9S2y0KWCt2lkODz5K2yxgmcWchS0ap518OatO1g5 nBLmO4CdvWl8D6ijE43nEGOgSUhtI1DDWGZaEWi6sETmH5lr907+e5hztAZcKKHFDRyi stVT4UNk2Xr2iPdCLkguLyfi98ryzjE6bkN0JxrtN3wUCINi5ZeVzqo4VBL2NPadyI2e U687l6P2sNRq2OHA2+/AMMP5ADU2mtmv572DfBZDBbkTTKQEDCefhKg3uOEG/2D8LijA VEZjU7lt0nKBAWpuyeM4eWTlv4//yhWMNhpqNaxoKuRmRMJWzfijQXeCf9i0dAfgU/pQ mUfg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :organization:from:references:cc:to:content-language:subject :user-agent:mime-version:date:message-id; bh=ku5ds84HBnIWBF9H6NNcspOV4qew2V1CzVsYPQwK+bk=; b=ZxODv7iheE9Q8l6Oh3ZMtiyJ1nCMKS+KtRPfvxgtevxFhtGYIh5lQqe11fScen1tzi 03ilvyl9ebOU0daDKAY1L0P9Oop4gVRhqm6OPMpA6k7P3PA/jupKyd72ypQ5CJXaN5ci Re0tJTaDDS65eeFrEdn4dfpmFaUMhIkiiyy4M5d4xwV+g3EWCWNovVfPN4Yr8YOe/KlX bh86em6vXVRKa2I68c/1BFMlfCxS01WxoDi0FMX8t43MCaOSuclypqIiYsPH44iaVvq4 ladvmBwzAknYv4kTlw4neSPu0UUzH5FUJdiX2y/IF+z59bE2yfffPexiJulT5sjhRTOJ bi6Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id b3-20020aa79503000000b0063d2b810fc6si5229026pfp.305.2023.06.01.03.28.36; Thu, 01 Jun 2023 03:28:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233422AbjFAKVM (ORCPT + 99 others); Thu, 1 Jun 2023 06:21:12 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54696 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233468AbjFAKTf (ORCPT ); Thu, 1 Jun 2023 06:19:35 -0400 Received: from mail.loongson.cn (mail.loongson.cn [114.242.206.163]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 37FE41994 for ; Thu, 1 Jun 2023 03:16:22 -0700 (PDT) Received: from loongson.cn (unknown [10.20.42.43]) by gateway (Coremail) with SMTP id _____8AxQ_A_b3hk5T8DAA--.7183S3; Thu, 01 Jun 2023 18:13:19 +0800 (CST) Received: from [10.20.42.43] (unknown [10.20.42.43]) by localhost.localdomain (Coremail) with SMTP id AQAAf8Bx8a8+b3hkAw2EAA--.18413S3; Thu, 01 Jun 2023 18:13:18 +0800 (CST) Message-ID: Date: Thu, 1 Jun 2023 18:13:18 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.11.0 Subject: Re: [PATCH v6 6/6] drm/etnaviv: allow usperspace create cached coherent bo Content-Language: en-US To: Lucas Stach , Russell King , Christian Gmeiner , David Airlie , Daniel Vetter Cc: linux-kernel@vger.kernel.org, etnaviv@lists.freedesktop.org, dri-devel@lists.freedesktop.org, loongson-kernel@lists.loongnix.cn References: <20230530160643.2344551-1-suijingfeng@loongson.cn> <20230530160643.2344551-7-suijingfeng@loongson.cn> <35c15c0912b4a9372b9c2194a46b518ce515ce3d.camel@pengutronix.de> <5c2faf7e-002c-dad0-c4fe-63aab04f7e87@loongson.cn> From: Sui Jingfeng Organization: Loongson In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-CM-TRANSID: AQAAf8Bx8a8+b3hkAw2EAA--.18413S3 X-CM-SenderInfo: xvxlyxpqjiv03j6o00pqjv00gofq/ X-Coremail-Antispam: 1Uk129KBjvJXoW3Xw48JFyDKF13Xr4xAr1DZFb_yoWfJw4xpF W7AFyYkrW8ZrWjkw1Iv3Z8A34fKw12qFWvk34ktw1q9398tFsrKr18KFW5Crn5Ar1fCr1a qr1jyry3uF1UArJanT9S1TB71UUUUjUqnTZGkaVYY2UrUUUUj1kv1TuYvTs0mT0YCTnIWj qI5I8CrVACY4xI64kE6c02F40Ex7xfYxn0WfASr-VFAUDa7-sFnT9fnUUIcSsGvfJTRUUU bqAYFVCjjxCrM7AC8VAFwI0_Jr0_Gr1l1xkIjI8I6I8E6xAIw20EY4v20xvaj40_Wr0E3s 1l1IIY67AEw4v_Jrv_JF1l8cAvFVAK0II2c7xJM28CjxkF64kEwVA0rcxSw2x7M28EF7xv wVC0I7IYx2IY67AKxVW5JVW7JwA2z4x0Y4vE2Ix0cI8IcVCY1x0267AKxVW8JVWxJwA2z4 x0Y4vEx4A2jsIE14v26r4UJVWxJr1l84ACjcxK6I8E87Iv6xkF7I0E14v26F4UJVW0owAa w2AFwI0_Jrv_JF1le2I262IYc4CY6c8Ij28IcVAaY2xG8wAqjxCEc2xF0cIa020Ex4CE44 I27wAqx4xG64xvF2IEw4CE5I8CrVC2j2WlYx0E2Ix0cI8IcVAFwI0_Jw0_WrylYx0Ex4A2 jsIE14v26F4j6r4UJwAm72CE4IkC6x0Yz7v_Jr0_Gr1lF7xvr2IY64vIr41lc7I2V7IY0V AS07AlzVAYIcxG8wCY1x0262kKe7AKxVWUAVWUtwCF04k20xvY0x0EwIxGrwCFx2IqxVCF s4IE7xkEbVWUJVW8JwCFI7km07C267AKxVWUXVWUAwC20s026c02F40E14v26r1j6r18MI 8I3I0E7480Y4vE14v26r106r1rMI8E67AF67kF1VAFwI0_Jw0_GFylIxkGc2Ij64vIr41l IxAIcVC0I7IYx2IY67AKxVW8JVW5JwCI42IY6xIIjxv20xvEc7CjxVAFwI0_Gr0_Cr1lIx AIcVCF04k26cxKx2IYs7xG6r1j6r1xMIIF0xvEx4A2jsIE14v26F4j6r4UJwCI42IY6I8E 87Iv6xkF7I0E14v26r4j6r4UJbIYCTnIWIevJa73UjIFyTuYvjxU2JKsUUUUU X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,NICE_REPLY_A, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 2023/6/1 01:53, Lucas Stach wrote: > Am Donnerstag, dem 01.06.2023 um 01:29 +0800 schrieb Sui Jingfeng: >> Hi, >> >> On 2023/6/1 00:33, Lucas Stach wrote: >>> Hi Sui Jingfeng, >>> >>> Am Mittwoch, dem 31.05.2023 um 00:06 +0800 schrieb Sui Jingfeng: >>>> cached system RAM is coherent on loongson CPUs, and the GPU and DC allways >>>> snoop the CPU's cache. write-combine caching property is not suitiable for >>>> us. >>>> >>> As previously mentioned in the Mesa MR, I don't think this is the right >>> approach. >>> >>> ETNA_BO_CACHED already looks coherent to userspace, as all accesses are >>> bracketed via the ETNAVIV_GEM_CPU_PREP and ETNAVIV_GEM_CPU_FINI ioctls, >>> which will do the necessary cache maintenance on platforms where device >>> coherence isn't enforced by the hardware, so there is no need for a >>> separate ETNA_BO_CACHED_COHERENT. >> As far as I can see,  ETNA_BO_CACHED_COHERENT could probably help to >> bypass the overhead of >> >> dma_sync_sgtable_for_cpu() and dma_sync_sgtable_for_device() brings to us. >> >> >> I have tested long time ago, there no need call this function on our >> platform. >> >> The glmark2 works as before if I comment out thoes two function. >> >> Are you serious, sir? >> > The dma_sync* functions are more or less no-ops when the device is > marked as being coherent. ce > instance, you might need to propagate the coherent property from the > GPU core device to the virtual DRM device, along the lines of how we > propagate other DMA properties from the GPU device to the DRM device in > etnaviv_pdev_probe. > > Other than that things should just work with minimal overhead. >>> Instead we just need a new ETNAVIV_PARAM to inform userspace about >>> hardware cache coherence being available for a specific GPU core, >> Ok, let me think about for a while how to implement this. >> > Simple: add new ETNAVIV_PARAM_GPU_COHERENT to > include/uapi/drm/etnaviv_drm.h, return the result from > dev_is_dma_coherent in etnaviv_gpu_get_param(). Okay, agree >> But How about we merge this first, I create another patch to improve it >> >> with a roughly working base first? I'm just asking if the answer is No :-) >> > The answer is a firm no. > > This impacts UAPI, so there is no chance to ever get rid of any wrong > decisions here, as any added UAPI needs to be supported indefinitely. > I'm not signing up for maintaining something I believe is implemented > upside down. > > Please don't take this the wrong way: I'm pretty excited to see etnaviv > used on more architectures and outside of the proven platform device > paths, so I'm happy to assist in working out the design and help you > get things merged in both kernel and Mesa. But I think we are still > quite a few steps away from having things worked out enough to even > think about merging those patchsets. > > Also please allow me to comment on the other patches of the series, so > I can get a better understanding of your platform/integration, before > sending another revision of those patches. I go to sleep yesterday. Okay, this sound fine. > Regards, > Lucas > >>> in >>> which case the userspace driver should switch to preferring >>> ETNA_BO_CACHED over ETNA_BO_WC. >> Yeah,  ETNA_BO_CACHED is enough. >> >> ETNA_BO_CACHED_COHERENT is actually a special case of ETNA_BO_CACHED. >> >>> Regards, >>> Lucas >>> >>>> Signed-off-by: Sui Jingfeng >>>> --- >>>> drivers/gpu/drm/etnaviv/etnaviv_drv.c | 2 +- >>>> drivers/gpu/drm/etnaviv/etnaviv_gem.c | 22 +++++++++++++++++++-- >>>> drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c | 9 ++++++++- >>>> include/uapi/drm/etnaviv_drm.h | 11 ++++++----- >>>> 4 files changed, 35 insertions(+), 9 deletions(-) >>>> >>>> diff --git a/drivers/gpu/drm/etnaviv/etnaviv_drv.c b/drivers/gpu/drm/etnaviv/etnaviv_drv.c >>>> index 052f745cecc0..2816c654c023 100644 >>>> --- a/drivers/gpu/drm/etnaviv/etnaviv_drv.c >>>> +++ b/drivers/gpu/drm/etnaviv/etnaviv_drv.c >>>> @@ -274,7 +274,7 @@ static int etnaviv_ioctl_gem_new(struct drm_device *dev, void *data, >>>> struct drm_etnaviv_gem_new *args = data; >>>> >>>> if (args->flags & ~(ETNA_BO_CACHED | ETNA_BO_WC | ETNA_BO_UNCACHED | >>>> - ETNA_BO_FORCE_MMU)) >>>> + ETNA_BO_CACHED_COHERENT | ETNA_BO_FORCE_MMU)) >>>> return -EINVAL; >>>> >>>> return etnaviv_gem_new_handle(dev, file, args->size, >>>> diff --git a/drivers/gpu/drm/etnaviv/etnaviv_gem.c b/drivers/gpu/drm/etnaviv/etnaviv_gem.c >>>> index b5f73502e3dd..d8b559bd33d3 100644 >>>> --- a/drivers/gpu/drm/etnaviv/etnaviv_gem.c >>>> +++ b/drivers/gpu/drm/etnaviv/etnaviv_gem.c >>>> @@ -343,6 +343,7 @@ void *etnaviv_gem_vmap(struct drm_gem_object *obj) >>>> static void *etnaviv_gem_vmap_impl(struct etnaviv_gem_object *obj) >>>> { >>>> struct page **pages; >>>> + pgprot_t prot; >>>> >>>> lockdep_assert_held(&obj->lock); >>>> >>>> @@ -350,8 +351,20 @@ static void *etnaviv_gem_vmap_impl(struct etnaviv_gem_object *obj) >>>> if (IS_ERR(pages)) >>>> return NULL; >>>> >>>> - return vmap(pages, obj->base.size >> PAGE_SHIFT, >>>> - VM_MAP, pgprot_writecombine(PAGE_KERNEL)); >>>> + switch (obj->flags) { >>>> + case ETNA_BO_CACHED_COHERENT: >>>> + case ETNA_BO_CACHED: >>>> + prot = PAGE_KERNEL; >>>> + break; >>>> + case ETNA_BO_UNCACHED: >>>> + prot = pgprot_noncached(PAGE_KERNEL); >>>> + break; >>>> + case ETNA_BO_WC: >>>> + default: >>>> + prot = pgprot_writecombine(PAGE_KERNEL); >>>> + } >>>> + >>>> + return vmap(pages, obj->base.size >> PAGE_SHIFT, VM_MAP, prot); >>>> } >>>> >>>> static inline enum dma_data_direction etnaviv_op_to_dma_dir(u32 op) >>>> @@ -545,6 +558,7 @@ static const struct drm_gem_object_funcs etnaviv_gem_object_funcs = { >>>> static int etnaviv_gem_new_impl(struct drm_device *dev, u32 size, u32 flags, >>>> const struct etnaviv_gem_ops *ops, struct drm_gem_object **obj) >>>> { >>>> + struct etnaviv_drm_private *priv = dev->dev_private; >>>> struct etnaviv_gem_object *etnaviv_obj; >>>> unsigned sz = sizeof(*etnaviv_obj); >>>> bool valid = true; >>>> @@ -555,6 +569,10 @@ static int etnaviv_gem_new_impl(struct drm_device *dev, u32 size, u32 flags, >>>> case ETNA_BO_CACHED: >>>> case ETNA_BO_WC: >>>> break; >>>> + case ETNA_BO_CACHED_COHERENT: >>>> + if (priv->has_cached_coherent) >>>> + break; >>>> + fallthrough; >>>> default: >>>> valid = false; >>>> } >>>> diff --git a/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c b/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c >>>> index 3524b5811682..671d91d8f1c6 100644 >>>> --- a/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c >>>> +++ b/drivers/gpu/drm/etnaviv/etnaviv_gem_prime.c >>>> @@ -112,11 +112,18 @@ static const struct etnaviv_gem_ops etnaviv_gem_prime_ops = { >>>> struct drm_gem_object *etnaviv_gem_prime_import_sg_table(struct drm_device *dev, >>>> struct dma_buf_attachment *attach, struct sg_table *sgt) >>>> { >>>> + struct etnaviv_drm_private *priv = dev->dev_private; >>>> struct etnaviv_gem_object *etnaviv_obj; >>>> size_t size = PAGE_ALIGN(attach->dmabuf->size); >>>> + u32 cache_flags; >>>> int ret, npages; >>>> >>>> - ret = etnaviv_gem_new_private(dev, size, ETNA_BO_WC, >>>> + if (priv->has_cached_coherent) >>>> + cache_flags = ETNA_BO_CACHED_COHERENT; >>>> + else >>>> + cache_flags = ETNA_BO_WC; >>>> + >>>> + ret = etnaviv_gem_new_private(dev, size, cache_flags, >>>> &etnaviv_gem_prime_ops, &etnaviv_obj); >>>> if (ret < 0) >>>> return ERR_PTR(ret); >>>> diff --git a/include/uapi/drm/etnaviv_drm.h b/include/uapi/drm/etnaviv_drm.h >>>> index af024d90453d..474b0db286de 100644 >>>> --- a/include/uapi/drm/etnaviv_drm.h >>>> +++ b/include/uapi/drm/etnaviv_drm.h >>>> @@ -90,13 +90,14 @@ struct drm_etnaviv_param { >>>> * GEM buffers: >>>> */ >>>> >>>> -#define ETNA_BO_CACHE_MASK 0x000f0000 >>>> +#define ETNA_BO_CACHE_MASK 0x000f0000 >>>> /* cache modes */ >>>> -#define ETNA_BO_CACHED 0x00010000 >>>> -#define ETNA_BO_WC 0x00020000 >>>> -#define ETNA_BO_UNCACHED 0x00040000 >>>> +#define ETNA_BO_CACHED 0x00010000 >>>> +#define ETNA_BO_WC 0x00020000 >>>> +#define ETNA_BO_UNCACHED 0x00040000 >>>> +#define ETNA_BO_CACHED_COHERENT 0x00080000 >>>> /* map flags */ >>>> -#define ETNA_BO_FORCE_MMU 0x00100000 >>>> +#define ETNA_BO_FORCE_MMU 0x00100000 >>>> >>>> struct drm_etnaviv_gem_new { >>>> __u64 size; /* in */ -- Jingfeng