Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp1379744pxb; Wed, 30 Mar 2022 02:01:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwUtckdbZtkVjf/3dH95u72vHT73ANiFQthtPni3z0dhBh/UtWiIxPj/vamR6TpOX8jvL8j X-Received: by 2002:a17:902:7c81:b0:156:30ef:7dec with SMTP id y1-20020a1709027c8100b0015630ef7decmr5955737pll.74.1648630868014; Wed, 30 Mar 2022 02:01:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1648630868; cv=none; d=google.com; s=arc-20160816; b=Xvulj0iaeJXUwJFTd7Hst7VBBge4Zj2napqFAZ6K/0b08ug8owBYBhXBtQoylBGivP OTsVdBkcluU5AbCaDKkrPXoNPyieyJEaQyAstmPm4HznKpaVUBo5oI+S6KkPVhhCfT4d FgETcEheSOxafBid5pEcv41dnfuqt/S6KlmUnE28YPV7U2iSMXOGcd8fL10okIrZ68Ro cVJGqcT1qjK7mQu1wDgxzRtiUF3o8UqvxEtRjYco+ZHygNGBykRc8ykoAhWzpuDbyUik O2DUjwNm52uk3djsiIEGBTj+Ol2wJUtJKAtU3ekSAUoiioxitFR0J6CrFX+8UzHTiHjo akBw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=6WzK0iEK1tA2vwWOojnK+sRWtG13aOccwS9qWJMzFmw=; b=CbIUdg2MGhgimDX2fgc2o3fohpLFGKZlFRjsFHkiimqgFP7aT898sQW1I9n/cXE3KU 05TARy4izVgdh2YU6I1tKzBcSxW1qH1asxwU2UlhFX5R/+OiDarfMZm72V2kf8JmFDPu u/L1Pya/IXARsreQqbpgS/LM1WMyFZ3zCSTRlBReJ5GpYOdkKZoAtlOu4JGKAHYIAwn7 J3SpI1NPr6/nWuRbkcDKUO2GcnKdQczWORv1h9lIzG7xqVBFN32onHH5JWx9cofJLjFj LhnDaLz32Ka2PyEX5Okl4vSsigKFXcbKT2Crtn2Zk9DqPQ3sWlgs02nwnsZL/YNbISub fEBw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=eRTnso2D; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id v24-20020a63f218000000b003821cee9789si18249676pgh.32.2022.03.30.02.00.54; Wed, 30 Mar 2022 02:01:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=eRTnso2D; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239682AbiC2RyR (ORCPT + 99 others); Tue, 29 Mar 2022 13:54:17 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50642 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240343AbiC2RyP (ORCPT ); Tue, 29 Mar 2022 13:54:15 -0400 Received: from mail-ej1-x630.google.com (mail-ej1-x630.google.com [IPv6:2a00:1450:4864:20::630]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C12501ED05D for ; Tue, 29 Mar 2022 10:52:31 -0700 (PDT) Received: by mail-ej1-x630.google.com with SMTP id bg10so36678488ejb.4 for ; Tue, 29 Mar 2022 10:52:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=6WzK0iEK1tA2vwWOojnK+sRWtG13aOccwS9qWJMzFmw=; b=eRTnso2DqyGizkbBzNBOPjTRvAPn3ud895nlnWhPZ4UjGAm0OZxfB37FkU+nLvw6Cx OjNt45Va6PSjR4jK/bub1G1Q3eTo2GeQB+9pmTpO4CEGMm7rql+cSZ6Fk7LljiP+p10C lxPJry/Sh5OS2akJE+Ov36FPSfFSIUMNK+CUkcmjeNbw5ebd8aKUtviSH7NrYxnUal9v ZfK1nt8TTht/oGsn1JXKfPS1WbWw3Q7+dKjy0p0lvMt7OfpCHQtlvvmCSVRY3NAuMarF eEYs9Y4FfbPI1RoXStyWd/49c6fAbwgFa8ui0Ba0QbKPeSvOkII27LGSKggdYimqvZpK U24g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=6WzK0iEK1tA2vwWOojnK+sRWtG13aOccwS9qWJMzFmw=; b=5CH8Tb8P39lYXxl+XQTcYcW8dThLKy9yBuNPMU7ePkJbnIpCMo0ffc4snZ2HSW6fhw 91exmKdejQiFNLn6KnpwqxGAmhIUX5NFwMfh/nXCc70CqMZSJ49QU1cD6Xc+J4eHwzDO iHgXH2bpQsiNHmEN0bXOg3Lb5cAQk7UPHI6L5xTxXqf/AKLe1ddnj+Ji7XlxTjf60Vem NDIBdkzgVLZV0nntWKtBUqAUELwYxF/9csWQ+aPVUMJ3rudIEBNlXUZygd5Z9n+CMYjA VUkBmaA3F/9ADfpYnvktn9gP/Avly00wQvv6KPJogdSzgVIipCXWwRwtvleYwfx5KlzI BQ3A== X-Gm-Message-State: AOAM531AyxVDkfplw8RETFygn1b2eNZN73pCgxjbtHe1Cux1PxjzNj4G vGWTnQRUwnbecpsJSWV6C8vKv9ErmIIGefvl4a7mww== X-Received: by 2002:a17:906:5d08:b0:6da:b4ea:937 with SMTP id g8-20020a1709065d0800b006dab4ea0937mr37059399ejt.446.1648576349854; Tue, 29 Mar 2022 10:52:29 -0700 (PDT) MIME-Version: 1.0 References: <20220328035951.1817417-1-tjmercier@google.com> <20220328035951.1817417-5-tjmercier@google.com> In-Reply-To: From: "T.J. Mercier" Date: Tue, 29 Mar 2022 10:52:18 -0700 Message-ID: Subject: Re: [RFC v4 4/8] dmabuf: heaps: export system_heap buffers with GPU cgroup charging To: "T.J. Mercier" , David Airlie , Maarten Lankhorst , Maxime Ripard , Thomas Zimmermann , Jonathan Corbet , Greg Kroah-Hartman , =?UTF-8?B?QXJ2ZSBIasO4bm5ldsOlZw==?= , Todd Kjos , Martijn Coenen , Joel Fernandes , Christian Brauner , Hridya Valsaraju , Suren Baghdasaryan , Sumit Semwal , =?UTF-8?Q?Christian_K=C3=B6nig?= , Benjamin Gaignard , Liam Mark , Laura Abbott , Brian Starkey , John Stultz , Tejun Heo , Zefan Li , Johannes Weiner , Shuah Khan , Kalesh Singh , Kenny.Ho@amd.com, =?UTF-8?Q?Michal_Koutn=C3=BD?= , Shuah Khan , dri-devel@lists.freedesktop.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-media@vger.kernel.org, linaro-mm-sig@lists.linaro.org, cgroups@vger.kernel.org, linux-kselftest@vger.kernel.org Cc: Daniel Vetter Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 29, 2022 at 1:42 AM Daniel Vetter wrote: > > On Mon, Mar 28, 2022 at 11:28:24AM -0700, T.J. Mercier wrote: > > On Mon, Mar 28, 2022 at 7:36 AM Daniel Vetter wrote: > > > > > > On Mon, Mar 28, 2022 at 03:59:43AM +0000, T.J. Mercier wrote: > > > > From: Hridya Valsaraju > > > > > > > > All DMA heaps now register a new GPU cgroup device upon creation, a= nd the > > > > system_heap now exports buffers associated with its GPU cgroup devi= ce for > > > > tracking purposes. > > > > > > > > Signed-off-by: Hridya Valsaraju > > > > Signed-off-by: T.J. Mercier > > > > > > > > --- > > > > v3 changes > > > > Use more common dual author commit message format per John Stultz. > > > > > > > > v2 changes > > > > Move dma-buf cgroup charge transfer from a dma_buf_op defined by ev= ery > > > > heap to a single dma-buf function for all heaps per Daniel Vetter a= nd > > > > Christian K=C3=B6nig. > > > > > > Apologies for being out of the loop quite a bit. I scrolled through t= his > > > all and I think it looks good to get going. > > > > > > The only thing I have is whether we should move the cgroup controller= s out > > > of dma-buf heaps, since that's rather android centric. E.g. > > > - a system gpucg_device which is used by all the various single page > > > allocators (dma-buf heap but also shmem helpers and really anything > > > else) > > > - same for cma, again both for dma-buf heaps and also for the gem cma > > > helpers in drm > > > > Thanks Daniel, in general that makes sense to me as an approach to > > making this more universal. However for the Android case I'm not sure > > if the part about a single system gpucg_device would be sufficient, > > because there are at least 12 different graphics related heaps that > > could potentially be accounted/limited differently. [1] So that > > raises the question of how fine grained we want this to be... I tend > > towards separating them all, but I haven't formed a strong opinion > > about this at the moment. It sounds like you are in favor of a > > smaller, more rigidly defined set of them? Either way, we need to add > > code for accounting at points where we know memory is specifically for > > graphics use and not something else right? (I.E. Whether it is a > > dma-buf heap or somewhere like drm_gem_object_init.) So IIUC the only > > question is what to use for the gpucg_device(s) at these locations. > > We don't have 12 in upstream, so this is a lot easier here :-) > > I'm not exactly sure why you have such a huge pile of them. > > For gem buffers it would be fairly similar to what you've done for dma-bu= f > heaps I think, with the various helper libraries (drivers stopped > hand-rolling their gem buffer) setting the right accounting group. And > yeah for system memory I think we'd need to have standard ones, for drive= r > specific ones it's kinda different. > > > [1] https://cs.android.com/android/platform/superproject/+/master:hardw= are/google/graphics/common/libion/ion.cpp;l=3D39-50 > > > > > > > > Otherwise this will only work on non-upstream android where gpu drive= rs > > > allocate everything from dma-buf heap. If you use something like the = x86 > > > android project with mesa drivers, then driver-internal buffers will = be > > > allocated through gem and not through dma-buf heaps. Or at least I th= ink > > > that's how it works. > > > > > > But also meh, we can fix this fairly easily later on by adding these > > > standard gpucg_dev somwehere with a bit of kerneldoc. > > > > This is what I was thinking would happen next, but IDK if anyone sees > > a more central place to do this type of use-specific accounting. > > Hm I just realized ... are the names in the groups abi? If yes then I > think we need to fix this before we merge anything. > -Daniel Do you mean the set of possible names being part of the ABI for GPU cgroups? I'm not exactly sure what you mean here. The name is a settable string inside the gpucg_device struct, and right now the docs say it must be from a string literal but this can be changed. The only one this patchset adds is "system", which comes from the name field in its dma_heap_export_info struct when it's first created (and that string is hardcoded). The heap gpucg_devices are all registered from one spot in dma-heap.c, so maybe we should append "-heap" to the gpucg_device names there so "system" is available for a standardized name. Let me make those two changes, and I will also rebase this series before resending. > > > > > > > > > Anyway has my all my ack, but don't count this as my in-depth review = :-) > > > -Daniel > > > > Thanks again for taking a look! > > > > > > > --- > > > > drivers/dma-buf/dma-heap.c | 27 +++++++++++++++++++++++++= ++ > > > > drivers/dma-buf/heaps/system_heap.c | 3 +++ > > > > include/linux/dma-heap.h | 11 +++++++++++ > > > > 3 files changed, 41 insertions(+) > > > > > > > > diff --git a/drivers/dma-buf/dma-heap.c b/drivers/dma-buf/dma-heap.= c > > > > index 8f5848aa144f..885072427775 100644 > > > > --- a/drivers/dma-buf/dma-heap.c > > > > +++ b/drivers/dma-buf/dma-heap.c > > > > @@ -7,6 +7,7 @@ > > > > */ > > > > > > > > #include > > > > +#include > > > > #include > > > > #include > > > > #include > > > > @@ -31,6 +32,7 @@ > > > > * @heap_devt heap device node > > > > * @list list head connecting to list of heaps > > > > * @heap_cdev heap char device > > > > + * @gpucg_dev gpu cgroup device for memory accounti= ng > > > > * > > > > * Represents a heap of memory from which buffers can be made. > > > > */ > > > > @@ -41,6 +43,9 @@ struct dma_heap { > > > > dev_t heap_devt; > > > > struct list_head list; > > > > struct cdev heap_cdev; > > > > +#ifdef CONFIG_CGROUP_GPU > > > > + struct gpucg_device gpucg_dev; > > > > +#endif > > > > }; > > > > > > > > static LIST_HEAD(heap_list); > > > > @@ -216,6 +221,26 @@ const char *dma_heap_get_name(struct dma_heap = *heap) > > > > return heap->name; > > > > } > > > > > > > > +#ifdef CONFIG_CGROUP_GPU > > > > +/** > > > > + * dma_heap_get_gpucg_dev() - get struct gpucg_device for the heap= . > > > > + * @heap: DMA-Heap to get the gpucg_device struct for. > > > > + * > > > > + * Returns: > > > > + * The gpucg_device struct for the heap. NULL if the GPU cgroup co= ntroller is > > > > + * not enabled. > > > > + */ > > > > +struct gpucg_device *dma_heap_get_gpucg_dev(struct dma_heap *heap) > > > > +{ > > > > + return &heap->gpucg_dev; > > > > +} > > > > +#else /* CONFIG_CGROUP_GPU */ > > > > +struct gpucg_device *dma_heap_get_gpucg_dev(struct dma_heap *heap) > > > > +{ > > > > + return NULL; > > > > +} > > > > +#endif /* CONFIG_CGROUP_GPU */ > > > > + > > > > struct dma_heap *dma_heap_add(const struct dma_heap_export_info *e= xp_info) > > > > { > > > > struct dma_heap *heap, *h, *err_ret; > > > > @@ -288,6 +313,8 @@ struct dma_heap *dma_heap_add(const struct dma_= heap_export_info *exp_info) > > > > list_add(&heap->list, &heap_list); > > > > mutex_unlock(&heap_list_lock); > > > > > > > > + gpucg_register_device(dma_heap_get_gpucg_dev(heap), exp_info-= >name); > > > > + > > > > return heap; > > > > > > > > err2: > > > > diff --git a/drivers/dma-buf/heaps/system_heap.c b/drivers/dma-buf/= heaps/system_heap.c > > > > index ab7fd896d2c4..752a05c3cfe2 100644 > > > > --- a/drivers/dma-buf/heaps/system_heap.c > > > > +++ b/drivers/dma-buf/heaps/system_heap.c > > > > @@ -395,6 +395,9 @@ static struct dma_buf *system_heap_allocate(str= uct dma_heap *heap, > > > > exp_info.ops =3D &system_heap_buf_ops; > > > > exp_info.size =3D buffer->len; > > > > exp_info.flags =3D fd_flags; > > > > +#ifdef CONFIG_CGROUP_GPU > > > > + exp_info.gpucg_dev =3D dma_heap_get_gpucg_dev(heap); > > > > +#endif > > > > exp_info.priv =3D buffer; > > > > dmabuf =3D dma_buf_export(&exp_info); > > > > if (IS_ERR(dmabuf)) { > > > > diff --git a/include/linux/dma-heap.h b/include/linux/dma-heap.h > > > > index 0c05561cad6e..e447a61d054e 100644 > > > > --- a/include/linux/dma-heap.h > > > > +++ b/include/linux/dma-heap.h > > > > @@ -10,6 +10,7 @@ > > > > #define _DMA_HEAPS_H > > > > > > > > #include > > > > +#include > > > > #include > > > > > > > > struct dma_heap; > > > > @@ -59,6 +60,16 @@ void *dma_heap_get_drvdata(struct dma_heap *heap= ); > > > > */ > > > > const char *dma_heap_get_name(struct dma_heap *heap); > > > > > > > > +/** > > > > + * dma_heap_get_gpucg_dev() - get a pointer to the struct gpucg_de= vice for the > > > > + * heap. > > > > + * @heap: DMA-Heap to retrieve gpucg_device for. > > > > + * > > > > + * Returns: > > > > + * The gpucg_device struct for the heap. > > > > + */ > > > > +struct gpucg_device *dma_heap_get_gpucg_dev(struct dma_heap *heap)= ; > > > > + > > > > /** > > > > * dma_heap_add - adds a heap to dmabuf heaps > > > > * @exp_info: information needed to register this h= eap > > > > -- > > > > 2.35.1.1021.g381101b075-goog > > > > > > > > > > -- > > > Daniel Vetter > > > Software Engineer, Intel Corporation > > > http://blog.ffwll.ch > > -- > Daniel Vetter > Software Engineer, Intel Corporation > http://blog.ffwll.ch