Received: by 2002:a05:6358:53a8:b0:117:f937:c515 with SMTP id z40csp549316rwe; Fri, 14 Apr 2023 06:48:12 -0700 (PDT) X-Google-Smtp-Source: AKy350a93MDsmDHjMp5sBw5s5l7f3HWSPOJ4/miNDBp7P4OODpNYAd7pqSdVp+akYmFTKvg6IBva X-Received: by 2002:a05:6a20:8f06:b0:ec:9437:84c5 with SMTP id b6-20020a056a208f0600b000ec943784c5mr5726499pzk.37.1681480091996; Fri, 14 Apr 2023 06:48:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1681480091; cv=none; d=google.com; s=arc-20160816; b=Y24iNvLWHrXis6NKVUQRzeQ11KicgFsAzqkI2uyg/Qz4vw+BmxAn/P56chXpincv0b trMvGQQ9Z4kSY1aYenF/wVmzvQbypfKc7ZNyYSNECeqNKpkjGoUB8ZfKh3IwXrQGvq0D 5uXuUOql6QawQ1lOAA5f8rojwhOFcBK69RlUsE+F5ZEXojg86MYzePSyI5Dtyo3wFlxJ hkRKIdylyFq9fD3LsmRcUDhxHJTJHSe6e31TCFB0c2pj6S1r9v2ekqi899kPtYM8Uvt6 U/MwqsW8fBjWUeyqV5mZ2odwB50cPUv1ch1qUKl7kQ+7IRhYCmevPdUhxOdlAGF7PS22 DHwA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=8k3j7iPMhP/FUSzfe643f8gemd9FDlPSy1ejUd/ofyU=; b=cG2wUXGMpNxPyJ9j8caK50CCw6PnLCrQVHDDJW2C2SL18EY31z1Uba1PsII6rmLnF9 2ZoyuTE08pRIGoRIpZLKdLMKOBJGP7Z4Ca6fotBSJmhhyzshx0MYjpejMPSdDaplJc55 0ZvLjaRikVrkPsYdNXnesLLQDb02+hu9MF6gHccQ+2tNBFSMZ5mnI0ylVNAY9dXGXhrh WlCHiAorIzQbo1gPN3mH+W7vWdEgzZChcJ7v7MepF+CPRQdedv7fspBwOaqCI7j5cFlC TMPur+UhXxvxhTY00vppoVqsgdk1b66Tkgk0OLdJTdJwg1R4w5N2td7UED9jlJ0d1Dvi WikQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b="EJ+z/pQV"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id y5-20020aa79425000000b0062dd75854ebsi4389302pfo.82.2023.04.14.06.47.55; Fri, 14 Apr 2023 06:48:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b="EJ+z/pQV"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229920AbjDNNkv (ORCPT + 99 others); Fri, 14 Apr 2023 09:40:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33910 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229625AbjDNNkt (ORCPT ); Fri, 14 Apr 2023 09:40:49 -0400 Received: from mail-oa1-x31.google.com (mail-oa1-x31.google.com [IPv6:2001:4860:4864:20::31]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A24A39767; Fri, 14 Apr 2023 06:40:40 -0700 (PDT) Received: by mail-oa1-x31.google.com with SMTP id 586e51a60fabf-1878504c22aso8382859fac.8; Fri, 14 Apr 2023 06:40:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1681479639; x=1684071639; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=8k3j7iPMhP/FUSzfe643f8gemd9FDlPSy1ejUd/ofyU=; b=EJ+z/pQVj6GYVRd8gupCF5cd8q30JpiNSlbsE91ra24QRTyl2fiOOlAou4n6pcRcrX ctFBL6HNNypjFnRMEG5j7aOgcU9iU4/Xl3dESt0XSzbFX2HTLtNw/2lNT1vd/FgG+nog mM+KySLR9NUBuIacy4KQq9Ohuel15zGSkMohVm7rju06seSULOMsDDToosdXkt0xPWNJ 2hBznLDJzmr94m0oec5CXHFmVjeh/nQgvJZB0bBygDcACpKCxlKQf4/TK3kG/35Mstk2 LrfgAdOwXOhRk2jjtVJ0H+hR5yJrBomJ1r8FjpcrY7Cj4v9ObHMZQsZMUBjBt499ZoHm CpLA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1681479639; x=1684071639; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=8k3j7iPMhP/FUSzfe643f8gemd9FDlPSy1ejUd/ofyU=; b=VJPAZqLDijeKhB4unI9qe7800RGEQP+HJVoKXQXAsdlGgAblkH1CCNbDqVSV9MarhQ d3LiAGguXkzZOBJAFZZp/ur8NTAxExsa0DueDgw52KA2mdeyTFHOE+lEZOO5LyRZNviv EFabw3cGXnZTzbyHh3lgi8c2EgUvmutgzO//ZGfmzhBv2CD9xQKsL93UkOV2b4z8jaNA yqAtxDdXsjrZrceA+6hbJKBH+rqe6QayGjeMvValXoxzasIJb/OTo+0GW3dLpJ5FoHta ctmfhMciKMb74HiysAozlrQxy8j9gAwm8eg9afk60jT9lMzmYT9t1WoCnDktiuXSegyN U+hg== X-Gm-Message-State: AAQBX9cPnQF7AnZjUvGuDXwuRCiRHffiAf4R6w36YyVY7/OJ38WVdeQw b4kRuQd+5xVB9clvyR43Pl54ojnq20TE5d/gq2g= X-Received: by 2002:a05:6870:34b:b0:187:8ee8:5f5b with SMTP id n11-20020a056870034b00b001878ee85f5bmr2871061oaf.5.1681479638775; Fri, 14 Apr 2023 06:40:38 -0700 (PDT) MIME-Version: 1.0 References: <20230411225725.2032862-1-robdclark@gmail.com> <20230411225725.2032862-7-robdclark@gmail.com> <29a8d9aa-c6ea-873f-ce0b-fb8199b13068@linux.intel.com> <8893ad56-8807-eb69-2185-b338725f0b18@linux.intel.com> In-Reply-To: <8893ad56-8807-eb69-2185-b338725f0b18@linux.intel.com> From: Rob Clark Date: Fri, 14 Apr 2023 06:40:27 -0700 Message-ID: Subject: Re: [PATCH v3 6/7] drm: Add fdinfo memory stats To: Tvrtko Ursulin Cc: dri-devel@lists.freedesktop.org, linux-arm-msm@vger.kernel.org, freedreno@lists.freedesktop.org, Boris Brezillon , Christopher Healy , Emil Velikov , Rob Clark , David Airlie , Maarten Lankhorst , Maxime Ripard , Thomas Zimmermann , Jonathan Corbet , "open list:DOCUMENTATION" , open list Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 14, 2023 at 1:57=E2=80=AFAM Tvrtko Ursulin wrote: > > > On 13/04/2023 21:05, Daniel Vetter wrote: > > On Thu, Apr 13, 2023 at 05:40:21PM +0100, Tvrtko Ursulin wrote: > >> > >> On 13/04/2023 14:27, Daniel Vetter wrote: > >>> On Thu, Apr 13, 2023 at 01:58:34PM +0100, Tvrtko Ursulin wrote: > >>>> > >>>> On 12/04/2023 20:18, Daniel Vetter wrote: > >>>>> On Wed, Apr 12, 2023 at 11:42:07AM -0700, Rob Clark wrote: > >>>>>> On Wed, Apr 12, 2023 at 11:17=E2=80=AFAM Daniel Vetter wrote: > >>>>>>> > >>>>>>> On Wed, Apr 12, 2023 at 10:59:54AM -0700, Rob Clark wrote: > >>>>>>>> On Wed, Apr 12, 2023 at 7:42=E2=80=AFAM Tvrtko Ursulin > >>>>>>>> wrote: > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> On 11/04/2023 23:56, Rob Clark wrote: > >>>>>>>>>> From: Rob Clark > >>>>>>>>>> > >>>>>>>>>> Add support to dump GEM stats to fdinfo. > >>>>>>>>>> > >>>>>>>>>> v2: Fix typos, change size units to match docs, use div_u64 > >>>>>>>>>> v3: Do it in core > >>>>>>>>>> > >>>>>>>>>> Signed-off-by: Rob Clark > >>>>>>>>>> Reviewed-by: Emil Velikov > >>>>>>>>>> --- > >>>>>>>>>> Documentation/gpu/drm-usage-stats.rst | 21 ++++++++ > >>>>>>>>>> drivers/gpu/drm/drm_file.c | 76 ++++++++++++++= +++++++++++++ > >>>>>>>>>> include/drm/drm_file.h | 1 + > >>>>>>>>>> include/drm/drm_gem.h | 19 +++++++ > >>>>>>>>>> 4 files changed, 117 insertions(+) > >>>>>>>>>> > >>>>>>>>>> diff --git a/Documentation/gpu/drm-usage-stats.rst b/Documenta= tion/gpu/drm-usage-stats.rst > >>>>>>>>>> index b46327356e80..b5e7802532ed 100644 > >>>>>>>>>> --- a/Documentation/gpu/drm-usage-stats.rst > >>>>>>>>>> +++ b/Documentation/gpu/drm-usage-stats.rst > >>>>>>>>>> @@ -105,6 +105,27 @@ object belong to this client, in the resp= ective memory region. > >>>>>>>>>> Default unit shall be bytes with optional unit specifiers= of 'KiB' or 'MiB' > >>>>>>>>>> indicating kibi- or mebi-bytes. > >>>>>>>>>> > >>>>>>>>>> +- drm-shared-memory: [KiB|MiB] > >>>>>>>>>> + > >>>>>>>>>> +The total size of buffers that are shared with another file (= ie. have more > >>>>>>>>>> +than a single handle). > >>>>>>>>>> + > >>>>>>>>>> +- drm-private-memory: [KiB|MiB] > >>>>>>>>>> + > >>>>>>>>>> +The total size of buffers that are not shared with another fi= le. > >>>>>>>>>> + > >>>>>>>>>> +- drm-resident-memory: [KiB|MiB] > >>>>>>>>>> + > >>>>>>>>>> +The total size of buffers that are resident in system memory. > >>>>>>>>> > >>>>>>>>> I think this naming maybe does not work best with the existing > >>>>>>>>> drm-memory- keys. > >>>>>>>> > >>>>>>>> Actually, it was very deliberate not to conflict with the existi= ng > >>>>>>>> drm-memory- keys ;-) > >>>>>>>> > >>>>>>>> I wouldn't have preferred drm-memory-{active,resident,...} but i= t > >>>>>>>> could be mis-parsed by existing userspace so my hands were a bit= tied. > >>>>>>>> > >>>>>>>>> How about introduce the concept of a memory region from the sta= rt and > >>>>>>>>> use naming similar like we do for engines? > >>>>>>>>> > >>>>>>>>> drm-memory-$CATEGORY-$REGION: ... > >>>>>>>>> > >>>>>>>>> Then we document a bunch of categories and their semantics, for= instance: > >>>>>>>>> > >>>>>>>>> 'size' - All reachable objects > >>>>>>>>> 'shared' - Subset of 'size' with handle_count > 1 > >>>>>>>>> 'resident' - Objects with backing store > >>>>>>>>> 'active' - Objects in use, subset of resident > >>>>>>>>> 'purgeable' - Or inactive? Subset of resident. > >>>>>>>>> > >>>>>>>>> We keep the same semantics as with process memory accounting (i= f I got > >>>>>>>>> it right) which could be desirable for a simplified mental mode= l. > >>>>>>>>> > >>>>>>>>> (AMD needs to remind me of their 'drm-memory-...' keys semantic= s. If we > >>>>>>>>> correctly captured this in the first round it should be equival= ent to > >>>>>>>>> 'resident' above. In any case we can document no category is eq= ual to > >>>>>>>>> which category, and at most one of the two must be output.) > >>>>>>>>> > >>>>>>>>> Region names we at most partially standardize. Like we could sa= y > >>>>>>>>> 'system' is to be used where backing store is system RAM and ot= hers are > >>>>>>>>> driver defined. > >>>>>>>>> > >>>>>>>>> Then discrete GPUs could emit N sets of key-values, one for eac= h memory > >>>>>>>>> region they support. > >>>>>>>>> > >>>>>>>>> I think this all also works for objects which can be migrated b= etween > >>>>>>>>> memory regions. 'Size' accounts them against all regions while = for > >>>>>>>>> 'resident' they only appear in the region of their current plac= ement, etc. > >>>>>>>> > >>>>>>>> I'm not too sure how to rectify different memory regions with th= is, > >>>>>>>> since drm core doesn't really know about the driver's memory reg= ions. > >>>>>>>> Perhaps we can go back to this being a helper and drivers with v= ram > >>>>>>>> just don't use the helper? Or?? > >>>>>>> > >>>>>>> I think if you flip it around to drm-$CATEGORY-memory{-$REGION}: = then it > >>>>>>> all works out reasonably consistently? > >>>>>> > >>>>>> That is basically what we have now. I could append -system to eac= h to > >>>>>> make things easier to add vram/etc (from a uabi standpoint).. > >>>>> > >>>>> What you have isn't really -system, but everything. So doesn't real= ly make > >>>>> sense to me to mark this -system, it's only really true for integra= ted (if > >>>>> they don't have stolen or something like that). > >>>>> > >>>>> Also my comment was more in reply to Tvrtko's suggestion. > >>>> > >>>> Right so my proposal was drm-memory-$CATEGORY-$REGION which I think = aligns > >>>> with the current drm-memory-$REGION by extending, rather than creati= ng > >>>> confusion with different order of key name components. > >>> > >>> Oh my comment was pretty much just bikeshed, in case someone creates = a > >>> $REGION that other drivers use for $CATEGORY. Kinda Rob's parsing poi= nt. > >>> So $CATEGORY before the -memory. > >>> > >>> Otoh I don't think that'll happen, so I guess we can go with whatever= more > >>> folks like :-) I don't really care much personally. > >> > >> Okay I missed the parsing problem. > >> > >>>> AMD currently has (among others) drm-memory-vram, which we could def= ine in > >>>> the spec maps to category X, if category component is not present. > >>>> > >>>> Some examples: > >>>> > >>>> drm-memory-resident-system: > >>>> drm-memory-size-lmem0: > >>>> drm-memory-active-vram: > >>>> > >>>> Etc.. I think it creates a consistent story. > >>>> > >>>> Other than this, my two I think significant opens which haven't been > >>>> addressed yet are: > >>>> > >>>> 1) > >>>> > >>>> Why do we want totals (not per region) when userspace can trivially > >>>> aggregate if they want. What is the use case? > >>>> > >>>> 2) > >>>> > >>>> Current proposal limits the value to whole objects and fixates that = by > >>>> having it in the common code. If/when some driver is able to support= sub-BO > >>>> granularity they will need to opt out of the common printer at which= point > >>>> it may be less churn to start with a helper rather than mid-layer. O= r maybe > >>>> some drivers already support this, I don't know. Given how important= VM BIND > >>>> is I wouldn't be surprised. > >>> > >>> I feel like for drivers using ttm we want a ttm helper which takes ca= re of > >>> the region printing in hopefully a standard way. And that could then = also > >>> take care of all kinds of of partial binding and funny rules (like ma= ybe > >>> we want a standard vram region that addds up all the lmem regions on > >>> intel, so that all dgpu have a common vram bucket that generic tools > >>> understand?). > >> > >> First part yes, but for the second I would think we want to avoid any > >> aggregation in the kernel which can be done in userspace just as well.= Such > >> total vram bucket would be pretty useless on Intel even since userspac= e > >> needs to be region aware to make use of all resources. It could even b= e > >> counter productive I think - "why am I getting out of memory when half= of my > >> vram is unused!?". > > > > This is not for intel-aware userspace. This is for fairly generic "gput= op" > > style userspace, which might simply have no clue or interest in what lm= emX > > means, but would understand vram. > > > > Aggregating makes sense. > > Lmem vs vram is now an argument not about aggregation but about > standardizing regions names. > > One detail also is a change in philosophy compared to engine stats where > engine names are not centrally prescribed and it was expected userspace > will have to handle things generically and with some vendor specific > knowledge. > > Like in my gputop patches. It doesn't need to understand what is what, > it just finds what's there and presents it to the user. > > Come some accel driver with local memory it wouldn't be vram any more. > Or even a headless data center GPU. So I really don't think it is good > to hardcode 'vram' in the spec, or midlayer, or helpers. > > And for aggregation.. again, userspace can do it just as well. If we do > it in kernel then immediately we have multiple sets of keys to output > for any driver which wants to show the region view. IMO it is just > pointless work in the kernel and more code in the kernel, when userspace > can do it. > > Proposal A (one a discrete gpu, one category only): > > drm-resident-memory: x KiB > drm-resident-memory-system: x KiB > drm-resident-memory-vram: x KiB > > Two loops in the kernel, more parsing in userspace. why would it be more than one loop, ie. mem.resident +=3D size; mem.category[cat].resident +=3D size; At the end of the day, there is limited real-estate to show a million different columns of information. Even the gputop patches I posted don't show everything of what is currently there. And nvtop only shows toplevel resident stat. So I think the "everything" stat is going to be what most tools use. BR, -R > Proposal B: > > drm-resident-memory-system: x KiB > drm-resident-memory-vram: x KiB > > Can be one loop, one helper, less text for userspace to parse and it can > still trivially show the total if so desired. > > For instance a helper (or two) with a common struct containing region > names and totals, where a callback into the driver tallies under each > region, as the drm helper is walking objects. > > >>> It does mean we walk the bo list twice, but *shrug*. People have been > >>> complaining about procutils for decades, they're still horrible, I th= ink > >>> walking bo lists twice internally in the ttm case is going to be ok. = If > >>> not, it's internals, we can change them again. > >>> > >>> Also I'd lean a lot more towards making ttm a helper and not putting = that > >>> into core, exactly because it's pretty clear we'll need more flexibil= ity > >>> when it comes to accurate stats for multi-region drivers. > >> > >> Exactly. > >> > >>> But for a first "how much gpu space does this app use" across everyth= ing I > >>> think this is a good enough starting point. > >> > >> Okay so we agree this would be better as a helper and not in the core. > > > > Nope, if you mean with this =3D Rob's patch. I was talking about a > > hypothetical region-aware extension for ttm-using drivers. > > > >> On the point are keys/semantics good enough as a starting point I am s= till > >> not convinced kernel should aggregate and that instead we should start= from > >> day one by appending -system (or something) to Rob's proposed keys. > > > > It should imo. Inflicting driver knowledge on generic userspace makes n= ot > > much sense, we should start with the more generally useful stuff imo. > > That's why there's the drm fdinfo spec and all that so it's not a > > free-for-all. > > > > Also Rob's stuff is _not_ system. Check on a i915 dgpu if you want :-) > > I am well aware it adds up everything, that is beside the point. > > Drm-usage-stats.rst text needs to be more precise across all keys at leas= t: > > +- drm-resident-memory: [KiB|MiB] > + > +The total size of buffers that are resident in system memory. > > But as said, I don't see the point in providing aggregated values. > > Regards, > > Tvrtko