Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp606396ioo; Sat, 21 May 2022 08:22:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzqDPGdpHAIOqxb0vgagCSdh8bzxBKeDGpXr41llWS2LVoeAuXeqItaT8VUVdlNCOxhA6ZI X-Received: by 2002:a17:906:6a1b:b0:6fe:c2f4:c232 with SMTP id qw27-20020a1709066a1b00b006fec2f4c232mr1685656ejc.251.1653146549478; Sat, 21 May 2022 08:22:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653146549; cv=none; d=google.com; s=arc-20160816; b=lcJbOJO3HlMHkOLnXanfwcdIHhJEguqS+wUt3AVZfaWJaRn/pB3ereRhnwsWm/RDOY PMvqMoF8UhptSRdboe1zVjZlaDBFGfQiMwiKoexc8s079wtCpy3W+cvtWKu8lIb9v/XU Kk9KWikMI9sjF3lFoqhCBGAy+eBrmeMZ5KHOTb7GoMqj3kSda4wLzjy7gYnu4moK856Y BwVhMs12QDBzPAhjhnIdQE9eGY45j1Zk6IK+7D92RJdQozds61eAcP+/B8bN61gap2AA e5zV4SMwhOogV5GHCrpntG4jrE/dqX4ADkvwMZkow8pSMVgc/clB1y9NpH1Yb6rU5c8Z BGHQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:mail-followup-to:message-id:subject:cc:to:from:date :dkim-signature; bh=vi8FLHHiPPyioIPEKRDcfMPiA7dt4CZBFdLz0VkHkSk=; b=i3d1ee0a5m1x3SVdlK630sUV2+bKiZ5Ho29wCMmbljwJBhvaD3+2YYXbnHriRX9uCk 6h8kkDOrd1GGeW4A/Do4dShmWNh1MaKbtCZoZtGRnD6S4EzyC/clB4SlSwPHu2tOdaYW 8hQGORVgKYeD6Bl6x6b8Q77NLvS9eLHP07A3BBIbuUXOHgmveJA+ar848UUw9WtOyvgP lQ5+rX64gn0HJ7u9sC/yxB4ppscWaIsGkVNtxZLQsVqo/jZf0JZx/nNp13ETQTYKodTb BhD6mgQl9BUJLJCaUcuNk8TVioRwC32CLgFSl8dcTCGXEKSq7oPZ0LbbIA0oKRe0ZToQ VVsg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=Mx5msfln; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id v14-20020a50f08e000000b0042ac740aedcsi4564889edl.470.2022.05.21.08.21.51; Sat, 21 May 2022 08:22:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=Mx5msfln; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239502AbiESONP (ORCPT + 99 others); Thu, 19 May 2022 10:13:15 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60788 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233051AbiESONO (ORCPT ); Thu, 19 May 2022 10:13:14 -0400 Received: from mail-ej1-x629.google.com (mail-ej1-x629.google.com [IPv6:2a00:1450:4864:20::629]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5A84F68F87 for ; Thu, 19 May 2022 07:13:12 -0700 (PDT) Received: by mail-ej1-x629.google.com with SMTP id n13so8351919ejv.1 for ; Thu, 19 May 2022 07:13:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=date:from:to:cc:subject:message-id:mail-followup-to:references :mime-version:content-disposition:in-reply-to; bh=vi8FLHHiPPyioIPEKRDcfMPiA7dt4CZBFdLz0VkHkSk=; b=Mx5msflnyEjn+csTnt3GB7T0omMZFuDuJt7I780xq0LRIqO3vUafELKRJ45PUXaNWe 74jJLzwXPU5CgbeFr1a9MeQK0ecteG6FKSVVqxfaiPerXonPNrppBNzGbGGPw2wGKJkR cyIQrFBNlKaA9Wm1gRyO0YEqUWg5RB3TROUSw= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :in-reply-to; bh=vi8FLHHiPPyioIPEKRDcfMPiA7dt4CZBFdLz0VkHkSk=; b=LCK0U9OtCd6GQGoeDV3vZiK9aWIQ3GnNrmANr+t6VjAaOTEFdDRvLw9CzCAB9x3tU2 DMNCjP/l/WQwsxQkHkNkFs7SnAOvPLMU/b03Mm+sAQ50FqKAWOE524WQCSHQ2vHw2/IF H5XorF66ClsR67gFEtXbyI5q5j+8olwbTvggAdwujCKm7sBLJ3SG6b7djpPPpbcL8PE8 pn89n2zTollZZsbwEGr2rqHhx/lu4z5bp+WpFdtb0b4aTZLbOiuPPW+22FGEBuU7tNPA WZc7W2aA4VSxoVKxusuS05+oee54rg+lpaHacByHGgw1n/XLAZxbTrRiScZTFTOkBiTE /awQ== X-Gm-Message-State: AOAM533hcrdSb1GVy2ShmCO++Me31lsXFJ74jzGk/NPteNP94D1XQQ+E /GvikE55IkN3e2h1Jtx+phupbQ== X-Received: by 2002:a17:907:628c:b0:6ee:70cf:d59 with SMTP id nd12-20020a170907628c00b006ee70cf0d59mr4580472ejc.402.1652969590886; Thu, 19 May 2022 07:13:10 -0700 (PDT) Received: from phenom.ffwll.local ([2a02:168:57f4:0:efd0:b9e5:5ae6:c2fa]) by smtp.gmail.com with ESMTPSA id i2-20020aa7c702000000b0042617ba6380sm2894441edq.10.2022.05.19.07.13.08 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 19 May 2022 07:13:09 -0700 (PDT) Date: Thu, 19 May 2022 16:13:07 +0200 From: Daniel Vetter To: Dmitry Osipenko Cc: Daniel Vetter , Thomas Zimmermann , David Airlie , Gerd Hoffmann , Gurchetan Singh , Chia-I Wu , Daniel Almeida , Gert Wollny , Gustavo Padovan , Daniel Stone , Tomeu Vizoso , Maarten Lankhorst , Maxime Ripard , Rob Herring , Steven Price , Alyssa Rosenzweig , Rob Clark , Emil Velikov , Robin Murphy , Dmitry Osipenko , linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, virtualization@lists.linux-foundation.org Subject: Re: [PATCH v4 11/15] drm/shmem-helper: Add generic memory shrinker Message-ID: Mail-Followup-To: Dmitry Osipenko , Thomas Zimmermann , David Airlie , Gerd Hoffmann , Gurchetan Singh , Chia-I Wu , Daniel Almeida , Gert Wollny , Gustavo Padovan , Daniel Stone , Tomeu Vizoso , Maarten Lankhorst , Maxime Ripard , Rob Herring , Steven Price , Alyssa Rosenzweig , Rob Clark , Emil Velikov , Robin Murphy , Dmitry Osipenko , linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, virtualization@lists.linux-foundation.org References: <5fdf5232-e2b2-b444-5a41-f1db7e6a04da@collabora.com> <3429a12f-9fbe-b66b-dbbd-94a1df54714e@collabora.com> <0ae6fed7-b166-d2b8-0e42-84b94b777c20@collabora.com> <31bc7a14-ff30-6961-b4fc-0aad83551df9@collabora.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <31bc7a14-ff30-6961-b4fc-0aad83551df9@collabora.com> X-Operating-System: Linux phenom 5.10.0-8-amd64 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 12, 2022 at 10:04:53PM +0300, Dmitry Osipenko wrote: > On 5/12/22 20:04, Daniel Vetter wrote: > > On Thu, 12 May 2022 at 13:36, Dmitry Osipenko > > wrote: > >> > >> On 5/11/22 22:09, Daniel Vetter wrote: > >>> On Wed, May 11, 2022 at 07:06:18PM +0300, Dmitry Osipenko wrote: > >>>> On 5/11/22 16:09, Daniel Vetter wrote: > >>>>>>>>> I'd like to ask you to reduce the scope of the patchset and build the > >>>>>>>>> shrinker only for virtio-gpu. I know that I first suggested to build > >>>>>>>>> upon shmem helpers, but it seems that it's easier to do that in a later > >>>>>>>>> patchset. > >>>>>>>> The first version of the VirtIO shrinker didn't support memory eviction. > >>>>>>>> Memory eviction support requires page fault handler to be aware of the > >>>>>>>> evicted pages, what should we do about it? The page fault handling is a > >>>>>>>> part of memory management, hence to me drm-shmem is already kinda a MM. > >>>>>>> Hm I still don't get that part, why does that also not go through the > >>>>>>> shmem helpers? > >>>>>> The drm_gem_shmem_vm_ops includes the page faults handling, it's a > >>>>>> helper by itself that is used by DRM drivers. > >>>>>> > >>>>>> I could try to move all the shrinker logic to the VirtIO and re-invent > >>>>>> virtio_gem_shmem_vm_ops, but what is the point of doing this for each > >>>>>> driver if we could have it once and for all in the common drm-shmem code? > >>>>>> > >>>>>> Maybe I should try to factor out all the shrinker logic from drm-shmem > >>>>>> into a new drm-shmem-shrinker that could be shared by drivers? Will you > >>>>>> be okay with this option? > >>>>> I think we're talking past each another a bit. I'm only bringing up the > >>>>> purge vs eviction topic we discussed in the other subthread again. > >>>> > >>>> Thomas asked to move the whole shrinker code to the VirtIO driver and > >>>> I's saying that this is not a great idea to me, or am I misunderstanding > >>>> the Thomas' suggestion? Thomas? > >>> > >>> I think it was just me creating a confusion here. > >>> > >>> fwiw I do also think that shrinker in shmem helpers makes sense, just in > >>> case that was also lost in confusion. > >> > >> Okay, good that we're on the same page now. > >> > >>>>>>> I'm still confused why drivers need to know the difference > >>>>>>> between evition and purging. Or maybe I'm confused again. > >>>>>> Example: > >>>>>> > >>>>>> If userspace uses IOV addresses, then these addresses must be kept > >>>>>> reserved while buffer is evicted. > >>>>>> > >>>>>> If BO is purged, then we don't need to retain the IOV space allocated > >>>>>> for the purged BO. > >>>>> Yeah but is that actually needed by anyone? If userspace fails to allocate > >>>>> another bo because of lack of gpu address space then it's very easy to > >>>>> handle that: > >>>>> > >>>>> 1. Make a rule that "out of gpu address space" gives you a special errno > >>>>> code like ENOSPC > >>>>> > >>>>> 2. If userspace gets that it walks the list of all buffers it marked as > >>>>> purgeable and nukes them (whether they have been evicted or not). Then it > >>>>> retries the bo allocation. > >>>>> > >>>>> Alternatively you can do step 2 also directly from the bo alloc ioctl in > >>>>> step 1. Either way you clean up va space, and actually a lot more (you > >>>>> potentially nuke all buffers marked as purgeable, not just the ones that > >>>>> have been purged already) and only when va cleanup is actually needed > >>>>> > >>>>> Trying to solve this problem at eviction time otoh means: > >>>>> - we have this difference between eviction and purging > >>>>> - it's still not complete, you still need to glue step 2 above into your > >>>>> driver somehow, and once step 2 above is glued in doing additional > >>>>> cleanup in the purge function is just duplicated logic > >>>>> > >>>>> So at least in my opinion this isn't the justification we need. And we > >>>>> should definitely not just add that complication "in case, for the > >>>>> future", if we don't have a real need right now. Adding it later on is > >>>>> easy, removing it later on because it just gets in the way and confuses is > >>>>> much harder. > >>>> > >>>> The IOVA space is only one example. > >>>> > >>>> In case of the VirtIO driver, we may have two memory allocation for a > >>>> BO. One is the shmem allcation in guest and the other is in host's vram. > >>>> If we will only release the guest's memory on purge, then the vram will > >>>> remain allocated until BO is destroyed, which unnecessarily sub-optimal. > >>> > >>> Hm but why don't you just nuke the memory on the host side too when you > >>> evict? Allowing the guest memory to be swapped out while keeping the host > >>> memory allocation alive also doesn't make a lot of sense for me. Both can > >>> be recreated (I guess at least?) on swap-in. > >> > >> Shouldn't be very doable or at least worth the efforts. It's userspace > >> that manages data uploading, kernel only provides transport for the > >> virtio-gpu commands. > >> > >> Drivers are free to use the same function for both purge() and evict() > >> callbacks if they want. Getting rid of the purge() callback creates more > >> problems than solves, IMO. > > > > Hm this still sounds pretty funny and defeats the point of > > purgeable/evictable buffers a bit I think. But also I guess we'd > > pushed this bikeshed to the max, so I think if you make ->purge > > optional and just call ->evict if that's not present, and document it > > all in the kerneldoc, then I think that's good. > > This is a good enough compromise to me. > > > I just don't think that encouraging drivers to distinguish between > > evict/purge is a good idea for almost all of them. > > Intel's shrinker checks the "madvise" status of BOs and then decides > what to do based on it. Perhaps we could move the decision-making about > purging to drivers and then it will be single evict() callback, but will > drivers really ever need to be responsible for this decision-making or > this will be an unnecessary boilerplate code in the drivers? I'll think > more about this. tbh I wouldn't worry about details, you've convinced me that some differentiation between evict and purged makes sense. And yeah maybe drivers should have a helper to check that instead of explicit argument, but that's a bikeshed color choice which should be fairly easy to adjust later on still. > Thank you all for taking time to look at this patchset. I'm preparing > the new version. Cheers, Daniel -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch