Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp149127pxk; Thu, 24 Sep 2020 01:51:34 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxOe7Rz9rbkkWIyRGdG6Bn/gtuD+QYbSP4nKkbBCXLW4FCpLRMIGJac85GvG2eySI9+KfF+ X-Received: by 2002:a17:906:37c6:: with SMTP id o6mr3681430ejc.404.1600937494144; Thu, 24 Sep 2020 01:51:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1600937494; cv=none; d=google.com; s=arc-20160816; b=Kvup88i+flpx4evzhUTNJsOXa1B76XgB08KKlJSGIhfSUuaGkWwgzsOXX1V+EGtfH0 xLfiUUAtWlNwq4y/VZlWsH1JcGkFVOyNZ2uM65KuGa00Lc29gbvvBOTbxSMB1nPsXBmn YDgIBem6+kNrGMxMRVNuR8d69Wxj3Uyf2a9ZPMPuXYNJSPNsu7mieEigzFqgAZkaZdgg f2rQGxqb2veK1mZ2T9pMOK/uukGdeYcTfUP4ax5dZtZpKzJjdK/e82g0DIxQ47hlxyXM n3VMFMFuuYNJBv9L6aA15BRK5gurGl5uzm0w2Rh7Dipy6xAIhNlTGA8ouVk0gqfUvMNZ SCLQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:mail-followup-to:message-id:subject:cc:to:from:date :dkim-signature; bh=p+lvZYR8eJYGyZ/42yRaNn1RAsHBDNLNqBDsP7erwRE=; b=XfPweF8uQrqz81y+bFD3cP/0UD3XC55ZyVsC+XfFoHJNHP4jl8+0ESmAUW0zDemmkx viWrswJsNqy2f44XIunm8We6ztRLYCwsnP7HszLz8iHKZYeBIgA5NQRH+xQdcP3uWEzA FCPIvmNqYCQamYLNSEMEoJ7LD3UQOefGc5aMGlspMY3qbOR6PKmrTSjmLa+WN8JrJTWR UCtDD0CD+fn67mPSH/GT2hakprPsw6wTXfNH/dAI3kyt5q2aOIO46fpIdzEBgl8TYQhO WArJkyM/rdvqnXH7kqx2jeBu2HzLzT5NMRN1Kfy89VLQfKsB4rsgYMLZRifUm0oUzHCP Zrvg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=dp1dsx16; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id h1si1679351edt.106.2020.09.24.01.51.10; Thu, 24 Sep 2020 01:51:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=dp1dsx16; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726943AbgIXIt4 (ORCPT + 99 others); Thu, 24 Sep 2020 04:49:56 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33518 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726786AbgIXIt4 (ORCPT ); Thu, 24 Sep 2020 04:49:56 -0400 Received: from mail-wm1-x341.google.com (mail-wm1-x341.google.com [IPv6:2a00:1450:4864:20::341]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 71FB4C0613CE for ; Thu, 24 Sep 2020 01:49:55 -0700 (PDT) Received: by mail-wm1-x341.google.com with SMTP id w2so2677910wmi.1 for ; Thu, 24 Sep 2020 01:49:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=date:from:to:cc:subject:message-id:mail-followup-to:references :mime-version:content-disposition:in-reply-to; bh=p+lvZYR8eJYGyZ/42yRaNn1RAsHBDNLNqBDsP7erwRE=; b=dp1dsx165aNx09ZmghHf2qF7fgeBpKrqzHg8cSUTZr6kNgaQfxbu3LLQEgJoDGy/b5 of6NIKTjwsP182agC8zsdOEFj5BBzWcGt15vWNo/TKmGTI6X9Uqzr9alWm5FwVBUTJGG jOzjylXleWuDKKKmwujjvQOTDPLpsrxf2jvGI= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :in-reply-to; bh=p+lvZYR8eJYGyZ/42yRaNn1RAsHBDNLNqBDsP7erwRE=; b=Pbg/aamkxhcDEYQWHUg3stV0c+YcrazzmoFvg1hgU10Tx0gKAR+ifOz8z1I0iRS1eZ FaYOvUkxswzMbwwOsjkn+nqBYAdQw1ptjEyGZ6S4V850hEaSHQNMwwcYeQdIX/h1eH1C TaQuqbobnqfn3GKJE30yVb5XQezNVx2L+bICL0UYwxS0Astj2xLrUrnxWYeziI2vNTP6 UOAJSwzeFovD5ovE4h/FUvSOxTJI/spmYrakT6vHjfc/QObnC+5N7n9UypQurYSAvJz7 5+VAGh+bZsq1cdU1zAEJEQP/BDVls6Ud+4ItddQ6lprfmMqBKaV4hCX/1xgMvhaGmhF4 TO5Q== X-Gm-Message-State: AOAM530CJtj4fcIQzuCez1ugDKkeEhMJj4WAQw2QfMbwZlQLH1p97imZ demo63i3QR4v4fBQOeh66JWEYJgZigwSZ+AZ X-Received: by 2002:a1c:2e17:: with SMTP id u23mr3801627wmu.73.1600937393952; Thu, 24 Sep 2020 01:49:53 -0700 (PDT) Received: from phenom.ffwll.local ([2a02:168:57f4:0:efd0:b9e5:5ae6:c2fa]) by smtp.gmail.com with ESMTPSA id e1sm2880807wrp.49.2020.09.24.01.49.52 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 24 Sep 2020 01:49:52 -0700 (PDT) Date: Thu, 24 Sep 2020 10:49:50 +0200 From: Daniel Vetter To: Rob Clark Cc: dri-devel , Rob Clark , Peter Zijlstra , linux-arm-msm , open list , Tim Murray , Tejun Heo , Daniel Vetter Subject: Re: [PATCH 0/3] drm: commit_work scheduling Message-ID: <20200924084950.GY438822@phenom.ffwll.local> Mail-Followup-To: Rob Clark , dri-devel , Rob Clark , Peter Zijlstra , linux-arm-msm , open list , Tim Murray , Tejun Heo References: <20200919193727.2093945-1-robdclark@gmail.com> <20200921092154.GJ438822@phenom.ffwll.local> <20200923152545.GQ438822@phenom.ffwll.local> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Operating-System: Linux phenom 5.7.0-1-amd64 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Sep 23, 2020 at 07:33:17PM -0700, Rob Clark wrote: > On Wed, Sep 23, 2020 at 8:25 AM Daniel Vetter wrote: > > > > On Tue, Sep 22, 2020 at 07:48:10AM -0700, Rob Clark wrote: > > > On Mon, Sep 21, 2020 at 11:59 PM Daniel Vetter wrote: > > > > > > > > On Mon, Sep 21, 2020 at 5:16 PM Rob Clark wrote: > > > > > > > > > > On Mon, Sep 21, 2020 at 2:21 AM Daniel Vetter wrote: > > > > > > > > > > > > On Sat, Sep 19, 2020 at 12:37:23PM -0700, Rob Clark wrote: > > > > > > > From: Rob Clark > > > > > > > > > > > > > > The android userspace treats the display pipeline as a realtime problem. > > > > > > > And arguably, if your goal is to not miss frame deadlines (ie. vblank), > > > > > > > it is. (See https://lwn.net/Articles/809545/ for the best explaination > > > > > > > that I found.) > > > > > > > > > > > > > > But this presents a problem with using workqueues for non-blocking > > > > > > > atomic commit_work(), because the SCHED_FIFO userspace thread(s) can > > > > > > > preempt the worker. Which is not really the outcome you want.. once > > > > > > > the required fences are scheduled, you want to push the atomic commit > > > > > > > down to hw ASAP. > > > > > > > > > > > > > > But the decision of whether commit_work should be RT or not really > > > > > > > depends on what userspace is doing. For a pure CFS userspace display > > > > > > > pipeline, commit_work() should remain SCHED_NORMAL. > > > > > > > > > > > > > > To handle this, convert non-blocking commit_work() to use per-CRTC > > > > > > > kthread workers, instead of system_unbound_wq. Per-CRTC workers are > > > > > > > used to avoid serializing commits when userspace is using a per-CRTC > > > > > > > update loop. > > > > > > > > > > > > > > A client-cap is introduced so that userspace can opt-in to SCHED_FIFO > > > > > > > priority commit work. > > > > > > > > > > > > > > A potential issue is that since 616d91b68cd ("sched: Remove > > > > > > > sched_setscheduler*() EXPORTs") we have limited RT priority levels, > > > > > > > meaning that commit_work() ends up running at the same priority level > > > > > > > as vblank-work. This shouldn't be a big problem *yet*, due to limited > > > > > > > use of vblank-work at this point. And if it could be arranged that > > > > > > > vblank-work is scheduled before signaling out-fences and/or sending > > > > > > > pageflip events, it could probably work ok to use a single priority > > > > > > > level for both commit-work and vblank-work. > > > > > > > > > > > > The part I don't like about this is that it all feels rather hacked > > > > > > together, and if we add more stuff (or there's some different thing in the > > > > > > system that also needs rt scheduling) then it doesn't compose. > > > > > > > > > > The ideal thing would be that userspace is in control of the > > > > > priorities.. the setclientcap approach seemed like a reasonable way to > > > > > give the drm-master a way to opt in. > > > > > > > > > > I suppose instead userspace could use sched_setscheduler().. but that > > > > > would require userspace to be root, and would require some way to find > > > > > the tid. > > > > > > > > Userspace already needs that for the SCHED_FIFO for surface-flinger. > > > > Or is the problem that CAP_SYS_NICE is only good for your own > > > > processes? > > > > > > tbh, I'm not completely sure offhand what gives surfaceflinger > > > permission to set itself SCHED_FIFO > > > > > > (But on CrOS there are a few more pieces to the puzzle) > > > > > > > Other question I have for this is whether there's any recommendations > > > > for naming the kthreads (since I guess that name is what becomes the > > > > uapi for userspace to control this)? > > > > > > > > Otherwise I think "userspace calls sched_setscheduler on the right > > > > kthreads" sounds like a good interface, since it lets userspace decide > > > > how it all needs to fit together and compose. Anything we hard-code in > > > > an ioctl is kinda lost cause. And we can choose the default values to > > > > work reasonably well when the compositor runs at normal priority > > > > (lowest niceness or something like that for the commit work). > > > > > > I don't really like the naming convention approach.. what is to stop > > > some unrelated process to name it's thread the same thing to get a > > > SCHED_FIFO boost.. > > > > > > But we can stick with my idea to expose the thread id as a read-only > > > CRTC property, for userspace to find the things to call > > > sched_setscheduler() on. If for whatever reason the drm master is not > > > privileged (or is running in a sandbox, etc), a small helper that has > > > the necessary permissions could open the drm device to find the CRTC > > > thread-ids and call sched_setscheduler().. > > > > Hm thread ids don't translate too well across PID namespaces I think ... > > So that's another can of worms. And pidfd doesn't really work as a > > property. > > hmm, I was kinda hoping there was already a solution for translating > thread-id's, but hadn't had a chance to dig through it yet You can translate them, and it happens automatically in process context (iirc at least). But when we set the read-only prop we don't know which process namespace the compositor is sitting in, so that translation isn't doing us any good. I think there's a root namespace that the kernel uses, but tbh I'm not sure how this all works. > > I also thought kernel threads can be distinguished from others, so > > userspace shouldn't be able to sneak in and get elevated by accident. > > I guess maybe you could look at the parent? I still would like to > think that we could come up with something a bit less shaking than > matching thread names by regexp.. ps marks up kernel threads with [], so there is a way. But I haven't looked at what it is exactly that tells kernel threads apart from others. But aside from that sounds like "match right kernel thread with regex and set its scheduler class" is how this is currently done, if I'm understanding what Tejun and Peter said correctly. Not pretty, but also *shrug* ... -Daniel > BR, > -R > > > -Daniel > > > > > > > > BR, > > > -R > > > > > > > -Daniel > > > > > > > > > Is there some way we could arrange for the per-crtc kthread's to be > > > > > owned by the drm master? That would solve the "must be root" issue. > > > > > And since the target audience is an atomic userspace, I suppose we > > > > > could expose the tid as a read-only property on the crtc? > > > > > > > > > > BR, > > > > > -R > > > > > > > > > > > So question to rt/worker folks: What's the best way to let userspace set > > > > > > the scheduling mode and priorities of things the kernel does on its > > > > > > behalf? Surely we're not the first ones where if userspace runs with some > > > > > > rt priority it'll starve out the kernel workers that it needs. Hardcoding > > > > > > something behind a subsystem ioctl (which just means every time userspace > > > > > > changes what it does, we need a new such flag or mode) can't be the right > > > > > > thing. > > > > > > > > > > > > Peter, Tejun? > > > > > > > > > > > > Thanks, Daniel > > > > > > > > > > > > > > > > > > > > Rob Clark (3): > > > > > > > drm/crtc: Introduce per-crtc kworker > > > > > > > drm/atomic: Use kthread worker for nonblocking commits > > > > > > > drm: Add a client-cap to set scheduling mode > > > > > > > > > > > > > > drivers/gpu/drm/drm_atomic_helper.c | 13 ++++++---- > > > > > > > drivers/gpu/drm/drm_auth.c | 4 ++++ > > > > > > > drivers/gpu/drm/drm_crtc.c | 37 +++++++++++++++++++++++++++++ > > > > > > > drivers/gpu/drm/drm_ioctl.c | 13 ++++++++++ > > > > > > > include/drm/drm_atomic.h | 31 ++++++++++++++++++++++++ > > > > > > > include/drm/drm_crtc.h | 10 ++++++++ > > > > > > > include/uapi/drm/drm.h | 13 ++++++++++ > > > > > > > 7 files changed, 117 insertions(+), 4 deletions(-) > > > > > > > > > > > > > > -- > > > > > > > 2.26.2 > > > > > > > > > > > > > > _______________________________________________ > > > > > > > dri-devel mailing list > > > > > > > dri-devel@lists.freedesktop.org > > > > > > > https://lists.freedesktop.org/mailman/listinfo/dri-devel > > > > > > > > > > > > -- > > > > > > Daniel Vetter > > > > > > Software Engineer, Intel Corporation > > > > > > http://blog.ffwll.ch > > > > > _______________________________________________ > > > > > dri-devel mailing list > > > > > dri-devel@lists.freedesktop.org > > > > > https://lists.freedesktop.org/mailman/listinfo/dri-devel > > > > > > > > > > > > > > > > -- > > > > Daniel Vetter > > > > Software Engineer, Intel Corporation > > > > http://blog.ffwll.ch > > > > -- > > Daniel Vetter > > Software Engineer, Intel Corporation > > http://blog.ffwll.ch -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch