Received: by 2002:a05:7412:a9a2:b0:e2:908c:2ebd with SMTP id o34csp2922481rdh; Mon, 30 Oct 2023 11:27:33 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEyKEDDHSUQvNEDig3e/73p0g0cPe8A5dLUKYhjMfXl/4Mmabw5Lel7l1iUmZMv25LIzJ3Y X-Received: by 2002:a17:903:11c4:b0:1c3:3363:8aea with SMTP id q4-20020a17090311c400b001c333638aeamr9267125plh.61.1698690453465; Mon, 30 Oct 2023 11:27:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698690453; cv=none; d=google.com; s=arc-20160816; b=iVU1ozJo4//OMpiAR9GToEXC6Q6GLTGUbzKdvAOzdzQ900AGPKZSTMQw59btaZGgqm sMd8jbFeMkPdFrBwMQ1Dfx+8kD5p59qGhRDNO/fgAHtdmCo8LnnLmGOpcSOR5wN8jNSR QwjAY6X2rSAQyFgVvUjMCDZfCTfDN1aAihSjaa9Iy5XAOb6khzMIaaRkBV/JJUmx5qbV VmBaQ2hReZWpTVahNkDBmvWWpUvndZ6vzpOo0qg5GaMZlNuOclKKZnlKo6u7iQ9ov7Yu ArO+CV5kgRLHbYsxpIZwqwTjsRToc5ehizlaKOJKnMQQ/neNjQfRMktgF27J+VTluJWk AU5g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=68ru9uL7QOJiXYVvDv7F1f/kURJOj/MVfTQISNyePXw=; fh=4aCCW9z2U+Io0mumpqQAW3CkaUKjatv2cVAh5JYCWxs=; b=vtsuLN3CHxcVrbjyIw58Yd5wPyYOcCYdPJKQybh1c79/zTt6Kiu8VXc5aLnRRMF90k EOPWhpQyFulJJHTvq7djVJDhJMZnJoG4jGymLCYYBHGNC6FBYbBSATCkDtZlK1VRDS07 C27wujInHfF3TIJmHIbQYz8AeHhBHwKGuGENUYUKbX16qZNC1Lk6PEpj4mM1R5t76cBi 9A7n9FqTxthAytqtC1pAbqfjs+vW7WbIzKMGldslbWanTcoDHsm2qrS33tBeqIDHOxlp TfqrMAGg2a4SDuihPkrjnDBm62rEwaV9PZBKuGSLwhmy9GNO5deBI8xMc5201YnSNT0x f+TA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=nRdhnSaw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from howler.vger.email (howler.vger.email. [2620:137:e000::3:4]) by mx.google.com with ESMTPS id d14-20020a170902cece00b001cc54873f98si1228903plg.149.2023.10.30.11.27.33 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 30 Oct 2023 11:27:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) client-ip=2620:137:e000::3:4; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=nRdhnSaw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 5AD528049054; Mon, 30 Oct 2023 11:27:30 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230509AbjJ3S1T (ORCPT + 99 others); Mon, 30 Oct 2023 14:27:19 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57618 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229897AbjJ3S1R (ORCPT ); Mon, 30 Oct 2023 14:27:17 -0400 Received: from mail-lf1-x12e.google.com (mail-lf1-x12e.google.com [IPv6:2a00:1450:4864:20::12e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C6898C1; Mon, 30 Oct 2023 11:27:13 -0700 (PDT) Received: by mail-lf1-x12e.google.com with SMTP id 2adb3069b0e04-507b96095abso6915436e87.3; Mon, 30 Oct 2023 11:27:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1698690432; x=1699295232; darn=vger.kernel.org; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=68ru9uL7QOJiXYVvDv7F1f/kURJOj/MVfTQISNyePXw=; b=nRdhnSawsq3CpRMZ4XLDDIgcrm7lpywKrnfv/dhbq6Myr+yUTEet6YfPt/H0mhD0bl LpmAq3oZ9ffiZcaS2LkesuhgoA8ByrFqjoG6Cj6/75JRhUNTFOPYZExgdi+7pEWHVsAJ fjja6ecCD2m2cPFvjNKTCjadgsx1ANhALMKh3CnHkc6HfK0mIdMaCDcufu5jU+/DSkXu 9HFadnJp2xI2RFbFAX+cCzIYY71SLON7/fQkdzN1qE11dMiM4EGcvz+tO3NZE6NbUSdN 1aguXvPDfVPg/B0RUbXdagF/eKueungn7fCfZLJZAG3NiMxVq2bt4e1sqWrsVXnu3rkN rRYA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1698690432; x=1699295232; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=68ru9uL7QOJiXYVvDv7F1f/kURJOj/MVfTQISNyePXw=; b=gC05pC3cdee8mwr8ByVLHPoyS7yCNLTPPildih/lv0LXPOJ/1l8PwEn36Q6ZknmWJy dNMGR9BbAWW6sufl+tYpqbqsiZa2+KfuaZZblTg3gdSapHl9LnUnp/dI46qafjYrfFlt kmoRZc8xBbo1No5rHmwdbVbCA5Oyzjkb/E6q5klb0cK66Li/PETXmzN3zftPRKIMEBsU 7oqWVyAWrZOsbz+saw6taCcxR9KvgTAVLNtqpO6GJzVwbaVTHDjd0W/xnsmE9zo8m4h9 CVfG5kysY7RaTfhta5Ofm95jo1nOgB13rZ3HmbhMEOoEx8zr+YCf5Hz5w7IZSY1KuDVA QhXw== X-Gm-Message-State: AOJu0Yz0zzDL1gk883Ai6tgbMmcYwHZp66U0VldzLFWX7groOLYYyDJg vJDqCJyhAwRc1dPaM25XBauqJh6KOHz1Onfi60MMSMQx X-Received: by 2002:a05:6512:1107:b0:508:1178:efa4 with SMTP id l7-20020a056512110700b005081178efa4mr9236721lfg.55.1698690431567; Mon, 30 Oct 2023 11:27:11 -0700 (PDT) MIME-Version: 1.0 References: <20231027165859.395638-1-robdclark@gmail.com> <20231027165859.395638-7-robdclark@gmail.com> <597b5bb3-ca75-4662-9904-12c4d8e9101a@amd.com> <836a6052-ad23-4a5f-9eb5-a7b5361b568c@gmail.com> In-Reply-To: <836a6052-ad23-4a5f-9eb5-a7b5361b568c@gmail.com> From: Rob Clark Date: Mon, 30 Oct 2023 11:26:59 -0700 Message-ID: Subject: Re: [PATCH 6/7] drm/exec: Pass in initial # of objects To: =?UTF-8?Q?Christian_K=C3=B6nig?= Cc: =?UTF-8?Q?Christian_K=C3=B6nig?= , Kefeng Wang , Karol Herbst , "open list:DRM DRIVER FOR NVIDIA GEFORCE/QUADRO GPUS" , dri-devel@lists.freedesktop.org, Mario Limonciello , Srinivasan Shanmugam , David Airlie , Rob Clark , Guchun Chen , Shashank Sharma , "open list:RADEON and AMDGPU DRM DRIVERS" , Dong Chenchen , Luben Tuikov , Philip Yang , Daniel Vetter , Arunpravin Paneer Selvam , linux-arm-msm@vger.kernel.org, Maarten Lankhorst , Maxime Ripard , Suren Baghdasaryan , Jack Xiao , Jonathan Kim , freedreno@lists.freedesktop.org, Felix Kuehling , "Pan, Xinhui" , open list , Thomas Zimmermann , Alex Deucher , Andrew Morton , Lang Yu Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-0.6 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on howler.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Mon, 30 Oct 2023 11:27:30 -0700 (PDT) On Mon, Oct 30, 2023 at 9:01=E2=80=AFAM Christian K=C3=B6nig wrote: > > Am 30.10.23 um 14:38 schrieb Rob Clark: > > On Mon, Oct 30, 2023 at 1:05=E2=80=AFAM Christian K=C3=B6nig > > wrote: > >> Am 27.10.23 um 18:58 schrieb Rob Clark: > >>> From: Rob Clark > >>> > >>> In cases where the # is known ahead of time, it is silly to do the ta= ble > >>> resize dance. > >> Ah, yes that was my initial implementation as well, but I ditched that > >> because nobody actually used it. > >> > >> One comment below. > >> > >>> Signed-off-by: Rob Clark > >>> --- > >>> drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 2 +- > >>> drivers/gpu/drm/amd/amdgpu/amdgpu_csa.c | 4 ++-- > >>> drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 4 ++-- > >>> drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c | 4 ++-- > >>> drivers/gpu/drm/drm_exec.c | 15 ++++++++++++--- > >>> drivers/gpu/drm/nouveau/nouveau_exec.c | 2 +- > >>> drivers/gpu/drm/nouveau/nouveau_uvmm.c | 2 +- > >>> include/drm/drm_exec.h | 2 +- > >>> 8 files changed, 22 insertions(+), 13 deletions(-) > >>> > >>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c b/drivers/gpu/drm= /amd/amdgpu/amdgpu_cs.c > >>> index efdb1c48f431..d27ca8f61929 100644 > >>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c > >>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c > >>> @@ -65,7 +65,7 @@ static int amdgpu_cs_parser_init(struct amdgpu_cs_p= arser *p, > >>> } > >>> > >>> amdgpu_sync_create(&p->sync); > >>> - drm_exec_init(&p->exec, DRM_EXEC_INTERRUPTIBLE_WAIT); > >>> + drm_exec_init(&p->exec, DRM_EXEC_INTERRUPTIBLE_WAIT, 0); > >>> return 0; > >>> } > >>> > >>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_csa.c b/drivers/gpu/dr= m/amd/amdgpu/amdgpu_csa.c > >>> index 720011019741..796fa6f1420b 100644 > >>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_csa.c > >>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_csa.c > >>> @@ -70,7 +70,7 @@ int amdgpu_map_static_csa(struct amdgpu_device *ade= v, struct amdgpu_vm *vm, > >>> struct drm_exec exec; > >>> int r; > >>> > >>> - drm_exec_init(&exec, DRM_EXEC_INTERRUPTIBLE_WAIT); > >>> + drm_exec_init(&exec, DRM_EXEC_INTERRUPTIBLE_WAIT, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> r =3D amdgpu_vm_lock_pd(vm, &exec, 0); > >>> if (likely(!r)) > >>> @@ -110,7 +110,7 @@ int amdgpu_unmap_static_csa(struct amdgpu_device = *adev, struct amdgpu_vm *vm, > >>> struct drm_exec exec; > >>> int r; > >>> > >>> - drm_exec_init(&exec, DRM_EXEC_INTERRUPTIBLE_WAIT); > >>> + drm_exec_init(&exec, DRM_EXEC_INTERRUPTIBLE_WAIT, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> r =3D amdgpu_vm_lock_pd(vm, &exec, 0); > >>> if (likely(!r)) > >>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/dr= m/amd/amdgpu/amdgpu_gem.c > >>> index ca4d2d430e28..16f1715148ad 100644 > >>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c > >>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c > >>> @@ -203,7 +203,7 @@ static void amdgpu_gem_object_close(struct drm_ge= m_object *obj, > >>> struct drm_exec exec; > >>> long r; > >>> > >>> - drm_exec_init(&exec, DRM_EXEC_IGNORE_DUPLICATES); > >>> + drm_exec_init(&exec, DRM_EXEC_IGNORE_DUPLICATES, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> r =3D drm_exec_prepare_obj(&exec, &bo->tbo.base, 1); > >>> drm_exec_retry_on_contention(&exec); > >>> @@ -739,7 +739,7 @@ int amdgpu_gem_va_ioctl(struct drm_device *dev, v= oid *data, > >>> } > >>> > >>> drm_exec_init(&exec, DRM_EXEC_INTERRUPTIBLE_WAIT | > >>> - DRM_EXEC_IGNORE_DUPLICATES); > >>> + DRM_EXEC_IGNORE_DUPLICATES, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> if (gobj) { > >>> r =3D drm_exec_lock_obj(&exec, gobj); > >>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c b/drivers/gpu/dr= m/amd/amdgpu/amdgpu_mes.c > >>> index b6015157763a..3c351941701e 100644 > >>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c > >>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c > >>> @@ -1105,7 +1105,7 @@ int amdgpu_mes_ctx_map_meta_data(struct amdgpu_= device *adev, > >>> > >>> amdgpu_sync_create(&sync); > >>> > >>> - drm_exec_init(&exec, 0); > >>> + drm_exec_init(&exec, 0, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> r =3D drm_exec_lock_obj(&exec, > >>> &ctx_data->meta_data_obj->tbo.ba= se); > >>> @@ -1176,7 +1176,7 @@ int amdgpu_mes_ctx_unmap_meta_data(struct amdgp= u_device *adev, > >>> struct drm_exec exec; > >>> long r; > >>> > >>> - drm_exec_init(&exec, 0); > >>> + drm_exec_init(&exec, 0, 0); > >>> drm_exec_until_all_locked(&exec) { > >>> r =3D drm_exec_lock_obj(&exec, > >>> &ctx_data->meta_data_obj->tbo.ba= se); > >>> diff --git a/drivers/gpu/drm/drm_exec.c b/drivers/gpu/drm/drm_exec.c > >>> index 5d2809de4517..27d11c20d148 100644 > >>> --- a/drivers/gpu/drm/drm_exec.c > >>> +++ b/drivers/gpu/drm/drm_exec.c > >>> @@ -69,16 +69,25 @@ static void drm_exec_unlock_all(struct drm_exec *= exec) > >>> * drm_exec_init - initialize a drm_exec object > >>> * @exec: the drm_exec object to initialize > >>> * @flags: controls locking behavior, see DRM_EXEC_* defines > >>> + * @nr: the initial # of objects > >>> * > >>> * Initialize the object and make sure that we can track locked ob= jects. > >>> + * > >>> + * If nr is non-zero then it is used as the initial objects table si= ze. > >>> + * In either case, the table will grow (be re-allocated) on demand. > >>> */ > >>> -void drm_exec_init(struct drm_exec *exec, uint32_t flags) > >>> +void drm_exec_init(struct drm_exec *exec, uint32_t flags, unsigned n= r) > >>> { > >>> + size_t sz =3D PAGE_SIZE; > >>> + > >>> + if (nr) > >>> + sz =3D (size_t)nr * sizeof(void *); > >>> + > >>> exec->flags =3D flags; > >>> - exec->objects =3D kmalloc(PAGE_SIZE, GFP_KERNEL); > >>> + exec->objects =3D kmalloc(sz, GFP_KERNEL); > >> Please use k*v*malloc() here since we can't predict how large that wil= l be. > > or __GFP_NOWARN? If userspace (or kasan) is cheeky and asks for ~0 > > objects, we should probably just fail? > > Oh, good point! If this value is controlled by userspace we must be much > more careful. > > Instead of __GFP_NOWARN or any other workaround we should use > kvmalloc_array() here. > > Maybe turn the code upside down, in other words something like this here: > > if (!nr) > nr =3D PAGE_SIZE / sizeof(void *); > > exec->objects =3D kvmalloc_array(nr, sizeof(void *), GFP_KERNEL); > exec->max_objects =3D exec->objects ? nr : 0; oh, good point BR, -R > > Regards, > Christian. > > > > > BR, > > -R > > > >> With that fixed the patch is Reviewed-by: Christian K=C3=B6nig > >> . > >> > >> Regards, > >> Christian. > >> > >>> /* If allocation here fails, just delay that till the first us= e */ > >>> - exec->max_objects =3D exec->objects ? PAGE_SIZE / sizeof(void *= ) : 0; > >>> + exec->max_objects =3D exec->objects ? sz / sizeof(void *) : 0; > >>> exec->num_objects =3D 0; > >>> exec->contended =3D DRM_EXEC_DUMMY; > >>> exec->prelocked =3D NULL; > >>> diff --git a/drivers/gpu/drm/nouveau/nouveau_exec.c b/drivers/gpu/drm= /nouveau/nouveau_exec.c > >>> index 19024ce21fbb..f5930cc0b3fb 100644 > >>> --- a/drivers/gpu/drm/nouveau/nouveau_exec.c > >>> +++ b/drivers/gpu/drm/nouveau/nouveau_exec.c > >>> @@ -103,7 +103,7 @@ nouveau_exec_job_submit(struct nouveau_job *job) > >>> > >>> nouveau_uvmm_lock(uvmm); > >>> drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT | > >>> - DRM_EXEC_IGNORE_DUPLICATES); > >>> + DRM_EXEC_IGNORE_DUPLICATES, 0); > >>> drm_exec_until_all_locked(exec) { > >>> struct drm_gpuva *va; > >>> > >>> diff --git a/drivers/gpu/drm/nouveau/nouveau_uvmm.c b/drivers/gpu/drm= /nouveau/nouveau_uvmm.c > >>> index aae780e4a4aa..3a9331a1c830 100644 > >>> --- a/drivers/gpu/drm/nouveau/nouveau_uvmm.c > >>> +++ b/drivers/gpu/drm/nouveau/nouveau_uvmm.c > >>> @@ -1288,7 +1288,7 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job= *job) > >>> } > >>> > >>> drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT | > >>> - DRM_EXEC_IGNORE_DUPLICATES); > >>> + DRM_EXEC_IGNORE_DUPLICATES, 0); > >>> drm_exec_until_all_locked(exec) { > >>> list_for_each_op(op, &bind_job->ops) { > >>> struct drm_gpuva_op *va_op; > >>> diff --git a/include/drm/drm_exec.h b/include/drm/drm_exec.h > >>> index b5bf0b6da791..f1a66c048721 100644 > >>> --- a/include/drm/drm_exec.h > >>> +++ b/include/drm/drm_exec.h > >>> @@ -135,7 +135,7 @@ static inline bool drm_exec_is_contended(struct d= rm_exec *exec) > >>> return !!exec->contended; > >>> } > >>> > >>> -void drm_exec_init(struct drm_exec *exec, uint32_t flags); > >>> +void drm_exec_init(struct drm_exec *exec, uint32_t flags, unsigned n= r); > >>> void drm_exec_fini(struct drm_exec *exec); > >>> bool drm_exec_cleanup(struct drm_exec *exec); > >>> int drm_exec_lock_obj(struct drm_exec *exec, struct drm_gem_object= *obj); >