Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp21929988rwd; Fri, 30 Jun 2023 01:23:47 -0700 (PDT) X-Google-Smtp-Source: APBJJlGvgAd2PjMuY7g3d2AJ/a9YfLiuBug1kDbTmDSFbfJKBKMcxRqjNzOGr2VaJ9j5H1HMvynn X-Received: by 2002:a17:903:1c5:b0:1b1:a4e2:a2e6 with SMTP id e5-20020a17090301c500b001b1a4e2a2e6mr1854646plh.12.1688113426814; Fri, 30 Jun 2023 01:23:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1688113426; cv=none; d=google.com; s=arc-20160816; b=Mkd4VDJgkD70weIvcnYosgTiS3wV7o74dWEuJqXSLtwSgl8wPFELsOUH4mxzn22VYf Yk2ros0OCU1ah1ORH6WKRW2rFdTYcunN5Lapnh+HWn7DMoke+9zw821UOUM6pRXOPXAf LDudx4uvDcIx4MpNw4QBwnIjEzZThiXPIz6RLXqlAaeI+USsWA7BEKxYANdYaOxpolJF BGsGrmOwwODhHcIzh0jymVOEv+dt10OR8eIGevq1USdD8sIlSp3nrKi0fnb/oMjGnJtH 2oJJuTOmj7wXD6W21c1T2jD49a2s3+Yuhj34A26Q0YZgsbyHBn5Rk9dt0m0seBUBnSoP mWxA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature; bh=obTtEevOojxKLVvRbcKfo3IQRsjax+AXsIM7W03QXrw=; fh=+n5yGjKW7s12OPJntKK0y6r3FErijkct4nemm8qvlp4=; b=v3bpSKCRSlwXphEed9OQkb/KVRpshewNoP4P5fCADXEbjpoLogdDBZyanGS72X1uzu X6JUwK4/Qz7UzioCIzdEP5xmi4DqG0pgPlh74+sqOxQtF6tYlCdHyypktDR6Vg0Q7mF4 XsxRalUUohqbb0XVt7FJ1aKIUv4sYYa72BHuMSaiC3C6DofuOx77w7qwyn+UGIADnbZv seO+5xVGWfDPS491cHI/nGZPISOr0k3naV3cP4lvO/y9BcplegRpWtKrgh18E7FhebIZ 3Hnp9kEh/zNLvwH/Y7doH2IzCcezE2YFFYUFwbHtv/Nr7otddKyCL196bAI/Bbw3xSEF TuOA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@collabora.com header.s=mail header.b=gvVru5Xs; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=collabora.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 18-20020a170902ee5200b001b86e3129e2si886692plo.9.2023.06.30.01.23.31; Fri, 30 Jun 2023 01:23:46 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@collabora.com header.s=mail header.b=gvVru5Xs; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=collabora.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231888AbjF3IJb (ORCPT + 99 others); Fri, 30 Jun 2023 04:09:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33694 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230233AbjF3IJ3 (ORCPT ); Fri, 30 Jun 2023 04:09:29 -0400 Received: from madras.collabora.co.uk (madras.collabora.co.uk [IPv6:2a00:1098:0:82:1000:25:2eeb:e5ab]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3DCE12D50; Fri, 30 Jun 2023 01:09:28 -0700 (PDT) Received: from localhost (unknown [IPv6:2a01:e0a:2c:6930:5cf4:84a1:2763:fe0d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) (Authenticated sender: bbrezillon) by madras.collabora.co.uk (Postfix) with ESMTPSA id 2F9B06606FFA; Fri, 30 Jun 2023 09:09:26 +0100 (BST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=collabora.com; s=mail; t=1688112566; bh=ANYK13+kMbufo5WxKeOKpUDTsJYoR7z7qubmUSW56Qk=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=gvVru5Xspm5Eor9iojhhEZHcNdM8BgyHneoHzQoNyIrwnMPC1vl+y+Z1DbR+rQIbP 1Ws/c0b1ZyTNbZI7BKQW5BlDjO6JdOSwJL05qR3osL8szn0ZZzi86kNOtIZokYK8Un fE8HwBHTJ8LUHDnsRy7hTurihF7sXX5/XlLeFJKDsNEHnm3TSdv+EQEH0OHouPqKTk sSNp+BmboD68/ri9+asbzJhjKti7xah7ynXp6oW+mdJaPb4QCdnrletH3WJPYC7yvp t4o3KYbMBSY+5Ks9mIfKIWfRbgfaWlDUcP5flAlmTzdJq8C3QH5Bz3xsadVDSN1Tw7 JNcf8EwrVPW1g== Date: Fri, 30 Jun 2023 10:09:23 +0200 From: Boris Brezillon To: Danilo Krummrich Cc: airlied@gmail.com, daniel@ffwll.ch, tzimmermann@suse.de, mripard@kernel.org, corbet@lwn.net, christian.koenig@amd.com, bskeggs@redhat.com, Liam.Howlett@oracle.com, matthew.brost@intel.com, alexdeucher@gmail.com, ogabbay@kernel.org, bagasdotme@gmail.com, willy@infradead.org, jason@jlekstrand.net, dri-devel@lists.freedesktop.org, nouveau@lists.freedesktop.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, Donald Robson , Dave Airlie Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings Message-ID: <20230630100923.3fbab839@collabora.com> In-Reply-To: <20230630100252.7ff6421d@collabora.com> References: <20230629222651.3196-1-dakr@redhat.com> <20230629222651.3196-3-dakr@redhat.com> <20230630100252.7ff6421d@collabora.com> Organization: Collabora X-Mailer: Claws Mail 4.1.1 (GTK 3.24.38; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 30 Jun 2023 10:02:52 +0200 Boris Brezillon wrote: > Hi Danilo, > > On Fri, 30 Jun 2023 00:25:18 +0200 > Danilo Krummrich wrote: > > > + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx) > > + * { > > + * struct driver_context *ctx = __ctx; > > + * > > + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap); > > + * > > + * drm_gpuva_unlink(op->remap.unmap->va); > > + * kfree(op->remap.unmap->va); > > + * > > + * if (op->remap.prev) { > > + * drm_gpuva_link(ctx->prev_va); > > I ended up switching to dma_resv-based locking for the GEMs and I > wonder what the locking is supposed to look like in the async-mapping > case, where we insert/remove the VA nodes in the drm_sched::run_job() > path. > > What I have right now is something like: > > dma_resv_lock(vm->resv); > > // split done in drm_gpuva_sm_map(), each iteration > // of the loop is a call to the driver ->[re,un]map() > // hook > for_each_sub_op() { > > // Private BOs have their resv field pointing to the > // VM resv and we take the VM resv lock before calling > // drm_gpuva_sm_map() > if (vm->resv != gem->resv) > dma_resv_lock(gem->resv); > > drm_gpuva_[un]link(va); > gem_[un]pin(gem); > > if (vm->resv != gem->resv) > dma_resv_unlock(gem->resv); > } > > dma_resv_unlock(vm->resv); > > In practice, I don't expect things to deadlock, because the VM resv is > not supposed to be taken outside the VM context and the locking order > is always the same (VM lock first, and then each shared BO > taken/released independently), but I'm not super thrilled by this > nested lock, and I'm wondering if we shouldn't have a pass collecting > locks in a drm_exec context first, and then have > the operations executed. IOW, something like that: > > drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES) > drm_exec_until_all_locked(exec) { > // Dummy GEM is the dummy GEM object I use to make the VM > // participate in the locking without having to teach > // drm_exec how to deal with raw dma_resv objects. > ret = drm_exec_lock_obj(exec, vm->dummy_gem); > drm_exec_retry_on_contention(exec); > if (ret) > return ret; > > // Could take the form of drm_gpuva_sm_[un]map_acquire_locks() > // helpers > for_each_sub_op() { > ret = drm_exec_lock_obj(exec, gem); > if (ret) > return ret; > } > } > > // each iteration of the loop is a call to the driver > // ->[re,un]map() hook > for_each_sub_op() { > ... > gem_[un]pin_locked(gem); Just wanted to clarify that the pages have been pinned at VM_BIND job creation time, so this gem_pin_locked() call is effectively just a pin_count++, not the whole page allocation, which we don't want to happen in a dma-signaling path. > drm_gpuva_[un]link(va); > ... > } > > drm_exec_fini(exec);