Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp8644399rwd; Tue, 20 Jun 2023 18:56:09 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7QnS67YCIW6NY28BehbRxzN7WS0b4Mf4H5JpeeBdVsUDbfFXXVKkNwppNh2hm/IdgZ4PVQ X-Received: by 2002:a05:6a00:2d9a:b0:66a:386c:e6a3 with SMTP id fb26-20020a056a002d9a00b0066a386ce6a3mr1429492pfb.34.1687312569321; Tue, 20 Jun 2023 18:56:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1687312569; cv=none; d=google.com; s=arc-20160816; b=P97qd30CUAPKsPK5afaivT4JocULAQsbo9W4aprHmVD1ZPYGpmEnbwTxfOzkyogV5x /lt4Yx9ZK/fTUUNpsVEyWKJK2MrEi1yFDw9UKXwD6H6GjIAag7+NXaf5BKpC0r0eSzhE L3y2zzNmzJxo5jVHpBqWREhaaJFbd+5e+XCOAN0z2fVRzvoZlGslEWTR/eCvAUFtsXF7 56IYbZAUeVJEkoubyMZXgk23RJL3IvxiCQQrxCPItUY7Q+KVANGwwJZXdWjXLO/GOCyV CSMuelTkr4cLIXMvB3n0ModhQYjHCo3SR/1J7ZyFYMSPMSZE/4xJ052M+dMT6cuumXv6 Nn4Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=dD+0WX2WV6Iq8HXq3xOanwTmTky971Axctsxt30metE=; b=NZnq+BqGUG+fEMN1LSR4UjvdCynQwHg6XyKhd4R8vfdjW090xxeKhjrzMFGdf13TGx SCXbybH9+T8M4rRzMxxiTmkHEBAmxRjSI0De8UTPqtnJ3c3tZOVnZ+/JTCHFDLdRzlfT 5F3ITnJaiKeUW/SHRseTII0ODYiMHUDypbvrKh4PSXMV8rKSyb57SqOint/kOniKyzf9 7kp1y6l+vx8ZhioC5R2ZC1orZjHgP3QO25p662fm25wDDgE5SiltWTtc/+OZLSiD3mfL dVXtILgZ3bWXxvlNqnT1M2m1r9i+lzpdV3gfTd5uq6TAC5aNIegMrNkiYbdrSjzcqNq/ bimA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b=AFwHoupW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id r26-20020aa79eda000000b00655079704afsi2974520pfq.119.2023.06.20.18.55.54; Tue, 20 Jun 2023 18:56:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b=AFwHoupW; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229912AbjFUA6f (ORCPT + 99 others); Tue, 20 Jun 2023 20:58:35 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53668 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229527AbjFUA6d (ORCPT ); Tue, 20 Jun 2023 20:58:33 -0400 Received: from fanzine2.igalia.com (fanzine2.igalia.com [213.97.179.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 03D22183 for ; Tue, 20 Jun 2023 17:58:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=igalia.com; s=20170329; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID: Date:Subject:Cc:To:From:Sender:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: In-Reply-To:References:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=dD+0WX2WV6Iq8HXq3xOanwTmTky971Axctsxt30metE=; b=AFwHoupWdW4Ozmxjp9xj9e/m3t XZc6hhXZuz5Fqn+LWY/W1l4/QQ1VyrBkr7VIpCSaHzXE/fdAdnp/75Dn1rn5uAi2Bgg217t9kJ2DL DOrVgClp6/5GFX7rbSDkZM5ISxzBF0vSDfhAYDUAX0Pt3oRkdJ24nXyARDkobAwV6t39Qck2p+Vlv dVgec97mwUL9tjx6bfBs2i4X+Wimp+yavP138mdogQBNrjjVDKlLy1xgaoFbmzZCBfuZD3MmA75qY qBhPdsjj40Yf424y7WVChuNJGMavYTZxlcMpdSp5ieY3WCGOzLFVpsv6BzWR+5Jqqq/F9CARcy+kS Pdb6XNGw==; Received: from [179.113.218.86] (helo=steammachine.lan) by fanzine2.igalia.com with esmtpsa (Cipher TLS1.3:ECDHE_X25519__RSA_PSS_RSAE_SHA256__AES_256_GCM:256) (Exim) id 1qBmAz-0011pg-Iu; Wed, 21 Jun 2023 02:58:26 +0200 From: =?UTF-8?q?Andr=C3=A9=20Almeida?= To: dri-devel@lists.freedesktop.org, amd-gfx@lists.freedesktop.org, linux-kernel@vger.kernel.org Cc: kernel-dev@igalia.com, alexander.deucher@amd.com, christian.koenig@amd.com, pierre-eric.pelloux-prayer@amd.com, Simon Ser , Rob Clark , Pekka Paalanen , Daniel Vetter , Daniel Stone , =?UTF-8?q?=27Marek=20Ol=C5=A1=C3=A1k=27?= , Dave Airlie , =?UTF-8?q?Michel=20D=C3=A4nzer?= , Samuel Pitoiset , =?UTF-8?q?Timur=20Krist=C3=B3f?= , Bas Nieuwenhuizen , =?UTF-8?q?Andr=C3=A9=20Almeida?= Subject: [RFC PATCH v3 0/4] drm: Standardize device reset notification Date: Tue, 20 Jun 2023 21:57:15 -0300 Message-ID: <20230621005719.836857-1-andrealmeid@igalia.com> X-Mailer: git-send-email 2.41.0 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, This is a new version of the documentation for DRM device resets. As I dived more in the subject, I started to believe that part of the problem was the lack of a DRM API to get reset information from the driver. With an API, we can better standardize reset queries, increase common code from both DRM and Mesa, and make easier to write end-to-end tests. So this patchset, along with the documentation, comes with a new IOCTL and two implementations of it for amdgpu and i915 (although just the former was really tested). This IOCTL uses the "context id" to query reset information, but this might be not generic enough to be included in a DRM API. At least for amdgpu, this information is encapsulated by libdrm so one can't just call the ioctl directly from the UMD as I was planning to, but a small refactor can be done to expose the id. Anyway, I'm sharing it as it is to gather feedback if this seems to work. The amdgpu and i915 implementations are provided as a mean of testing and as exemplification, and not as reference code yet, as the goal is more about the interface itself then the driver parts. For the documentation itself, after spending some time reading the reset path in the kernel in Mesa, I decide to rewrite it to better reflect how it works, from bottom to top. You can check the userspace side of the IOCLT here: Mesa: https://gitlab.freedesktop.org/andrealmeid/mesa/-/commit/cd687b22fb32c21b23596c607003e2a495f465 libdrm: https://gitlab.freedesktop.org/andrealmeid/libdrm/-/commit/b31e5404893ee9a85d1aa67e81c2f58c1dac3c46 For testing, I use this vulkan app that has an infinity loop in the shader: https://github.com/andrealmeid/vulkan-triangle-v1 Feedbacks are welcomed! Thanks, André v2: https://lore.kernel.org/all/20230227204000.56787-1-andrealmeid@igalia.com/ v1: https://lore.kernel.org/all/20230123202646.356592-1-andrealmeid@igalia.com/ André Almeida (4): drm/doc: Document DRM device reset expectations drm: Create DRM_IOCTL_GET_RESET drm/amdgpu: Implement DRM_IOCTL_GET_RESET drm/i915: Implement DRM_IOCTL_GET_RESET Documentation/gpu/drm-uapi.rst | 51 ++++++++++++++++ drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 4 +- drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c | 35 +++++++++++ drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.h | 5 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_job.c | 12 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_job.h | 2 + drivers/gpu/drm/drm_debugfs.c | 2 + drivers/gpu/drm/drm_ioctl.c | 58 +++++++++++++++++++ drivers/gpu/drm/i915/gem/i915_gem_context.c | 18 ++++++ drivers/gpu/drm/i915/gem/i915_gem_context.h | 2 + .../gpu/drm/i915/gem/i915_gem_context_types.h | 2 + drivers/gpu/drm/i915/i915_driver.c | 2 + include/drm/drm_device.h | 3 + include/drm/drm_drv.h | 3 + include/uapi/drm/drm.h | 21 +++++++ include/uapi/drm/drm_mode.h | 15 +++++ 17 files changed, 233 insertions(+), 3 deletions(-) -- 2.41.0