Received: by 2002:ab2:3319:0:b0:1ef:7a0f:c32d with SMTP id i25csp141363lqc; Thu, 7 Mar 2024 12:45:53 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCWf2lg59QfFRgJnWAkIFs5B4v6q4lA4f+BWb7kLVTp63k1ppy8jo1PiPBhCQmjSaxmtyoyfMlFv3fe+PfL6LAi9IZIwyUTUc1p/q1D9KQ== X-Google-Smtp-Source: AGHT+IGLYqnAZ35idUIxaGSJMSi7MdySGiQ5k2/9WoQkKQFXAEN2WSNN7Z0Wx/XUvRik1gCe1Vbq X-Received: by 2002:a17:906:d8b4:b0:a45:ae7d:c8fc with SMTP id qc20-20020a170906d8b400b00a45ae7dc8fcmr5564190ejb.60.1709844353593; Thu, 07 Mar 2024 12:45:53 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1709844353; cv=pass; d=google.com; s=arc-20160816; b=y5OdMmj09MMsBmrVzDHmckEV9rAraoMnpvmF7oqDdLADsyo4MfnvGGL3ACTu4KKbKj fHnp3/najhIN7jkcpa61mmXo5pNMwGPoxavl9kqVSLi6GT7TpqDQUBhs90525GVpHYEV MeA1mdZs238Z/eO5V0YF9grL6UbCd0zL8me0oIzrpieis//cGEj7DnezNKlakWqFIcRY Znd8v/zE3r0XmB6qNgAa85rD8by8IVqS9FQRdxYDQjU2vv1OEWZd6O0nCZ8IE8vpveK1 +XGm8xoRlQ3TZoy5JerLgiddUsFiprZ1ssjvpe+SExvg5LPPLwmx3RTq10z2xVXPoEo2 2CwQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:list-unsubscribe:list-subscribe :list-id:precedence:dkim-signature; bh=w+3FhdmrXd3GVtm7VM7YbcxkD8Ujzz7d6MpBmECMceE=; fh=fHWBBERvxa+n3aT5kHCrth6NAukRAC6HadR5bFGIEx0=; b=Feqr0CM5O2q+7CQRIjyyZbGi8s09j9XCFVMRYXxL1ILLbjibpC0o0pICbU0MbyXuaw zeYmO0q301xJdNKEooV8YVW5WGWiiPkxgCBNFT26dg+C7cUe2uVM3r5BdqFo3b2KrtSg bB1DnJ8n/ufgHnplvuIjAylOg+sFuEOUZIVSP02f3YxpBwHsUTlIZ7tkzgDq3lsI6aEL TQshALf6yaGx8belEH6G7xcA2yyQJM+YybAE2Zx3pQnXEcpt1iXDrI3P3/AEJHq7/ThW UamcwdZ80cN6hzRdyzBoJtmP2ldI8tfQwTXvCjcOtueDbtqlBgysFr8V2vlKoq/CsIdZ sssQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=Q6+dPddM; arc=pass (i=1 spf=pass spfdomain=gmail.com dkim=pass dkdomain=gmail.com dmarc=pass fromdomain=gmail.com); spf=pass (google.com: domain of linux-kernel+bounces-96217-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-96217-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id z13-20020a17090655cd00b00a4306aea0bcsi6876878ejp.146.2024.03.07.12.45.53 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 07 Mar 2024 12:45:53 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-96217-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=Q6+dPddM; arc=pass (i=1 spf=pass spfdomain=gmail.com dkim=pass dkdomain=gmail.com dmarc=pass fromdomain=gmail.com); spf=pass (google.com: domain of linux-kernel+bounces-96217-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-96217-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 2C2E11F21E41 for ; Thu, 7 Mar 2024 20:45:53 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 4779413959D; Thu, 7 Mar 2024 20:45:46 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="Q6+dPddM" Received: from mail-pl1-f175.google.com (mail-pl1-f175.google.com [209.85.214.175]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 03A551386AF for ; Thu, 7 Mar 2024 20:45:43 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.175 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709844345; cv=none; b=drngBDCsLbtg/TVeqCQQQNAoROLKtl7akYlSoyPZU2qVYi28fenbiTXkTgozNgdTclXxKuq5Aq7WAMbNtwabq2Kw8zq0npsM5j8CHXUuXb6KhpICKw41ggWwU85Emr+29OGcIL3WCXjByr3UXhbC1XJNn5gCY6MLZC58Wb51uJk= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709844345; c=relaxed/simple; bh=kgVsBO9fAWx4C4lKypuEs11y4Czdj0K9xrrVMK3VHyg=; h=MIME-Version:References:In-Reply-To:From:Date:Message-ID:Subject: To:Cc:Content-Type; b=QHBzunfnh/JLKG1b0rY+n3CjXPi31d52dReGMw0BX5ZWtjGBdyhGqdjdWJPZjuajAsEmsXpLgxOk/KmBZSA5k1vV9jtRuBwtt/ziahX5QUwdT30JsL3qPZKbO967gTtK31Bmb9HTZRb4/eswp2MQVE2/ZRnCxcz0D/WjSF0LEWM= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=Q6+dPddM; arc=none smtp.client-ip=209.85.214.175 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-pl1-f175.google.com with SMTP id d9443c01a7336-1dd0d46ecc3so10527035ad.2 for ; Thu, 07 Mar 2024 12:45:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709844343; x=1710449143; darn=vger.kernel.org; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=w+3FhdmrXd3GVtm7VM7YbcxkD8Ujzz7d6MpBmECMceE=; b=Q6+dPddMQvTiVxca7XTVlnGPomZB8AR7UNm7M/7vf0ODqvYG3H8SKAEQavurdd4+1r iS93o0HZNFBjzPOGR7ZmNKOw32J8ylDF1yGGCzFCCi6kXBKbHOuPiQG/w5TItALMbmqH axqHSQMqyqqxedVqk0elxRvgPQZ8Yx6kEslrm+OhbId/NsbSHPhxue2XvHobsg7xBkt+ mhXarHp9JNoSijfsnvBGnK8CHslf30lPYlQEqgq9haO3bROcRR2nIwJZuqlWjBqIr41B BNvDrPwLMlwcwOG+2p1BtkyaG2MSoozF2/gm6+6jLS7xzwrfy4//85N3r+V4ZPI+c5Y/ /Jtg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709844343; x=1710449143; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=w+3FhdmrXd3GVtm7VM7YbcxkD8Ujzz7d6MpBmECMceE=; b=K+kOeTZNkp303664d+f7R6rQY55ayyo6KvYe4hRDPGzHk2Wn+eAZXT1HgHQD1q5HRJ VP106oWalXhpMa+azd/TTLRj83JmpYVhK0FYVduHRDtSaDLKXIsqVVu9F0zLHVfn9O2E Vy8fDHcI7pStapn+xN+H7OBn5p5adLyhyNnYJeCsl91z6Qbhl8M2mcf6mkH342OelhEQ gLFXar+j+mNe2NMekXqlMIFPceegA4kQXLzeni/RpU7gCFL+XOUtJie0T59W0Gp71oHt 9kDh6ylj/pVmMkI4OGVu6lp4XwO1e46RPb9BnaZ0pbD57BFgcIBriC0iiKQ/pJjm0QiR IDIQ== X-Forwarded-Encrypted: i=1; AJvYcCUJz/SF1dQ5LuHsOrnn3z53xarRPK2cYFW9ZyZLoqyTNr8BhJe/ONAUD50xUnBkyPl+bLub+gjiC/+FM5QcHZyHKE/SjXdOnxSUgv6n X-Gm-Message-State: AOJu0Yy9UjL5zRQAF810XZegVuhKQZ0+G28yCThYJhyL0IpEPAr96ECp dOUAhUaflI0m4Juv5Y1luZ4JsdhikKCMrhZuzuU/kL6pe6/7hu9cpSYrCdr1ZrHx5rkiGE7aRJi LAXo00nwxTsGqYf3n/K1JlxzR8yk= X-Received: by 2002:a17:902:eccc:b0:1dc:b3bc:b7e1 with SMTP id a12-20020a170902eccc00b001dcb3bcb7e1mr11971789plh.1.1709844343172; Thu, 07 Mar 2024 12:45:43 -0800 (PST) Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 References: <20240307165932.3856952-1-sunil.khatri@amd.com> <20240307165932.3856952-3-sunil.khatri@amd.com> In-Reply-To: From: Alex Deucher Date: Thu, 7 Mar 2024 15:45:30 -0500 Message-ID: Subject: Re: [PATCH 2/2] drm/amdgpu: add vm fault information to devcoredump To: "Khatri, Sunil" Cc: Sunil Khatri , Alex Deucher , =?UTF-8?Q?Christian_K=C3=B6nig?= , Shashank Sharma , amd-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org, Mukul Joshi , Arunpravin Paneer Selvam Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Thu, Mar 7, 2024 at 3:42=E2=80=AFPM Khatri, Sunil wro= te: > > > On 3/8/2024 12:44 AM, Alex Deucher wrote: > > On Thu, Mar 7, 2024 at 12:00=E2=80=AFPM Sunil Khatri wrote: > >> Add page fault information to the devcoredump. > >> > >> Output of devcoredump: > >> **** AMDGPU Device Coredump **** > >> version: 1 > >> kernel: 6.7.0-amd-staging-drm-next > >> module: amdgpu > >> time: 29.725011811 > >> process_name: soft_recovery_p PID: 1720 > >> > >> Ring timed out details > >> IP Type: 0 Ring Name: gfx_0.0.0 > >> > >> [gfxhub] Page fault observed > >> Faulty page starting at address 0x0000000000000000 > > Do you want a : before the address for consistency? > sure. > > > >> Protection fault status register:0x301031 > > How about a space after the : for consistency? > > > > For parsability, it may make more sense to just have a list of key valu= e pairs: > > [GPU page fault] > > hub: > > addr: > > status: > > [Ring timeout details] > > IP: > > ring: > > name: > > > > etc. > > Sure i agree but till now i was capturing information like we shared in > dmesg which is user readable. But surely one we have enough data i could > arrange all in key: value pairs like you suggest in a patch later if > that works ? Sure. Alex > > > > >> VRAM is lost due to GPU reset! > >> > >> Signed-off-by: Sunil Khatri > >> --- > >> drivers/gpu/drm/amd/amdgpu/amdgpu_reset.c | 14 +++++++++++++- > >> 1 file changed, 13 insertions(+), 1 deletion(-) > >> > >> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_reset.c b/drivers/gpu/d= rm/amd/amdgpu/amdgpu_reset.c > >> index 147100c27c2d..dd39e614d907 100644 > >> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_reset.c > >> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_reset.c > >> @@ -203,8 +203,20 @@ amdgpu_devcoredump_read(char *buffer, loff_t offs= et, size_t count, > >> coredump->ring->name); > >> } > >> > >> + if (coredump->adev) { > >> + struct amdgpu_vm_fault_info *fault_info =3D > >> + &coredump->adev->vm_manager.fault_info; > >> + > >> + drm_printf(&p, "\n[%s] Page fault observed\n", > >> + fault_info->vmhub ? "mmhub" : "gfxhub"); > >> + drm_printf(&p, "Faulty page starting at address 0x%016= llx\n", > >> + fault_info->addr); > >> + drm_printf(&p, "Protection fault status register:0x%x\= n", > >> + fault_info->status); > >> + } > >> + > >> if (coredump->reset_vram_lost) > >> - drm_printf(&p, "VRAM is lost due to GPU reset!\n"); > >> + drm_printf(&p, "\nVRAM is lost due to GPU reset!\n"); > >> if (coredump->adev->reset_info.num_regs) { > >> drm_printf(&p, "AMDGPU register dumps:\nOffset: V= alue:\n"); > >> > >> -- > >> 2.34.1 > >>