Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp420405pxj; Thu, 13 May 2021 08:00:13 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy0fOXbuRVUvjtKr64rYAZ6JPl6LULZkPb8WbETM2HOdOtKhnCXero1ri7akS1AV2VJ3L5M X-Received: by 2002:a17:906:90cf:: with SMTP id v15mr44606816ejw.432.1620918013029; Thu, 13 May 2021 08:00:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620918013; cv=none; d=google.com; s=arc-20160816; b=V6FLu99xrmB+4grX9joQsJa+cDhowtEWbZHR7AEnZtWrCdUqY/lvgl15Uu0RlCNXmE Ytb3I5VJBa8qyKTegLgVDyIt53lL2pSlDcmJJYVH6Up6rXJlrUUTQYWmQiizaqRTKakl 3CesPAokxacHWvz0NWfg9etP2mPnnSBo/IsCq3hyJg1pX2O/uLmlZAJvgqEldDduGliR YxnilMP2ZggBXNJTgILdXFrZWN7pqArUL1/WTPCooJYKqrKqSzf0lNCDlkmv7u6ZiWhw krIqbw2uwM5pDDTzzpUOaJYOheLv9uak4KEViOv0LCtYTQ+7lVoLbqlpCgK42oFqs1jl g0bg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=AbgyPKG5ZtG/4WqxP4kIpo39kNw3OZzOO2nwU/C30hg=; b=XKl5T3k3WAe9gZ3U2y+6Nw1Kdb/Kw9MEEdIHl2tv6mczdysSW6QH0v1QhtFdOAmAGH lqF7fT1KZ1nydtFfae8kVSHdM9bAYa1iKoB1UzIPmHnL6dtg98XE5VQEAEPpLW086jBq kusDEfIsOev0f/+MAnYbA0ksvIi5aCIwVxFZre2VCHzlRPCiU1vST74I+lENvfDDSkKM 3PkDc+4WaLJXuVSqNh+X3UPyl/BKlA/u0kfQdsom+9to5Lqts3mTQ1Du9h3s0u3I7HXr CN3woIU90B/QPCNRqOLkzTYodpALfVmnhqFlJ3zhKJ61cF8BNabDaux14GQ0zFIXM23J CVNQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=oRJpIZAr; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y8si3279009edc.456.2021.05.13.07.59.47; Thu, 13 May 2021 08:00:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=oRJpIZAr; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234500AbhEMO6y (ORCPT + 99 others); Thu, 13 May 2021 10:58:54 -0400 Received: from mail.skyhub.de ([5.9.137.197]:54934 "EHLO mail.skyhub.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229925AbhEMO6x (ORCPT ); Thu, 13 May 2021 10:58:53 -0400 Received: from zn.tnic (p200300ec2f0e440021f4b7a45291c72c.dip0.t-ipconnect.de [IPv6:2003:ec:2f0e:4400:21f4:b7a4:5291:c72c]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.skyhub.de (SuperMail on ZX Spectrum 128k) with ESMTPSA id 7CBD11EC023E; Thu, 13 May 2021 16:57:41 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alien8.de; s=dkim; t=1620917861; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:in-reply-to:in-reply-to: references:references; bh=AbgyPKG5ZtG/4WqxP4kIpo39kNw3OZzOO2nwU/C30hg=; b=oRJpIZArC9T1M+9MNwcVtDwjwfbntLsBzjjToIMCKWfzI3s0eG1Hr+RerU06xPGRegGdNV 4kXRfxt+dIQV3oe9KG2yNlBVoNz4nQFBNgRjaiKItSEZNZpwuZcOKQHamI3kAxT17XuauI AOQFZrfNvojet7MBk7JzSn81CuVsorc= Date: Thu, 13 May 2021 16:57:37 +0200 From: Borislav Petkov To: Alex Deucher Cc: "Joshi, Mukul" , x86-ml , "Kasiviswanathan, Harish" , lkml , "amd-gfx@lists.freedesktop.org" Subject: Re: [PATCH] drm/amdgpu: Register bad page handler for Aldebaran Message-ID: References: <20210512013058.6827-1-mukul.joshi@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 13, 2021 at 10:32:45AM -0400, Alex Deucher wrote: > Right. The sys admin can query the bad page count and decide when to > retire the card. Yap, although the driver should actively "tell" the sysadmin when some critical counts of retired VRAM pages are reached because I doubt all admins would go look at those counts on their own. Btw, you say "admin" - am I to understand that those are some high end GPU cards with ECC memory? If consumer grade stuff has this too, then the driver should very much warn on such levels on its own because normal users won't know what and where to look. Other than that, the big picture sounds good to me. Thx. -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette