Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp722786pxj; Thu, 13 May 2021 15:33:50 -0700 (PDT) X-Google-Smtp-Source: ABdhPJylhFGF2wxn4MM7XKnOAFdF8CSN4yq7K2Y9/05AyCc/7t4rvXCSPeZvb4KyOPVs01HhxFxq X-Received: by 2002:a5d:9612:: with SMTP id w18mr31491785iol.183.1620945230287; Thu, 13 May 2021 15:33:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620945230; cv=none; d=google.com; s=arc-20160816; b=MtjjSeRPyF1O1qBBwpkjRzgEpiZlvFsQZkxNK4rxe7ySCFIqtojaIwCMgg8K79EUeW L60eAW+55fDG4z+RXG3KAGQf9ASivCRAnFfPrmMSmKr5SlM68LFdT+WS/ICrZvAMNUDY s8VDqai0RapD3wVmN205vV3mlzHIH6oNXGB3OYiZez7PsG/6BUS1WnOLbP/BA2xSLEPi QcDUr1D9XxnUTbxxMoxgcIDRqgqzjkG9hvYD/K2VasTABiKxMnnVmXZS2EpYjUTVIuDs RzW4IAamDfqOVCViyHNlVx+8ToJQkzJj8QLRYDMQiOrJaj7YZH8Fiyt+WRwfsERb2Cys NRjQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=2jiYyfGon+l+dlMJ6kapOd+In1t5dSTiN7MQBF/Q/Es=; b=BCcWRffsN7uXpsFGEVQRipjxATFHr3gOFX6z0D7wbTnt6NPVKTY1V6HCmbIdMWAG6F AdLqPrpq8rKmFnhMWUGCmhaNkgagQ4+1Wd+tMov6XpLCwxputeki/pu6nAIpCwAEBii/ 2Fws2XOinq60/xuoFcVDr0QKX4jErTh01lSPc+tvSozafQqFpSKhpBDkcwd1So0NrbQ0 SJXNzUqLB9itXut2w9R4tFaA9RCMdIN5w/1tCBcZVgaPbKbFZ59cHmB1fbmASqBGRMyu Cf8ZhB+OUVasrge/L7nQ2YGtk1cdZJsQQf3Xsqnd3f0oWQk2IBmbiVuJTA9P37ZlmCbM H3nQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=j30501Pe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t13si5084134jai.45.2021.05.13.15.33.22; Thu, 13 May 2021 15:33:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=j30501Pe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233905AbhEMPDd (ORCPT + 99 others); Thu, 13 May 2021 11:03:33 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50752 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233073AbhEMPD0 (ORCPT ); Thu, 13 May 2021 11:03:26 -0400 Received: from mail-oo1-xc2a.google.com (mail-oo1-xc2a.google.com [IPv6:2607:f8b0:4864:20::c2a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B5C1DC061574 for ; Thu, 13 May 2021 08:02:15 -0700 (PDT) Received: by mail-oo1-xc2a.google.com with SMTP id o202-20020a4a2cd30000b02901fcaada0306so5712113ooo.7 for ; Thu, 13 May 2021 08:02:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=2jiYyfGon+l+dlMJ6kapOd+In1t5dSTiN7MQBF/Q/Es=; b=j30501PewlzcvcuQ4+8CqsWaMTS4JQ0lqMxYvK8rfcLbJEYtYIdIB1VD+EOmozA3Lh dPQ+IitmoW7fTYLvMf7jNrpt8gHvZRQZG/HuOsP1/aNI3dWJQOJ41+dk5i5m1M/423tK AoDRNcKCAPMaS2hIk3PsY+45IFW4k4I27XpU7BTsQmanXVZTxjBA6IX/hMftMhdvhUJH hehJIjIRAuVbSGJAc7SbogLAeC7DoW1YVNgHFFZGt79MAaOpZsvz2TJrXrSzPRoV8b0c cQR1SxkJhVOJAPhWBUjl74hlcbmhpRi8jlZge1WgV+9B7xHf79Sgv9CQiG+rMfdSoY2K XWTA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=2jiYyfGon+l+dlMJ6kapOd+In1t5dSTiN7MQBF/Q/Es=; b=kxAqlOWTVHU+u1ql5WLb2KvEeR+M3cNgqpiAGWQtqAd41qYenS/u7EUiHdkejPtPQp 6+l7nC0ooAChSYrUb97ZZfJDcFqIYQ3/667Sm7PAnWBEiqFBR2EpuNSGLmKlLLraPYGq hZKKsigLWY7aKrakqB5Q+UYIbPhc2wMIOgd2Cbnco1SZSqQQmIGQP8Nr4zvFkBChrjTN oAs7RbmuqQsLSW7njx6TFM+hIS0wldy2FBsmIQg+/+m7TsnQfw/98vADEcoy66NHhlms qf+ALQIygFAD3tXz/DzqH/YdWliU0KG7JXwLMD1qcK4nVGXblal7h5Cwr4XKjbH7c2uO oUpg== X-Gm-Message-State: AOAM5315hPluw/HuF7GStNTAI1I83UXEA4BSar0n2zIyDQ/i0WjhW4ey g3M/HbfFyafe6Cpm5Kax+0GFA26jx9zxfA9AAtQ= X-Received: by 2002:a4a:d543:: with SMTP id q3mr32537508oos.72.1620918133701; Thu, 13 May 2021 08:02:13 -0700 (PDT) MIME-Version: 1.0 References: <20210512013058.6827-1-mukul.joshi@amd.com> In-Reply-To: From: Alex Deucher Date: Thu, 13 May 2021 11:02:02 -0400 Message-ID: Subject: Re: [PATCH] drm/amdgpu: Register bad page handler for Aldebaran To: Borislav Petkov Cc: "Joshi, Mukul" , x86-ml , "Kasiviswanathan, Harish" , lkml , "amd-gfx@lists.freedesktop.org" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 13, 2021 at 10:57 AM Borislav Petkov wrote: > > On Thu, May 13, 2021 at 10:32:45AM -0400, Alex Deucher wrote: > > Right. The sys admin can query the bad page count and decide when to > > retire the card. > > Yap, although the driver should actively "tell" the sysadmin when some > critical counts of retired VRAM pages are reached because I doubt all > admins would go look at those counts on their own. I think we print something in the log as well when we hit the threshold. I need to double check the code. > > Btw, you say "admin" - am I to understand that those are some high end > GPU cards with ECC memory? If consumer grade stuff has this too, then > the driver should very much warn on such levels on its own because > normal users won't know what and where to look. > Currently it's only available on workstation and datacenter boards. > Other than that, the big picture sounds good to me. Thanks! Alex > > Thx. > > -- > Regards/Gruss, > Boris. > > https://people.kernel.org/tglx/notes-about-netiquette