Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp635392pxb; Thu, 23 Sep 2021 07:38:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy15ZqYiZX0Ug02mFMPTuw0nTxFO9meqKId+nf2PxA5ave83k7a5gsiR5C+4qCm//DwW47b X-Received: by 2002:a92:ce48:: with SMTP id a8mr3943521ilr.115.1632407928813; Thu, 23 Sep 2021 07:38:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632407928; cv=none; d=google.com; s=arc-20160816; b=aQzyDg2rtlSSxGe0+cigevzcH2wsyKrSLR77dCQKBRTuyiJAsJEGC+gO20FuowYSBM j4Hv7S9n4BT6Fuikv7OLtdnwUIUU/1Q0Hf9/Jown58uFZF9JzRqRysWOuc6T+GTBHcxs qZCjWxM1kBMQ9pqg3YmJ8I4JdiaA3BbNG1TzxbviMcganaLZeySidDBrnUx4GlYDjoqi jv2lLusrVxbmTd0TM07dir8YjOFJyz4XFF+lGcfo6VYOZ6G0FMILZI7oNCNoBLIRKpql 40Qu4NWJyivKF7T9AAJ+G3Es0MRBeoLAGpnWWtY7XYC3WWnyDAfuslAhXVBIeWl4jATU Fqcw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=HfoPwsxKHCcbtGaxkHIOM1bQ75QFFghWL979CqRbrFw=; b=eCpSVNbCRywAk3/E9aP09MlciTww4aaLpRBfN2WhbTLXG8k/Z72N7tcc4l6NaTx6ZS +cOeO8l6xnX0LSLBe4+qhoFYPiawFFU8ch5rgGwMYvsyHAgM2WfbjqksopVLnKp5IdD2 boJiHghZylDKw5bxTyLblqTqV+OVKBxodgWkR9ZvEjoSd2Vr0b54BMyik7LMVz8zpL6B XRezRvauTEu9F2fvFDU22uZPDuxpdAHasuj+c8bjn78n4GJEO94elggUpSo0iHqDu96C Zskv9jrqJxqK0t2jl7oeVvbyczCJzuNppCugHghIQEDhya8yzP3yr6HOx8/tVAYnSIdN OFYA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=Rx5O0nY0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p33si6165480jac.94.2021.09.23.07.38.27; Thu, 23 Sep 2021 07:38:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=Rx5O0nY0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241696AbhIWOiy (ORCPT + 99 others); Thu, 23 Sep 2021 10:38:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49854 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S241565AbhIWOix (ORCPT ); Thu, 23 Sep 2021 10:38:53 -0400 Received: from mail.skyhub.de (mail.skyhub.de [IPv6:2a01:4f8:190:11c2::b:1457]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D2136C061574; Thu, 23 Sep 2021 07:37:21 -0700 (PDT) Received: from zn.tnic (p200300ec2f0d6800d4c5e0e0616b3501.dip0.t-ipconnect.de [IPv6:2003:ec:2f0d:6800:d4c5:e0e0:616b:3501]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.skyhub.de (SuperMail on ZX Spectrum 128k) with ESMTPSA id 309CD1EC056B; Thu, 23 Sep 2021 16:37:16 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alien8.de; s=dkim; t=1632407836; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:in-reply-to:in-reply-to: references:references; bh=HfoPwsxKHCcbtGaxkHIOM1bQ75QFFghWL979CqRbrFw=; b=Rx5O0nY0jYGcwQF/bSZi8maFoS15Mo0LtUhi5e0YG9Q+Yq8Ck3sI3RsyTcaBFSB3oH0EmV Yld3Ju3Dx5hg8HjdqY8dj0ZSLpHYHfhhZLc+WCOKEOhwNOGpMwoQ699jFogBTuB3ZCer5w /QVZE3GiP4L7x3fZ39Cl8wHquNRAnf0= Date: Thu, 23 Sep 2021 16:37:10 +0200 From: Borislav Petkov To: Yazen Ghannam Cc: Mukul Joshi , linux-edac@vger.kernel.org, x86@kernel.org, linux-kernel@vger.kernel.org, mingo@redhat.com, mchehab@kernel.org, amd-gfx@lists.freedesktop.org Subject: Re: [PATCHv3 2/2] drm/amdgpu: Register MCE notifier for Aldebaran RAS Message-ID: References: <20210913021311.12896-2-mukul.joshi@amd.com> <20210922193620.15925-1-mukul.joshi@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Sep 23, 2021 at 02:29:07PM +0000, Yazen Ghannam wrote: > > + /* > > + * If the error was generated in UMC_V2, which belongs to GPU UMCs, > > + * and error occurred in DramECC (Extended error code = 0) then only > > + * process the error, else bail out. > > + */ > > + if (!m || !((smca_get_bank_type(m->bank) == SMCA_UMC_V2) && > > + (XEC(m->status, 0x1f) == 0x0))) > > The MCA_STATUS[ErrorCodeExt] field is bits [21:16], so the mask should be > 0x3f. > > > + return NOTIFY_DONE; > > + > > + /* > > + * If it is correctable error, return. > > + */ > > + if (mce_is_correctable(m)) > > + return NOTIFY_OK; > > Shouldn't this be "NOTIFY_DONE" if "don't care" about this error? I think the logic here is to stop calling any further consumers on the notify chain because this is a GPU correctable error and they can't do anything about it anyway, right? Or am I misreading it? -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette