Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp164195pxa; Tue, 11 Aug 2020 21:11:58 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwlEGfQAX+glZzK42Mml3ShkFC8UuJyj7eyN8vJYKghk37/u2QVuJ3nCeIZ7sceEWQjxJWD X-Received: by 2002:a17:906:d92:: with SMTP id m18mr30469230eji.428.1597205518074; Tue, 11 Aug 2020 21:11:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597205518; cv=none; d=google.com; s=arc-20160816; b=g6NeT/R/l+QPQ47F3t+DfG8kSzd760Wh431t3PVKuTCzLjeiciYBOnw1Ipv5KIvAjQ HTgBs/ZCDt5Qqvw9diFUXpK62qkXOnT8p9geQ5V95Zkdb6UL7RqNHtjke/fP5yRnf3jf 9OeM/o6vh5dIG0/aEPg9f45qfyRjtS22Hpz5V3Rz1jua3gFImyNSGtjNWU3A8J8Pf7n5 /FxLDMc1HRNIunidGsCvmM1x5iUhx6isl/w41UImMQUzsVqu3MacMyxcACjXS7SLYBEK p/sDOIK4glgyrYJOQ8fLGSkiKK2LbxcDx1GnFppyAAg4rL8Com6SiC3kYO/w5YlxTbYq zdbw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject:dkim-signature; bh=//mkLAI7oVv7tPrVMf+q8zyv+BuUYqKH5dh+Sj+3EdQ=; b=Gfh+HxcY8OJT88NPvBOKp6CF1HhVmMRPlPvFugvXFXnc+mHZOd/RcuYvN+ms2zGsLT L8uRyGzvYUxrGNhIT9/8XTEb8/7KSk+OqGfedcm4sTGIpEaFP64q7XXnI16U3X5LS0jS FF/T7ibxH/Ryi1u7XSnanuIq6rR9hphUDDmFv3+7giZS5cxC/6zj4+Ltet12YJeYuqBR OwIqMg67PNapHG0h08dto3LZbabuoEYhTE14ik67rz2nlSxHjVj336dHqf37CqI1Osso OUdJupNYIY1yvRQYk7FEzmAo9jHKgY/Wt9DpVGLLwyKbU+7EVWgXszc96n7Ub3UQtvfc V7Ww== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance-com.20150623.gappssmtp.com header.s=20150623 header.b=KArapE08; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cz8si514891edb.455.2020.08.11.21.11.34; Tue, 11 Aug 2020 21:11:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance-com.20150623.gappssmtp.com header.s=20150623 header.b=KArapE08; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726430AbgHLEIu (ORCPT + 99 others); Wed, 12 Aug 2020 00:08:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35866 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725825AbgHLEIt (ORCPT ); Wed, 12 Aug 2020 00:08:49 -0400 Received: from mail-pj1-x1041.google.com (mail-pj1-x1041.google.com [IPv6:2607:f8b0:4864:20::1041]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C4DC1C06174A for ; Tue, 11 Aug 2020 21:08:49 -0700 (PDT) Received: by mail-pj1-x1041.google.com with SMTP id e4so481349pjd.0 for ; Tue, 11 Aug 2020 21:08:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20150623.gappssmtp.com; s=20150623; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding; bh=//mkLAI7oVv7tPrVMf+q8zyv+BuUYqKH5dh+Sj+3EdQ=; b=KArapE08iPWlLOYUz2a0GNX7Ix7FzJv4LF6e5BhqlwnkTAOHeijkXm9JE9MkqtROia kkR7rHoqSxMX9dXKZDo5fkG/b4gytlCLRiIZRJdE/7EKm7E2jRklbIE4PMfjuyRX12wI EozimkgeeX5LuNNaSdkLQ1OZ1nxu1yUMP0UsH4FRGENPKQ4QBfLLMy1tVCJ/DaPRlS9m +Pb5GJjH0HO6joiolWK612WMIqhBkLjDfx6VMGLFJtxmD9cSnJiEcWDmOvJBISdS+Azl o7WMqYzzqDskQ/IQ1zU234P7lZveJlVl3fypg1aHcmXNYeGyWj7Vafsetu+dAvnhY9Ls vzTA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=//mkLAI7oVv7tPrVMf+q8zyv+BuUYqKH5dh+Sj+3EdQ=; b=ldb+SLjdSPULt2j2zlvQ1yI1EsxAu2t/sPDAMG6wtUjcLDTLWwnA/PfgebYCFxoOSM PGJ2LpyeSSGtkRvgSidy8xSlo8J9GWKRKuCSp9GU13nUKRJm5AcHkhPkfcV8r1vDsUdK e65SA8exPFatvoRrsZT6ilPgPtECP+bZ/dbATXPkGBwVrjF+T+m741j2TQ99f53xEz9M Tcx5vG9AU6/mp15izg8CX6mFuMdTE8z+7BAd2Zh5ra/kyLJYoCxhPJTrWvyf5TECv4fU 6wFjVTeA3E4JaCZMomvii3rU6GZDZhC6PHib+2QbPv/oW5wA4Tap78g8xyGpF8SB/4vT ZCRw== X-Gm-Message-State: AOAM533DKa0zzLV0XTYw3F3J3HBIu+NBqx600PXK1n7ayOpkpTppKZU1 OboQPpmixRgEbIiMUfK9dnxsfA== X-Received: by 2002:a17:902:c154:: with SMTP id 20mr3863312plj.64.1597205329263; Tue, 11 Aug 2020 21:08:49 -0700 (PDT) Received: from [10.91.184.177] ([103.136.220.70]) by smtp.gmail.com with ESMTPSA id z9sm609425pfn.59.2020.08.11.21.08.45 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 11 Aug 2020 21:08:48 -0700 (PDT) Subject: Re: [External] Re: [PATCH] x86/MCE/AMD, EDAC/mce_amd To: Yazen Ghannam Cc: bp@alien8.de, mchehab@kernel.org, tony.luck@intel.com, james.morse@arm.com, rrichter@marvell.com, linux-edac@vger.kernel.org, linux-kernel@vger.kernel.org, liuxian.1@bytedance.com References: <20200809043559.9740-1-zhoufeng.zf@bytedance.com> <20200810190958.GA3406209@yaz-nikka.amd.com> From: zhoufeng Message-ID: <191bc22e-c1d3-16c3-8ec0-96f3db7c8e24@bytedance.com> Date: Wed, 12 Aug 2020 12:08:43 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.1.1 MIME-Version: 1.0 In-Reply-To: <20200810190958.GA3406209@yaz-nikka.amd.com> Content-Type: text/plain; charset=gbk; format=flowed Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Ok, as more AMD servers are used in bytedance, we hope to issue a patch to the community as soon as possible. Thank you very much. ?? 2020/8/11 ????3:10, Yazen Ghannam ะด??: > On Sun, Aug 09, 2020 at 12:35:59PM +0800, Feng zhou wrote: >> From: zhoufeng >> >> The edac_mce_amd module calls decode_dram_ecc() on AMD Family17h and >> later systems. This function is used in amd64_edac_mod to do >> system-specific decoding for DRAM ECC errors. The function takes a >> "NodeId" as a parameter. >> >> In AMD documentation, NodeId is used to identify a physical die in a >> system. This can be used to identify a node in the AMD_NB code and also >> it is used with umc_normaddr_to_sysaddr(). >> >> However, the input used for decode_dram_ecc() is currently the NUMA node >> of a logical CPU. so this will cause the address translation function to >> fail or report incorrect results. >> >> Signed-off-by: zhoufeng >> --- >> drivers/edac/mce_amd.c | 2 +- >> 1 file changed, 1 insertion(+), 1 deletion(-) >> >> diff --git a/drivers/edac/mce_amd.c b/drivers/edac/mce_amd.c >> index 325aedf46ff2..73c805113322 100644 >> --- a/drivers/edac/mce_amd.c >> +++ b/drivers/edac/mce_amd.c >> @@ -996,7 +996,7 @@ static void decode_smca_error(struct mce *m) >> } >> >> if (bank_type == SMCA_UMC && xec == 0 && decode_dram_ecc) >> - decode_dram_ecc(cpu_to_node(m->extcpu), m); >> + decode_dram_ecc(topology_physical_package_id(m->extcpu), m); > This will break on Naples systems, because the NodeId and the physical > package ID will not match. > > I can send a patch soon that will work for Naples, Rome, and later > systems. > > Thanks, > Yazen