Received: by 2002:a05:6a10:87d6:0:0:0:0 with SMTP id g22csp670578pxr; Mon, 11 Apr 2022 04:19:25 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzD+ABUls4yeHeigw6wuahK0BijNo32xL9UEGcLEXQV75qBQX02m9rKDO5hEe/tVI4jEu0/ X-Received: by 2002:a05:6402:288d:b0:41d:7db9:de95 with SMTP id eg13-20020a056402288d00b0041d7db9de95mr3730697edb.206.1649675964828; Mon, 11 Apr 2022 04:19:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649675964; cv=none; d=google.com; s=arc-20160816; b=BGErwJqpwre9hRP2gZIxolhTo+YgVl0Swvhbz5QMqd5QQY2+9OF0yw/ob3W1ceNiVq L8KKTPR7Beh2yPDMm5si644tZboMGA0KRkFJG6ZkRN0TTByu2jBuxazTNcNgPXF6czoo O7WxSPWycVCkOpF4/enrax+edbIleFhMwYPr/XdTg9xBMqBwEvGSfwJPHs4UO+oj4ou0 svFLDnaxDy849omrGux2DdbTmYFPLWDRvKZSd3MvOgPXTlSegCMbEjR1q0nRykYShj6B 5C5s7+WEAPFp7kRFt01whgBeW5c/KwjnxYV9hcPkwtMZoPRdWhwbKYoySlJ+QbcCcG4g FHng== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:date:subject:cc:to:from:sender :hmm_source_type:hmm_attache_num:hmm_source_ip; bh=XCapB5y9FOhObXwn+Y+DmPgQdNVqp6BWPduGuL5+fls=; b=LD/3j44Y520qC0B0l0aI3qR/Sid7ktPUWup2oxJUmXX3cMIY4rSKCZ9yhfC1KT+Sgc Ps2K/Q1Guw//3AhH1fJdKFZhAsQH3ZoBoMlhZ8z3tYHuFQ/5gGbBI+HhJkK6sDOh8UMz QXLoslR8KsWR+Kt+BummCmEDqwd9Twr/39GueO9spVUWvDVJQz7mUz1ts7Hw3b3tI0he DmYYnfWpfOGSus8zCrlzwqAzZ2ynY+W0/90NC7Y+5K3Xn+AVNr+JfLSNRnuEBJ9b+95v 6yTzXQt9nYA5Lv90A8j0wfecIa2yuasyAMF5Qbgr5mgJDRjZuIQzE+DGlUpUH9Pcykyg 5YWA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id y14-20020a056402134e00b0041d650cda3asi5336418edw.577.2022.04.11.04.18.59; Mon, 11 Apr 2022 04:19:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230206AbiDJDJL (ORCPT + 99 others); Sat, 9 Apr 2022 23:09:11 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43696 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229861AbiDJDJJ (ORCPT ); Sat, 9 Apr 2022 23:09:09 -0400 Received: from chinatelecom.cn (prt-mail.chinatelecom.cn [42.123.76.223]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id AD61B74DDF; Sat, 9 Apr 2022 20:06:55 -0700 (PDT) HMM_SOURCE_IP: 172.18.0.218:46286.1692545743 HMM_ATTACHE_NUM: 0000 HMM_SOURCE_TYPE: SMTP Received: from clientip-202.80.192.38 (unknown [172.18.0.218]) by chinatelecom.cn (HERMES) with SMTP id D453B2800A9; Sun, 10 Apr 2022 11:06:45 +0800 (CST) X-189-SAVE-TO-SEND: +liuxp11@chinatelecom.cn Received: from ([172.18.0.218]) by app0025 with ESMTP id fa8e5ec638234b88beadec1941b045a2 for rafael@kernel.org; Sun, 10 Apr 2022 11:06:52 CST X-Transaction-ID: fa8e5ec638234b88beadec1941b045a2 X-Real-From: liuxp11@chinatelecom.cn X-Receive-IP: 172.18.0.218 X-MEDUSA-Status: 0 Sender: liuxp11@chinatelecom.cn From: Liu Xinpeng To: rafael@kernel.org, dave.hansen@linux.intel.com, x86@kernel.org, hpa@zytor.com, keescook@chromium.org, anton@enomsg.org, ccross@android.com, robert.moore@intel.com, tony.luck@intel.com, lenb@kernel.org, james.morse@arm.com, bp@alien8.de, tglx@linutronix.de, mingo@redhat.com, ying.huang@intel.com, gong.chen@linux.intel.com Cc: linux-kernel@vger.kernel.org, linux-acpi@vger.kernel.org, Liu Xinpeng Subject: [PATCH v6] ACPI: APEI: fix missing erst record id Date: Sun, 10 Apr 2022 11:06:39 +0800 Message-Id: <1649559999-5193-1-git-send-email-liuxp11@chinatelecom.cn> X-Mailer: git-send-email 1.8.3.1 X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_PASS, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Read a record is cleared by others, but the deleted record cache entry is still created by erst_get_record_id_next. When next enumerate the records, get the cached deleted record, then erst_read return -ENOENT and try to get next record, loop back to first ID will return 0 in function __erst_record_id_cache_add_one and then set record_id as APEI_ERST_INVALID_RECORD_ID, finished this time read operation. It will result in read the records just in the cache hereafter. This patch cleared the deleted record cache, fix the issue that "./erst-inject -p" shows record counts not equal to "./erst-inject -n". A reproducer of the problem(retry many times): [root@localhost erst-inject]# ./erst-inject -c 0xaaaaa00011 [root@localhost erst-inject]# ./erst-inject -p rc: 273 rcd sig: CPER rcd id: 0xaaaaa00012 rc: 273 rcd sig: CPER rcd id: 0xaaaaa00013 rc: 273 rcd sig: CPER rcd id: 0xaaaaa00014 [root@localhost erst-inject]# ./erst-inject -i 0xaaaaa000006 [root@localhost erst-inject]# ./erst-inject -i 0xaaaaa000007 [root@localhost erst-inject]# ./erst-inject -i 0xaaaaa000008 [root@localhost erst-inject]# ./erst-inject -p rc: 273 rcd sig: CPER rcd id: 0xaaaaa00012 rc: 273 rcd sig: CPER rcd id: 0xaaaaa00013 rc: 273 rcd sig: CPER rcd id: 0xaaaaa00014 [root@localhost erst-inject]# ./erst-inject -n total error record count: 6 Changelog: v1->v2 Fix style problems v2->v3 Fix apei_read_mce called erst_get_record_id_next and modify the commit message. v3->v4 Add erst_clear_cache in another retry. v4->v5 Implement a new function for looking for a specific record type, suggested by Tony Luck. v5->v6 In function erst_clear_cache, using mutex_lock insead of mutex_lock_interruptible ensure that the cache is cleared. If erst_read return value is less than record head length, consider it as EIO; other error return to caller; creatorid is not wanted, consider it ENOENT and keep the cache for other types. Signed-off-by: Liu Xinpeng Reviewed-by: Tony Luck --- arch/x86/kernel/cpu/mce/apei.c | 8 ++-- drivers/acpi/apei/erst-dbg.c | 3 +- drivers/acpi/apei/erst.c | 77 +++++++++++++++++++++++++++++++--- include/acpi/apei.h | 2 + 4 files changed, 78 insertions(+), 12 deletions(-) diff --git a/arch/x86/kernel/cpu/mce/apei.c b/arch/x86/kernel/cpu/mce/apei.c index 0e3ae64d3b76..717192915f28 100644 --- a/arch/x86/kernel/cpu/mce/apei.c +++ b/arch/x86/kernel/cpu/mce/apei.c @@ -177,16 +177,14 @@ ssize_t apei_read_mce(struct mce *m, u64 *record_id) /* no more record */ if (*record_id == APEI_ERST_INVALID_RECORD_ID) goto out; - rc = erst_read(*record_id, &rcd.hdr, sizeof(rcd)); + rc = erst_read_record(*record_id, &rcd.hdr, sizeof(rcd), sizeof(rcd), + &CPER_CREATOR_MCE); /* someone else has cleared the record, try next one */ if (rc == -ENOENT) goto retry; else if (rc < 0) goto out; - /* try to skip other type records in storage */ - else if (rc != sizeof(rcd) || - !guid_equal(&rcd.hdr.creator_id, &CPER_CREATOR_MCE)) - goto retry; + memcpy(m, &rcd.mce, sizeof(*m)); rc = sizeof(*m); out: diff --git a/drivers/acpi/apei/erst-dbg.c b/drivers/acpi/apei/erst-dbg.c index c740f0faad39..8bc71cdc2270 100644 --- a/drivers/acpi/apei/erst-dbg.c +++ b/drivers/acpi/apei/erst-dbg.c @@ -111,7 +111,8 @@ static ssize_t erst_dbg_read(struct file *filp, char __user *ubuf, goto out; } retry: - rc = len = erst_read(id, erst_dbg_buf, erst_dbg_buf_len); + rc = len = erst_read_record(id, erst_dbg_buf, erst_dbg_buf_len, + erst_dbg_buf_len, NULL); /* The record may be cleared by others, try read next record */ if (rc == -ENOENT) goto retry_next; diff --git a/drivers/acpi/apei/erst.c b/drivers/acpi/apei/erst.c index 698d67cee052..ca78b584a65e 100644 --- a/drivers/acpi/apei/erst.c +++ b/drivers/acpi/apei/erst.c @@ -856,6 +856,74 @@ ssize_t erst_read(u64 record_id, struct cper_record_header *record, } EXPORT_SYMBOL_GPL(erst_read); +static void erst_clear_cache(u64 record_id) +{ + int i; + u64 *entries; + + mutex_lock(&erst_record_id_cache.lock); + + entries = erst_record_id_cache.entries; + for (i = 0; i < erst_record_id_cache.len; i++) { + if (entries[i] == record_id) + entries[i] = APEI_ERST_INVALID_RECORD_ID; + } + __erst_record_id_cache_compact(); + + mutex_unlock(&erst_record_id_cache.lock); +} + +ssize_t erst_read_record(u64 record_id, struct cper_record_header *record, + size_t buflen, size_t recordhead, const guid_t *creatorid) +{ + ssize_t len; + + /* + * if creatorid is NULL, read any record for erst-dbg module + */ + if (creatorid == NULL) { + len = erst_read(record_id, record, buflen); + if (len == -ENOENT) + erst_clear_cache(record_id); + + return len; + } + + len = erst_read(record_id, record, buflen); + /* + * if erst_read return value is -ENOENT skip to next record_id, + * and clear the record_id cache. + */ + if (len == -ENOENT) { + erst_clear_cache(record_id); + goto out; + } + + if (len < 0) + goto out; + + /* + * if erst_read return value is less than record head length, + * consider it as -EIO, and clear the record_id cache. + */ + if (len < recordhead) { + len = -EIO; + erst_clear_cache(record_id); + goto out; + } + + /* + * if creatorid is not wanted, consider it as not found, + * for skipping to next record_id. + */ + if (!guid_equal(&record->creator_id, creatorid)) + len = -ENOENT; + +out: + return len; +} +EXPORT_SYMBOL_GPL(erst_read_record); + int erst_clear(u64 record_id) { int rc, i; @@ -996,16 +1064,13 @@ static ssize_t erst_reader(struct pstore_record *record) goto out; } - len = erst_read(record_id, &rcd->hdr, rcd_len); + len = erst_read_record(record_id, &rcd->hdr, rcd_len, sizeof(*rcd), + &CPER_CREATOR_PSTORE); /* The record may be cleared by others, try read next record */ if (len == -ENOENT) goto skip; - else if (len < 0 || len < sizeof(*rcd)) { - rc = -EIO; + else if (len < 0) goto out; - } - if (!guid_equal(&rcd->hdr.creator_id, &CPER_CREATOR_PSTORE)) - goto skip; record->buf = kmalloc(len, GFP_KERNEL); if (record->buf == NULL) { diff --git a/include/acpi/apei.h b/include/acpi/apei.h index afaca3a075e8..dc60f7db5524 100644 --- a/include/acpi/apei.h +++ b/include/acpi/apei.h @@ -46,6 +46,8 @@ int erst_get_record_id_next(int *pos, u64 *record_id); void erst_get_record_id_end(void); ssize_t erst_read(u64 record_id, struct cper_record_header *record, size_t buflen); +ssize_t erst_read_record(u64 record_id, struct cper_record_header *record, + size_t buflen, size_t recordlen, const guid_t *creatorid); int erst_clear(u64 record_id); int arch_apei_enable_cmcff(struct acpi_hest_header *hest_hdr, void *data); -- 2.23.0