Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp2916256ybg; Sat, 6 Jun 2020 05:37:21 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwAq+9FYeSdhAs109w4PBWrCMyhuV3WgYUlPk92qBYjkWnFWaZGLVHj69MnYO51SsjAHrIo X-Received: by 2002:aa7:da42:: with SMTP id w2mr13477073eds.176.1591447040930; Sat, 06 Jun 2020 05:37:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591447040; cv=none; d=google.com; s=arc-20160816; b=sZIyTc63rX1mD/lj5Q1epH8AvinAqWCg17OiUWhlUiT3RhS7x73meBVCcgcxC3HfsM xSMczSxtVzwnmQiuYexTTdaZggmYWFTxltMeUuItWjRKpa3nM4QR8Y2O74XlGwZajE9u KY0kU7S7wLddyP1djpp0MRMB72nu1yhP9L9vIS+OUnxvIqFBkhM81xkAEyi/X9BzweLy 7ax2XtUJi859F0tL1WbIqPWEXLTrgK9Jo2/DwFz+Nk5vDHfvkol4kKcPjwweXE6cgItz T0nCJoHrR2wtFDGpB1GaELa1n0O18pui7yOhBEbIJPU6smK5qweYkcsPcQ3ep0YOoclP 54/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from; bh=8LtOLPye7Njz3rcsoOnKkp3VTeWWy+VVi8ekMj8z6AI=; b=e7huLAruP1JGxavJZcpA5WHO3RkuWOBdB+iqcCnxHKlRdb9KL3IhR/mSRmElNodo+n A6SMRsfFLgZX7vcYR2cRG0U51RBG35KFq4sSIvDRh0g+4LGrr45e0psPeFsoMioJgY2p gEnLnC3RXdQVTLjfyhg+MLZQlzmCNK0YWCH/xKjG7iC4ukhZqeISdmyCqm8zCdw6fVBj d5H8pk/XQyOC2CdSCm+g3GkWm32LcshAt33pE8aDwVAfEmNret4JmtuFtpaeiTvuQtJ6 xBI+c/iEUuDLfnTSqlTSrRVgxgcNfoE5wIbUIn+UEK/hqtRp6hDenKGZSBgHtdN7i2qR V88w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id k14si5225889ejb.286.2020.06.06.05.36.55; Sat, 06 Jun 2020 05:37:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728751AbgFFMfA (ORCPT + 99 others); Sat, 6 Jun 2020 08:35:00 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:9244 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725831AbgFFMe7 (ORCPT ); Sat, 6 Jun 2020 08:34:59 -0400 Received: from pps.filterd (m0098421.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 056CVhF3045555; Sat, 6 Jun 2020 08:34:22 -0400 Received: from pps.reinject (localhost [127.0.0.1]) by mx0a-001b2d01.pphosted.com with ESMTP id 31g74rm7u1-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sat, 06 Jun 2020 08:34:21 -0400 Received: from m0098421.ppops.net (m0098421.ppops.net [127.0.0.1]) by pps.reinject (8.16.0.36/8.16.0.36) with SMTP id 056CVmMa045784; Sat, 6 Jun 2020 08:34:21 -0400 Received: from ppma03ams.nl.ibm.com (62.31.33a9.ip4.static.sl-reverse.com [169.51.49.98]) by mx0a-001b2d01.pphosted.com with ESMTP id 31g74rm7tg-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sat, 06 Jun 2020 08:34:21 -0400 Received: from pps.filterd (ppma03ams.nl.ibm.com [127.0.0.1]) by ppma03ams.nl.ibm.com (8.16.0.42/8.16.0.42) with SMTP id 056CEuBI032311; Sat, 6 Jun 2020 12:34:19 GMT Received: from b06cxnps3074.portsmouth.uk.ibm.com (d06relay09.portsmouth.uk.ibm.com [9.149.109.194]) by ppma03ams.nl.ibm.com with ESMTP id 31g2s7rmv2-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sat, 06 Jun 2020 12:34:19 +0000 Received: from b06wcsmtp001.portsmouth.uk.ibm.com (b06wcsmtp001.portsmouth.uk.ibm.com [9.149.105.160]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 056CYHuV36897016 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Sat, 6 Jun 2020 12:34:17 GMT Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 565A2A405B; Sat, 6 Jun 2020 12:34:17 +0000 (GMT) Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 48D65A4060; Sat, 6 Jun 2020 12:34:13 +0000 (GMT) Received: from vajain21-in-ibm-com (unknown [9.85.89.98]) by b06wcsmtp001.portsmouth.uk.ibm.com (Postfix) with SMTP; Sat, 6 Jun 2020 12:34:13 +0000 (GMT) Received: by vajain21-in-ibm-com (sSMTP sendmail emulation); Sat, 06 Jun 2020 18:04:12 +0530 From: Vaibhav Jain To: Ira Weiny Cc: Santosh Sivaraj , linux-nvdimm@lists.01.org, linux-kernel@vger.kernel.org, Steven Rostedt , "Oliver O'Halloran" , "Aneesh Kumar K . V" , Dan Williams , linuxppc-dev@lists.ozlabs.org Subject: Re: [PATCH v10 6/6] powerpc/papr_scm: Implement support for PAPR_PDSM_HEALTH In-Reply-To: <20200605183655.GP1505637@iweiny-DESK2.sc.intel.com> References: <20200604234136.253703-1-vaibhav@linux.ibm.com> <20200604234136.253703-7-vaibhav@linux.ibm.com> <20200605183655.GP1505637@iweiny-DESK2.sc.intel.com> Date: Sat, 06 Jun 2020 18:04:11 +0530 Message-ID: <87wo4kfk58.fsf@linux.ibm.com> MIME-Version: 1.0 Content-Type: text/plain X-TM-AS-GCONF: 00 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.216,18.0.687 definitions=2020-06-06_07:2020-06-04,2020-06-06 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 phishscore=0 malwarescore=0 mlxlogscore=999 spamscore=0 clxscore=1015 mlxscore=0 lowpriorityscore=0 cotscore=-2147483648 adultscore=0 priorityscore=1501 suspectscore=1 bulkscore=0 impostorscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2004280000 definitions=main-2006060098 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Ira Weiny writes: > On Fri, Jun 05, 2020 at 05:11:36AM +0530, Vaibhav Jain wrote: >> This patch implements support for PDSM request 'PAPR_PDSM_HEALTH' >> that returns a newly introduced 'struct nd_papr_pdsm_health' instance >> containing dimm health information back to user space in response to >> ND_CMD_CALL. This functionality is implemented in newly introduced >> papr_pdsm_health() that queries the nvdimm health information and >> then copies this information to the package payload whose layout is >> defined by 'struct nd_papr_pdsm_health'. >> >> Cc: "Aneesh Kumar K . V" >> Cc: Dan Williams >> Cc: Michael Ellerman >> Cc: Ira Weiny >> Signed-off-by: Vaibhav Jain >> --- >> Changelog: >> >> v9..v10: >> * Removed code in papr_pdsm_health that performed validation on pdsm >> payload version and corrosponding struct and defines used for >> validation of payload version. >> * Dropped usage of struct papr_pdsm_health in 'struct >> papr_scm_priv'. Instead papr_psdm_health() now uses >> 'papr_scm_priv.health_bitmap' to populate the pdsm payload. >> * Above change also fixes the problem where this patch was removing >> the code that was previously introduced in this patch-series. >> [ Ira ] >> * Introduced a new def ND_PDSM_ENVELOPE_HDR_SIZE that indicates the >> space allocated to 'struct nd_pdsm_cmd_pkg' fields except 'struct >> nd_cmd_pkg'. This def is useful in validating payload sizes. >> * Reworked papr_pdsm_health() to enforce a specific payload size for >> 'PAPR_PDSM_HEALTH' pdsm request. >> >> Resend: >> * Added ack from Aneesh. >> >> v8..v9: >> * s/PAPR_SCM_PDSM_HEALTH/PAPR_PDSM_HEALTH/g [ Dan , Aneesh ] >> * s/PAPR_SCM_PSDM_DIMM_*/PAPR_PDSM_DIMM_*/g >> * Renamed papr_scm_get_health() to papr_psdm_health() >> * Updated patch description to replace papr-scm dimm with nvdimm. >> >> v7..v8: >> * None >> >> Resend: >> * None >> >> v6..v7: >> * Updated flags_show() to use seq_buf_printf(). [Mpe] >> * Updated papr_scm_get_health() to use newly introduced >> __drc_pmem_query_health() bypassing the cache [Mpe]. >> >> v5..v6: >> * Added attribute '__packed' to 'struct nd_papr_pdsm_health_v1' to >> gaurd against possibility of different compilers adding different >> paddings to the struct [ Dan Williams ] >> >> * Updated 'struct nd_papr_pdsm_health_v1' to use __u8 instead of >> 'bool' and also updated drc_pmem_query_health() to take this into >> account. [ Dan Williams ] >> >> v4..v5: >> * None >> >> v3..v4: >> * Call the DSM_PAPR_SCM_HEALTH service function from >> papr_scm_service_dsm() instead of papr_scm_ndctl(). [Aneesh] >> >> v2..v3: >> * Updated struct nd_papr_scm_dimm_health_stat_v1 to use '__xx' types >> as its exported to the userspace [Aneesh] >> * Changed the constants DSM_PAPR_SCM_DIMM_XX indicating dimm health >> from enum to #defines [Aneesh] >> >> v1..v2: >> * New patch in the series >> --- >> arch/powerpc/include/uapi/asm/papr_pdsm.h | 33 +++++++++++ >> arch/powerpc/platforms/pseries/papr_scm.c | 70 +++++++++++++++++++++++ >> 2 files changed, 103 insertions(+) >> >> diff --git a/arch/powerpc/include/uapi/asm/papr_pdsm.h b/arch/powerpc/include/uapi/asm/papr_pdsm.h >> index 8b1a4f8fa316..c4c990ede5d4 100644 >> --- a/arch/powerpc/include/uapi/asm/papr_pdsm.h >> +++ b/arch/powerpc/include/uapi/asm/papr_pdsm.h >> @@ -71,12 +71,17 @@ struct nd_pdsm_cmd_pkg { >> __u8 payload[]; /* In/Out: Sub-cmd data buffer */ >> } __packed; >> >> +/* Calculate size used by the pdsm header fields minus 'struct nd_cmd_pkg' */ >> +#define ND_PDSM_ENVELOPE_HDR_SIZE \ >> + (sizeof(struct nd_pdsm_cmd_pkg) - sizeof(struct nd_cmd_pkg)) >> + [ ] > > This is kind of a weird name for this. > > Isn't this just the ND PDSM header size? What is 'envelope' mean > here? Was referring to 'nd_cmd_pkg' envelop here. But yes removing 'ENVELOPE' should make it more concise. Have already done this in v11 of this patch. > >> /* >> * Methods to be embedded in ND_CMD_CALL request. These are sent to the kernel >> * via 'nd_pdsm_cmd_pkg.hdr.nd_command' member of the ioctl struct >> */ >> enum papr_pdsm { >> PAPR_PDSM_MIN = 0x0, >> + PAPR_PDSM_HEALTH, >> PAPR_PDSM_MAX, >> }; >> >> @@ -95,4 +100,32 @@ static inline void *pdsm_cmd_to_payload(struct nd_pdsm_cmd_pkg *pcmd) >> return (void *)(pcmd->payload); >> } >> >> +/* Various nvdimm health indicators */ >> +#define PAPR_PDSM_DIMM_HEALTHY 0 >> +#define PAPR_PDSM_DIMM_UNHEALTHY 1 >> +#define PAPR_PDSM_DIMM_CRITICAL 2 >> +#define PAPR_PDSM_DIMM_FATAL 3 >> + >> +/* >> + * Struct exchanged between kernel & ndctl in for PAPR_PDSM_HEALTH >> + * Various flags indicate the health status of the dimm. >> + * >> + * dimm_unarmed : Dimm not armed. So contents wont persist. >> + * dimm_bad_shutdown : Previous shutdown did not persist contents. >> + * dimm_bad_restore : Contents from previous shutdown werent restored. >> + * dimm_scrubbed : Contents of the dimm have been scrubbed. >> + * dimm_locked : Contents of the dimm cant be modified until CEC reboot >> + * dimm_encrypted : Contents of dimm are encrypted. >> + * dimm_health : Dimm health indicator. One of PAPR_PDSM_DIMM_XXXX >> + */ >> +struct nd_papr_pdsm_health { >> + __u8 dimm_unarmed; >> + __u8 dimm_bad_shutdown; >> + __u8 dimm_bad_restore; >> + __u8 dimm_scrubbed; >> + __u8 dimm_locked; >> + __u8 dimm_encrypted; >> + __u16 dimm_health; >> +} __packed; >> + >> #endif /* _UAPI_ASM_POWERPC_PAPR_PDSM_H_ */ >> diff --git a/arch/powerpc/platforms/pseries/papr_scm.c b/arch/powerpc/platforms/pseries/papr_scm.c >> index 05eb56ecab5e..984942be24c1 100644 >> --- a/arch/powerpc/platforms/pseries/papr_scm.c >> +++ b/arch/powerpc/platforms/pseries/papr_scm.c >> @@ -421,6 +421,72 @@ static int is_cmd_valid(struct nvdimm *nvdimm, unsigned int cmd, void *buf, >> return 0; >> } >> >> +/* Fetch the DIMM health info and populate it in provided package. */ >> +static int papr_pdsm_health(struct papr_scm_priv *p, >> + struct nd_pdsm_cmd_pkg *pkg) >> +{ >> + int rc; >> + struct nd_papr_pdsm_health health = { 0 }; >> + u16 copysize = sizeof(struct nd_papr_pdsm_health); >> + u16 payload_size = pkg->hdr.nd_size_out - ND_PDSM_ENVELOPE_HDR_SIZE; >> + >> + /* Ensure correct payload size that can hold struct nd_papr_pdsm_health */ >> + if (payload_size != copysize) { >> + dev_dbg(&p->pdev->dev, >> + "Unexpected payload-size (%u). Expected (%u)", >> + pkg->hdr.nd_size_out, copysize); >> + rc = -ENOSPC; >> + goto out; >> + } >> + >> + /* Ensure dimm health mutex is taken preventing concurrent access */ >> + rc = mutex_lock_interruptible(&p->health_mutex); >> + if (rc) >> + goto out; >> + >> + /* Always fetch upto date dimm health data ignoring cached values */ >> + rc = __drc_pmem_query_health(p); >> + if (rc) { >> + mutex_unlock(&p->health_mutex); >> + goto out; >> + } >> + >> + /* update health struct with various flags derived from health bitmap */ >> + health = (struct nd_papr_pdsm_health) { >> + .dimm_unarmed = p->health_bitmap & PAPR_PMEM_UNARMED_MASK, >> + .dimm_bad_shutdown = p->health_bitmap & PAPR_PMEM_BAD_SHUTDOWN_MASK, >> + .dimm_bad_restore = p->health_bitmap & PAPR_PMEM_BAD_RESTORE_MASK, >> + .dimm_encrypted = p->health_bitmap & PAPR_PMEM_ENCRYPTED, >> + .dimm_locked = p->health_bitmap & PAPR_PMEM_SCRUBBED_AND_LOCKED, >> + .dimm_scrubbed = p->health_bitmap & PAPR_PMEM_SCRUBBED_AND_LOCKED, > > Are you sure these work? These are not assignments to a bool so I don't think > gcc will do what you want here. Yeah, somehow this slipped by and didnt show up in my tests. I checked the assembly dump and seems GCC was silently skipping initializing these fields without making any noise. Have rectified this in v11 by changing these designated initilizers to .dimm_locked = !!(p->health_bitmap & PAPR_PMEM_SCRUBBED_AND_LOCKED) Thanks for catching this. ~ Vaibhav > > Ira > >> + .dimm_health = PAPR_PDSM_DIMM_HEALTHY, >> + }; >> + >> + /* Update field dimm_health based on health_bitmap flags */ >> + if (p->health_bitmap & PAPR_PMEM_HEALTH_FATAL) >> + health.dimm_health = PAPR_PDSM_DIMM_FATAL; >> + else if (p->health_bitmap & PAPR_PMEM_HEALTH_CRITICAL) >> + health.dimm_health = PAPR_PDSM_DIMM_CRITICAL; >> + else if (p->health_bitmap & PAPR_PMEM_HEALTH_UNHEALTHY) >> + health.dimm_health = PAPR_PDSM_DIMM_UNHEALTHY; >> + >> + /* struct populated hence can release the mutex now */ >> + mutex_unlock(&p->health_mutex); >> + >> + dev_dbg(&p->pdev->dev, "Copying payload size=%u\n", copysize); >> + >> + /* Copy the health struct to the payload */ >> + memcpy(pdsm_cmd_to_payload(pkg), &health, copysize); >> + >> + /* Update fw size including size of struct nd_pdsm_cmd_pkg fields */ >> + pkg->hdr.nd_fw_size = copysize + ND_PDSM_ENVELOPE_HDR_SIZE; >> + >> +out: >> + dev_dbg(&p->pdev->dev, "completion code = %d\n", rc); >> + >> + return rc; >> +} >> + >> /* >> * For a given pdsm request call an appropriate service function. >> * Note: Use 'nd_pdsm_cmd_pkg.cmd_status to report psdm servicing errors. Hence >> @@ -435,6 +501,10 @@ static void papr_scm_service_pdsm(struct papr_scm_priv *p, >> >> /* Call pdsm service function */ >> switch (pdsm) { >> + case PAPR_PDSM_HEALTH: >> + pkg->cmd_status = papr_pdsm_health(p, pkg); >> + break; >> + >> default: >> dev_dbg(&p->pdev->dev, "PDSM[0x%x]: Unsupported PDSM request\n", >> pdsm); >> -- >> 2.26.2 >> -- Cheers ~ Vaibhav