Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp1464100pxb; Wed, 10 Feb 2021 08:54:50 -0800 (PST) X-Google-Smtp-Source: ABdhPJyGgSOHLsP51SMuIkRTfLWaQIQbuNWF9fMZBiBjIEZ5T6bcCZfSSUDz+WmEETIvq+ogvKib X-Received: by 2002:a05:6402:12d6:: with SMTP id k22mr3945984edx.368.1612976090202; Wed, 10 Feb 2021 08:54:50 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1612976090; cv=none; d=google.com; s=arc-20160816; b=iuUVyhd3RwmPxDPkxj+SQXgSbagd470RtCfOJj4OValpWiXj/9oiKv5djWVz/NB59m M1mHS90kc2+Er2Lu6mMo/pP+sGpJ3pxaqUhf42Ui+RgohQyuKp7TFBZOqCjF1m5Gko/L Zs6Z8wyVCVBHVJSvhf51tB4UmFG2TwCeUPSjbVT8tcotx44mzRXEL+j0Kf7A1G8C0AkA TFQz2Y6jFyZ2Py99frs6BQQGsE4EhqDHChYYcdEJy24XfFaGwk/cAxmW/9psSk3k/uAH MBzX0A3KOKsS5IEtuxN/RAlg2zoKzK8PfoHiaKiJjv4si0ZS5AsXm/08NfXMM2m3m1+c Nleg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:ironport-sdr :ironport-sdr; bh=gj/3Eiv8EZ739Az5dij1mAPrtRTOVwDiZsKN+3vVOfM=; b=IpU4gUexeWSYnfx7yvkD2EftJoMR4r3+edOz/jHu2imI6NdxSqXJYC8BI43eYnUsQe uxfTwVgZ8okL20nC0BFnuE6rRJRWhpvcEyuF/Q6xsKccpZRXL6KV2RGZk4RI/evatSOY 24l68IF23EwJRNI3ejTs80XaRaGgPcRzt/h0K0ljLErq5p3BD71zXF/NUjrfrMIeHDqo 5kEUQDPfAjK4BSBPS1TaHXm88RLJDqxzQrhtRFJFJ9NgUxv+I0cC6VHBwsBsMLXIUp2t Eq6xq2oL5Ptcl7MILMa3L+JvYe2Ig9WMZpS66VvU5KsaD0Kk82YzklLddkbmWpXSQ1TL 8MpQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l3si1645337edr.604.2021.02.10.08.54.26; Wed, 10 Feb 2021 08:54:50 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233237AbhBJQwE (ORCPT + 99 others); Wed, 10 Feb 2021 11:52:04 -0500 Received: from mga03.intel.com ([134.134.136.65]:53339 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233087AbhBJQtv (ORCPT ); Wed, 10 Feb 2021 11:49:51 -0500 IronPort-SDR: k4QgDAZxiXRIQ1Wx9gufipSaKME6IDYduo92w47UnMqWnpKFwrTq4PpVJDmIAzgXUemmgeoaSo ygbLzyHA6QhQ== X-IronPort-AV: E=McAfee;i="6000,8403,9891"; a="182174754" X-IronPort-AV: E=Sophos;i="5.81,168,1610438400"; d="scan'208";a="182174754" Received: from orsmga008.jf.intel.com ([10.7.209.65]) by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Feb 2021 08:49:06 -0800 IronPort-SDR: nPop3UUEPWzWG5fYpSunzvTaGuD6gWSniBaNF3DXgUFmgxy8A0z61/KAFosXfm+zC90iYn5hx8 aJeiJSXGNRfA== X-IronPort-AV: E=Sophos;i="5.81,168,1610438400"; d="scan'208";a="396773601" Received: from lgrunes-mobl.amr.corp.intel.com (HELO intel.com) ([10.252.135.4]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Feb 2021 08:49:06 -0800 Date: Wed, 10 Feb 2021 08:49:04 -0800 From: Ben Widawsky To: Ariel.Sibley@microchip.com Cc: linux-cxl@vger.kernel.org, linux-acpi@vger.kernel.org, linux-kernel@vger.kernel.org, linux-nvdimm@lists.01.org, linux-pci@vger.kernel.org, helgaas@kernel.org, cbrowy@avery-design.com, hch@infradead.org, dan.j.williams@intel.com, david@redhat.com, rientjes@google.com, ira.weiny@intel.com, jcm@jonmasters.org, Jonathan.Cameron@huawei.com, rafael.j.wysocki@intel.com, rdunlap@infradead.org, vishal.l.verma@intel.com, jgroves@micron.com, sean.v.kelley@intel.com Subject: Re: [PATCH v2 5/8] cxl/mem: Add a "RAW" send command Message-ID: <20210210164904.lfhtfvlyeypfpjhe@intel.com> References: <20210210000259.635748-1-ben.widawsky@intel.com> <20210210000259.635748-6-ben.widawsky@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21-02-10 15:26:27, Ariel.Sibley@microchip.com wrote: > > diff --git a/drivers/cxl/Kconfig b/drivers/cxl/Kconfig > > index c4ba3aa0a05d..08eaa8e52083 100644 > > --- a/drivers/cxl/Kconfig > > +++ b/drivers/cxl/Kconfig > > @@ -33,6 +33,24 @@ config CXL_MEM > > > > If unsure say 'm'. > > > > +config CXL_MEM_RAW_COMMANDS > > + bool "RAW Command Interface for Memory Devices" > > + depends on CXL_MEM > > + help > > + Enable CXL RAW command interface. > > + > > + The CXL driver ioctl interface may assign a kernel ioctl command > > + number for each specification defined opcode. At any given point in > > + time the number of opcodes that the specification defines and a device > > + may implement may exceed the kernel's set of associated ioctl function > > + numbers. The mismatch is either by omission, specification is too new, > > + or by design. When prototyping new hardware, or developing / > > debugging > > + the driver it is useful to be able to submit any possible command to > > + the hardware, even commands that may crash the kernel due to their > > + potential impact to memory currently in use by the kernel. > > + > > + If developing CXL hardware or the driver say Y, otherwise say N. > > Blocking RAW commands by default will prevent vendors from developing user > space tools that utilize vendor specific commands. Vendors of CXL.mem devices > should take ownership of ensuring any vendor defined commands that could cause > user data to be exposed or corrupted are disabled at the device level for > shipping configurations. Thanks for brining this up Ariel. If there is a recommendation on how to codify this, I would certainly like to know because the explanation will be long. --- The background: The enabling/disabling of the Kconfig option is driven by the distribution and/or system integrator. Even if we made the default 'y', nothing stops them from changing that. if you are using this driver in production and insist on using RAW commands, you are free to carry around a small patch to get rid of the WARN (it is a one-liner). To recap why this is in place - the driver owns the sanctity of the device and therefore a [large] part of the whole system. What we can do as driver writers is figure out the set of commands that are "safe" and allow those. Aside from being able to validate them, we're able to mediate them with other parallel operations that might conflict. We gain the ability to squint extra hard at bug reports. We provide a reason to try to use a well defined part of the spec. Realizing that only allowing that small set of commands in a rapidly growing ecosystem is not a welcoming API; we decided on RAW. Vendor commands can be one of two types: 1. Some functionality probably most vendors want. 2. Functionality that is really single vendor specific. Hopefully we can agree that the path for case #1 is to work with the consortium to standardize a command that does what is needed and that can eventually become part of UAPI. The situation is unfortunate, but temporary. If you won't be able to upgrade your kernel, patch out the WARN as above. The second situation is interesting and does need some more thought and discussion. --- I see 3 realistic options for truly vendor specific commands. 1. Tough noogies. Vendors aren't special and they shouldn't do that. 2. modparam to disable the WARN for specific devices (let the sysadmin decide) 3. Try to make them part of UAPI. The right answer to me is #1, but I also realize I live in the real world. #2 provides too much flexibility. Vendors will just do what they please and distros and/or integrators will be seen as hostile if they don't accommodate. I like #3, but I have a feeling not everyone will agree. My proposal for vendor specific commands is, if it's clear it's truly a unique command, allow adding it as part of UAPI (moving it out of RAW). I expect like 5 of these, ever. If we start getting multiple per vendor, we've failed. The infrastructure is already in place to allow doing this pretty easily. I think we'd have to draw up some guidelines (like adding test cases for the command) to allow these to come in. Anything with command effects is going to need extra scrutiny. In my opinion, as maintainers of the driver, we do owe the community an answer as to our direction for this. Dan, what is your thought?