Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp66421pxu; Wed, 14 Oct 2020 20:30:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJySYspcBjVrd/dpWBZCsgGPYTd1isVDajSUiPQkjlp+i4VMOj8E1OSHQBaih4JK/d6ixjUn X-Received: by 2002:a17:906:1418:: with SMTP id p24mr2376182ejc.46.1602732649166; Wed, 14 Oct 2020 20:30:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602732649; cv=none; d=google.com; s=arc-20160816; b=Y+R1hCTDmPGIfbUhwY7oFDyGb/r1Rq1gFckKb1b5UPOlv6leNiU9dc8FRgNeYgP48t 8yf/RozPinCePyz9UZYW14r5GTHQbXxJfQV15Sdre5EUMB4vfgfjamSto01S98XAXSCN PaWIWVs1HqGYkK0XnVMR8qDODs/o2APSVQcBrv/Nz7Kf9sg8bYxPvlzmXXuj/k90LqiD ryk6sfnGCgDFpVo7W34YoGTGk2AocZ2ZDOMj21HYEbzrZK0I9FcgyGmalDSpx7N+wKzk cRWEivhWXICEcCnfHHmdEUnYqr1Baj8tfLiQ++OI5Nfnaw8vj5czaJur0o0q4IlAsfm4 ah0Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :ironport-sdr:ironport-sdr; bh=rJWIychPPtnLp4jiEmGa6yJnxVlAT14K87PTxQ4YuJU=; b=LQlCVoT2/KtR/AxNCsHhw8IUD4RIZuqrKnlrQ+a3SiNtS5GCMerCdeJUDNqry8X634 Zx810z3VIMg3eqdfRyyPz7FhTxeQ+bIcIKvd20ZUmJNhmnAQBVbn8wIB7HAzhTdqJ6Xm YV1Qd+AYn1PGFa7iZA/bE4Lz+xHQEuBtcVWfX+oK0ERl5u4kVqXTpCfxDo9PNg3iaU6o sluPFFRNPOWcZmQEoTRYOK8sMO2twQZvgGDjnT2Q7qo0QvsA3aq7Sl9DthlADpza2bHz 4hrbuGeeFOIR/njGUKKmMlSLhWuntJZ/Zgb3HZNldjYwq5Ncj/HHV+HLQossgDqY4POl ypKQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y63si1070450edy.415.2020.10.14.20.30.26; Wed, 14 Oct 2020 20:30:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729045AbgJOBKi (ORCPT + 99 others); Wed, 14 Oct 2020 21:10:38 -0400 Received: from mga11.intel.com ([192.55.52.93]:3036 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728458AbgJOBKY (ORCPT ); Wed, 14 Oct 2020 21:10:24 -0400 IronPort-SDR: y2r1/mlHQY6aXergE9Wxgzu4B9dfeOKzOeubXoMO3rBYYBZO0+Ne8uJ4smKSodmBLkECHN9cd4 OI2FT2wdF0aw== X-IronPort-AV: E=McAfee;i="6000,8403,9774"; a="162765397" X-IronPort-AV: E=Sophos;i="5.77,376,1596524400"; d="scan'208";a="162765397" X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Oct 2020 18:08:25 -0700 IronPort-SDR: xRNfDck7YaZS4U7CeLu3eG8Aj2EL0jqp+Xk42HFT8CPHLPEv5vC1UNPxevHekTA57BtGk8Z5E5 PHhaSdswy4bw== X-IronPort-AV: E=Sophos;i="5.77,376,1596524400"; d="scan'208";a="531055780" Received: from iweiny-desk2.sc.intel.com (HELO localhost) ([10.3.52.147]) by orsmga005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Oct 2020 18:08:25 -0700 Date: Wed, 14 Oct 2020 18:08:24 -0700 From: Ira Weiny To: Dave Hansen Cc: Thomas Gleixner , Ingo Molnar , Borislav Petkov , Andy Lutomirski , Peter Zijlstra , Fenghua Yu , x86@kernel.org, Dave Hansen , Dan Williams , Andrew Morton , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-nvdimm@lists.01.org, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, linux-kselftest@vger.kernel.org Subject: Re: [PATCH RFC V3 5/9] x86/pks: Add PKS kernel API Message-ID: <20201015010824.GP2046448@iweiny-DESK2.sc.intel.com> References: <20201009194258.3207172-1-ira.weiny@intel.com> <20201009194258.3207172-6-ira.weiny@intel.com> <29e9b8f1-35d6-d1d4-661d-a36fd296b593@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <29e9b8f1-35d6-d1d4-661d-a36fd296b593@intel.com> User-Agent: Mutt/1.11.1 (2018-12-01) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Oct 13, 2020 at 11:43:57AM -0700, Dave Hansen wrote: > > +static inline void pks_update_protection(int pkey, unsigned long protection) > > +{ > > + current->thread.saved_pkrs = update_pkey_val(current->thread.saved_pkrs, > > + pkey, protection); > > + preempt_disable(); > > + write_pkrs(current->thread.saved_pkrs); > > + preempt_enable(); > > +} > > Why does this need preempt count manipulation in addition to the > get/put_cpu_var() inside of write_pkrs()? This is a bug. The disable should be around the update_pkey_val(). > > > +/** > > + * PKS access control functions > > + * > > + * Change the access of the domain specified by the pkey. These are global > > + * updates. They only affects the current running thread. It is undefined and > > + * a bug for users to call this without having allocated a pkey and using it as > > + * pkey here. > > + * > > + * pks_mknoaccess() > > + * Disable all access to the domain > > + * pks_mkread() > > + * Make the domain Read only > > + * pks_mkrdwr() > > + * Make the domain Read/Write > > + * > > + * @pkey the pkey for which the access should change. > > + * > > + */ > > +void pks_mknoaccess(int pkey) > > +{ > > + pks_update_protection(pkey, PKEY_DISABLE_ACCESS); > > +} > > +EXPORT_SYMBOL_GPL(pks_mknoaccess); > > These are named like PTE manipulation functions, which is kinda weird. > > What's wrong with: pks_disable_access(pkey) ? Internal review suggested these names. I'm not dead set on them. FWIW I would rather they not get to wordy. I was trying to get some consistency with pks_mk*() as meaning PKS 'make' X. Do me 'disable' implies a state transition where 'make' implies we are 'setting' an absolute value. I think the later is a better name. And 'make' made more sense because 'set' is so overloaded IHO. > > > +void pks_mkread(int pkey) > > +{ > > + pks_update_protection(pkey, PKEY_DISABLE_WRITE); > > +} > > +EXPORT_SYMBOL_GPL(pks_mkread); > > I really don't like this name. It doesn't make readable, or even > read-only, *especially* if it was already access-disabled. Ok. But it does sense if going from access-disable to read, correct?. I could see this being better named pks_mkreadonly() so that going from RW to this would make more sense. Especially after thinking about it above 'read only' needs to be in the name. Before I change anything I'd like to get consensus on naming. How about the following? pks_mk_noaccess() pks_mk_readonly() pks_mk_readwrite() ? > > > +static const char pks_key_user0[] = "kernel"; > > + > > +/* Store names of allocated keys for debug. Key 0 is reserved for the kernel. */ > > +static const char *pks_key_users[PKS_NUM_KEYS] = { > > + pks_key_user0 > > +}; > > + > > +/* > > + * Each key is represented by a bit. Bit 0 is set for key 0 and reserved for > > + * its use. We use ulong for the bit operations but only 16 bits are used. > > + */ > > +static unsigned long pks_key_allocation_map = 1 << PKS_KERN_DEFAULT_KEY; > > + > > +/* > > + * pks_key_alloc - Allocate a PKS key > > + * > > + * @pkey_user: String stored for debugging of key exhaustion. The caller is > > + * responsible to maintain this memory until pks_key_free(). > > + */ > > +int pks_key_alloc(const char * const pkey_user) > > +{ > > + int nr; > > + > > + if (!cpu_feature_enabled(X86_FEATURE_PKS)) > > + return -EINVAL; > > I'm not sure I like -EINVAL for this. I thought we returned -ENOSPC for > this case for user pkeys. -ENOTSUP? I'm not really sure anyone will need to know the difference between the platform not supporting the key vs running out of them. But they are 2 different error conditions. > > > + while (1) { > > + nr = find_first_zero_bit(&pks_key_allocation_map, PKS_NUM_KEYS); > > + if (nr >= PKS_NUM_KEYS) { > > + pr_info("Cannot allocate supervisor key for %s.\n", > > + pkey_user); > > + return -ENOSPC; We return -ENOSPC here when running out of keys. > > + } > > + if (!test_and_set_bit_lock(nr, &pks_key_allocation_map)) > > + break; > > + } > > + > > + /* for debugging key exhaustion */ > > + pks_key_users[nr] = pkey_user; > > + > > + return nr; > > +} > > +EXPORT_SYMBOL_GPL(pks_key_alloc); > > + > > +/* > > + * pks_key_free - Free a previously allocate PKS key > > + * > > + * @pkey: Key to be free'ed > > + */ > > +void pks_key_free(int pkey) > > +{ > > + if (!cpu_feature_enabled(X86_FEATURE_PKS)) > > + return; > > + > > + if (pkey >= PKS_NUM_KEYS || pkey <= PKS_KERN_DEFAULT_KEY) > > + return; > > This seems worthy of a WARN_ON_ONCE() at least. It's essentially > corrupt data coming into a kernel API. Ok, Done, Ira