Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp936564pxu; Thu, 3 Dec 2020 17:00:35 -0800 (PST) X-Google-Smtp-Source: ABdhPJwEuRhL3yuK9iBgLDpakw97jiAbmt1M+e590820BGvMPvqAOlfWXXQbDxCbJRPrSeL5uRu4 X-Received: by 2002:a17:906:31cb:: with SMTP id f11mr5003165ejf.142.1607043635435; Thu, 03 Dec 2020 17:00:35 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1607043635; cv=none; d=google.com; s=arc-20160816; b=f6if7Mr1Zo+KOr64Ruvks3N9dJVFkYbx1zoRtOTO6qpFM9ylN6DuKB6zT8MetctvTZ 3mijps+iyHlqHwc1CuHAthAns3oHEmUf3k618B4RYzglXmoU/Kd3E8q5ti3RMhcAUlT8 vZuodr3KpKRiii50+UgMa9UCxVDVAlKbZz0DnH4KpPFPNLV0N8Znm1Ve72ETihsp3NcK Dpu+jvqxKefkHl6oQHtII3C/KF3rxvx+OI73rkUlGMSmo13zits+2cNdkGcP0fwwLeAS GrFJKj1HLGW6EK/rksU7aju0ZToYBVttnsk8yyksvbmYyyTxcrE8HAQbt9XNS0J+PIjO qKTQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:dkim-signature :date; bh=R2rxEDFhJG8erWfkaltyEsDKvOWpHq7qHV/UW6brXBI=; b=jnCtXzUUWpL2WC2mLrwoCEpNHLcloyfdy7XujU4TDYjeahILIEFoQtdr+WmnsDLt8M 4HA/ARlopNRF913Bgfk6LlEQBshm0oNG5na4jxMUo56whJOzwvI4Ta6pfUlBLjSRDuAq Sg4o6xrcJgNV6z0IQazfWyHkDy8Lf0QOhgM8UvUHOprxTgEn29uFsRaZScw1TQX1wzXH tkdyLA/nXfXS7vDc9XQ1gSJHF6HbW/N9GTVqgPdGtDpOHOC5yrDBpbO9FbDbsOXQgJ8I mozpTRYHAk9dsZEb7yCTmPOH5HgrjmfBwcNmSrqPd0ejzGONPJfT7qcGysYnkjBN4pjo zm9Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=f152A+vw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a5si1905663edu.121.2020.12.03.17.00.12; Thu, 03 Dec 2020 17:00:35 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=f152A+vw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730016AbgLDA5j (ORCPT + 99 others); Thu, 3 Dec 2020 19:57:39 -0500 Received: from mail.kernel.org ([198.145.29.99]:51260 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727042AbgLDA5j (ORCPT ); Thu, 3 Dec 2020 19:57:39 -0500 Date: Fri, 4 Dec 2020 09:56:53 +0900 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1607043418; bh=I70sZb+GFxuM858yM3SdnXf8YH2Ws5GPPACtMQ+ZcvY=; h=From:To:Cc:Subject:In-Reply-To:References:From; b=f152A+vwZdZ6S+U8k0BRC/KONO2Omi5DN541LRhLoTfaff7X8JpTsae0oTa2x3tc7 lGk8yiQE+eddqQI1qg4WPFQwDB1LPKfIiTu1U8eh1698jew4aEyCVLyZusp42RqbwB apzmR+51J9lG2LNCvBx4Q4cIkpVuyaAo23rwnNLoYAjb4T96B8rPHFVg3zyUMBsSNk RLVSuBtEr+neSE96xNUDSlRAVkl/JPIqviw6GUE3tR/MrnbuR7xDFErkaCPHRrf/gt uEZht8f6sbJv+JtnPzQGAGoWm1SxozGTKlUu3lSfAsGg82Mu62AMpDw9cdbFiGgZGr xqBkadSzPNzFQ== From: Masami Hiramatsu To: Tom Lendacky Cc: Borislav Petkov , Masami Hiramatsu , x86@kernel.org, Thomas Gleixner , Ingo Molnar , Kees Cook , "H . Peter Anvin" , Joerg Roedel , "Gustavo A . R . Silva" , Jann Horn , Srikar Dronamraju , Ricardo Neri , linux-kernel@vger.kernel.org Subject: Re: [PATCH v2 1/3] x86/uprobes: Fix not using prefixes.nbytes for loop over prefixes.bytes Message-Id: <20201204095653.c948106a294fd6e731df5594@kernel.org> In-Reply-To: References: <160697102582.3146288.10127018634865687932.stgit@devnote2> <160697103739.3146288.7437620795200799020.stgit@devnote2> <20201203123757.GH3059@zn.tnic> <20201203124121.GI3059@zn.tnic> <20201203124820.GJ3059@zn.tnic> <1c1b265f-34e3-f5cc-0e7b-186dc26c94b7@amd.com> <20201203165420.GL3059@zn.tnic> <20201203170140.GM3059@zn.tnic> <20201203181712.GN3059@zn.tnic> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.32; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 3 Dec 2020 12:49:46 -0600 Tom Lendacky wrote: > On 12/3/20 12:17 PM, Borislav Petkov wrote: > > On Thu, Dec 03, 2020 at 12:10:10PM -0600, Tom Lendacky wrote: > >> Since that struct is used in multiple places, I think basing it on the array > >> size is the best way to go. The main point of the check is just to be sure > >> you don't read outside of the array. > > > > Well, what happens if someone increases the array size of: > > > > struct insn_field { > > union { > > insn_byte_t bytes[4]; > > ^^^^ > > > > ? > > I think we need to keep the parsing of the instruction separate from > accessing the prefixes after (successfully) parsing it. This fix is merely > making sure that we don't read outside the bounds of the array that > currently holds the legacy prefixes. > > > > > That's why a separate array only for legacy prefixes would be better > > in the long run. The array size check is good as a short-term fix for > > stable. > > > > I'd say. > > According to Volume 3 of the AMD APM (Figure 1-2 on page 5), there could > be as many as 5 legacy prefixes and it says that more than one prefix from > each group is undefined behavior. The instruction parsing code doesn't > seem to take into account the different prefix groups. So I agree with you > that short term the array size check works, and long term, the legacy > prefix support probably needs a closer look. Hmm, there is a difference between Intel SDM and AMD APM. Intel SDM vol.2 2.1.1 Instruction Prefixes Instruction prefixes are divided into four groups, each with a set of allowable prefix codes. For each instruction, it is only useful to include up to one prefix code from each of the four groups (Groups 1, 2, 3, 4). AMD APM vol.3 1.2.1 Summary of Legacy Prefixes Table 1-1 on page 7 shows the legacy prefixes. The legacy prefixes are organized into five groups, as shown in the left-most column of Table 1-1. An instruction encoding may include a maximum of one prefix from each of the five groups. So, Intel CPU doesn't accept LOCK-REP because those are in a same prefix group, but AMD says it is acceptable. Actually, insn.c only accepts the prefix up to 4, so if there is any instruction which has 5 prefixes, it will fail to parse. Note that anyway the same prefix can be repeated, we can see a good example in K8_NOP*. /* Opteron 64bit nops 1: nop 2: osp nop 3: osp osp nop 4: osp osp osp nop */ In this case, insn.c just store the 1 osp in the prefixes.bytes[], and just increment prefixes.nbytes for the repeated prefixes. Anyway, if there is LOCK-REP prefix combination, I have to introduce new insn_field for legacy prefix. Thank you, -- Masami Hiramatsu