Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1509094imu; Wed, 28 Nov 2018 10:33:19 -0800 (PST) X-Google-Smtp-Source: AFSGD/UbE3mHZnokOP16G1CzVY/l5Cbo/XF2QJV6Bnb6vs35Je6x5JVQNKQHxYHNZWOo2he95O6c X-Received: by 2002:a63:63c3:: with SMTP id x186mr34226663pgb.330.1543429999230; Wed, 28 Nov 2018 10:33:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543429999; cv=none; d=google.com; s=arc-20160816; b=xL8hn4tB9DAgwcLqAm7jqas4PZFualR2mAg1fPEH/keRVa/mJgTCa0xN7ROELErrlp fZzzTT7MQqVx2nIzD0DefX5lTv/z0mIGwyYxf2anPxyUK0lACGvOxhxcuXJGHj6xy81R EJxyS577pljMXe1mB2V7TEjRFGhlnc3qJnOKy/Q3v8woZPZA/4Now2C57zSz947BZoOB pwLlLv+Ox/qsE0/DqqzKyXp6NPJ9fpB1FEvaKmu8q7bOxUKMy9b0qIGggt9KrfMLk6JB ATMrzkZtqMWIbUUFwwxOHM9B8/xruhKV20yWeVhwqaBdRxBd/GAKe1PAhB34IPlfiyi+ 9jrQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature; bh=XtrritqyCgXvd/Oac0XlhV9ZTEWanBaIrFkfpwFUgoI=; b=h+c2c3rZlphNOJrVbigVF/N4XI7xdKx9Bej0hwxQVasgLo8wQxPgd4Fg9nTwxZUYDG xasqSj8jkNzCInJsEe59p6t9B5fb71P4zXV6uZ2VenFBRRTfJB40hkXD2fyIoVebHGZv vYQF5ibZLktYRUq4EdW10NgQ2XnMd6RefZc1jpu0HLsfwLXQ5kivs/39jQaBKfiJz+2b ripsgqidy+TNBOjajzWevM7DaMK3sc95cyAf6zDCHgpZSIDq1rRc9SRqY+8jIBsqmSY7 aoMOWHo4gYb1oZMf6pra7+otCOp6W8Q/HxwhEPdKc8wicgd2ZKnBuPa3b2iU0/qRTV6Z Y/dQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=hrLY43ae; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q1-v6si8270182plb.14.2018.11.28.10.32.54; Wed, 28 Nov 2018 10:33:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=hrLY43ae; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728708AbeK2Fc3 (ORCPT + 99 others); Thu, 29 Nov 2018 00:32:29 -0500 Received: from mail-pg1-f195.google.com ([209.85.215.195]:33072 "EHLO mail-pg1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725761AbeK2Fc3 (ORCPT ); Thu, 29 Nov 2018 00:32:29 -0500 Received: by mail-pg1-f195.google.com with SMTP id z11so9876523pgu.0; Wed, 28 Nov 2018 10:29:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=XtrritqyCgXvd/Oac0XlhV9ZTEWanBaIrFkfpwFUgoI=; b=hrLY43aeQQctAtGXs6mvsCbS1JaUdjCeCQKH9Jd4/kyn3F2By200bXeF+JzmVXfB+9 OAsPOy/AkaCVaPb8fBYZX8qyr6crHsSIBtjEAxdbzGtOe+vYaZQ9KD1cO8CLDW31d1sS AwrYr9s1NK3XswCkKQ8eyIk231QfBFzzAh/QqE88UiKkl4x/PB7YWJPCtDs48JDpZf9N 2l75Aysh89DDhew7js8hV4q/Sd4eqqxXSyeErCes86pB7hK+ttTdmhLdUNPsnadQj6ES qtONysdhpb5sjrdnC80ugXTFeZqOyZbeGQclLTgnziseJs/fZPSUsW5jABZEcsn5zFTX XzmA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=XtrritqyCgXvd/Oac0XlhV9ZTEWanBaIrFkfpwFUgoI=; b=VHfW+Cho5AFXEEGF1DEEQ+w/xlv3npiRKLYXx6vx2vJPq6JuARZa3IQpc1o17GOWDN 7LzXX5zBtwAAOViOw8swRMCxGlGOLj/r1fUR9LL9Oguwv20xmB8ARh+bLFZ6CuSEA9QT wA50XmB3iWieqxtrNt/rB3QDPr+H4QH6yCTLplMxieO3CGrklqjYwPI6KSy+fbr6MEkg KmarCjYQstkuhasBDUl+Po0UTFYu1cPgI+zGrBmB5TaRl99Nrrf16y5fRuppyMRHeIyd CioTA533gKHTc0Hkk6T6eEEQiQavq62ytR01wKaqEoRNbdGl0fyxUrQTnrMMAb+Q1kFa IPug== X-Gm-Message-State: AGRZ1gLnEDRvofWmANjtBn4/F6dA4C2XLipcZL2NRj7OlnS7QF10/wWx 0er1nJyoAv0vN87VdlziUds= X-Received: by 2002:a62:1bd7:: with SMTP id b206mr38415056pfb.213.1543429794690; Wed, 28 Nov 2018 10:29:54 -0800 (PST) Received: from [10.33.114.204] ([66.170.99.1]) by smtp.gmail.com with ESMTPSA id b5sm16446863pfc.150.2018.11.28.10.29.53 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 28 Nov 2018 10:29:53 -0800 (PST) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 12.1 \(3445.101.1\)) Subject: =?utf-8?Q?Re=3A_=5BPATCH_0/2=5D_Don=E2=80=99t_leave_executable_TL?= =?utf-8?Q?B_entries_to_freed_pages?= From: Nadav Amit In-Reply-To: <20181128095734.GA23467@arm.com> Date: Wed, 28 Nov 2018 10:29:52 -0800 Cc: Rick Edgecombe , Andrew Morton , Andy Lutomirski , linux-mm , LKML , Kernel Hardening , naveen.n.rao@linux.vnet.ibm.com, anil.s.keshavamurthy@intel.com, David Miller , Masami Hiramatsu , Steven Rostedt , Ingo Molnar , ast@kernel.org, Daniel Borkmann , jeyu@kernel.org, netdev@vger.kernel.org, Ard Biesheuvel , Jann Horn , kristen@linux.intel.com, dave.hansen@intel.com, deneen.t.dock@intel.com Content-Transfer-Encoding: quoted-printable Message-Id: References: <20181128000754.18056-1-rick.p.edgecombe@intel.com> <449E6648-5599-476D-8136-EE570101F930@gmail.com> <20181128095734.GA23467@arm.com> To: Will Deacon X-Mailer: Apple Mail (2.3445.101.1) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Nov 28, 2018, at 1:57 AM, Will Deacon wrote: >=20 > On Tue, Nov 27, 2018 at 05:21:08PM -0800, Nadav Amit wrote: >>> On Nov 27, 2018, at 5:06 PM, Nadav Amit = wrote: >>>=20 >>>> On Nov 27, 2018, at 4:07 PM, Rick Edgecombe = wrote: >>>>=20 >>>> Sometimes when memory is freed via the module subsystem, an = executable >>>> permissioned TLB entry can remain to a freed page. If the page is = re-used to >>>> back an address that will receive data from userspace, it can = result in user >>>> data being mapped as executable in the kernel. The root of this = behavior is >>>> vfree lazily flushing the TLB, but not lazily freeing the = underlying pages.=20 >>>>=20 >>>> There are sort of three categories of this which show up across = modules, bpf, >>>> kprobes and ftrace: >>>>=20 >>>> 1. When executable memory is touched and then immediatly freed >>>>=20 >>>> This shows up in a couple error conditions in the module loader and = BPF JIT >>>> compiler. >>>=20 >>> Interesting! >>>=20 >>> Note that this may cause conflict with "x86: avoid W^X being broken = during >>> modules loading=E2=80=9D, which I recently submitted. >>=20 >> I actually have not looked on the vmalloc() code too much recent, but = it >> seems =E2=80=A6 strange: >>=20 >> void vm_unmap_aliases(void) >> { =20 >>=20 >> ... >> mutex_lock(&vmap_purge_lock); >> purge_fragmented_blocks_allcpus(); >> if (!__purge_vmap_area_lazy(start, end) && flush) >> flush_tlb_kernel_range(start, end); >> mutex_unlock(&vmap_purge_lock); >> } >>=20 >> Since __purge_vmap_area_lazy() releases the memory, it seems there is = a time >> window between the release of the region and the TLB flush, in which = the >> area can be allocated for another purpose. This can result in a >> (theoretical) correctness issue. No? >=20 > If __purge_vmap_area_lazy() returns false, then it hasn't freed the = memory, > so we only invalidate the TLB if 'flush' is true in that case. If > __purge_vmap_area_lazy() returns true instead, then it takes care of = the TLB > invalidation before the freeing. Right. Sorry for my misunderstanding. Thanks, Nadav