Received: by 2002:ac0:950c:0:0:0:0:0 with SMTP id f12csp2111883imc; Tue, 12 Mar 2019 07:18:39 -0700 (PDT) X-Google-Smtp-Source: APXvYqzOtOl8b5++2571Us46byofKU2zBRzxmop1TJObdCq/cGOx1XSy7bcvqTnTsTJMJWDeyB1o X-Received: by 2002:a63:1925:: with SMTP id z37mr35012942pgl.67.1552400319395; Tue, 12 Mar 2019 07:18:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1552400319; cv=none; d=google.com; s=arc-20160816; b=vjj7sfbrwwFvQ3XBa/iM77sjVI6j37dICvU983PpchAz2ZPYwBz4XbJ3kfUT6g7b5t lI80ztZRB8e9nfJ9zmwsrfvDl6Dymlr65RH6N7atKvu8awnBM8UQBL6kLMeM7xtcb0Cu m1oF4R9QLAaZ8yrIIQMCwvDEXw7XA6HNZE5zGAYNHPWghgAhSeqYaTZRgvsH0qp9U+4/ 5tXUJrwS8RrkyyhIe3oCzv+7s1+1aAF9T4BPGGjMt6lrzlt1ZO3MqKHJuzs66dO2FYTD DuL+KH0zdGhmqZnWui/FFXxx1qzHy4no8j+dfIOsUjnviKGSf8zZAXeUHXig8S2xI5j/ LDxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=JqJ7+Ah6HvL9avHk7AQXUYUEQBRFVjWD/VGM0xcL36s=; b=QzdPRDxsAnlz20W4rHErV1GkFb0+EeoLMcscy+1cBajTOq06lC34hPQ9hU7J8vhRND GT0pvSjRRMj26ZWaSX9sjdQ25Kbp1JG44qxs1MlEVJ+oj8H2g1baYXINv1iQvTpHBDsL N8S0CmWBoTtlshMl0g94svRYZSvrrmvqOyOOykOaCTqPp8nLrdA5rTRaF4hj44DVxmHm XkBZ+u+1WwH3as8UQgLrErBkjWMBpxyY5Eq1Jg0tyAHyU5N0md/N3b4B8tQuTZu/oZkT gMJ3Dt/ENHh6SmYlbfzUdKmLajdhj7M6CNE4gWNAD+jJjzjPkeOWbLQNWqfGhhu4tBPp irVA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h69si8334391pfc.120.2019.03.12.07.18.23; Tue, 12 Mar 2019 07:18:39 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726854AbfCLOSA (ORCPT + 99 others); Tue, 12 Mar 2019 10:18:00 -0400 Received: from mx2.suse.de ([195.135.220.15]:57756 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726738AbfCLORz (ORCPT ); Tue, 12 Mar 2019 10:17:55 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id A6EB2B644; Tue, 12 Mar 2019 14:17:52 +0000 (UTC) From: Vlastimil Babka To: Andrew Morton Cc: Linus Torvalds , Jann Horn , Michal Hocko , linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-api@vger.kernel.org, Jiri Kosina , Dominique Martinet , Andy Lutomirski , Dave Chinner , Kevin Easton , Matthew Wilcox , Cyril Hrubis , Tejun Heo , "Kirill A . Shutemov" , Daniel Gruss , Vlastimil Babka , Josh Snyder , Michal Hocko , Jiri Kosina Subject: [PATCH v2 1/2] mm/mincore: make mincore() more conservative Date: Tue, 12 Mar 2019 15:17:07 +0100 Message-Id: <20190312141708.6652-2-vbabka@suse.cz> X-Mailer: git-send-email 2.20.1 In-Reply-To: <20190312141708.6652-1-vbabka@suse.cz> References: <20190130124420.1834-1-vbabka@suse.cz> <20190312141708.6652-1-vbabka@suse.cz> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Jiri Kosina The semantics of what mincore() considers to be resident is not completely clear, but Linux has always (since 2.3.52, which is when mincore() was initially done) treated it as "page is available in page cache". That's potentially a problem, as that [in]directly exposes meta-information about pagecache / memory mapping state even about memory not strictly belonging to the process executing the syscall, opening possibilities for sidechannel attacks. Change the semantics of mincore() so that it only reveals pagecache information for non-anonymous mappings that belog to files that the calling process could (if it tried to) successfully open for writing; otherwise we'd be including shared non-exclusive mappings, which - is the sidechannel - is not the usecase for mincore(), as that's primarily used for data, not (shared) text [mhocko@suse.com: restructure can_do_mincore() conditions] Originally-by: Linus Torvalds Originally-by: Dominique Martinet Cc: Dominique Martinet Cc: Andy Lutomirski Cc: Dave Chinner Cc: Kevin Easton Cc: Matthew Wilcox Cc: Cyril Hrubis Cc: Tejun Heo Cc: Kirill A. Shutemov Cc: Daniel Gruss Signed-off-by: Jiri Kosina Signed-off-by: Vlastimil Babka Acked-by: Josh Snyder Acked-by: Michal Hocko Signed-off-by: Jiri Kosina Signed-off-by: Vlastimil Babka --- mm/mincore.c | 23 ++++++++++++++++++++++- 1 file changed, 22 insertions(+), 1 deletion(-) diff --git a/mm/mincore.c b/mm/mincore.c index 218099b5ed31..c3f058bd0faf 100644 --- a/mm/mincore.c +++ b/mm/mincore.c @@ -169,6 +169,22 @@ static int mincore_pte_range(pmd_t *pmd, unsigned long addr, unsigned long end, return 0; } +static inline bool can_do_mincore(struct vm_area_struct *vma) +{ + if (vma_is_anonymous(vma)) + return true; + if (!vma->vm_file) + return false; + /* + * Reveal pagecache information only for non-anonymous mappings that + * correspond to the files the calling process could (if tried) open + * for writing; otherwise we'd be including shared non-exclusive + * mappings, which opens a side channel. + */ + return inode_owner_or_capable(file_inode(vma->vm_file)) || + inode_permission(file_inode(vma->vm_file), MAY_WRITE) == 0; +} + /* * Do a chunk of "sys_mincore()". We've already checked * all the arguments, we hold the mmap semaphore: we should @@ -189,8 +205,13 @@ static long do_mincore(unsigned long addr, unsigned long pages, unsigned char *v vma = find_vma(current->mm, addr); if (!vma || addr < vma->vm_start) return -ENOMEM; - mincore_walk.mm = vma->vm_mm; end = min(vma->vm_end, addr + (pages << PAGE_SHIFT)); + if (!can_do_mincore(vma)) { + unsigned long pages = DIV_ROUND_UP(end - addr, PAGE_SIZE); + memset(vec, 1, pages); + return pages; + } + mincore_walk.mm = vma->vm_mm; err = walk_page_range(addr, end, &mincore_walk); if (err < 0) return err; -- 2.20.1