Received: by 2002:a17:90a:9103:0:0:0:0 with SMTP id k3csp11810512pjo; Thu, 2 Jan 2020 14:40:46 -0800 (PST) X-Google-Smtp-Source: APXvYqzuRRbDQoMPMb+HOCIpKp/aXcJQIkkt9jQR7lcQvB4sONIpo5DYtvO+G/SULbg7KSzd5VCK X-Received: by 2002:a05:6830:1199:: with SMTP id u25mr89397751otq.344.1578004846162; Thu, 02 Jan 2020 14:40:46 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1578004846; cv=none; d=google.com; s=arc-20160816; b=Ow+KC/TBzbmkDzF/uwM1H1ezxKDh118pAa3ZnUobolN+0arW2UH8u8TR3QXRPXsYl9 MXAJPjBUC0RXFI/ra14rZBBISV0rCSMKsJ9vmWqnWukiRQsSs9UURbS+MBNUGbgMYuSD P7ncb2P/5KAqXta1/aL4G8ldf/xbmNPC3Omuj+1TaEguV2hbkBdm4XhfOf/xDlhIIrt1 lHZ+GIRpzL8aq6QPbtrpiSiiTa7CzRGFRANW1kg5+I5veoSkJDJpsoxjfTIBJBH5sEhw SixohTxHx0100uEXUGBtPcETSchzY39W9oGF+iSiMykuOr1mf1t7fQ8rpIyXBQGEcrXk ausQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=HmBgBGzghSJ4YLYJYLMMMHfN3XF6BSJ87wDEfHfxVZE=; b=Quo0CBtQenT2Fklo8vf4hTCxGWN/tbyVsoB9iErLaTpo4lzblcYbwKIt3ARBTYNiHT WTO0IQiuabebccc2VPGCfWl61Pz5P5GZhoevr5iKn+nsRlOOtb5uacfRKE3nVwb/4WMv aeskb/5b35aNfMD/352Q/Gg8RaUTGYWHlbU52EzksH/biHtN1nDzjh2e/qyIrCOdKeW9 3cdaFUSUcRr7e2WjWqmDwKLRcB7VjCxOH+vGZ3U6UE2V55sj3a9A+QIcjyx9UhWvKJws wYejyTC3UmnwiXz7jxpom6bCAyXi87TrfEZvjduLUM9UucQojq5z4BgtGxL54GSw9+Gd u/eQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=ehH69hVV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c22si36923868oth.167.2020.01.02.14.40.33; Thu, 02 Jan 2020 14:40:46 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=ehH69hVV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731361AbgABWit (ORCPT + 99 others); Thu, 2 Jan 2020 17:38:49 -0500 Received: from mail.kernel.org ([198.145.29.99]:53460 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731354AbgABWir (ORCPT ); Thu, 2 Jan 2020 17:38:47 -0500 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 29FBB21835; Thu, 2 Jan 2020 22:38:46 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1578004726; bh=AsgN12WOBTB9X5A+bu6+gH2DzaMQ1n3a6A3Pg8Z64Vw=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=ehH69hVVDgJyC1Xf51EkPU9zGhEXzhZH0jwVF5baUXDFLeKyTvK1zy4ot9PblvffS oOINPZtOsqnagppylvLBXKB/yX6yTgRlfyo5jgAJmLu2AiVjDq7g6baqzhqTcBEfnb Hc/EWr4vHVIYC8j5HUoulNhpqbzo/u/5W5f3IQuo= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Alexander Viro , Jann Horn , "Eric W. Biederman" , Linus Torvalds , Siddharth Chandrasekaran Subject: [PATCH 4.4 128/137] Make filldir[64]() verify the directory entry filename is valid Date: Thu, 2 Jan 2020 23:08:21 +0100 Message-Id: <20200102220604.355269800@linuxfoundation.org> X-Mailer: git-send-email 2.24.1 In-Reply-To: <20200102220546.618583146@linuxfoundation.org> References: <20200102220546.618583146@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Linus Torvalds commit 8a23eb804ca4f2be909e372cf5a9e7b30ae476cd upstream. This has been discussed several times, and now filesystem people are talking about doing it individually at the filesystem layer, so head that off at the pass and just do it in getdents{64}(). This is partially based on a patch by Jann Horn, but checks for NUL bytes as well, and somewhat simplified. There's also commentary about how it might be better if invalid names due to filesystem corruption don't cause an immediate failure, but only an error at the end of the readdir(), so that people can still see the filenames that are ok. There's also been discussion about just how much POSIX strictly speaking requires this since it's about filesystem corruption. It's really more "protect user space from bad behavior" as pointed out by Jann. But since Eric Biederman looked up the POSIX wording, here it is for context: "From readdir: The readdir() function shall return a pointer to a structure representing the directory entry at the current position in the directory stream specified by the argument dirp, and position the directory stream at the next entry. It shall return a null pointer upon reaching the end of the directory stream. The structure dirent defined in the header describes a directory entry. From definitions: 3.129 Directory Entry (or Link) An object that associates a filename with a file. Several directory entries can associate names with the same file. ... 3.169 Filename A name consisting of 1 to {NAME_MAX} bytes used to name a file. The characters composing the name may be selected from the set of all character values excluding the slash character and the null byte. The filenames dot and dot-dot have special meaning. A filename is sometimes referred to as a 'pathname component'." Note that I didn't bother adding the checks to any legacy interfaces that nobody uses. Also note that if this ends up being noticeable as a performance regression, we can fix that to do a much more optimized model that checks for both NUL and '/' at the same time one word at a time. We haven't really tended to optimize 'memchr()', and it only checks for one pattern at a time anyway, and we really _should_ check for NUL too (but see the comment about "soft errors" in the code about why it currently only checks for '/') See the CONFIG_DCACHE_WORD_ACCESS case of hash_name() for how the name lookup code looks for pathname terminating characters in parallel. Link: https://lore.kernel.org/lkml/20190118161440.220134-2-jannh@google.com/ Cc: Alexander Viro Cc: Jann Horn Cc: Eric W. Biederman Signed-off-by: Linus Torvalds Signed-off-by: Siddharth Chandrasekaran Signed-off-by: Greg Kroah-Hartman --- fs/readdir.c | 40 ++++++++++++++++++++++++++++++++++++++++ 1 file changed, 40 insertions(+) --- a/fs/readdir.c +++ b/fs/readdir.c @@ -51,6 +51,40 @@ out: EXPORT_SYMBOL(iterate_dir); /* + * POSIX says that a dirent name cannot contain NULL or a '/'. + * + * It's not 100% clear what we should really do in this case. + * The filesystem is clearly corrupted, but returning a hard + * error means that you now don't see any of the other names + * either, so that isn't a perfect alternative. + * + * And if you return an error, what error do you use? Several + * filesystems seem to have decided on EUCLEAN being the error + * code for EFSCORRUPTED, and that may be the error to use. Or + * just EIO, which is perhaps more obvious to users. + * + * In order to see the other file names in the directory, the + * caller might want to make this a "soft" error: skip the + * entry, and return the error at the end instead. + * + * Note that this should likely do a "memchr(name, 0, len)" + * check too, since that would be filesystem corruption as + * well. However, that case can't actually confuse user space, + * which has to do a strlen() on the name anyway to find the + * filename length, and the above "soft error" worry means + * that it's probably better left alone until we have that + * issue clarified. + */ +static int verify_dirent_name(const char *name, int len) +{ + if (WARN_ON_ONCE(!len)) + return -EIO; + if (WARN_ON_ONCE(memchr(name, '/', len))) + return -EIO; + return 0; +} + +/* * Traditional linux readdir() handling.. * * "count=1" is a special case, meaning that the buffer is one @@ -159,6 +193,9 @@ static int filldir(struct dir_context *c int reclen = ALIGN(offsetof(struct linux_dirent, d_name) + namlen + 2, sizeof(long)); + buf->error = verify_dirent_name(name, namlen); + if (unlikely(buf->error)) + return buf->error; buf->error = -EINVAL; /* only used if we fail.. */ if (reclen > buf->count) return -EINVAL; @@ -243,6 +280,9 @@ static int filldir64(struct dir_context int reclen = ALIGN(offsetof(struct linux_dirent64, d_name) + namlen + 1, sizeof(u64)); + buf->error = verify_dirent_name(name, namlen); + if (unlikely(buf->error)) + return buf->error; buf->error = -EINVAL; /* only used if we fail.. */ if (reclen > buf->count) return -EINVAL;