Received: by 2002:a25:ca44:0:0:0:0:0 with SMTP id a65csp2172097ybg; Thu, 30 Jul 2020 12:11:54 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz+OD7Vkq6DQISy1CWxwtlyxWUExVvk0NMld2Ya85OE+jAkDPN/o+tDUvqjRAiBaKMsxsTs X-Received: by 2002:a17:906:3291:: with SMTP id 17mr579262ejw.370.1596136313664; Thu, 30 Jul 2020 12:11:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596136313; cv=none; d=google.com; s=arc-20160816; b=dMhXVfA99SCaIeCj0nIcBnCnk7ohDI6cIRrPya47IFVHaLI4aSBZ49ktLrNeOmN7sA NbF7Judi6AOs3NwxvNN8z6S3QmuHFMWcI0E482bSzrjJWzDCSoxhwVcD5MEhiBubICwP nUJRIhBxqIqiEANUCRFkr4buMuADaCom3MpttcwHoyRAcOq5/Shq2MBu4zC/GdDB96ZJ e41m7wY+HE4YwDpt8u4CuQE6duy4r6b5YW2MIVTbNQj9Q5I9RNKwXaE+fgbzzIkvh0kT c/8OnBAFY7fQFkB5WMS/vWPVoezIpc58hLkdg2INdTuuuo/xU7Qb1b7GghBGtF8ujw2p 44XA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=M/PjmAninS+4waGJS0k7TVpi9mX9COho3YSX6SxTl+Y=; b=uebBfHvfxvLF3cbQRn8fxTK9DgVFx6LNb3AeZ/zz/HLlwsUHdLpgQzP8V+gU0lTdHD Dg2wh0z7QniW5c1DUqs9IVk9qAgaLUWc4WFkZyX/tNYAPZE1G5SjrW7VKmNsu7RMPAlq LiBYx5Guv+K++1fcGJ7KRHosiAT6Hs9yWIkT4gr2SaEf1IeKCsF8Av2HxE7knEJI0gSH w/xOmm2CLiR3z0qQuueV3HS0Eat75ADKcEohf+do3h6iU+0aRu6rqY2l9w9mHxwCtSrB SigVNwkQsVAlDijFQUSqZoKjuMqLQ07MPZygHywX04ovYw/8v0XbOXfqRFBwPtvIoGnu cUiA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=LNxjwrKZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ks17si3695471ejb.204.2020.07.30.12.11.31; Thu, 30 Jul 2020 12:11:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=LNxjwrKZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730446AbgG3TLU (ORCPT + 99 others); Thu, 30 Jul 2020 15:11:20 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42200 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730413AbgG3TLT (ORCPT ); Thu, 30 Jul 2020 15:11:19 -0400 Received: from mail-pj1-x1043.google.com (mail-pj1-x1043.google.com [IPv6:2607:f8b0:4864:20::1043]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 61746C061574; Thu, 30 Jul 2020 12:11:19 -0700 (PDT) Received: by mail-pj1-x1043.google.com with SMTP id t15so5427440pjq.5; Thu, 30 Jul 2020 12:11:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=M/PjmAninS+4waGJS0k7TVpi9mX9COho3YSX6SxTl+Y=; b=LNxjwrKZpykbDtTMF6elkiLywrLlNwej5RqnOIPN9chduUY1rXrii9ppSPSGNy52sc UaDhUnw2sk2Bc5Vf+FR8lmHPvYASUzSYyDKjLqN07fF3YoDjNslS2OORe7gEuXkorzKo s+dbpIY1gNtoYsB7e6NyqlCpeYKOBvIxwE/X21v7eXTttPPYYjiez5kI/Q4XNZdw89uX c2cTznWG3JvnNhPkXw0jc1lWRQML8E1XDWjbSO+pT5UfWlcTQOzg578qZKqRznc042JJ 4v3kt75lTT4RFTa9oh3Af2pUNhrKgVOEC/A8KohZDvUwVvUy7KPKJ8fykKvzIa0tYoBl HBkA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=M/PjmAninS+4waGJS0k7TVpi9mX9COho3YSX6SxTl+Y=; b=RfX/H5tRIzf17jfKVDZecL2oFIBpZfWNIaSJTC8iZWYQS5AvRRmQ/kL1gda7oI8gqo Jbuzv7IJ7EZAyn5RZL2jDoba1l6yvS/WDOYxrdXu89vkbIa/UhiMIbBD7LjriU0dlQSO 8y0Qab7miBOF5vshVVvJ7JCY6amaF3lOITWs2CMzpk9aD+Ph7e1BiE3Ec/k8D/YcjK95 P+uANfuK2b9WtkH00S1MD6Tj5fK9b2qbVakl8KVTa5G8infA5SR1OZjTdssTaGaIuP8w cARqi0bu6wFEiiI6SHgT+CSb8ScyrqwwAJjtj26h+I/Fzjc9pZBP+dkusmVuhPlq5Di9 H2yA== X-Gm-Message-State: AOAM531yOx69ZVrIRMOf1nPZx8YfzsbU3eq/SoWPzfX9IeNYRKvddVUx 9v2LBnF5ELVod5tu2/RqwPU= X-Received: by 2002:a62:84d5:: with SMTP id k204mr420518pfd.66.1596136278844; Thu, 30 Jul 2020 12:11:18 -0700 (PDT) Received: from nickserv.localdomain (c-98-33-101-203.hsd1.ca.comcast.net. [98.33.101.203]) by smtp.gmail.com with ESMTPSA id o22sm8644976pfu.206.2020.07.30.12.11.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 30 Jul 2020 12:11:18 -0700 (PDT) From: Nick Terrell To: Borislav Petkov , Thomas Gleixner Cc: linux-kernel@vger.kernel.org, Chris Mason , linux-kbuild@vger.kernel.org, x86@kernel.org, gregkh@linuxfoundation.org, Petr Malat , Kees Cook , Kernel Team , Nick Terrell , Adam Borowski , Patrick Williams , rmikey@fb.com, mingo@kernel.org, Patrick Williams , Sedat Dilek , Norbert Lange , Andrew Morton , Alex Xu , Masahiro Yamada , Nick Terrell Subject: [PATCH v10 1/8] lib: prepare zstd for preboot environment Date: Thu, 30 Jul 2020 12:08:34 -0700 Message-Id: <20200730190841.2071656-2-nickrterrell@gmail.com> X-Mailer: git-send-email 2.27.0 In-Reply-To: <20200730190841.2071656-1-nickrterrell@gmail.com> References: <20200730190841.2071656-1-nickrterrell@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Nick Terrell * Remove a double definition of the CHECK_F macro when the zstd library is amalgamated. * Switch ZSTD_copy8() to __builtin_memcpy(), because in the preboot environment on x86 gcc can't inline `memcpy()` otherwise. * Limit the gcc hack in ZSTD_wildcopy() to the broken gcc version. See https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81388. These changes are necessary to get the build to work in the preboot environment, and to get reasonable performance. ZSTD_copy8() and ZSTD_wildcopy() are in the core of the zstd hot loop. So outlining these calls to memcpy(), and having an extra branch are very detrimental to performance. Reviewed-by: Kees Cook Tested-by: Sedat Dilek Signed-off-by: Nick Terrell --- lib/zstd/fse_decompress.c | 9 +-------- lib/zstd/zstd_internal.h | 14 ++++++++++++-- 2 files changed, 13 insertions(+), 10 deletions(-) diff --git a/lib/zstd/fse_decompress.c b/lib/zstd/fse_decompress.c index a84300e5a013..0b353530fb3f 100644 --- a/lib/zstd/fse_decompress.c +++ b/lib/zstd/fse_decompress.c @@ -47,6 +47,7 @@ ****************************************************************/ #include "bitstream.h" #include "fse.h" +#include "zstd_internal.h" #include #include #include /* memcpy, memset */ @@ -60,14 +61,6 @@ enum { FSE_static_assert = 1 / (int)(!!(c)) }; \ } /* use only *after* variable declarations */ -/* check and forward error code */ -#define CHECK_F(f) \ - { \ - size_t const e = f; \ - if (FSE_isError(e)) \ - return e; \ - } - /* ************************************************************** * Templates ****************************************************************/ diff --git a/lib/zstd/zstd_internal.h b/lib/zstd/zstd_internal.h index 1a79fab9e13a..dac753397f86 100644 --- a/lib/zstd/zstd_internal.h +++ b/lib/zstd/zstd_internal.h @@ -127,7 +127,14 @@ static const U32 OF_defaultNormLog = OF_DEFAULTNORMLOG; * Shared functions to include for inlining *********************************************/ ZSTD_STATIC void ZSTD_copy8(void *dst, const void *src) { - memcpy(dst, src, 8); + /* + * zstd relies heavily on gcc being able to analyze and inline this + * memcpy() call, since it is called in a tight loop. Preboot mode + * is compiled in freestanding mode, which stops gcc from analyzing + * memcpy(). Use __builtin_memcpy() to tell gcc to analyze this as a + * regular memcpy(). + */ + __builtin_memcpy(dst, src, 8); } /*! ZSTD_wildcopy() : * custom version of memcpy(), can copy up to 7 bytes too many (8 bytes if length==0) */ @@ -137,13 +144,16 @@ ZSTD_STATIC void ZSTD_wildcopy(void *dst, const void *src, ptrdiff_t length) const BYTE* ip = (const BYTE*)src; BYTE* op = (BYTE*)dst; BYTE* const oend = op + length; - /* Work around https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81388. +#if defined(GCC_VERSION) && GCC_VERSION >= 70000 && GCC_VERSION < 70200 + /* + * Work around https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81388. * Avoid the bad case where the loop only runs once by handling the * special case separately. This doesn't trigger the bug because it * doesn't involve pointer/integer overflow. */ if (length <= 8) return ZSTD_copy8(dst, src); +#endif do { ZSTD_copy8(op, ip); op += 8; -- 2.27.0