Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp1713840ybt; Thu, 25 Jun 2020 12:17:15 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy97mX/6u9CVxUrQWzflRdUp2mzMghbTc/7TwH+kNa2ZGexMOXPLkgArYVJNHYbxv+D5EZ3 X-Received: by 2002:a50:d790:: with SMTP id w16mr32281928edi.231.1593112635521; Thu, 25 Jun 2020 12:17:15 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1593112635; cv=none; d=google.com; s=arc-20160816; b=qZwL53a89roa3JEKYDKm4knH0kWpO1jVmbdwrIWisX3T8pt7j2Sbc28KOKPhPw1vCx ui6Eje6F03UNY/CvIoM1RovYTkLw2Ea3jRsAnAjA1V1M/JjXo1gHWrm061ufHBDzuopY Y9erQeayETQrSVOB+EHfY1Pc5FsxfX91OXfNuIVylLfMjk6squGIUhO47E1WMksnbHca KV6197reew/0JJ/Hp0J6RQpHIOzd/S9214w3aYog0OKdeCn+aa4wUN5VaGZ6FsTL1c8c 4/IEHtkUppnUgGlVe4GBAmaG2//Yel0VwGosoDJzLNLTU02IL+dLujzeiTTtxPESIsOP x5Sg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:from :subject:references:mime-version:message-id:in-reply-to:date :dkim-signature; bh=KSzIS4R7pLrhz4aCSES8O8YLNhHnwl2dEbCnP+qYXsc=; b=Q585G4Tdsgy6iH5WY2EbmTeo+p68mzrepGvUSmGhyqp4TxPIAXwZah/PCTHyYmDCTY f3q8HhrbpKiP3u5hkwaRIpYr2qZxCk9x763FPjstbTv2QO7LexPLbj/LaPWoyY0ZaIji mYG/dkSjnA5qedc5yQoeIG3CIE3X2vkTr7Y40yqWxiV7hJOLBpcn7jt+iACNWz3C+bxW 9WIDgkiDB9MxiI2RbdUOSKbHOwKvQlb0WGD3AkjOG5GFxih18m5Hief/ZPHKzP3ocLTM E7aA1ygokzducNsxB6YnNpBKxY/kDSNScBOE7JDyC2fEzyp7lI8/bVxZWLY7lb2yhPNw /Oxw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=knKRqtKF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q3si7904380edn.31.2020.06.25.12.16.50; Thu, 25 Jun 2020 12:17:15 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=knKRqtKF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2403780AbgFYSsu (ORCPT + 99 others); Thu, 25 Jun 2020 14:48:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58988 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390875AbgFYSst (ORCPT ); Thu, 25 Jun 2020 14:48:49 -0400 Received: from mail-qt1-x849.google.com (mail-qt1-x849.google.com [IPv6:2607:f8b0:4864:20::849]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6DC0AC08C5DB for ; Thu, 25 Jun 2020 11:48:49 -0700 (PDT) Received: by mail-qt1-x849.google.com with SMTP id g6so4686796qtr.0 for ; Thu, 25 Jun 2020 11:48:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:in-reply-to:message-id:mime-version:references:subject:from:to :cc:content-transfer-encoding; bh=KSzIS4R7pLrhz4aCSES8O8YLNhHnwl2dEbCnP+qYXsc=; b=knKRqtKFatSe2XltGnadadXYJpHZawUTiMMobs3BChNf5NM/0u4nk3w1QtVHi/2TGJ MGNBG0TtVcq9viEuR8zd0jqEo96IcfEx2gHSvdbcB3ckYEBrQpTXPq6tK7H3viZFPRCh W+tC3F45N6UTYj0fflhVzOdIoHvpOoyhbYthTQDmVfNUqHx0tGPh7vhaiXJwh7WiaksG 7Kf5IMLzUrlNU6YJQr6tdlMkp4qTejHrXN7RwXjbYezMfBQiMV6OqL4xP8Ruhg7l7wC+ 7t2aLAeRG5q6VH6HMvzX+Eg6XRp5dSUfVkt84VBZgheT20HcdfWZHWgyMfgVe44smHLm eYww== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:in-reply-to:message-id:mime-version :references:subject:from:to:cc:content-transfer-encoding; bh=KSzIS4R7pLrhz4aCSES8O8YLNhHnwl2dEbCnP+qYXsc=; b=YWbQbjVmNCm7PrOMsaQQqYKETrVfCeKDLZ5KSSjHRPmWKUWGcmpEOtZazrQl+Oj+f1 pvPRxrkkrbfoUalGEr4S5gsH6lrLBadLVCZyRhii8UBNstMRK8urBs9g0ecuXi1QKg0e 3mvtRWqcVs9ie8TXJ6MNT06s2EwZ/4oU7CnZ6izKIEvdme9eoF6ocE7wEnynG+kGAgjg evMxCCFVEXS3Gk6dljQxEc0Yt0Z8uWGBUWnVFLWALAN2g3IHcA1bcBvs03Zk7aiSUkuK kP/nBAncFSYB8h7nXrN1Y/JxH8VD22m9coO9PFTrtDYnoRH2vt7Betqs4PlGn7MaJHcI pXaQ== X-Gm-Message-State: AOAM533cnm7xhpoKF1QnI5L5kRAdkqTjPSXQOZ6dCOx2sV34+MQ6Issv 0DxEHAscByhhcz5IKABGmxtJ+TNZEu8eIAlJJSg= X-Received: by 2002:ad4:44a6:: with SMTP id n6mr2687847qvt.113.1593110928539; Thu, 25 Jun 2020 11:48:48 -0700 (PDT) Date: Thu, 25 Jun 2020 11:47:52 -0700 In-Reply-To: <20200622231536.7jcshis5mdn3vr54@google.com> Message-Id: <20200625184752.73095-1-ndesaulniers@google.com> Mime-Version: 1.0 References: <20200622231536.7jcshis5mdn3vr54@google.com> X-Mailer: git-send-email 2.27.0.111.gc72c7da667-goog Subject: [PATCH v2] vmlinux.lds: add PGO and AutoFDO input sections From: Nick Desaulniers To: Arnd Bergmann Cc: "=?UTF-8?q?F=C4=81ng-ru=C3=AC=20S=C3=B2ng?=" , Nick Desaulniers , stable@vger.kernel.org, Jian Cai , Luis Lozano , Manoj Gupta , linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org, clang-built-linux@googlegroups.com Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Basically, consider .text.{hot|unlikely|unknown}.* part of .text, too. When compiling with profiling information (collected via PGO instrumentations or AutoFDO sampling), Clang will separate code into .text.hot, .text.unlikely, or .text.unknown sections based on profiling information. After D79600 (clang-11), these sections will have a trailing `.` suffix, ie. .text.hot., .text.unlikely., .text.unknown.. When using -ffunction-sections together with profiling infomation, either explicitly (FGKASLR) or implicitly (LTO), code may be placed in sections following the convention: .text.hot., .text.unlikely., .text.unknown. where , , and are functions. (This produces one section per function; we generally try to merge these all back via linker script so that we don't have 50k sections). For the above cases, we need to teach our linker scripts that such sections might exist and that we'd explicitly like them grouped together, otherwise we can wind up with code outside of the _stext/_etext boundaries that might not be mapped properly for some architectures, resulting in boot failures. If the linker script is not told about possible input sections, then where the section is placed as output is a heuristic-laiden mess that's non-portable between linkers (ie. BFD and LLD), and has resulted in many hard to debug bugs. Kees Cook is working on cleaning this up by adding --orphan-handling=3Dwarn linker flag used in ARCH=3Dpowerpc to additional architectures. In the case of linker scripts, borrowing from the Zen of Python: explicit is better than implicit. Also, ld.bfd's internal linker script considers .text.hot AND .text.hot.* to be part of .text, as well as .text.unlikely and .text.unlikely.*. I didn't see support for .text.unknown.*, and didn't see Clang producing such code in our kernel builds, but I see code in LLVM that can produce such section names if profiling information is missing. That may point to a larger issue with generating or collecting profiles, but I would much rather be safe and explicit than have to debug yet another issue related to orphan section placement. Cc: stable@vger.kernel.org Link: https://sourceware.org/git/?p=3Dbinutils-gdb.git;a=3Dcommitdiff;h=3Da= dd44f8d5c5c05e08b11e033127a744d61c26aee Link: https://sourceware.org/git/?p=3Dbinutils-gdb.git;a=3Dcommitdiff;h=3D1= de778ed23ce7492c523d5850c6c6dbb34152655 Link: https://reviews.llvm.org/D79600 Link: https://bugs.chromium.org/p/chromium/issues/detail?id=3D1084760 Reported-by: Jian Cai Debugged-by: Luis Lozano Suggested-by: F=C4=81ng-ru=C3=AC S=C3=B2ng Tested-by: Luis Lozano Tested-by: Manoj Gupta Signed-off-by: Nick Desaulniers --- Changes V1 -> V2: * Add .text.unknown.*. It's not strictly necessary for us yet, but I really worry that it could become a problem for us. Either way, I'm happy to drop for a V3, but I'm suggesting we not. * Beef up commit message. * Drop references to LLD; the LLVM change had nothing to do with LLD. I've realized I have a Pavlovian-response to changes from F=C4=81ng-ru=C3= =AC that I associate with LLD. I'm seeking professional help for my ailment. Forgive me. * Add link to now public CrOS bug. include/asm-generic/vmlinux.lds.h | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/include/asm-generic/vmlinux.lds.h b/include/asm-generic/vmlinu= x.lds.h index d7c7c7f36c4a..245c1af4c057 100644 --- a/include/asm-generic/vmlinux.lds.h +++ b/include/asm-generic/vmlinux.lds.h @@ -560,7 +560,10 @@ */ #define TEXT_TEXT \ ALIGN_FUNCTION(); \ - *(.text.hot TEXT_MAIN .text.fixup .text.unlikely) \ + *(.text.hot .text.hot.*) \ + *(TEXT_MAIN .text.fixup) \ + *(.text.unlikely .text.unlikely.*) \ + *(.text.unknown .text.unknown.*) \ NOINSTR_TEXT \ *(.text..refcount) \ *(.ref.text) \ --=20 2.27.0.111.gc72c7da667-goog