Received: by 10.213.65.68 with SMTP id h4csp840540imn; Tue, 20 Mar 2018 17:21:36 -0700 (PDT) X-Google-Smtp-Source: AG47ELuxf2U2+IldtUBIi1dh0d7Kte+X3XZ4dw93E2Y/4Pwkynu6x4fLC0Mfx2TH+DigLUE5k6Ed X-Received: by 2002:a17:902:6ac1:: with SMTP id i1-v6mr15447628plt.152.1521591696659; Tue, 20 Mar 2018 17:21:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521591696; cv=none; d=google.com; s=arc-20160816; b=K5mdL6hnf6LqBB0S5bPzzwZMZix+pWTeSST/cBeN69qq7955G6C6s4MBg6RGKH7YEf ysGcz8vRZwQCpBWn9vkUVwiaemO1x1kBeSoRt0VDlamLXhI1wfWut94gJ8jtjlagoD1e rbGjI0zybTSwG7iu8tEKmgbCwdFzMGiVJoIiuAnOd/FfM8JIJGyQ2RHNKL4Q5H7xRa9J 6NBISSIS/R3Zmv4JPRicxtSXy9kwdcJiuMdABIpzG2s+C5emjoN6w0o2j5P3UZ+j1ZJD CDgh8xgVZQ+wUFmfyfua/BvbKTSTMSMaZxMcnVfkVbXfx81xxh42swc4HfMhMmSjb4mR IC1A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :arc-authentication-results; bh=oYlog8Z1dm6cazABz390ROxOKEInaMuPHS5gxsHz79U=; b=FDtQMR97r9JwMxLXLpGuFIV0esxpgB9f1FdVGD+gkQQJV1i3wMq9h/i2gmkhdiAM87 Y0jAGP1CLINArASY0RAKOsquoXDD6JRZALIIq6/09Ce69itpWvvlYuKhhVg+FqcbQGT3 1vK/29Z8XKaVSQdiuOR55jUTlMHNwgvMqBdLgi5CwWMZFmkPic5cfM947x2qRmVy2sdb WnSdjSjTNLnL1GAVbPP0yfyejeS/LtVj37ePgKQhijMQW0EQl+PbcMmdiKICse8wFKQM 6knqfeOTb3R7dwBAPYjDCULl9cfHiYPxrCcpUeF5aNAvaOqJVbXSAM8nBAPm9g4au8sz NEEw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s9si1931958pgc.202.2018.03.20.17.21.22; Tue, 20 Mar 2018 17:21:36 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751549AbeCUASd (ORCPT + 99 others); Tue, 20 Mar 2018 20:18:33 -0400 Received: from www62.your-server.de ([213.133.104.62]:60527 "EHLO www62.your-server.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751018AbeCUASc (ORCPT ); Tue, 20 Mar 2018 20:18:32 -0400 Received: from [62.202.221.10] (helo=localhost) by www62.your-server.de with esmtpsa (TLSv1.2:DHE-RSA-AES256-GCM-SHA384:256) (Exim 4.85_2) (envelope-from ) id 1eyRSY-00057m-Gy; Wed, 21 Mar 2018 01:18:30 +0100 From: Daniel Borkmann To: torvalds@linux-foundation.org Cc: ast@kernel.org, psodagud@codeaurora.org, netdev@vger.kernel.org, Daniel Borkmann , Chenbo Feng , Richard Smith , Chandler Carruth , linux-kernel@vger.kernel.org Subject: [PATCH] kbuild: disable clang's default use of -fmerge-all-constants Date: Wed, 21 Mar 2018 01:18:24 +0100 Message-Id: <20180321001824.8395-1-daniel@iogearbox.net> X-Mailer: git-send-email 2.9.5 X-Authenticated-Sender: daniel@iogearbox.net X-Virus-Scanned: Clear (ClamAV 0.99.3/24405/Wed Mar 21 00:08:08 2018) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Prasad reported that he has seen crashes in BPF subsystem with netd on Android with arm64 in the form of (note, the taint is unrelated): [ 4134.721483] Unable to handle kernel paging request at virtual address 800000001 [ 4134.820925] Mem abort info: [ 4134.901283] Exception class = DABT (current EL), IL = 32 bits [ 4135.016736] SET = 0, FnV = 0 [ 4135.119820] EA = 0, S1PTW = 0 [ 4135.201431] Data abort info: [ 4135.301388] ISV = 0, ISS = 0x00000021 [ 4135.359599] CM = 0, WnR = 0 [ 4135.470873] user pgtable: 4k pages, 39-bit VAs, pgd = ffffffe39b946000 [ 4135.499757] [0000000800000001] *pgd=0000000000000000, *pud=0000000000000000 [ 4135.660725] Internal error: Oops: 96000021 [#1] PREEMPT SMP [ 4135.674610] Modules linked in: [ 4135.682883] CPU: 5 PID: 1260 Comm: netd Tainted: G S W 4.14.19+ #1 [ 4135.716188] task: ffffffe39f4aa380 task.stack: ffffff801d4e0000 [ 4135.731599] PC is at bpf_prog_add+0x20/0x68 [ 4135.741746] LR is at bpf_prog_inc+0x20/0x2c [ 4135.751788] pc : [] lr : [] pstate: 60400145 [ 4135.769062] sp : ffffff801d4e3ce0 [...] [ 4136.258315] Process netd (pid: 1260, stack limit = 0xffffff801d4e0000) [ 4136.273746] Call trace: [...] [ 4136.442494] 3ca0: ffffff94ab7ad584 0000000060400145 ffffffe3a01bf8f8 0000000000000006 [ 4136.460936] 3cc0: 0000008000000000 ffffff94ab844204 ffffff801d4e3cf0 ffffff94ab7ad584 [ 4136.479241] [] bpf_prog_add+0x20/0x68 [ 4136.491767] [] bpf_prog_inc+0x20/0x2c [ 4136.504536] [] bpf_obj_get_user+0x204/0x22c [ 4136.518746] [] SyS_bpf+0x5a8/0x1a88 Android's netd was basically pinning the uid cookie BPF map in BPF fs (/sys/fs/bpf/traffic_cookie_uid_map) and later on retrieving it again resulting in above panic. Issue is that the map was wrongly identified as a prog! Above kernel was compiled with clang 4.0, and it turns out that clang decided to merge the bpf_prog_iops and bpf_map_iops into a single memory location, such that the two i_ops could then not be distinguished anymore. Reason for this miscompilation is that clang has the more aggressive -fmerge-all-constants enabled by default. In fact, clang source code has a comment about it in lib/AST/ExprConstant.cpp on why it is okay to do so: Pointers with different bases cannot represent the same object. (Note that clang defaults to -fmerge-all-constants, which can lead to inconsistent results for comparisons involving the address of a constant; this generally doesn't matter in practice.) The issue never appeared with gcc however, since gcc does not enable -fmerge-all-constants by default and even *explicitly* states in it's option description that using this flag results in non-conforming behavior, quote from man gcc: Languages like C or C++ require each variable, including multiple instances of the same variable in recursive calls, to have distinct locations, so using this option results in non-conforming behavior. There are also various clang bug reports open on that matter [1], where clang developers acknowledge the non-conforming behavior, and refer to disabling it with -fno-merge-all-constants. But even if this gets fixed in clang today, there are already users out there that triggered this. Thus, fix this issue by explicitly adding -fno-merge-all-constants to the kernel's Makefile to generically disable this optimization, since potentially other places in the kernel could subtly break as well. Note, there is also a flag called -fmerge-constants (not supported by clang), which is more conservative and only applies to strings and it's enabled in gcc's -O/-O2/-O3/-Os optimization levels. In gcc's code, the two flags -fmerge-{all-,}constants share the same variable internally, so when disabling it via -fno-merge-all-constants, then we really don't merge any const data (e.g. strings), and text size increases with gcc (14,927,214 -> 14,942,646 for vmlinux.o). $ gcc -fverbose-asm -O2 foo.c -S -o foo.S -> foo.S lists -fmerge-constants under options enabled $ gcc -fverbose-asm -O2 -fno-merge-all-constants foo.c -S -o foo.S -> foo.S doesn't list -fmerge-constants under options enabled $ gcc -fverbose-asm -O2 -fno-merge-all-constants -fmerge-constants foo.c -S -o foo.S -> foo.S lists -fmerge-constants under options enabled Thus, as a workaround we need to set both -fno-merge-all-constants *and* -fmerge-constants in the Makefile in order for text size to stay as is. [1] https://bugs.llvm.org/show_bug.cgi?id=18538 Reported-by: Prasad Sodagudi Signed-off-by: Daniel Borkmann Cc: Linus Torvalds Cc: Chenbo Feng Cc: Richard Smith Cc: Chandler Carruth Cc: linux-kernel@vger.kernel.org Tested-by: Prasad Sodagudi Acked-by: Alexei Starovoitov --- [ Hi Linus, feel free to take this fix directly if you want. Alternatively, we could route it via bpf tree. Thanks a lot for your feedback! ] Makefile | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/Makefile b/Makefile index c4322de..af07bf5 100644 --- a/Makefile +++ b/Makefile @@ -826,6 +826,15 @@ KBUILD_CFLAGS += $(call cc-disable-warning, pointer-sign) # disable invalid "can't wrap" optimizations for signed / pointers KBUILD_CFLAGS += $(call cc-option,-fno-strict-overflow) +# clang sets -fmerge-all-constants by default as optimization, but this +# is non-conforming behavior for C and in fact breaks the kernel, so we +# need to disable it here generally. +KBUILD_CFLAGS += $(call cc-option,-fno-merge-all-constants) + +# for gcc -fno-merge-all-constants disables everything, but it is fine +# to have actual conforming behavior enabled. +KBUILD_CFLAGS += $(call cc-option,-fmerge-constants) + # Make sure -fstack-check isn't enabled (like gentoo apparently did) KBUILD_CFLAGS += $(call cc-option,-fno-stack-check,) -- 2.9.5