Received: by 2002:ac0:950c:0:0:0:0:0 with SMTP id f12csp1825295imc; Tue, 12 Mar 2019 01:08:42 -0700 (PDT) X-Google-Smtp-Source: APXvYqymrtuyrLTwq9+2KUgbFi8YAKHzhMTMk3a0Xr25OS3/vye3pdb6omdur2T1GL2i8EM2fn2T X-Received: by 2002:a17:902:2be8:: with SMTP id l95mr38735160plb.270.1552378121976; Tue, 12 Mar 2019 01:08:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1552378121; cv=none; d=google.com; s=arc-20160816; b=COCzURdKlz9HQoBobW8TDibHyMf5h1YtGfOaIzjn77qsELJbrZ9lJnjY3GyxEdH9qQ EY6Kk575LymMXCxpr323nP9RQlNKxP98PoWyYAB2rgLIIvkQyeJOCkbYVMpiql7wwxWI pvGW+RS8zBETKDVFR9HFDa8dfxBivFlgHBLK18SD+Lk3e1qedv8rY1AZymWfcnHLcjJN 6m4KlZz+GcEEmKnt0P0iLEYvkgkP6hBFgcHg9NWhK3b2oLv4RMfum0EDzA7G+cDFjFcN 7SYB26eJQnHhfzYBfjepqT7mWguDFBI1m+XzT7MzWKcja5J0vGBlIfmaXzt0Lhl74KnJ 8MCw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=Jq08fGFB6OGyp0Iud0Dqe/0oVO4Yhrwtrq0ZA+ojEv4=; b=e2+MqGtSMiTzXp6y9IWj9Dt+9+U9rnFCuSBCIFoOnQv0dja6arXp0zivH1uH2jB4IA EC2U0HEkqcb0DxLEqYOzAuO/5hPGagV1agY/HuP5p9+kjgeqJoQKzrMCXLqjay3GJa8c au6lYlan3S4DgReCFUMKFocrhmv+QuG0boWypSV1p9pSEO1dQdGHHfC2uf3oe3dzULC2 j4haq4Bn0ijI/tAS+fUFrrarRM9C5GjY+0O2zSnsd8cq1rhT1pRUGJyuQ4qru5f+iuib v34DVjas67gNCSscZAK+VWnNUAgzJPAkPllahtFgK80HuY+YgxP2uLbeengiItnBdSLD AYPA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t3si7384009plq.430.2019.03.12.01.08.26; Tue, 12 Mar 2019 01:08:41 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727480AbfCLIHu (ORCPT + 99 others); Tue, 12 Mar 2019 04:07:50 -0400 Received: from mx2.suse.de ([195.135.220.15]:42968 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725906AbfCLIHt (ORCPT ); Tue, 12 Mar 2019 04:07:49 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id A97BEB609; Tue, 12 Mar 2019 08:07:47 +0000 (UTC) Date: Tue, 12 Mar 2019 09:07:46 +0100 From: Michal Hocko To: Pi-Hsun Shih Cc: Andrew Morton , Matthew Wilcox , Johannes Weiner , Vlastimil Babka , Minchan Kim , Omar Sandoval , Huang Ying , Tejun Heo , Wei Yang , open list Subject: Re: [PATCH] mm/swap: Avoid undefined behavior in __swapoffset Message-ID: <20190312080746.GF5721@dhcp22.suse.cz> References: <20190307094653.119425-1-pihsun@chromium.org> <20190307122313.GA30234@dhcp22.suse.cz> <20190307132353.GB30234@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 12-03-19 15:02:38, Pi-Hsun Shih wrote: > On Thu, Mar 7, 2019 at 9:23 PM Michal Hocko wrote: > > > > On Thu 07-03-19 20:47:52, Pi-Hsun Shih wrote: > > > On Thu, Mar 7, 2019 at 8:23 PM Michal Hocko wrote: > > > > > > > > On Thu 07-03-19 17:46:50, Pi-Hsun Shih wrote: > > > > > Use offsetof to calculate offset of a field to avoid UBSAN warning like: > > > > > > > > > > =================================================================== > > > > > UBSAN: Undefined behaviour in mm/swapfile.c:3010:38 > > > > > member access within null pointer of type 'union swap_header' > > > > > CPU: 6 PID: 1833 Comm: swapon Tainted: G S 4.19.23 #43 > > > > > Call trace: > > > > > dump_backtrace+0x0/0x194 > > > > > show_stack+0x20/0x2c > > > > > __dump_stack+0x20/0x28 > > > > > dump_stack+0x70/0x94 > > > > > ubsan_epilogue+0x14/0x44 > > > > > ubsan_type_mismatch_common+0xf4/0xfc > > > > > __ubsan_handle_type_mismatch_v1+0x34/0x54 > > > > > __se_sys_swapon+0x654/0x1084 > > > > > __arm64_sys_swapon+0x1c/0x24 > > > > > el0_svc_common+0xa8/0x150 > > > > > el0_svc_compat_handler+0x2c/0x38 > > > > > el0_svc_compat+0x8/0x18 > > > > > ================================================================== > > > > > > > > Could you be more specific about what exactly is undefined here and > > > > why offsetof is any better. AFAIR it uses the same construct unless a > > > > compiler defines a built in. > > > > > > > > I do not object the change itself because it is cleaner to use the > > > > existing helper but I am wondering why this is fixing ubsan. Is ubsan > > > > defining the compiler variant and consider it safe? > > > > > > > > > > The undefined behavior is from trying to accessing a member of NULL, > > > even not using it value but only use the address. > > > > Hmm, we've been using this trick for ages and I do not remember any > > compiler to complain as there is no real access. I am not sure what the > > C standard has to tell about that but I presume reasonable compilers > > will not abuse the UB here. > > > > Some more testing shows that GCC optimize the > ((size_t)&((type*)0)->member) to a constant in the result binary, and > never emit any UBSAN checks on the statement. > Clang doesn't optimize it to a constant in -O0, optimize it to a > constant in -O1 or above, and always emit the > __ubsan_handle_type_mismatch check when "-fsanitize=undefined" is > given. > So this UBSAN warning only happens when kernel is compiled by clang, not GCC. > > From what I've found, it's a UB from C standard view point > (https://software.intel.com/en-us/blogs/2015/04/20/null-pointer-dereferencing-causes-undefined-behavior), > but I agree that probably no reasonable compilers would abuse the UB > here. I really do not want to go and lawyering about the standard here but getting an address of an offset based on NULL ptr is not really dereferencing of a NULL ptr. At least this was not the case for ages and no compiler can afford to change it because there is quite a lot of userspace to rely on this construct. But as I've said using offseoff is nicer so I completely support a patch that get's read of a custom redefinition of it or open code directly. But calling it an UB is a bit of stretch. -- Michal Hocko SUSE Labs