Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp817339ybg; Tue, 9 Jun 2020 13:41:47 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyYGXK72ndMm/50sbKk8s9T7CGyEvXRk5BPW2AlkA651MOal6+ZOVdTIFtCkdvoHEG0KUAl X-Received: by 2002:aa7:c2c4:: with SMTP id m4mr27758426edp.299.1591735307019; Tue, 09 Jun 2020 13:41:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591735307; cv=none; d=google.com; s=arc-20160816; b=z83lLrnIh2kp8QfQFRcjxSgckH88Iz0GVVrmAUQlSt5nnA4HsaXyDoaeTYX/m28RMO uPuBTmbX9C6vta3m/m4lgtmi60qyvUELUBJS8Da5QnCstv+BR0kHaDfsWvgTPa/Nk6ER x730mHHtm3mFYcRLM3NR0//EtQ/VjVKUFRZqJ+ftsc3JgXjQMjiramJTXSIF/dVEZ0uI TjS9S+yYOK2WrzjuhpF32baQ4SV+w8EOvlJ9PVO7/XiURyGmEOnyfZekLI+YaW5MNdZc U5TC2ts0WFnkmJsVWqM/2hQJmZTyOZWFSTyGdUkILFC2/yWpoeGTcO6TZpOf3bMvV4hn Ikvg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=DaWN0Z7K19+Sr+ET5NxbUJs/URCiMT1ql8VJ6ipfuvM=; b=UvTwTn/rSaHPtTIv9Lx8zYlmbNjV5Q+fqXPnpknbjlZrpllj9dqoUiwNCBzHTgeWpf sI2Dx46rF5Lc5hWFBw9l7O/B+L/EQSQmG9DOhVmIlmwnOcETeJ7HLZoKbs+fbC66ogy8 Pkxd/+Zv+HMT1ZwbDbM5ZD8mNusMXrWKtMhRS5WDpYkHgqZNhn7EX7GcuAU/T1AmI1g5 V316RHc7kecgfs6BaWAnmVOC+N9i5RRwu1TuJHdRVgLJ9qsiX6P8vZg79EBheXBmL6Nm dMJ0NxlBkeAYnWmzoy5pxdMEHsVqwfZOC2jjiyvCJBPuQ7/BpqcT8ER9s/V8r8Yqpd9i dE+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=dYAtbV2m; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ds2si13609959ejc.114.2020.06.09.13.41.24; Tue, 09 Jun 2020 13:41:47 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=dYAtbV2m; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732383AbgFIUVh (ORCPT + 99 others); Tue, 9 Jun 2020 16:21:37 -0400 Received: from mail.kernel.org ([198.145.29.99]:43810 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729875AbgFIUVg (ORCPT ); Tue, 9 Jun 2020 16:21:36 -0400 Received: from sol.localdomain (c-107-3-166-239.hsd1.ca.comcast.net [107.3.166.239]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id D438C206C3; Tue, 9 Jun 2020 20:21:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1591734096; bh=7Omx5Ufj4IP/9aozcsbfZbE20tRiHiqGloCyLqZg/hw=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=dYAtbV2mQ3dzCyPuZgI39zrAuS68KGdzy9NnCVxicEcqIKlMlUt/Gh1Li2bLJoC/X wmcpVAjG9NaP05c9+LbepXdvPfk10YLZqSzMBdxtMpEMlq7VDGLSep4fbhrpUSg5k3 yiU4rYaZ2HO5340DOwknjkojRudCqsn3kSgDpf8Y= Date: Tue, 9 Jun 2020 13:21:34 -0700 From: Eric Biggers To: Guenter Roeck Cc: Peter Zijlstra , tglx@linutronix.de, frederic@kernel.org, linux-kernel@vger.kernel.org, x86@kernel.org, cai@lca.pw, mgorman@techsingularity.net Subject: Re: [RFC][PATCH 7/7] sched: Replace rq::wake_list Message-ID: <20200609202134.GA1105@sol.localdomain> References: <20200526161057.531933155@infradead.org> <20200526161908.129371594@infradead.org> <20200604141837.GA179816@roeck-us.net> <20200605002433.GA148196@sol.localdomain> <20200605074154.GB2750@hirez.programming.kicks-ass.net> <20200605161532.GD1373@sol.localdomain> <53318971-561c-b445-0408-530b3d3ba44e@roeck-us.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <53318971-561c-b445-0408-530b3d3ba44e@roeck-us.net> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Jun 06, 2020 at 04:13:33PM -0700, Guenter Roeck wrote: > On 6/5/20 9:15 AM, Eric Biggers wrote: > > On Fri, Jun 05, 2020 at 09:41:54AM +0200, Peter Zijlstra wrote: > >> On Thu, Jun 04, 2020 at 05:24:33PM -0700, Eric Biggers wrote: > >>> On Thu, Jun 04, 2020 at 07:18:37AM -0700, Guenter Roeck wrote: > >>>> On Tue, May 26, 2020 at 06:11:04PM +0200, Peter Zijlstra wrote: > >>>>> The recent commit: 90b5363acd47 ("sched: Clean up scheduler_ipi()") > >>>>> got smp_call_function_single_async() subtly wrong. Even though it will > >>>>> return -EBUSY when trying to re-use a csd, that condition is not > >>>>> atomic and still requires external serialization. > >>>>> > >>>>> The change in ttwu_queue_remote() got this wrong. > >>>>> > >>>>> While on first reading ttwu_queue_remote() has an atomic test-and-set > >>>>> that appears to serialize the use, the matching 'release' is not in > >>>>> the right place to actually guarantee this serialization. > >>>>> > >>>>> The actual race is vs the sched_ttwu_pending() call in the idle loop; > >>>>> that can run the wakeup-list without consuming the CSD. > >>>>> > >>>>> Instead of trying to chain the lists, merge them. > >>>>> > >>>>> Signed-off-by: Peter Zijlstra (Intel) > >>>>> --- > >>>> ... > >>>>> + /* > >>>>> + * Assert the CSD_TYPE_TTWU layout is similar enough > >>>>> + * for task_struct to be on the @call_single_queue. > >>>>> + */ > >>>>> + BUILD_BUG_ON(offsetof(struct task_struct, wake_entry_type) - offsetof(struct task_struct, wake_entry) != > >>>>> + offsetof(struct __call_single_data, flags) - offsetof(struct __call_single_data, llist)); > >>>>> + > >>>> > >>>> There is no guarantee in C that > >>>> > >>>> type1 a; > >>>> type2 b; > >>>> > >>>> in two different data structures means that offsetof(b) - offsetof(a) > >>>> is the same in both data structures unless attributes such as > >>>> __attribute__((__packed__)) are used. > >>>> > >>>> As result, this does and will cause a variety of build errors depending > >>>> on the compiler version and compile flags. > >>>> > >>>> Guenter > >>> > >>> Yep, this breaks the build for me. > >> > >> -ENOCONFIG > > > > For me, the problem seems to be randstruct. To reproduce, you can use > > (on x86_64): > > > > make defconfig > > echo CONFIG_GCC_PLUGIN_RANDSTRUCT=y >> .config > > make olddefconfig > > make kernel/smp.o > > > > I confirmed that disabling CONFIG_GCC_PLUGIN_RANDSTRUCT "fixes" the problem > in my test builds. Maybe it would make sense to mark that configuration option > for the time being as BROKEN. > Still occurring on Linus' tree. This needs to be fixed. (And not by removing support for randstruct; that's not a "fix"...) Shouldn't the kbuild test robot have caught this? - Eric