Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756641AbYKKOsQ (ORCPT ); Tue, 11 Nov 2008 09:48:16 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1756011AbYKKOr7 (ORCPT ); Tue, 11 Nov 2008 09:47:59 -0500 Received: from wa-out-1112.google.com ([209.85.146.178]:52389 "EHLO wa-out-1112.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755831AbYKKOr5 (ORCPT ); Tue, 11 Nov 2008 09:47:57 -0500 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:cc:in-reply-to:mime-version :content-type:content-transfer-encoding:content-disposition :references; b=KJ44QRqZwfFU5ZtR4AswaygT4658PZDdV/H5WlVHZtiks/mNMJPxBQgfef409fUahG DD4hOfibvFwaawACqpIyof6Y2ss57WDIYymP5DMkFVdVM3itjsRcxZi4z6bRzdvASuc0 QICoR87no7rJxBBE2Tvl86XcGX2jgcXuzC77o= Message-ID: <19f34abd0811110647y2a00cfbfr2b219a5aa1b3ac9f@mail.gmail.com> Date: Tue, 11 Nov 2008 15:47:56 +0100 From: "Vegard Nossum" To: "Ingo Molnar" , "Rafael J. Wysocki" Subject: Re: [Bug #11989] Suspend failure on NForce4-based boards due to chanes in stop_machine Cc: "Heiko Carstens" , "Linux Kernel Mailing List" , "Kernel Testers List" , "Rusty Russell" , "Peter Zijlstra" , "Oleg Nesterov" , "Dmitry Adamushko" , "Andrew Morton" In-Reply-To: <20081111105214.GA15645@elte.hu> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <20081110120401.GA15518@osiris.boeblingen.de.ibm.com> <200811101547.21325.rjw@sisk.pl> <200811102355.42389.rjw@sisk.pl> <20081111105214.GA15645@elte.hu> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 856 Lines: 23 On Tue, Nov 11, 2008 at 11:52 AM, Ingo Molnar wrote: > [ Cc:-ed workqueue/locking/suspend-race-condition experts. ] > > Seems like the new kernel/stop_machine.c logic has a race for the test > sequence above. (Below is the bisected commit again, maybe the race is > visible via email review as well.) I try again. I think that the test for stop_machine_data in stop_cpu() should not have been moved from __stop_machine(). Because now cpu_online_map may change in-between calls to stop_cpu() (if the callback tries to online/offline CPUs), and the end result may be different. Maybe? Vegard -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/