Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932272Ab0LSCUm (ORCPT ); Sat, 18 Dec 2010 21:20:42 -0500 Received: from virgo.iok.hu ([212.40.97.103]:51627 "EHLO virgo.iok.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932251Ab0LSCUk (ORCPT ); Sat, 18 Dec 2010 21:20:40 -0500 X-Greylist: delayed 1043 seconds by postgrey-1.27 at vger.kernel.org; Sat, 18 Dec 2010 21:20:40 EST Date: Sun, 19 Dec 2010 03:03:13 +0100 From: Miklos Vajna To: Peter Zijlstra Cc: shenghui , kernel-janitors@vger.kernel.org, linux-kernel@vger.kernel.org, mingo@elte.hu, Greg KH Subject: Re: [PATCH] avoid race condition in pick_next_task_fair in kernel/sched_fair.c Message-ID: <20101219020313.GJ31750@genesis.frugalware.org> References: <1277808215.1868.5.camel@laptop> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="XbHSybK3LHOYQtWI" Content-Disposition: inline In-Reply-To: <1277808215.1868.5.camel@laptop> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1967 Lines: 65 --XbHSybK3LHOYQtWI Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Jun 29, 2010 at 12:43:35PM +0200, Peter Zijlstra wrote: > On Tue, 2010-06-29 at 15:10 +0800, shenghui wrote: > > I think some lock on the metadata can fix this issue, but we may > > change plenty of code to add support for lock. I think the easist > > way is just substacting nr_running before dequing tasks.=20 >=20 > But all that is fully serialized by the rq->lock.. so I'm really not > seeing how this can happen. Hi, Here is a panic I got today: http://frugalware.org/~vmiklos/pics/bug/2.6.37-rc6.png More details: I get this sometimes on boot or shutdown when testing systemd. I did not get it with sysvinit, so I guess it may be related to systemd's heavy cgroups usage, but I'm not sure. Sadly it isn't 100% reproducible but I usually hit it at least once a day. The config is here: http://frugalware.org/~vmiklos/logs/2.6.37-rc6.config (I just did a yes "" | make config to update it to 2.6.37-rc6.) I got something similar with 2.6.36.1 as well: http://frugalware.org/~vmiklos/pics/bug/2.6.36.1.png Ah, and this is on i686 in VMware - though given that I never had this problem with systemd, I guess it won't be an emulator bug. :) I'm not familiar with the sched code, is it possible that this is related? Thanks, Miklos --XbHSybK3LHOYQtWI Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) iEYEARECAAYFAk0NZ+EACgkQe81tAgORUJZIhwCgo4frOYAEfEX8RWWxIUrKFtZK XTEAniwYuFB7UsKLSSgtKUAaqJkRzqi0 =XIDa -----END PGP SIGNATURE----- --XbHSybK3LHOYQtWI-- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/