Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753026AbZG2K3N (ORCPT ); Wed, 29 Jul 2009 06:29:13 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751873AbZG2K3M (ORCPT ); Wed, 29 Jul 2009 06:29:12 -0400 Received: from gw1.cosmosbay.com ([212.99.114.194]:33862 "EHLO gw1.cosmosbay.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751052AbZG2K3M (ORCPT ); Wed, 29 Jul 2009 06:29:12 -0400 Message-ID: <4A7023BC.6000109@cosmosbay.com> Date: Wed, 29 Jul 2009 12:26:04 +0200 From: Eric Dumazet User-Agent: Thunderbird 2.0.0.22 (Windows/20090605) MIME-Version: 1.0 To: Jens Rosenboom CC: Peter Zijlstra , Sonny Rao , Linux Kernel Mailing List , Ingo Molnar , Thomas Gleixner Subject: Re: futexes: Still infinite loop in get_futex_key() in 2.6.31-rc4 References: <1248681637.7279.12.camel@fnki-nb00130> <1248694266.6987.1594.camel@twins> <1248697004.7279.31.camel@fnki-nb00130> <1248697409.6987.1617.camel@twins> <1248698755.7279.47.camel@fnki-nb00130> <1248701812.6987.1637.camel@twins> <1248848568.6757.13.camel@fnki-nb00130> <1248861443.6757.20.camel@fnki-nb00130> In-Reply-To: <1248861443.6757.20.camel@fnki-nb00130> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-1.6 (gw1.cosmosbay.com [0.0.0.0]); Wed, 29 Jul 2009 12:26:04 +0200 (CEST) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1061 Lines: 26 Jens Rosenboom a ?crit : > On Wed, 2009-07-29 at 08:22 +0200, Jens Rosenboom wrote: >> On Mon, 2009-07-27 at 15:36 +0200, Peter Zijlstra wrote: >> [...] >>> Bugger.. how easy it is to reproduce? >> Okay, my colleague found the right combination of scripts, take the two >> attached, run them both a couple of times in parallel for some hours, >> and get a stuck ps. This happens both on an old 2.6.29.1 I happened to >> still have on one machine as with 2.6.31-rc4. Both of them dual-core >> Opterons as the original one. If you want further tracebacks or other >> information, let me know. > > Forget about null.pl even, just run pees.pl twice and a top to watch it, > has worked for me within less than an hour several times now. > Ah that makes sense now... maybe execve() forgets to clear clear_child_tid -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/