Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S261243AbVDSBJQ (ORCPT ); Mon, 18 Apr 2005 21:09:16 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S261244AbVDSBJP (ORCPT ); Mon, 18 Apr 2005 21:09:15 -0400 Received: from gate.crashing.org ([63.228.1.57]:13031 "EHLO gate.crashing.org") by vger.kernel.org with ESMTP id S261243AbVDSBJK (ORCPT ); Mon, 18 Apr 2005 21:09:10 -0400 Subject: Re: PATCH [PPC64]: dead processes never reaped From: Benjamin Herrenschmidt To: Linas Vepstas Cc: Paul Mackerras , Anton Blanchard , Andrew Morton , linuxppc64-dev , Linux Kernel list In-Reply-To: <20050418193833.GW15596@austin.ibm.com> References: <20050418193833.GW15596@austin.ibm.com> Content-Type: text/plain Date: Tue, 19 Apr 2005 11:07:01 +1000 Message-Id: <1113872821.5516.330.camel@gaston> Mime-Version: 1.0 X-Mailer: Evolution 2.0.4 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1864 Lines: 44 On Mon, 2005-04-18 at 14:38 -0500, Linas Vepstas wrote: > > Hi, > > The patch below appears to fix a problem where a number of dead processes > linger on the system. On a highly loaded system, dozens of processes > were found stuck in do_exit(), calling thier very last schedule(), and > then being lost forever. > > Processes that are PF_DEAD are cleaned up *after* the context switch, > in a routine called finish_task_switch(task_t *prev). The "prev" gets > the value returned by _switch() in entry.S, but this value comes from > > __switch_to (struct task_struct *prev, > struct task_struct *new) > { > old_thread = ¤t->thread; ///XXX shouldn't this be prev, not current? > last = _switch(old_thread, new_thread); > return last; > } > > The way I see it, "prev" and "current" are almost always going to be > pointing at the same thing; however, if a "need resched" happens, > or there's a pre-emept or some-such, then prev and current won't be > the same; in which case, finish_task_switch() will end up cleaning > up the old current, instead of prev. This will result in dead processes > hanging around, which will never be scheduled again, and will never > get a chance to have put_task_struct() called on them. Ok, thinking moer about this ... that will need maybe some help from Ingo so I fully understand where schedule's are allowed ... We are basically in the middle of the scheduler here, so I wonder how much of the scheduler itself can be preempted or so ... Basically, under which circumstances can prev and current be different ? Ben. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/