Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933760AbXFFNM0 (ORCPT ); Wed, 6 Jun 2007 09:12:26 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1757958AbXFFNMT (ORCPT ); Wed, 6 Jun 2007 09:12:19 -0400 Received: from mx2.compro.net ([216.54.166.4]:28439 "EHLO mx2.compro.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755175AbXFFNMT (ORCPT ); Wed, 6 Jun 2007 09:12:19 -0400 X-IronPort-AV: E=Sophos;i="4.16,389,1175486400"; d="scan'208";a="418744" Message-ID: <4666B2A4.7090603@compro.net> Date: Wed, 06 Jun 2007 09:12:04 -0400 From: Mark Hounschell Reply-To: markh@compro.net Organization: Compro Computer Svcs. User-Agent: Thunderbird 1.5.0.10 (X11/20060911) MIME-Version: 1.0 To: linux-kernel@vger.kernel.org CC: markh@compro.net, Oleg Nesterov , Mark Hounschell , Andrew Morton , Ingo Molnar Subject: Re: floppy.c soft lockup References: <20070531192256.GA88@tv-sign.ru> <465F2D96.9060502@compro.net> <20070601110058.GA83@tv-sign.ru> <466028DB.3060509@compro.net> <20070601151605.GA108@tv-sign.ru> <4660534E.6050903@cfl.rr.com> <20070601183642.GA92@tv-sign.ru> <466078FF.2080508@cfl.rr.com> <20070602123030.GA719@tv-sign.ru> <4661D698.5040009@cfl.rr.com> <20070603081417.GA81@tv-sign.ru> <46641B16.9090701@compro.net> In-Reply-To: <46641B16.9090701@compro.net> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1803 Lines: 44 Mark Hounschell wrote: > Oleg Nesterov wrote: >> On 06/02, Mark Hounschell wrote: >>> Jun 2 16:36:11 harley kernel: ERR!! events/1 flush hang: c201dbc0 >>> c201dbc0 10012 10012 >>> Jun 2 16:36:11 harley kernel: CURR: 7974 7974 vrsx 93 26 >>> Jun 2 16:36:11 harley kernel: wq_barrier_func+0x0/0x8 >>> Jun 2 16:36:11 harley kernel: vmstat_update+0x0/0x24 >>> Jun 2 16:36:11 harley kernel: ---- >>> Jun 2 16:36:11 harley kernel: cache_reap+0x0/0xf4 >> As expected. >> >> Note that ->nivcsw/->nvcsw doesn't change. There is no "spare time" >> on CPU 1, "vrsx" monopolizes CPU. events/1->cache_reap() was preempted >> by vrsx, it had no chance to run since then. Note that jobs == 7974 >> doesn't change too. I forgot to print cwq->thread->state, but it should >> be TASK_RUNNING. It would not be possible to kill vrsx if cache_reap() >> stalled. >> >> I don't think this is a kernel problem, vrsx breaks flush_workqueue(). >> Ingo can answer authoritatively, but I think SCHED_RR/SCHED_FIFO were >> not designed to be 100% cpu-bound. >> >> That said, I think it makes sense to get rid of flush_scheduled_work() >> in floppy.c. >> > > Oleg, thanks for your time in diagnosing this. > > As far as a 100% CPU bound task being a valid thing to do, it has been > done for many years on SMP machines. Any kernel limitation on this > surely must be considered a bug? > Could someone authoritatively comment on this? Is a SCHED_RR/SCHED_FIFO 100% Cpu bound process supported in an SMP env on Linux? (vanilla or -rt) Thanks and Regards Mark - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/