Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S266363AbUAVTi2 (ORCPT ); Thu, 22 Jan 2004 14:38:28 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S266362AbUAVThr (ORCPT ); Thu, 22 Jan 2004 14:37:47 -0500 Received: from h00a0cca1a6cf.ne.client2.attbi.com ([65.96.182.167]:24705 "EHLO h00a0cca1a6cf.ne.client2.attbi.com") by vger.kernel.org with ESMTP id S266363AbUAVThj (ORCPT ); Thu, 22 Jan 2004 14:37:39 -0500 Date: Thu, 22 Jan 2004 14:37:04 -0500 From: timothy parkinson To: john stultz Cc: hauan@cmu.edu, Linux Kernel Subject: Re: 2.6.1 "clock preempt"? Message-ID: <20040122193704.GA552@h00a0cca1a6cf.ne.client2.attbi.com> Mail-Followup-To: john stultz , hauan@cmu.edu, Linux Kernel References: <1074630968.19174.49.camel@steinar.cheme.cmu.edu> <1074633977.16374.67.camel@cog.beaverton.ibm.com> <1074697593.5650.26.camel@steinar.cheme.cmu.edu> <1074709166.16374.73.camel@cog.beaverton.ibm.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1074709166.16374.73.camel@cog.beaverton.ibm.com> User-Agent: Mutt/1.4.1i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3473 Lines: 90 hi john, i think this sounds like the same issue that i've been seeing with 2.5/2.6 kernels for a while now. smp kernel on a dual PIII machine without preempt. after running for a while the "losing ticks" shows up in dmesg, and the system loses a lot of time. load seems to make it worse, so that just might be the trigger. i'll try that one-liner as well when i get a chance - you said that if the system still loses time, that narrows it down to hardware, yes? thanks... timothy On Wed, Jan 21, 2004 at 10:19:26AM -0800, john stultz wrote: > On Wed, 2004-01-21 at 07:06, Steinar Hauan wrote: > > On Tue, 2004-01-20 at 16:26, john stultz wrote: > > > How quickly do you see this message? Does it happen right at boot time, > > > or during load? > > > > boottime: > > Jan 19 08:59:24 localhost kernel: Linux version 2.6.1 > > (root@offa) (gcc version 3.3.2 20031218 > > (Red Hat Linux 3.3.2-5)) #2 > > SMP Sat Jan 17 18:05:09 EST 2004 > > > > (yes, i have SMP enabled even for a single cpu machine; the kernel > > will be run on a multitude of machines of which most are SMP) > > > > log messages prior to time issues: > > > > Jan 20 04:02:01 localhost anacron[6057]: Updated timestamp for > > job `cron.daily' to > > 2004-01-20 > > Jan 20 04:05:38 localhost kernel: hdb: dma_timer_expiry: > > dma status == 0x61 > > Jan 20 04:05:48 localhost kernel: hdb: DMA timeout error > > Jan 20 04:05:48 localhost kernel: hdb: dma timeout error: > > status=0xd0 {Busy } > > Jan 20 04:05:48 localhost kernel: hda: DMA disabled > > Jan 20 04:05:48 localhost kernel: hdb: DMA disabled > > Jan 20 04:05:48 localhost kernel: ide0: reset: success > > > > (hda and hdb are on the same controller; the only active disk at > > this time should be hda ... hdb should be essentially idle > > except for cron.daily scripts running around then) > > > > timing error starts here; interactive work > > > > Jan 20 07:52:40 localhost kernel: Losing too many ticks! > > Hmm. It might be that IDE PIO mode on your system blocks interrupts for > too long. > > > > You might want to try the attached patch to see if we're overreacting > > [...] > > > Also, do you see the problem when preempt is disabled > > > > testing overnight failed to reproduce the problem. note that the > > original event only occurred after about 12 hrs of testing. > > Sorry, that was vague. Did you fail to reproduce the problem using the > patch or with preempt disabled? > > > i'll install lm_sensors & smarttools and run cpuburn for a while > > to ensure this is not related to any hardware issues. > > I doubt its its a hardware fault issue, but more likely some driver not > working properly with your hardware and blocking interrupts for too > long. > > thanks > -john > > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > > - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/