Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S965411AbXA3H2B (ORCPT ); Tue, 30 Jan 2007 02:28:01 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S965413AbXA3H2A (ORCPT ); Tue, 30 Jan 2007 02:28:00 -0500 Received: from smtp.osdl.org ([65.172.181.24]:46371 "EHLO smtp.osdl.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S965411AbXA3H2A (ORCPT ); Tue, 30 Jan 2007 02:28:00 -0500 Date: Mon, 29 Jan 2007 23:27:27 -0800 From: Andrew Morton To: Jiri Slaby Cc: linux-kernel@vger.kernel.org, Ingo Molnar , Miles Lane , Anton Altaparmakov Subject: Re: 2.6.20-rc6-mm1 Message-Id: <20070129232727.96e580ac.akpm@osdl.org> In-Reply-To: <45BC7A26.1080303@gmail.com> References: <20070127234928.64d8e437.akpm@osdl.org> <45BC7A26.1080303@gmail.com> X-Mailer: Sylpheed version 2.2.7 (GTK+ 2.8.17; x86_64-unknown-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2181 Lines: 59 On Sun, 28 Jan 2007 11:25:42 +0100 Jiri Slaby wrote: > Andrew Morton napsal(a): > > Temporarily at > > > > http://userweb.kernel.org/~akpm/2.6.20-rc6-mm1/ > > I'm still seeing this during bootup: > BUG: at /home/l/latest/xxx/arch/i386/mm/highmem.c:52 kmap_atomic() > [] show_trace_log_lvl+0x1a/0x30 > [] show_trace+0x12/0x14 > [] dump_stack+0x16/0x18 > [] kmap_atomic+0x16c/0x20e > [] ntfs_end_buffer_async_read+0x18e/0x2ed > [] end_bio_bh_io_sync+0x26/0x3f > [] bio_endio+0x37/0x62 > [] __end_that_request_first+0x224/0x444 > [] end_that_request_chunk+0x8/0xa > [] scsi_end_request+0x1f/0xc7 > [] scsi_io_completion+0x7b/0x33a > [] sd_rw_intr+0x23/0x1ab > [] scsi_finish_command+0x42/0x47 > [] scsi_softirq_done+0x64/0xcf > [] blk_done_softirq+0x54/0x62 > [] __do_softirq+0x75/0xde > [] do_softirq+0x3b/0x3d > [] irq_exit+0x3b/0x3d > [] do_IRQ+0x51/0x8d > [] common_interrupt+0x23/0x28 > [] cpu_idle+0x80/0xc3 > [] rest_init+0x23/0x36 > [] start_kernel+0x3a5/0x43c > [<00000000>] 0x0 > ======================= > > I.e. KM_BIO_SRC_IRQ through softirq path. > argh. ntfs_end_buffer_async_read() doesn't know whether it will be called from hardirq or from softirq context: it depends upon the underlying driver. In this case, if the CPU running ntfs_end_buffer_async_read() is interrupted by IO completion against a different disk controller and that completion handler uses KM_BIO_SRC_IRQ (as it is allowed to do), it will trash ntfs_end_buffer_async_read()'s atomic kmap and unpleasing things will ensue. I guess a suitable fix here is to protect that kmap with local_irq_save/restore. I wonder where else we have that bug? - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/