Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756960Ab3H2UCG (ORCPT ); Thu, 29 Aug 2013 16:02:06 -0400 Received: from mx1.redhat.com ([209.132.183.28]:42755 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753771Ab3H2UCE (ORCPT ); Thu, 29 Aug 2013 16:02:04 -0400 Date: Thu, 29 Aug 2013 16:01:57 -0400 From: Vivek Goyal To: Tomoki Sekiyama Cc: "linux-kernel@vger.kernel.org" , "axboe@kernel.dk" , Seiji Aguchi , "majianpeng@gmail.com" , Tejun Heo Subject: Re: [PATCH] elevator: Fix a race in elevator switching and md device initialization Message-ID: <20130829200157.GC8697@redhat.com> References: <20130829184304.GC6171@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3734 Lines: 93 On Thu, Aug 29, 2013 at 07:29:07PM +0000, Tomoki Sekiyama wrote: > On 8/29/13 14:43 , "Vivek Goyal" wrote: > >On Thu, Aug 29, 2013 at 02:33:10PM -0400, Vivek Goyal wrote: > >> On Mon, Aug 26, 2013 at 09:45:15AM -0400, Tomoki Sekiyama wrote: > >> > The soft lockup below happes at the boot time of the system using dm > >> > multipath and automated elevator switching udev rules. > >> > > >> > [ 356.127001] BUG: soft lockup - CPU#3 stuck for 22s! [sh:483] > >> > [ 356.127001] RIP: 0010:[] [] > >>lock_timer_base.isra.35+0x1d/0x50 > >> > ... > >> > [ 356.127001] Call Trace: > >> > [ 356.127001] [] try_to_del_timer_sync+0x20/0x70 > >> > [ 356.127001] [] ? > >>kmem_cache_alloc_node_trace+0x20a/0x230 > >> > [ 356.127001] [] del_timer_sync+0x52/0x60 > >> > [ 356.127001] [] cfq_exit_queue+0x32/0xf0 > >> > [ 356.127001] [] elevator_exit+0x2f/0x50 > >> > [ 356.127001] [] elevator_change+0xf1/0x1c0 > >> > [ 356.127001] [] elv_iosched_store+0x20/0x50 > >> > [ 356.127001] [] queue_attr_store+0x59/0xb0 > >> > [ 356.127001] [] sysfs_write_file+0xc6/0x140 > >> > [ 356.127001] [] vfs_write+0xbd/0x1e0 > >> > [ 356.127001] [] SyS_write+0x49/0xa0 > >> > [ 356.127001] [] system_call_fastpath+0x16/0x1b > >> > > >> > >> Tokomi, > >> > >> As you noticed, there is a fedora bug open with similar signature. May > >> be this patch will fix that issue also. > >> > >> https://bugzilla.redhat.com/show_bug.cgi?id=902012 > >> > >> > >> > This is caused by a race between md device initialization and sysfs > >>knob > >> > to switch the scheduler. > >> > > >> > * multipathd: > >> > SyS_ioctl -> do_vfs_ioctl -> dm_ctl_ioctl -> ctl_ioctl -> table_load > >> > -> dm_setup_md_queue -> blk_init_allocated_queue -> elevator_init: > >> > > >> > q->elevator = elevator_alloc(q, e); // not yet initialized > >> > > >> > * sh -c 'echo deadline > /sys/$DEVPATH/queue/scheduler' > >> > SyS_write -> vfs_write -> sysfs_write_file -> queue_attr_store > >> > ( mutex_lock(&q->sysfs_lock) here. ) > >> > -> elv_iosched_store -> elevator_change: > >> > > >> > elevator_exit(old); // try to de-init uninitialized elevator and > >>hang up > >> > > > > >If problem in this case is that we are trying to exit() the elevator > >which has not been properly initialized, then we should not attach > >the elevator to the queue yet. > > > >In cfq_init_queue(), can we move following code towards the end of > >function. > > > > spin_lock_irq(q->queue_lock); > > q->elevator = eq; > > spin_unlock_irq(q->queue_lock); > > > >So till elevator is initialized, we will not attach it to queue and > >elevator_switch() will return as it will not find a valid elevator > >on the queue. > > > > > >elevator_change() { > > if (!q->elevator) > > return -ENXIO; > >} > > > >Thanks > >Vivek > > I think it also works, though I prefer introducing explicit locking, > as you said, so that code won't break again in some future. I agree. Providing explicit locking and making sure only one elevator can be initializing at a time on a queue and others wait till that operation is complete, will make up the code more readable and less bug suspecible. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/