Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755242AbdLOMX0 (ORCPT ); Fri, 15 Dec 2017 07:23:26 -0500 Received: from mx2.suse.de ([195.135.220.15]:48701 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754856AbdLOMXY (ORCPT ); Fri, 15 Dec 2017 07:23:24 -0500 Subject: Re: [PATCH v5 6/7] scsi: libsas: direct call probe and destruct To: Jason Yan , martin.petersen@oracle.com, jejb@linux.vnet.ibm.com Cc: linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org, john.garry@huawei.com, zhaohongjiang@huawei.com, hare@suse.com, dan.j.williams@intel.com, jthumshirn@suse.de, Raj.Dinesh@microsemi.com, hch@lst.de, huangdaode@hisilicon.com, chenxiang66@hisilicon.com, xiyou.wangcong@gmail.com, Ewan Milne , Tomas Henzl References: <20171208094210.24887-1-yanaijie@huawei.com> <20171208094210.24887-7-yanaijie@huawei.com> From: Hannes Reinecke Message-ID: <488de1f2-b7ff-13a5-ec6b-f95d3e099fe2@suse.de> Date: Fri, 15 Dec 2017 13:23:22 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <20171208094210.24887-7-yanaijie@huawei.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3953 Lines: 81 On 12/08/2017 10:42 AM, Jason Yan wrote: > In commit 87c8331fcf72 ("[SCSI] libsas: prevent domain rediscovery > competing with ata error handling") introduced disco mutex to prevent > rediscovery competing with ata error handling and put the whole > revalidation in the mutex. But the rphy add/remove needs to wait for the > error handling which also grabs the disco mutex. This may leads to dead > lock.So the probe and destruct event were introduce to do the rphy > add/remove asynchronously and out of the lock. > > The asynchronously processed workers makes the whole discovery process > not atomic, the other events may interrupt the process. For example, > if a loss of signal event inserted before the probe event, the > sas_deform_port() is called and the port will be deleted. > > And sas_port_delete() may run before the destruct event, but the > port-x:x is the top parent of end device or expander. This leads to > a kernel WARNING such as: > > [ 82.042979] sysfs group 'power' not found for kobject 'phy-1:0:22' > [ 82.042983] ------------[ cut here ]------------ > [ 82.042986] WARNING: CPU: 54 PID: 1714 at fs/sysfs/group.c:237 > sysfs_remove_group+0x94/0xa0 > [ 82.043059] Call trace: > [ 82.043082] [] sysfs_remove_group+0x94/0xa0 > [ 82.043085] [] dpm_sysfs_remove+0x60/0x70 > [ 82.043086] [] device_del+0x138/0x308 > [ 82.043089] [] sas_phy_delete+0x38/0x60 > [ 82.043091] [] do_sas_phy_delete+0x6c/0x80 > [ 82.043093] [] device_for_each_child+0x58/0xa0 > [ 82.043095] [] sas_remove_children+0x40/0x50 > [ 82.043100] [] sas_destruct_devices+0x64/0xa0 > [ 82.043102] [] process_one_work+0x1fc/0x4b0 > [ 82.043104] [] worker_thread+0x50/0x490 > [ 82.043105] [] kthread+0xfc/0x128 > [ 82.043107] [] ret_from_fork+0x10/0x50 > > Make probe and destruct a direct call in the disco and revalidate function, > but put them outside the lock. The whole discovery or revalidate won't > be interrupted by other events. And the DISCE_PROBE and DISCE_DESTRUCT > event are deleted as a result of the direct call. > > Introduce a new list to destruct the sas_port and put the port delete after > the destruct. This makes sure the right order of destroying the sysfs > kobject and fix the warning above. > > In sas_ex_revalidate_domain() have a loop to find all broadcasted > device, and sometimes we have a chance to find the same expander twice. > Because the sas_port will be deleted at the end of the whole revalidate > process, sas_port with the same name cannot be added before this. > Otherwise the sysfs will complain of creating duplicate filename. Since > the LLDD will send broadcast for every device change, we can only > process one expander's revalidation. > > Signed-off-by: Jason Yan > CC: John Garry > CC: Johannes Thumshirn > CC: Ewan Milne > CC: Christoph Hellwig > CC: Tomas Henzl > CC: Dan Williams > --- > drivers/scsi/libsas/sas_ata.c | 1 - > drivers/scsi/libsas/sas_discover.c | 32 ++++++++++++++++++-------------- > drivers/scsi/libsas/sas_expander.c | 8 +++----- > drivers/scsi/libsas/sas_internal.h | 1 + > drivers/scsi/libsas/sas_port.c | 3 +++ > include/scsi/libsas.h | 3 +-- > include/scsi/scsi_transport_sas.h | 1 + > 7 files changed, 27 insertions(+), 22 deletions(-) > Signed-off-by: Hannes Reinecke Cheers, Hannes -- Dr. Hannes Reinecke Teamlead Storage & Networking hare@suse.de +49 911 74053 688 SUSE LINUX GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: F. Imendörffer, J. Smithard, J. Guild, D. Upmanyu, G. Norton HRB 21284 (AG Nürnberg)