Received: by 10.192.165.156 with SMTP id m28csp1182055imm; Wed, 18 Apr 2018 05:40:28 -0700 (PDT) X-Google-Smtp-Source: AIpwx495uwsTFfEL1jSP6RuwkoLPOgT0OnaKEDbALAKOVThTzcy4C8JynPKDYntVifYjFjC787d/ X-Received: by 10.99.179.6 with SMTP id i6mr1649942pgf.434.1524055228373; Wed, 18 Apr 2018 05:40:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1524055228; cv=none; d=google.com; s=arc-20160816; b=NrOc9G3aFLh6q8oBC2Dpcgu1MTsksOu8TAytua/UH7tyXG0o25m4WLs9RGmHJ25HKx RrczTz9HOBiG6vQ2ARlBD810azAXfPH/QLQuWMuFDj7euOnhO/GoxxJnIt2ZyumOEYQy 1f1CyI/0jisJkn0peWljKxU91IXkc7JyrpvWjHwKVpjB3KKefPwuZk+Mnc0ZoABgGp70 pTtCPyWw2C3v0Ya1BtLNeOWhMgwzurDPJQsv0fXUE+dHfTbBdEbR/b6nl4oGnW1xUXlw 5NDzWlJqqr9pgrC2jCj9etufxP2vZOSSJKyx8L1GEL/HJrH2qjCno1+F1orUJpNnYIO7 p+6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=hkmrrSeLQuDM44/KdtvQnovwt6G5vapXOTPSB7y8QTM=; b=xhiRxkHQV0NCZFjKHrSZNR82JBQYww/S4V5CS24F/E//CuYVv4zaqAJT8odrHq+ZKT xG9RTCnX5sYxife4csAybCSdJ7gM9XwpMd9iwFQOm+vkFDkxvJCikoiwTRYqxJ9KvlS7 BxsyLesmEDsdnnVMPNW723ibYLOVZx9eJnFz64J0U4pxU4HqrtzAY34YdSi0wxzKp9wM pjPtNHsbxbc26+BLkyTURbpxXUPXRrNzwDLl74Kk14CMxZ485BH2u2XivsMQLJHM82OS nhQ2PVOIb25nIds7ZtiuSxhh/rBu7McY0uizfx1saG5BjCmXD/p054ho3MDyc0fYYBij 94pA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d16-v6si1202222plr.141.2018.04.18.05.40.13; Wed, 18 Apr 2018 05:40:28 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753596AbeDRMi6 (ORCPT + 99 others); Wed, 18 Apr 2018 08:38:58 -0400 Received: from www381.your-server.de ([78.46.137.84]:58010 "EHLO www381.your-server.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752617AbeDRMiz (ORCPT ); Wed, 18 Apr 2018 08:38:55 -0400 Received: from [78.46.172.2] (helo=sslproxy05.your-server.de) by www381.your-server.de with esmtpsa (TLSv1.2:DHE-RSA-AES256-GCM-SHA384:256) (Exim 4.85_2) (envelope-from ) id 1f8mMH-0008Ef-LM; Wed, 18 Apr 2018 14:38:45 +0200 Received: from [2003:86:2c44:e800:8200:bff:fe9b:6612] by sslproxy05.your-server.de with esmtpsa (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.89) (envelope-from ) id 1f8mLA-000NEf-AJ; Wed, 18 Apr 2018 14:37:36 +0200 Subject: Re: [PATCH v3 06/11] iio: inkern: add module put/get on iio dev module when requesting channels To: Jonathan Cameron , Dmitry Torokhov Cc: Eugen Hristev , Jonathan Cameron , ludovic.desroches@microchip.com, alexandre.belloni@bootlin.com, linux-arm-kernel@lists.infradead.org, devicetree@vger.kernel.org, linux-kernel@vger.kernel.org, linux-iio@vger.kernel.org, linux-input@vger.kernel.org, nicolas.ferre@microchip.com, robh@kernel.org References: <1523350677-27106-1-git-send-email-eugen.hristev@microchip.com> <1523350677-27106-7-git-send-email-eugen.hristev@microchip.com> <20180415203321.1aaaf91e@archlinux> <20180416235821.GF77055@dtor-ws> <67771f4c-d30c-4a28-2a9b-d5585186d60a@microchip.com> <20180417191906.GB67621@dtor-ws> <20180418103548.000078b7@huawei.com> From: Lars-Peter Clausen Message-ID: <79d51d4e-eea7-7295-7e32-26149d6aeadd@metafoo.de> Date: Wed, 18 Apr 2018 14:37:01 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <20180418103548.000078b7@huawei.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Authenticated-Sender: lars@metafoo.de X-Virus-Scanned: Clear (ClamAV 0.99.3/24490/Wed Apr 18 06:22:36 2018) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 04/18/2018 11:35 AM, Jonathan Cameron wrote: > On Tue, 17 Apr 2018 12:19:06 -0700 > Dmitry Torokhov wrote: > >> Hi Eugen, >> >> On Tue, Apr 17, 2018 at 10:39:24AM +0300, Eugen Hristev wrote: >>> >>> >>> On 17.04.2018 02:58, Dmitry Torokhov wrote: >>>> On Sun, Apr 15, 2018 at 08:33:21PM +0100, Jonathan Cameron wrote: >>>>> On Tue, 10 Apr 2018 11:57:52 +0300 >>>>> Eugen Hristev wrote: >>>>> >>>>>> When requesting channels for a particular consumer device, >>>>>> besides requesting the device (incrementing the reference counter), also >>>>>> do it for the driver module of the iio dev. This will avoid the situation >>>>>> where the producer IIO device can be removed and the consumer is still >>>>>> present in the kernel. >>>>>> >>>>>> Signed-off-by: Eugen Hristev >>>>>> --- >>>>>> drivers/iio/inkern.c | 8 +++++++- >>>>>> 1 file changed, 7 insertions(+), 1 deletion(-) >>>>>> >>>>>> diff --git a/drivers/iio/inkern.c b/drivers/iio/inkern.c >>>>>> index ec98790..68d9b87 100644 >>>>>> --- a/drivers/iio/inkern.c >>>>>> +++ b/drivers/iio/inkern.c >>>>>> @@ -11,6 +11,7 @@ >>>>>> #include >>>>>> #include >>>>>> #include >>>>>> +#include >>>>>> #include >>>>>> #include "iio_core.h" >>>>>> @@ -152,6 +153,7 @@ static int __of_iio_channel_get(struct iio_channel *channel, >>>>>> if (index < 0) >>>>>> goto err_put; >>>>>> channel->channel = &indio_dev->channels[index]; >>>>>> + try_module_get(channel->indio_dev->driver_module); >>>>> >>>>> And if it fails? (the module we are trying to get is going away...) >>>>> >>>>> We should try and handle it I think. Be it by just erroring out of here. >>>> >>>> Even more, this has nothing to do with modules. A device can go away for >>>> any number of reasons (we unbind it manually via sysfs, we pull the USB >>>> plug from the host in case it is USB-connected device, we unload I2C >>>> adapter for the bus device resides on, we kick underlying PCI device) >>>> and we should be able to handle this in some fashion. Handling errors >>>> from reads and ignoring garbage is one of methods. >>>> >>>> FWIW this is a NACK from me. >>>> >>>> Thanks. >>> Hello, >>> >>> This patch is actually a "best effort attempt" for the consumer driver >>> (touch driver) to get a reference to the producer of the data (the IIO >>> device), when it requests the specific channels. >>> As of this moment, there is no attempt whatsoever for the consumer to have a >>> reference on the producer driver. Thus, the producer can be removed at any >>> time, and the consumer will fail ungraciously. >> >> This is the root of the issue. The consumer should be prepared to handle >> errors from producer. >> >>> I can change the perspective from "best effort" to "mandatory" to get a >>> reference to the producer, or you wish to stop trying to get any reference >>> at all (remove this patch completely) ? >> >> You should take reference to the device itself (if it is not taken >> already), so it does not disappear completely and you can continue using >> IIO API to access it, and IIO API should be prepared to deal with "dead" >> devices, but as I pointed in my other email, trying to pin the driver >> is quite pointless as there are myriad other ways of device stopping >> working besides module unloading. >> >> In any case, I think this problem is outside of the scope of this >> patchset that adds a generic resistive touchscreen, so if you want to >> continue working on this I'd recommend moving it into a separate series. >> >> Thanks. >> > Agreed, this one has come up a number of times before. Quite a lot of > work got done by (IIRC) Lars Peter Clausen to stabilize things in various > unexpected 'going away' events. Of course there may be paths we have > added since that (it was years ago) that can cause trouble... > > Anyhow, separate issue as Dmitry says, let's deal with it separately. We do properly get a ref to the device. Not getting a ref to the module is on purpose. Cause as Dmitry said it does nothing the device can disappear at any point either way. I second the NACK.