Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp3350714ybb; Mon, 6 Apr 2020 07:18:27 -0700 (PDT) X-Google-Smtp-Source: APiQypK1UZ575YgCglSoxmsN82wkCsb5eiteI+RjIfzmHdGZDwpdC0JrnaXGl/u9A+NIm7ZpyCXg X-Received: by 2002:a4a:1ec3:: with SMTP id 186mr17095790ooq.66.1586182707469; Mon, 06 Apr 2020 07:18:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586182707; cv=none; d=google.com; s=arc-20160816; b=A2bbJBTR56yOiR/GyP08E9eSti/qrgyxa/HISeWkJak1dyFkgz78FgxEM9iXMeAd61 oYSUJEYAcIL6AQ9DQMdpUzyyXjRHLHByig494S1aOGOvtfYdgQMYAo7bs3P/VASuxUfR cxtjPDtmSpb78vonemH7c8FS6/uyuw0LE0Ud4tFw6Jr4a9BEQcl00Ftxqb+PC+242vUw BNHzFAYVHfgwOc2EVMKhHnw0oBChobrexd2/vFPurMlmhQKtN4xWP/PyIDqWezfruhNm U8klUPS3wRGxFso3ERG2EL4OsLnr7ndlTPIAAbTeEvOECAZmlf6cKyF64nVWV/BsrfzR gu3w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature; bh=K3OsL86kpDlil9CCOSzLighfs4HUJBXvocnPU68sU5Y=; b=yykgmUl+0QkmJYRyyY09rlRYbR/TxrxIk/MYGCF4d9UKeRuRy751BIp9ObA6fFPlEd b/v2SOOHUda6AFxM2zPV5XlPiZVC2z1uJPkdi4Gf/hzFNU+lj+823VDq/CRNw5Xj4HeO yqmXZGmL9SYkCYnblmwPsRnb/LknxufcZZvod1ilrtJnklaV/0o3KjhE+yEqbDuOSEbr OKV+2FLsE9J1JwgpAIBTtBoD624eB+LQTo7Zzw3WtS3pQNQcyszHw0hqahDdhNU8JxCT 9OxuPNkfFMISIwhzgnlUJfFhvb0xVeDd6fBNj1Ncs2xA3DxNuBHccfw4Tq08ZZ+GUSt5 8EXQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@st.com header.s=STMicroelectronics header.b=piwsEnmQ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=st.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f29si7668922ooh.85.2020.04.06.07.18.14; Mon, 06 Apr 2020 07:18:27 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@st.com header.s=STMicroelectronics header.b=piwsEnmQ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=st.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728618AbgDFOR7 (ORCPT + 99 others); Mon, 6 Apr 2020 10:17:59 -0400 Received: from mx08-00178001.pphosted.com ([91.207.212.93]:1486 "EHLO mx07-00178001.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728200AbgDFOR7 (ORCPT ); Mon, 6 Apr 2020 10:17:59 -0400 Received: from pps.filterd (m0046660.ppops.net [127.0.0.1]) by mx07-00178001.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 036E470H019322; Mon, 6 Apr 2020 16:17:44 +0200 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=st.com; h=subject : to : cc : references : from : message-id : date : mime-version : in-reply-to : content-type : content-transfer-encoding; s=STMicroelectronics; bh=K3OsL86kpDlil9CCOSzLighfs4HUJBXvocnPU68sU5Y=; b=piwsEnmQhgL4OMJeTzkH9Dn9ehQJr0jaqtHGa7VXXyyx6oK/6i7g6A6LSOFgbS2Hywxm WbRXgijeVHOb7svaqZYA+pEnNgwssnbVAUEValIrS+0FkT6uN8iaUbgi1DM5yJBiBdC+ ulT+o583FZQhe2qtIEX+OYIX3cCZaYwZIOLRNGBU82cx1zqrCO10MOgs475wfb4UYXBU a93d7RPMdpIKQ9PIvg87ZHExVXl672+EZ+CsCaK06tU65RNP4WWoHFW3YKYs22f6Wksg tcBm1pjqQ1LbU2zfC/Arg3SlCUti3ii0CRnU326qC1kL1sb8IyUFBF0hoManugKxjewa sA== Received: from beta.dmz-eu.st.com (beta.dmz-eu.st.com [164.129.1.35]) by mx07-00178001.pphosted.com with ESMTP id 306fc9tgv7-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Mon, 06 Apr 2020 16:17:44 +0200 Received: from euls16034.sgp.st.com (euls16034.sgp.st.com [10.75.44.20]) by beta.dmz-eu.st.com (STMicroelectronics) with ESMTP id 841B710002A; Mon, 6 Apr 2020 16:17:43 +0200 (CEST) Received: from Webmail-eu.st.com (sfhdag3node1.st.com [10.75.127.7]) by euls16034.sgp.st.com (STMicroelectronics) with ESMTP id 66BFF220A27; Mon, 6 Apr 2020 16:17:43 +0200 (CEST) Received: from lmecxl0889.tpe.st.com (10.75.127.46) by SFHDAG3NODE1.st.com (10.75.127.7) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Mon, 6 Apr 2020 16:17:42 +0200 Subject: Re: [PATCH v2 1/2] remoteproc: Add character device interface To: =?UTF-8?Q?Cl=c3=a9ment_Leger?= CC: Rishabh Bhatnagar , linux-kernel , linux-remoteproc , Bjorn Andersson , psodagud , tsoni , sidgup References: <1585699438-14394-1-git-send-email-rishabhb@codeaurora.org> <5b1c8287-0077-87e7-9364-b1f5a104c9e3@st.com> <6261646b2e0c4d9c8a30900b2f475890@codeaurora.org> <730c75c9-15e2-19c5-d97a-190bf1e6ffaa@st.com> <634144036.14036712.1586174761552.JavaMail.zimbra@kalray.eu> From: Arnaud POULIQUEN Message-ID: <8379238a-a9e0-da4e-330a-18dffba5f841@st.com> Date: Mon, 6 Apr 2020 16:17:41 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.4.1 MIME-Version: 1.0 In-Reply-To: <634144036.14036712.1586174761552.JavaMail.zimbra@kalray.eu> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.75.127.46] X-ClientProxiedBy: SFHDAG7NODE3.st.com (10.75.127.21) To SFHDAG3NODE1.st.com (10.75.127.7) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138,18.0.676 definitions=2020-04-06_08:2020-04-06,2020-04-06 signatures=0 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Clément, On 4/6/20 2:06 PM, Clément Leger wrote: > Hi Arnaud, > > ----- On 6 Apr, 2020, at 11:01, Arnaud Pouliquen arnaud.pouliquen@st.com wrote: > >> On 4/3/20 9:13 PM, rishabhb@codeaurora.org wrote: >>> On 2020-04-02 10:28, Arnaud POULIQUEN wrote: >>>> Hi >>>> >>>> On 4/1/20 2:03 AM, Rishabh Bhatnagar wrote: >>>>> Add the character device interface for userspace applications. >>>>> This interface can be used in order to boot up and shutdown >>>>> remote subsystems. Currently there is only a sysfs interface >>>>> which the userspace clients can use. If a usersapce application >>>>> crashes after booting the remote processor does not get any >>>>> indication about the crash. It might still assume that the >>>>> application is running. For example modem uses remotefs service >>>>> to fetch data from disk/flash memory. If the remotefs service >>>>> crashes, modem keeps on requesting data which might lead to a >>>>> crash. Adding a character device interface makes the remote >>>>> processor tightly coupled with the user space application. >>>>> A crash of the application leads to a close on the file descriptors >>>>> therefore shutting down the remoteproc. >>>> >>>> Sorry I'm late in the discussion, I hope I've gone through the whole >>>> discussion so I don't reopen a closed point... >>>> >>>> Something here is not crystal clear to me so I'd rather share it... >>>> >>>> I suppose that you the automatic restart of the application is not possible to >>>> stop and restart the remote processor... >>> Yes correct, while we wait for the application to restart we might observe a >>> fatal crash. >>>> >>>> Why this use case can not be solved by a process monitor or a service >>>> in userland that detects the application crash and stop the remote >>>> firmware using >>>> the sysfs interface? >>>> >>> What happens in the case where the process monitor itself crashes? This is >>> actually the approach we follow in our downstream code. We have a central entity >>> in userspace that controls bootup/shutdown of some remote processors based on >>> the >>> votes from userspace clients. We have observed cases where this entity >>> itself crashes and remote processors are left hanging. >> >> Your description makes me feel like this patch is only a workaround of something >> that >> should be fixed in the userland, even if i understand that hanging is one of the >> most >> critical problem and have to be fixed. >> For instance, how to handle several applications that interact with the remote >> processor >> ( e.g. rpmsg service applications) how to stop and restart everything. Using the >> char >> device would probaly resolve only a part of the issue... >> >> I'm not aware about your environment and i'm not a userland expert. But what i >> still not >> understand why a parent process can not do the job... >> I just test a simple script on my side that treat the kill -9 of an application >> ("cat" in my case). > > This is not entirely true, if the parent process is killed with a SIGKILL, then > the process will not be able to handle anything and the remoteproc will still > be running. > > What I understood from Rishabh patch is a way to allow a single process handling > the rproc state. We have the same kind of need and currently, if the > user application crashes, then the rproc is still running (which happens). > >> >> #start the remote firmware >> cp $1 /lib/firmware/ >> echo $1> /sys/class/remoteproc/remoteproc0/firmware >> echo start >/sys/class/remoteproc/remoteproc0/state >> #your binary >> cat /dev/kmsg >> # stop the remote firmware in case of crash (and potentially some other apps) >> echo stop >/sys/class/remoteproc/remoteproc0/state >> > > This is not really "production proof" and what happens if the application is > responsible of setting the firmware which might be jitted ? > And if the script receives the SIGKILL, then we are back to the same problem. Yes this is just a basic example, not an implementation which would depend on the environment. i'm just trying here to put forward a multi-process solution...and that I'm not an userland expert :). > > I really think, this is a step forward an easier and reliable use of the remoteproc > on userland to guarantee a coherent rproc state even if host application > crashes. I can see 3 ways of handling an application crash: - just shutdown the firmware => can be done through char device - stop some other related processes and/or generate a remote proc crash dump for debug => /sysfs and/or debugfs - do nothing as you want a silence application reboot and re-attach to the running firmware => use sysfs I'm challenging the solution because splitting the API seems to me not a good solution. Now i wonder how it works for the other applications that are relying on some other kernel frameworks... Perhaps the answer is that these frameworks don't use sysfs but char device. That would means that the sysfs solution is not the more adapted solution and perhaps we should migrate to a char device. But in this case, i think that it should implement the whole API and be exclusive with the syfs legacy API (so no sysfs or sysfs in read-only). Regards, Arnaud > > Regards, > > Clément > >> Anyway, it's just my feeling, let other people give their feedback. >> >>>> I just want to be sure that there is no alternative to this, because >>>> having two ways >>>> for application to shutdown the firmware seems to me confusing... >>> Does making this interface optional/configurable helps? >>>> >>>> What about the opposite service, mean inform the application that the remote >>>> processor is crashed? >>>> Do you identify such need? or the "auto" crash recovery is sufficient? >>> Auto recovery works perfectly for us. Although there is a mechanism in >>> place using QMI(Qualcomm MSM interface) that can notify clients about remote >>> processor crash. >> >> Thanks for the information. >> >> Regards >> Arnaud >> >>>> >>>> Thanks, >>>> Arnaud >>>>> >>>>> Signed-off-by: Rishabh Bhatnagar >>>>> --- >>>>>  drivers/remoteproc/Kconfig               |   9 +++ >>>>>  drivers/remoteproc/Makefile              |   1 + >>>>>  drivers/remoteproc/remoteproc_cdev.c     | 100 +++++++++++++++++++++++++++++++ >>>>>  drivers/remoteproc/remoteproc_internal.h |  22 +++++++ >>>>>  include/linux/remoteproc.h               |   2 + >>>>>  5 files changed, 134 insertions(+) >>>>>  create mode 100644 drivers/remoteproc/remoteproc_cdev.c >>>>> >>>>> diff --git a/drivers/remoteproc/Kconfig b/drivers/remoteproc/Kconfig >>>>> index de3862c..6374b79 100644 >>>>> --- a/drivers/remoteproc/Kconfig >>>>> +++ b/drivers/remoteproc/Kconfig >>>>> @@ -14,6 +14,15 @@ config REMOTEPROC >>>>> >>>>>  if REMOTEPROC >>>>> >>>>> +config REMOTEPROC_CDEV >>>>> +    bool "Remoteproc character device interface" >>>>> +    help >>>>> +      Say y here to have a character device interface for Remoteproc >>>>> +      framework. Userspace can boot/shutdown remote processors through >>>>> +      this interface. >>>>> + >>>>> +      It's safe to say N if you don't want to use this interface. >>>>> + >>>>>  config IMX_REMOTEPROC >>>>>      tristate "IMX6/7 remoteproc support" >>>>>      depends on ARCH_MXC >>>>> diff --git a/drivers/remoteproc/Makefile b/drivers/remoteproc/Makefile >>>>> index e30a1b1..b7d4f77 100644 >>>>> --- a/drivers/remoteproc/Makefile >>>>> +++ b/drivers/remoteproc/Makefile >>>>> @@ -9,6 +9,7 @@ remoteproc-y                += remoteproc_debugfs.o >>>>>  remoteproc-y                += remoteproc_sysfs.o >>>>>  remoteproc-y                += remoteproc_virtio.o >>>>>  remoteproc-y                += remoteproc_elf_loader.o >>>>> +obj-$(CONFIG_REMOTEPROC_CDEV)        += remoteproc_cdev.o >>>>>  obj-$(CONFIG_IMX_REMOTEPROC)        += imx_rproc.o >>>>>  obj-$(CONFIG_MTK_SCP)            += mtk_scp.o mtk_scp_ipi.o >>>>>  obj-$(CONFIG_OMAP_REMOTEPROC)        += omap_remoteproc.o >>>>> diff --git a/drivers/remoteproc/remoteproc_cdev.c >>>>> b/drivers/remoteproc/remoteproc_cdev.c >>>>> new file mode 100644 >>>>> index 0000000..8182bd1 >>>>> --- /dev/null >>>>> +++ b/drivers/remoteproc/remoteproc_cdev.c >>>>> @@ -0,0 +1,100 @@ >>>>> +// SPDX-License-Identifier: GPL-2.0-only >>>>> +/* >>>>> + * Character device interface driver for Remoteproc framework. >>>>> + * >>>>> + * Copyright (c) 2020, The Linux Foundation. All rights reserved. >>>>> + */ >>>>> + >>>>> +#include >>>>> +#include >>>>> +#include >>>>> +#include >>>>> +#include >>>>> + >>>>> +#include "remoteproc_internal.h" >>>>> + >>>>> +#define NUM_RPROC_DEVICES    64 >>>>> +static dev_t rproc_cdev; >>>>> +static DEFINE_IDA(cdev_minor_ida); >>>>> + >>>>> +static int rproc_cdev_open(struct inode *inode, struct file *file) >>>>> +{ >>>>> +    struct rproc *rproc; >>>>> + >>>>> +    rproc = container_of(inode->i_cdev, struct rproc, char_dev); >>>>> + >>>>> +    if (!rproc) >>>>> +        return -EINVAL; >>>>> + >>>>> +    if (rproc->state == RPROC_RUNNING) >>>>> +        return -EBUSY; >>>>> + >>>>> +    return rproc_boot(rproc); >>>>> +} >>>>> + >>>>> +static int rproc_cdev_release(struct inode *inode, struct file *file) >>>>> +{ >>>>> +    struct rproc *rproc; >>>>> + >>>>> +    rproc = container_of(inode->i_cdev, struct rproc, char_dev); >>>>> + >>>>> +    if (!rproc || rproc->state != RPROC_RUNNING) >>>>> +        return -EINVAL; >>>>> + >>>>> +    rproc_shutdown(rproc); >>>>> + >>>>> +    return 0; >>>>> +} >>>>> + >>>>> +static const struct file_operations rproc_fops = { >>>>> +    .open = rproc_cdev_open, >>>>> +    .release = rproc_cdev_release, >>>>> +}; >>>>> + >>>>> +int rproc_char_device_add(struct rproc *rproc) >>>>> +{ >>>>> +    int ret, minor; >>>>> +    dev_t cdevt; >>>>> + >>>>> +    minor = ida_simple_get(&cdev_minor_ida, 0, NUM_RPROC_DEVICES, >>>>> +                   GFP_KERNEL); >>>>> +    if (minor < 0) { >>>>> +        dev_err(&rproc->dev, "%s: No more minor numbers left! rc:%d\n", >>>>> +            __func__, minor); >>>>> +        return -ENODEV; >>>>> +    } >>>>> + >>>>> +    cdev_init(&rproc->char_dev, &rproc_fops); >>>>> +    rproc->char_dev.owner = THIS_MODULE; >>>>> + >>>>> +    cdevt = MKDEV(MAJOR(rproc_cdev), minor); >>>>> +    ret = cdev_add(&rproc->char_dev, cdevt, 1); >>>>> +    if (ret < 0) >>>>> +        ida_simple_remove(&cdev_minor_ida, minor); >>>>> + >>>>> +    rproc->dev.devt = cdevt; >>>>> +    return ret; >>>>> +} >>>>> + >>>>> +void rproc_char_device_remove(struct rproc *rproc) >>>>> +{ >>>>> +    __unregister_chrdev(MAJOR(rproc->dev.devt), MINOR(rproc->dev.devt), 1, >>>>> +                "rproc"); >>>>> +    ida_simple_remove(&cdev_minor_ida, MINOR(rproc->dev.devt)); >>>>> +} >>>>> + >>>>> +void __init rproc_init_cdev(void) >>>>> +{ >>>>> +    int ret; >>>>> + >>>>> +    ret = alloc_chrdev_region(&rproc_cdev, 0, NUM_RPROC_DEVICES, "rproc"); >>>>> +    if (ret < 0) { >>>>> +        pr_err("Failed to alloc rproc_cdev region, err %d\n", ret); >>>>> +        return; >>>>> +    } >>>>> +} >>>>> + >>>>> +void __exit rproc_exit_cdev(void) >>>>> +{ >>>>> +    __unregister_chrdev(MAJOR(rproc_cdev), 0, NUM_RPROC_DEVICES, "rproc"); >>>>> +} >>>>> diff --git a/drivers/remoteproc/remoteproc_internal.h >>>>> b/drivers/remoteproc/remoteproc_internal.h >>>>> index 493ef92..28d61a1 100644 >>>>> --- a/drivers/remoteproc/remoteproc_internal.h >>>>> +++ b/drivers/remoteproc/remoteproc_internal.h >>>>> @@ -47,6 +47,27 @@ struct dentry *rproc_create_trace_file(const char *name, >>>>> struct rproc *rproc, >>>>>  int rproc_init_sysfs(void); >>>>>  void rproc_exit_sysfs(void); >>>>> >>>>> +#ifdef CONFIG_REMOTEPROC_CDEV >>>>> +void rproc_init_cdev(void); >>>>> +void rproc_exit_cdev(void); >>>>> +int rproc_char_device_add(struct rproc *rproc); >>>>> +void rproc_char_device_remove(struct rproc *rproc); >>>>> +#else >>>>> +static inline void rproc_init_cdev(void) >>>>> +{ >>>>> +} >>>>> +static inline void rproc_exit_cdev(void) >>>>> +{ >>>>> +} >>>>> +static inline int rproc_char_device_add(struct rproc *rproc) >>>>> +{ >>>>> +    return 0; >>>>> +} >>>>> +static inline void  rproc_char_device_remove(struct rproc *rproc) >>>>> +{ >>>>> +} >>>>> +#endif >>>>> + >>>>>  void rproc_free_vring(struct rproc_vring *rvring); >>>>>  int rproc_alloc_vring(struct rproc_vdev *rvdev, int i); >>>>> >>>>> @@ -63,6 +84,7 @@ struct resource_table *rproc_elf_find_loaded_rsc_table(struct >>>>> rproc *rproc, >>>>>  struct rproc_mem_entry * >>>>>  rproc_find_carveout_by_name(struct rproc *rproc, const char *name, ...); >>>>> >>>>> + >>>>>  static inline >>>>>  int rproc_fw_sanity_check(struct rproc *rproc, const struct firmware *fw) >>>>>  { >>>>> diff --git a/include/linux/remoteproc.h b/include/linux/remoteproc.h >>>>> index 16ad666..c4ca796 100644 >>>>> --- a/include/linux/remoteproc.h >>>>> +++ b/include/linux/remoteproc.h >>>>> @@ -37,6 +37,7 @@ >>>>> >>>>>  #include >>>>>  #include >>>>> +#include >>>>>  #include >>>>>  #include >>>>>  #include >>>>> @@ -514,6 +515,7 @@ struct rproc { >>>>>      bool auto_boot; >>>>>      struct list_head dump_segments; >>>>>      int nb_vdev; >>>>> +    struct cdev char_dev; >>>>>  }; >>>>> >>>>>  /**