Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754195AbdLFIzY (ORCPT ); Wed, 6 Dec 2017 03:55:24 -0500 Received: from mx08-00178001.pphosted.com ([91.207.212.93]:51854 "EHLO mx07-00178001.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753584AbdLFIzU (ORCPT ); Wed, 6 Dec 2017 03:55:20 -0500 Subject: Re: [PATCH v4 3/5] remoteproc: Pass type of shutdown to subdev remove To: Bjorn Andersson CC: Andy Gross , Ohad Ben-Cohen , Arun Kumar Neelakantam , Chris Lew , , , , References: <20171130011644.9421-1-bjorn.andersson@linaro.org> <20171130011644.9421-4-bjorn.andersson@linaro.org> <6d6d01bb-f8f4-3380-c9ee-3d14c707bdf0@st.com> <20171205064611.GM28761@minitux> <516ab4fc-f1c9-574b-3ca7-a6c3c6667358@st.com> <20171205171717.GN28761@minitux> From: Arnaud Pouliquen Message-ID: <0c4e7c5a-bfbf-cc41-756c-9b1f0ebfc5ef@st.com> Date: Wed, 6 Dec 2017 09:55:12 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <20171205171717.GN28761@minitux> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.75.127.47] X-ClientProxiedBy: SFHDAG5NODE3.st.com (10.75.127.15) To SFHDAG3NODE1.st.com (10.75.127.7) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-12-06_03:,, signatures=0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5023 Lines: 136 Hello, I saw your new version but i 'm answering to this one to continue discussion. On 12/05/2017 06:17 PM, Bjorn Andersson wrote: > On Tue 05 Dec 02:54 PST 2017, Arnaud Pouliquen wrote: > >> >> >> On 12/05/2017 07:46 AM, Bjorn Andersson wrote: >>> On Fri 01 Dec 06:50 PST 2017, Arnaud Pouliquen wrote: >>> >>>> hello Bjorn, >>>> >>>> Sorry for these late remarks/questions >>>> >>> >>> No worries, I'm happy to see you reading the patch! >>> >>>> >>>> On 11/30/2017 02:16 AM, Bjorn Andersson wrote: >>> [..] >>>>> diff --git a/drivers/remoteproc/qcom_common.c b/drivers/remoteproc/qcom_common.c >>> [..] >>>>> @@ -785,17 +785,17 @@ static int rproc_probe_subdevices(struct rproc *rproc) >>>>> >>>>> unroll_registration: >>>>> list_for_each_entry_continue_reverse(subdev, &rproc->subdevs, node) >>>>> - subdev->remove(subdev); >>>>> + subdev->remove(subdev, false); >>>> Why do you need to do a non graceful remove in this case? This could >>>> lead to side effect like memory leakage... >>>> >>> >>> Regardless of this being true or false resources should always be >>> reclaimed. >>> >>> The reason for introducing this is that the modem in the Qualcomm >>> platforms implements persistent storage and it's preferred to tell it to >>> flush the latest data to the storage server (on the Linux side) before >>> pulling the plug. But in the case of a firmware crash this mechanism >>> will not be operational and there's no point in attempting this >>> "graceful shutdown". >> I understand your usecase for Qualcomm, but in rproc_probe_subdevices >> there is not crash of the remote firmware , so remove should be graceful. >> > > Now I get your point, sorry. I agree with you, as this is triggering a > clean stop of the system this should be marked "graceful". > > Will update, thanks. > >>> >>> [..] >>>>> diff --git a/include/linux/remoteproc.h b/include/linux/remoteproc.h >>>>> index 44e630eb3d94..20a9467744ea 100644 >>>>> --- a/include/linux/remoteproc.h >>>>> +++ b/include/linux/remoteproc.h >>>>> @@ -456,7 +456,7 @@ struct rproc_subdev { >>>>> struct list_head node; >>>>> >>>>> int (*probe)(struct rproc_subdev *subdev); >>>>> - void (*remove)(struct rproc_subdev *subdev); >>>>> + void (*remove)(struct rproc_subdev *subdev, bool graceful); >>>> What about adding a new ops instead of a parameter, like a recovery >>>> callback? >>>> >>> >>> I think that for symmetry purposes it should be probe/remove in both >>> code paths. A possible alternative to the proposal would be to introduce >>> an operation "request_shutdown()" the would be called in the proposed >>> graceful code path. >>> >>> >>> However, in the Qualcomm SMD and GLINK (conceptually equivalent to >>> virtio-rpmsg) it is possible to open and close communication channels >>> and it's conceivable to see that the graceful case would close all >>> channels cleanly while the non-graceful case would just remove the rpmsg >>> devices (and leave the channel states/memory as is). >>> >>> In this case a "request_shutdown()" would complicate things, compared to >>> the boolean. >>> >> I would be more for a specific ops that inform sub-dev on a crash. This >> would allow sub-dev to perform specific action (for instance dump) and >> store crash information, then on remove, sub_dev would perform specific >> action. > > There is a separate discussion (although dormant) on how to gather core > dumps, which should cover these cases. > >> This could be something like "trigger_recovery" that is propagated to >> the sub-dev. >> > > Right, this step does make sense, but is the opposite of what I need - > i.e. a means to trigger a clean shutdown. Could you clarify this point? i do not see my proposal as the opposite. In your proposal: - rproc_trigger_recovery: graceful is set to false - rproc_shutdown: Graceful is set to true My proposal is to call an new ops (if defined) before the stop in rproc_trigger_recovery. If you set a local variable in Qualcomm subdev drivers this should do the job for your need. I tried to have a look in Qualcomm part to understand implementation. but seems that you just add the parameter for time being. I think that main point that bother me here, is the that the "graceful" mode should be the normal mode. And in your implementation this look like the exception mode. Perhaps more a feeling than anything else...but if you decide to keep argument i would propose to inverse logic using something like "rproc_crashed" instead of "graceful". > >> That would sound more flexible from my point of view. >> > > At this point I see this flexibility as unnecessary complexity, if such > need show up (beyond the core dump gathering) we should bring this up > again. I let you decide what is the best solution. My concerns is to be sure that your solution is enough generic and not too Qualcomm platform oriented. As you mentioned in OpenAMP meeting it is quite difficult to come back on an implementation , especially if it impacts the API. Regards Arnaud > > Regards, > Bjorn >