Received: by 2002:a25:23cc:0:0:0:0:0 with SMTP id j195csp1191484ybj; Tue, 5 May 2020 15:06:01 -0700 (PDT) X-Google-Smtp-Source: APiQypI6aHbvCb9HRB+H1t2Ssy91Qc4ds0gUdmuxm1+bQNi1JrPFyX+A0YSLRYNlWgHPmHkJSRpI X-Received: by 2002:a17:906:7d7:: with SMTP id m23mr4588824ejc.154.1588716360931; Tue, 05 May 2020 15:06:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588716360; cv=none; d=google.com; s=arc-20160816; b=oQQA9jTfy/bjo6NnaFkVeWsl3G8l1rCAEEDqjXiCalhCcHHMq0aqEpjiFp4ELxxgZi cHY5C379KUIXuxynDNmlAdvyJwFaRfs0zIn741AMC1V+fo9tYPRVdyS3vwNiVNxz0dsH COBrScrQb/s2m06JRkZZLtxl4xbKNhHkN5K1K1wPTOYS5EwYwRR5vLOoPpWXyr2UD8yf gE4TDnNB3hWncFbXlcsUq9DSmefr8gnw/GE9r3rCgmVbmlnmGk7JAINBILw13mQ5YJqp HWFZ7xu46j4LFfM+Kz6tjSckXFJKerEZZRbrFIWUAJbduJ8uBHIpyYvi2VA0kZuUIekU c5+Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=VCda7CdBJ7GD+X9YByeTjdG9v0RMwDipXe8ofov/KVQ=; b=oudSu5rUiu2pQcjtquKdJqatv1ZZd6Qcyl12PFOVQ895HDfXRX35lW2/W+JECvatQj 8wPbeLrTrIlatg1PqseQQPX4ZKI3DnkLR5YgyWBts7WeI0SdN3hzgzVBlxAGw+HkwUT9 U4X2C4NFkeQlGhIUPBNnQTpsy1SUOqvrkGnMQMu2hHfZ8u9I995QxkgttyEnBFMzjAsY H2iUOSdL6oQ5ckt5JObTZCYhno26AD/aOU7zp/FD8lWvn9CjU+fnDM5vvCgrdck0sv/p nIQw6/967B6zyH9D0lDtdf9f3+BNoHp+jjO2ixgUvTlBTAeC5sMZge7uzTiIi7Ly3FZc bDGA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=OOFo0XOF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id oo22si9246ejb.328.2020.05.05.15.05.34; Tue, 05 May 2020 15:06:00 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=OOFo0XOF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729445AbgEEWDd (ORCPT + 99 others); Tue, 5 May 2020 18:03:33 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37816 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729178AbgEEWDb (ORCPT ); Tue, 5 May 2020 18:03:31 -0400 Received: from mail-pl1-x644.google.com (mail-pl1-x644.google.com [IPv6:2607:f8b0:4864:20::644]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9AAB6C061A0F for ; Tue, 5 May 2020 15:03:30 -0700 (PDT) Received: by mail-pl1-x644.google.com with SMTP id v2so1422205plp.9 for ; Tue, 05 May 2020 15:03:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=VCda7CdBJ7GD+X9YByeTjdG9v0RMwDipXe8ofov/KVQ=; b=OOFo0XOFsR93rldJVgSfWUFQD/p4SseAWJ73I7C9e3gt5Hz6DpaCD9EMac9CtWOLey 1g/OhEO9QPHSe+f6vtflxL1sXNcDZgY/Qv/LJTV99dgkKPohEneEgOrcHv6tHMRNH/R+ YIzDRRWs8+16EOrrJ3LAKSZ9QVwkuNTZuebRao8lQrhjScTc4MwR2oFDN8kPdUCfw0uz 9Xte58dhqPF2fqOq1svKd1ED4TKc0zNc7X/W1BBgQyZN1NV/fInv0r4gkV5KMoQHsIV3 R+OpGZlvHtNuaCbd91bLqZd6jyRxhSKgM8rCrZhCVo2Pn97UJLQNISLNVqoiZBzqR4gI RhlQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=VCda7CdBJ7GD+X9YByeTjdG9v0RMwDipXe8ofov/KVQ=; b=jqud4/liMCNVmUcNBhzgcUWpcGnU/1dPMG54UsDR/7O7s57aEWrr9fY17HSGJRQEQK kVUtusYm6GVxQ0rT1+wYKerJiuC+B8T0VSyAImJvotVJPXOdlDeVQ12dFdfRHKjJijS1 5ZU+Xiy3a7mOYJFu4rXpqDFnjpBSKmR3LNZMGjV466m5SrdidNrMbILYBy4On310+Klx 3zmNpcNFWXlohFYAR6opdJ3kI0ArMJ5wJoJ6fc9zMGIKK1+a0ht8DwqJCt/24n4BUTRa pU9oTI/HoN9xibkOxD9crdK8VRnKGt7A4ds+TFsJmf/fFzyzVS41Pt8w7KPNvlPJq1bw hsPg== X-Gm-Message-State: AGi0PuYkvFKZWBdyHosNkI/Qgm8TcmNWELrRjL+cubXeDzGNqCdmwnMs atxXS7kJLsTx1AHNkWH92oQF9w== X-Received: by 2002:a17:902:a413:: with SMTP id p19mr5231926plq.1.1588716209909; Tue, 05 May 2020 15:03:29 -0700 (PDT) Received: from xps15 (S0106002369de4dac.cg.shawcable.net. [68.147.8.254]) by smtp.gmail.com with ESMTPSA id y186sm6934pfy.66.2020.05.05.15.03.28 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 05 May 2020 15:03:29 -0700 (PDT) Date: Tue, 5 May 2020 16:03:27 -0600 From: Mathieu Poirier To: Arnaud POULIQUEN Cc: bjorn.andersson@linaro.org, ohad@wizery.com, loic.pallardy@st.com, s-anna@ti.com, linux-remoteproc@vger.kernel.org, corbet@lwn.net, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v3 10/14] remoteproc: Deal with synchronisation when shutting down Message-ID: <20200505220327.GB18333@xps15> References: <20200424200135.28825-1-mathieu.poirier@linaro.org> <20200424200135.28825-11-mathieu.poirier@linaro.org> <20200430202312.GE17031@xps15> <04b8f860-2b01-7e4f-cdea-08a3cf8af26c@st.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <04b8f860-2b01-7e4f-cdea-08a3cf8af26c@st.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, May 04, 2020 at 01:34:43PM +0200, Arnaud POULIQUEN wrote: > > > On 4/30/20 10:23 PM, Mathieu Poirier wrote: > > On Wed, Apr 29, 2020 at 10:19:49AM +0200, Arnaud POULIQUEN wrote: > >> > >> > >> On 4/24/20 10:01 PM, Mathieu Poirier wrote: > >>> The remoteproc core must not allow function rproc_shutdown() to > >>> proceed if currently synchronising with a remote processor and > >>> the synchronisation operations of that remote processor does not > >>> support it. Also part of the process is to set the synchronisation > >>> flag so that the remoteproc core can make the right decisions when > >>> restarting the system. > >>> > >>> Signed-off-by: Mathieu Poirier > >>> --- > >>> drivers/remoteproc/remoteproc_core.c | 32 ++++++++++++++++++++++++ > >>> drivers/remoteproc/remoteproc_internal.h | 7 ++++++ > >>> 2 files changed, 39 insertions(+) > >>> > >>> diff --git a/drivers/remoteproc/remoteproc_core.c b/drivers/remoteproc/remoteproc_core.c > >>> index 3a84a38ba37b..48afa1f80a8f 100644 > >>> --- a/drivers/remoteproc/remoteproc_core.c > >>> +++ b/drivers/remoteproc/remoteproc_core.c > >>> @@ -1849,6 +1849,27 @@ int rproc_boot(struct rproc *rproc) > >>> } > >>> EXPORT_SYMBOL(rproc_boot); > >>> > >>> +static bool rproc_can_shutdown(struct rproc *rproc) > >>> +{ > >>> + /* > >>> + * The remoteproc core is the lifecycle manager, no problem > >>> + * calling for a shutdown. > >>> + */ > >>> + if (!rproc_needs_syncing(rproc)) > >>> + return true; > >>> + > >>> + /* > >>> + * The remoteproc has been loaded by another entity (as per above > >>> + * condition) and the platform code has given us the capability > >>> + * of stopping it. > >>> + */ > >>> + if (rproc->sync_ops->stop) > >>> + return true; > >> > >> This means that if rproc->sync_ops->stop is null rproc_stop_subdevices will not > >> be called? seems not symmetric with the start sequence. > > > > If rproc->sync_ops->stop is not provided then the remoteproc core can't stop the > > remote processor at all after it has synchronised with it. If a usecase > > requires some kind of soft reset then a stop() function that uses a mailbox > > notification or some other mechanism can be provided to tell the remote > > processor to put itself back in startup mode again. > > > > Is this fine with you or there is still something I don't get? > > My point here is more around the subdevices. But perhaps i missed something... > > In rproc_start rproc_start_subdevices is called, even if sync_start is null. Here I'll take that you mean sync_ops::start() > But in rproc_shutdown rproc_stop is not called, if sync_ops->stop is null. > So rproc_stop_subdevices is not called in this case. Correct. I am pretty sure some people don't want the remoteproc core to be able to do anything other than synchronise with a remote processor, be it at boot time or when the remote processor has crashed. I can also see scenarios where people want to be able to start and stop subdevices from the remoteproc core, but _not_ power cycle the remote processor. In such cases the sync_ops::stop() should be some kind of notification telling the remote processor to put itself back in initialisation mode and sync_flags.after_stop should be set to true. > Then if sync_flags.after_stop is false, it looks like that something will go wrong > at next start. If sync_ops::stop is NULL then the value of sync_flags.after_stop becomes irrelevant because that state can't be reached. Let me know if you found a condition where this isn't the case and I will correct it. > > > > >> Probably not useful to test it here as condition is already handled in rproc_stop_device... > >> > >> Regards > >> Arnaud > >>> + > >>> + /* Any other condition should not be allowed */ > >>> + return false; > >>> +} > >>> + > >>> /** > >>> * rproc_shutdown() - power off the remote processor > >>> * @rproc: the remote processor > >>> @@ -1879,6 +1900,9 @@ void rproc_shutdown(struct rproc *rproc) > >>> return; > >>> } > >>> > >>> + if (!rproc_can_shutdown(rproc)) > >>> + goto out; > >>> + > >>> /* if the remote proc is still needed, bail out */ > >>> if (!atomic_dec_and_test(&rproc->power)) > >>> goto out; > >>> @@ -1898,6 +1922,14 @@ void rproc_shutdown(struct rproc *rproc) > >>> kfree(rproc->cached_table); > >>> rproc->cached_table = NULL; > >>> rproc->table_ptr = NULL; > >>> + > >>> + /* > >>> + * The remote processor has been switched off - tell the core what > >>> + * operation to use from hereon, i.e whether an external entity will > >>> + * reboot the remote processor or it is now the remoteproc core's > >>> + * responsability. > >>> + */ > >>> + rproc_set_sync_flag(rproc, RPROC_SYNC_STATE_SHUTDOWN); > >>> out: > >>> mutex_unlock(&rproc->lock); > >>> } > >>> diff --git a/drivers/remoteproc/remoteproc_internal.h b/drivers/remoteproc/remoteproc_internal.h > >>> index 61500981155c..7dcc0a26892b 100644 > >>> --- a/drivers/remoteproc/remoteproc_internal.h > >>> +++ b/drivers/remoteproc/remoteproc_internal.h > >>> @@ -27,6 +27,9 @@ struct rproc_debug_trace { > >>> /* > >>> * enum rproc_sync_states - remote processsor sync states > >>> * > >>> + * @RPROC_SYNC_STATE_SHUTDOWN state to use after the remoteproc core > >>> + * has shutdown (rproc_shutdown()) the > >>> + * remote processor. > >>> * @RPROC_SYNC_STATE_CRASHED state to use after the remote processor > >>> * has crashed but has not been recovered by > >>> * the remoteproc core yet. > >>> @@ -36,6 +39,7 @@ struct rproc_debug_trace { > >>> * operation to use. > >>> */ > >>> enum rproc_sync_states { > >>> + RPROC_SYNC_STATE_SHUTDOWN, > >>> RPROC_SYNC_STATE_CRASHED, > >>> }; > >>> > >>> @@ -43,6 +47,9 @@ static inline void rproc_set_sync_flag(struct rproc *rproc, > >>> enum rproc_sync_states state) > >>> { > >>> switch (state) { > >>> + case RPROC_SYNC_STATE_SHUTDOWN: > >>> + rproc->sync_with_rproc = rproc->sync_flags.after_stop; > >>> + break; > >>> case RPROC_SYNC_STATE_CRASHED: > >>> rproc->sync_with_rproc = rproc->sync_flags.after_crash; > >>> break; > >>>