Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp4177627pxa; Mon, 10 Aug 2020 02:52:47 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzLaiUnJKzLczZ6b4e4pjEyDcQleRnFCiL3DIuXlg1G9x+Rkbzpy1ZL8qRbm55crTUNFXTa X-Received: by 2002:a17:906:c7d3:: with SMTP id dc19mr20754821ejb.495.1597053167075; Mon, 10 Aug 2020 02:52:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597053167; cv=none; d=google.com; s=arc-20160816; b=CMJOeMqiQPnPfR7PpAuOGQrY0ljvm7XWIsM9plYanyfknycoCx7FLJq4NVsVXmwt2T eGu0GJDR9bkEkVUUQx0Gl9huStCqhTRyzc+OkuCbJqvWDsiIZEQUyvBXhq7dI4h61BH9 dZLPALT4VnuNb0guIWDx5AeXIynsFaFaTCiqQsfx7QrpFi6ggeZSfWhIjb1cnVzl1joB E8r+35fTLoxHejcbX0ZV9YpFpc6y30FWDv3RTvysjcxz2OydWOhXh37YCs/V5qPbtUqy tNagZMOWyKJhtGkOOZM3duI9aDX3Rdd9XBDGmHO/Yq3jQZehB3ydlA3rN3t+HH/zUFyW nL/A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature; bh=wLAOFokMTumfYpsNR+8SVNBDFRnEeROvH0KpvMDc688=; b=AY8wFN6HF1RRMvfwwOkWodKz0t0ZKDpJxt4gFAsx1Thg1D09DmPqtC2z/4jh7lU1RH rqC9iOFHges2/QW6ZxcP9XFiedYG4E746Dg3eX+6Vaw74Nrj8MnwlDF1YtzdN2L6fDHO EhbvxACCO6LhYhT5rkOQlpibyO1342VwqWHq5lWAyuU7q1rWZ1BP+17UAg5zY8/dKdNG HjFf0Xk+DUb7CP06pydn7IeKxBaFCSPwzUnE3ekgytpDoBT0CMr3s8axjq83ttCl3vrv 8WX3mCoQb04/IYqLvRmJXyEKwF1baX8jMlRuTf7c/vwnlD2V86yxLxJtSbqRQEZgnucq tp/A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=k+dFKhCh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b12si9745415eds.283.2020.08.10.02.52.23; Mon, 10 Aug 2020 02:52:47 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=k+dFKhCh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726584AbgHJJvF (ORCPT + 99 others); Mon, 10 Aug 2020 05:51:05 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40988 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725846AbgHJJvD (ORCPT ); Mon, 10 Aug 2020 05:51:03 -0400 Received: from mail-wr1-x442.google.com (mail-wr1-x442.google.com [IPv6:2a00:1450:4864:20::442]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6BC9BC061786 for ; Mon, 10 Aug 2020 02:51:03 -0700 (PDT) Received: by mail-wr1-x442.google.com with SMTP id a15so7509038wrh.10 for ; Mon, 10 Aug 2020 02:51:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=wLAOFokMTumfYpsNR+8SVNBDFRnEeROvH0KpvMDc688=; b=k+dFKhCh67gRr7m58/fPDWfkNNhbMS48AtVuEWuUaL7GgRIyIEJqcBKh4X4a758b57 KkxB+nlBkH22LvsGhtwj+9AKExh67WZoahawhyDAkFVD9weSi8e5ZbVcy/eGKq099dNT 2rPIwd2l1SjoFjVUGdYWaoP+bfcSinZkbNbkr9MR2UhlRe6MyObY5TnVIz63omf1VXEc /CiuYs7UqB+sFQ+1RAzS4v8QUgGIqojlO2zYXqpOHhQb4j0xfterod5++UDpWvEyFfu1 dkN1qnsISdn7dKkCzrj6hspVdZz0ntwatQs9yITSo29sJgxhUIS1t/mceafkgkVnAger 2xsQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=wLAOFokMTumfYpsNR+8SVNBDFRnEeROvH0KpvMDc688=; b=FPk9WH3BoKaGqHHZc9WmF1wrRF0FF5b+nAKtRpYT72tD2+gPGlQiu6RpZZe5SkybBm i529EilJlcVmJ7bTpR9N7d1wiHpHGZ4WhWqlsfS+sys7wGiSro2hFuzs3MB6CSMDEJYj IMP/6d2K0KSVKPzUjoiVVoiT8xLdLI4flBCAAKywekbqXbOscUA9Lc1003yOO8AhQ3Oy hWKh8X0te6Fi1HKRr1KrQdIJiZ609YKS3cpieRi3AnBVlu62NGJyVzBc1O6TBJxCAmO0 NngnbhBGWK7PNe5lmmBr2CE0pOs0ZULocrOIM1A2hwf3Q/Mjel931dbIiI+ovpcn7izk lcPA== X-Gm-Message-State: AOAM531wgOaHjaFWQdnJVq/LCydzOyyHqlvGZBcHXenYpRCmk+UnrPmX HEaZ1xMktlnm64ZfaQEy0XDT1A== X-Received: by 2002:a5d:5588:: with SMTP id i8mr23947186wrv.177.1597053062045; Mon, 10 Aug 2020 02:51:02 -0700 (PDT) Received: from [192.168.1.8] ([195.24.90.54]) by smtp.googlemail.com with ESMTPSA id f6sm2022636wme.32.2020.08.10.02.51.00 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 10 Aug 2020 02:51:01 -0700 (PDT) Subject: Re: [RESEND 1/3] venus: core: handle race condititon for core ops To: Mansur Alisha Shaik , linux-media@vger.kernel.org Cc: linux-kernel@vger.kernel.org, linux-arm-msm@vger.kernel.org, vgarodia@codeaurora.org References: <1596781478-12216-1-git-send-email-mansur@codeaurora.org> <1596781478-12216-2-git-send-email-mansur@codeaurora.org> From: Stanimir Varbanov Message-ID: Date: Mon, 10 Aug 2020 12:50:59 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <1596781478-12216-2-git-send-email-mansur@codeaurora.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Mansur, Thanks for the patches! On 8/7/20 9:24 AM, Mansur Alisha Shaik wrote: > For core ops we are having only write protect but there > is no read protect, because of this in multthreading > and concurrency, one CPU core is reading without wait > which is causing the NULL pointer dereferece crash. > > one such scenario is as show below, where in one > core core->ops becoming NULL and in another core > calling core->ops->session_init(). > > CPU: core-7: > Call trace: > hfi_session_init+0x180/0x1dc [venus_core] > vdec_queue_setup+0x9c/0x364 [venus_dec] > vb2_core_reqbufs+0x1e4/0x368 [videobuf2_common] > vb2_reqbufs+0x4c/0x64 [videobuf2_v4l2] > v4l2_m2m_reqbufs+0x50/0x84 [v4l2_mem2mem] > v4l2_m2m_ioctl_reqbufs+0x2c/0x38 [v4l2_mem2mem] > v4l_reqbufs+0x4c/0x5c > __video_do_ioctl+0x2b0/0x39c > > CPU: core-0: > Call trace: > venus_shutdown+0x98/0xfc [venus_core] > venus_sys_error_handler+0x64/0x148 [venus_core] > process_one_work+0x210/0x3d0 > worker_thread+0x248/0x3f4 > kthread+0x11c/0x12c > > Signed-off-by: Mansur Alisha Shaik > --- > drivers/media/platform/qcom/venus/core.c | 2 +- > drivers/media/platform/qcom/venus/hfi.c | 5 ++++- > 2 files changed, 5 insertions(+), 2 deletions(-) > > diff --git a/drivers/media/platform/qcom/venus/core.c b/drivers/media/platform/qcom/venus/core.c > index 203c653..fe99c83 100644 > --- a/drivers/media/platform/qcom/venus/core.c > +++ b/drivers/media/platform/qcom/venus/core.c > @@ -64,8 +64,8 @@ static void venus_sys_error_handler(struct work_struct *work) > pm_runtime_get_sync(core->dev); > > hfi_core_deinit(core, true); > - hfi_destroy(core); > mutex_lock(&core->lock); > + hfi_destroy(core); As my recovery fixes [1] touches this part also, could you please apply them on top of yours and re-test? Otherwise this patch looks good to me. [1] https://www.spinics.net/lists/linux-arm-msm/msg70092.html > venus_shutdown(core); > > pm_runtime_put_sync(core->dev); > diff --git a/drivers/media/platform/qcom/venus/hfi.c b/drivers/media/platform/qcom/venus/hfi.c > index a211eb9..2eeb31f 100644 > --- a/drivers/media/platform/qcom/venus/hfi.c > +++ b/drivers/media/platform/qcom/venus/hfi.c > @@ -195,7 +195,7 @@ EXPORT_SYMBOL_GPL(hfi_session_create); > int hfi_session_init(struct venus_inst *inst, u32 pixfmt) > { > struct venus_core *core = inst->core; > - const struct hfi_ops *ops = core->ops; > + const struct hfi_ops *ops; > int ret; > > if (inst->state != INST_UNINIT) > @@ -204,10 +204,13 @@ int hfi_session_init(struct venus_inst *inst, u32 pixfmt) > inst->hfi_codec = to_codec_type(pixfmt); > reinit_completion(&inst->done); > > + mutex_lock(&core->lock); > + ops = core->ops; > ret = ops->session_init(inst, inst->session_type, inst->hfi_codec); > if (ret) > return ret; > > + mutex_unlock(&core->lock); > ret = wait_session_msg(inst); > if (ret) > return ret; > -- regards, Stan