Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp192500iob; Mon, 2 May 2022 16:48:31 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyJ3K1GGorvVF54puNOJ65689SAUEzNeLQ5za019MqLT2N0Bua35R4nXOUTTeIHYwiFgUwL X-Received: by 2002:a63:5b14:0:b0:39c:d7d5:722e with SMTP id p20-20020a635b14000000b0039cd7d5722emr11554138pgb.478.1651535311681; Mon, 02 May 2022 16:48:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651535311; cv=none; d=google.com; s=arc-20160816; b=m8IliH1VqbML5SVUPU46JCswSQrcYgDY6q0kIws+tqsoKwMvx0/tfg3lVZAPimJpLG iNutyzEfI0R43IPErl4CtdmeiM7ho18kHZzAgtka6GdAJCd/VHGwUUmG0WDunY2GqJ9t 1YSplODZ5+ITKbgTviYNYlod8lRtWY0EGhq0UobqDcB7O1xqd2nVYZAQ7G904vnvbKOg dRnI3adAeUhIRuioO3PLaQ4Ff1FBwxeCyJA+ypsYdWarKhXiZJ1kDih5EGFcgWTQeghI +LRbu/4gOLLX4NJGDd6XqfDoVOgF8ge/PKQ9WaCv3v959LiUDymbGeD8i+AR8RR3dWNU WMMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=KCgiDhy9A6FkcH9K9s8PuU6AkgU3yYdWy88x8ddb6Zc=; b=mNpnjaPEpZbg8onrcoJsV+G+nRt90PRp+HtXbXiraliG1+HcmJRF7qf4xpMIV/wFc+ 2Nmlh5ZjSQka0PysAd7H3lvM6gDAMcEycK0jcybQMtzRobsyaSMNjgHIaqcZp+2i9gwU SLdrxIMsieVsS2olt+C5fwrx8Ei9Tu93KFidbK3fN7jI9vejebem28NFUmGkXAyM6NNR aq4EJBA2UC2Tw1mrIHioRohhdpk5N1dzdnq07ZJRzirUt9GVVFwnHs7pg8lCyhGorhax O1aMNlOTWZipXhdJO1yHxrNTDWP1qmGGzSug6uHLKSNjtw2yz44wLmC9u8NSTe/L461/ sOdw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b="fnUMj/NW"; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id 32-20020a630b20000000b003816043f10bsi15392528pgl.768.2022.05.02.16.48.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 02 May 2022 16:48:31 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b="fnUMj/NW"; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 06DF833A1A; Mon, 2 May 2022 16:48:26 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241843AbiEBQNl (ORCPT + 99 others); Mon, 2 May 2022 12:13:41 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54278 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242185AbiEBQNh (ORCPT ); Mon, 2 May 2022 12:13:37 -0400 Received: from alexa-out-sd-01.qualcomm.com (alexa-out-sd-01.qualcomm.com [199.106.114.38]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id AF84CDF41; Mon, 2 May 2022 09:10:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; i=@quicinc.com; q=dns/txt; s=qcdkim; t=1651507807; x=1683043807; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=KCgiDhy9A6FkcH9K9s8PuU6AkgU3yYdWy88x8ddb6Zc=; b=fnUMj/NWMcLHChukuYV0m10L5sFoqh+5dnNyIDrIICSwc8GOwCdiP8YX Q2vAnl2nah4pUUe9gZZ+WIJUczMcmUuCMHGr0Dp+uO/COfbF6nPxKPWUR MCNeLLCHUqdYYFyowSd89bxTBWf4/aXEhL0m9yuJXYX79n8C4YNaSrEzq s=; Received: from unknown (HELO ironmsg03-sd.qualcomm.com) ([10.53.140.143]) by alexa-out-sd-01.qualcomm.com with ESMTP; 02 May 2022 09:10:07 -0700 X-QCInternal: smtphost Received: from nasanex01c.na.qualcomm.com ([10.47.97.222]) by ironmsg03-sd.qualcomm.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 May 2022 09:10:06 -0700 Received: from nalasex01a.na.qualcomm.com (10.47.209.196) by nasanex01c.na.qualcomm.com (10.47.97.222) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.22; Mon, 2 May 2022 09:10:06 -0700 Received: from [10.38.244.235] (10.80.80.8) by nalasex01a.na.qualcomm.com (10.47.209.196) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.22; Mon, 2 May 2022 09:10:03 -0700 Message-ID: <1ef7f0f1-075b-3ae2-ea80-fd6274ac1507@quicinc.com> Date: Mon, 2 May 2022 09:10:01 -0700 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.2 Subject: Re: [Freedreno] [PATCH] drm/msm/disp/dpu1: avoid clearing hw interrupts if hw_intr is null during drm uninit Content-Language: en-US To: Dmitry Baryshkov CC: Stephen Boyd , Vinod Polimera , , , , , , , , References: <1650952931-31988-1-git-send-email-quic_vpolimer@quicinc.com> <200eddae-02b8-5479-3e81-1f3885200ac0@quicinc.com> From: Abhinav Kumar In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01a.na.qualcomm.com (10.52.223.231) To nalasex01a.na.qualcomm.com (10.47.209.196) X-Spam-Status: No, score=-3.6 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 5/2/2022 1:05 AM, Dmitry Baryshkov wrote: > On Mon, 2 May 2022 at 04:38, Abhinav Kumar wrote: >> >> Looks like our new CI has given all the answers we need :) which is a >> great win for the CI in my opinion. >> >> Take a look at this report : >> https://gitlab.freedesktop.org/drm/msm/-/jobs/22015361 >> >> This issue seems to be because this change >> https://github.com/torvalds/linux/commit/169466d4e59ca204683998b7f45673ebf0eb2de6 >> is missing in our tree. >> >> Without this change, what happens is that we are not hitting the return >> 0 because we check for ENODEV. >> >> >> /* >> * External bridges are mandatory for eDP interfaces: one has to >> * provide at least an eDP panel (which gets wrapped into >> panel-bridge). >> * >> * For DisplayPort interfaces external bridges are optional, so >> * silently ignore an error if one is not present (-ENODEV). >> */ >> rc = dp_parser_find_next_bridge(dp_priv->parser); >> if (!dp->is_edp && rc == -ENODEV) >> return 0; >> >> So, I think we should do both: >> >> 1) Since we are running CI on the tree, backport this change so that >> this error path doesnt hit? >> >> 2) Add this protection as well because this shows that we can indeed hit >> this path in EDEFER cases causing this crash. > > I have been waiting for v2 for the last week or so. It should include > a fixed Fixes tag and an updated description (which should note that > this happens in the error path, etc) as requested by Stephen. > Prior to the above CI report, we did not know what is the error path in which this happens and why. Till then we were just speculating. Now that we do, we can certainly add it and post a v2. >> >> Thanks >> >> Abhinav >> >> On 4/27/2022 3:53 AM, Dmitry Baryshkov wrote: >>> On 27/04/2022 00:50, Stephen Boyd wrote: >>>> Quoting Vinod Polimera (2022-04-25 23:02:11) >>>>> Avoid clearing irqs and derefernce hw_intr when hw_intr is null. >>>> >>>> Presumably this is only the case when the display driver doesn't fully >>>> probe and something probe defers? Can you clarify how this situation >>>> happens? >>>> >>>>> >>>>> BUG: Unable to handle kernel NULL pointer dereference at virtual >>>>> address 0000000000000000 >>>>> >>>>> Call trace: >>>>> dpu_core_irq_uninstall+0x50/0xb0 >>>>> dpu_irq_uninstall+0x18/0x24 >>>>> msm_drm_uninit+0xd8/0x16c >>>>> msm_drm_bind+0x580/0x5fc >>>>> try_to_bring_up_master+0x168/0x1c0 >>>>> __component_add+0xb4/0x178 >>>>> component_add+0x1c/0x28 >>>>> dp_display_probe+0x38c/0x400 >>>>> platform_probe+0xb0/0xd0 >>>>> really_probe+0xcc/0x2c8 >>>>> __driver_probe_device+0xbc/0xe8 >>>>> driver_probe_device+0x48/0xf0 >>>>> __device_attach_driver+0xa0/0xc8 >>>>> bus_for_each_drv+0x8c/0xd8 >>>>> __device_attach+0xc4/0x150 >>>>> device_initial_probe+0x1c/0x28 >>>>> >>>>> Fixes: a73033619ea ("drm/msm/dpu: squash dpu_core_irq into >>>>> dpu_hw_interrupts") >>>> >>>> The fixes tag looks odd. In dpu_core_irq_uninstall() at that commit it >>>> is dealing with 'irq_obj' which isn't a pointer. After commit >>>> f25f656608e3 ("drm/msm/dpu: merge struct dpu_irq into struct >>>> dpu_hw_intr") dpu_core_irq_uninstall() starts using 'hw_intr' which is >>>> allocated on the heap. If we backported this patch to a place that had >>>> a73033619ea without f25f656608e3 it wouldn't make any sense. >>> >>> I'd agree here. The following tag would be correct: >>> >>> Fixes: f25f656608e3 ("drm/msm/dpu: merge struct dpu_irq into struct >>> dpu_hw_intr") >>> >>> >>>> >>>>> Signed-off-by: Vinod Polimera >>>>> --- >>>>> drivers/gpu/drm/msm/disp/dpu1/dpu_hw_interrupts.c | 3 +++ >>>>> 1 file changed, 3 insertions(+) >>>>> >>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_hw_interrupts.c >>>>> b/drivers/gpu/drm/msm/disp/dpu1/dpu_hw_interrupts.c >>>>> index c515b7c..ab28577 100644 >>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_hw_interrupts.c >>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_hw_interrupts.c >>>>> @@ -599,6 +599,9 @@ void dpu_core_irq_uninstall(struct dpu_kms *dpu_kms) >>>>> { >>>>> int i; >>>>> >>>>> + if (!dpu_kms->hw_intr) >>>>> + return; >>>>> + >>>>> pm_runtime_get_sync(&dpu_kms->pdev->dev); >>>>> for (i = 0; i < dpu_kms->hw_intr->total_irqs; i++) >>> >>> > > >