Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp1987804pxm; Thu, 24 Feb 2022 13:41:41 -0800 (PST) X-Google-Smtp-Source: ABdhPJw0OT0bCp3WSD96VdKpqQoGVa4ko+cZGL7eMx4oMWWTdgJMskvXe3n4fWlmZVFBmf4jsUvt X-Received: by 2002:a17:907:2a53:b0:6ce:e4fe:3f92 with SMTP id fe19-20020a1709072a5300b006cee4fe3f92mr3759626ejc.389.1645738900643; Thu, 24 Feb 2022 13:41:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645738900; cv=none; d=google.com; s=arc-20160816; b=AU6AAE27Dy54ic5HNZKaGZMbdYYUU6fh4FgOyi2s8NcUZIu3w8QDIqwJlVKhQw+AyK pv712RJO3dKVVCAdWnZV2dvmPCfwBBR4RsVw6OcCMTUECYHrZ6zItWPty9zB+eW6H1zV laAgJ9Y80Ytg12xSJTWpG+wS5QcvuvwAbpimXLHo+92TMX4vfHG1rzdB3kS8/Onl47if weJjS9oS2edduWW8UEbP2PMvWjb3iFeGje3NkEFqSrvhB4jGg22KklZHut/ik/rlfpuq KDBb1dV8DiXt8h1U2Oiyw3tYOGvbWAaYMuRAuGWZgJz4bnmNeObFhoENWRN5rKZ7i3cO w29Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=/a67+0gGbi/53fucMkp86XK5Syhq+aP0PxhSEw207Xg=; b=GEYO5leTwaM80394DcTX/nowS5WulDbcQvp+MogK/9Gpx6Bm/Uih0hbKRNm4ptgI+g 0KYHwd6t0yppBfktp1fz6A1aqggcSlSBWL3fOLpP+KHdY4oSDQQJyirqY04JS/5XUMFN 4MJlj11gG64RZyp3BDTVjeXTjg8s3L4LE1mKmjKpWxQKlMYNRF1jSPPxohnxKGot8A6/ S3vzoSixM8f44YRAz0/s5GD4jtLcjaDqaeP8ZCeeyJwX1fC19ouqyoZVB66aS/BDlxWC 3mbAfjrgupCYGAANkQnFCcore6a25Dy41fu61jU/L/aBOxH+jjPF0LOGka2dLk6pXFlZ dUTQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b=EPr2Ey+S; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ee42-20020a056402292a00b00410875aa4b8si426703edb.381.2022.02.24.13.41.16; Thu, 24 Feb 2022 13:41:40 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b=EPr2Ey+S; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234683AbiBXVhi (ORCPT + 99 others); Thu, 24 Feb 2022 16:37:38 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51718 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230493AbiBXVhf (ORCPT ); Thu, 24 Feb 2022 16:37:35 -0500 Received: from fanzine2.igalia.com (fanzine2.igalia.com [213.97.179.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2DCE2193DA for ; Thu, 24 Feb 2022 13:36:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=igalia.com; s=20170329; h=In-Reply-To:Content-Type:MIME-Version:References:Message-ID: Subject:Cc:To:From:Date:Sender:Reply-To:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=/a67+0gGbi/53fucMkp86XK5Syhq+aP0PxhSEw207Xg=; b=EPr2Ey+S1oil5n9plHZcXMFKz1 8bWTSqEhsHuEejmT6pOOdEgwET+XbNO8Dpc4SNIvMZFjDexUVxhaoRpNWwYPKxkqOi/R/fJp6/tSo VczkYthWzO8XTx0/nTZiWl67rSKqhMOzfxsZgMjhkkU/of0fabvuiidt4taNuLA9ss2Ld5C8vbZrL iwYnqa6UIbFqjDszHu7+5EOzbPDooAMlmPAKMGeZNQULjuqiI1Em4KpWFryYzI47rbtUL1Np/09gP f+gjnLCvX4m4TTsGD5ZmL2LgeRDSyRgfxQcJOQoRvB+c8r0LLGTEb9zKI9htQVorYaH/pDFmnHEbN HZ1mdjzg==; Received: from [102.222.140.247] (helo=mail.igalia.com) by fanzine2.igalia.com with esmtpsa (Cipher TLS1.3:ECDHE_SECP256R1__RSA_PSS_RSAE_SHA256__AES_256_GCM:256) (Exim) id 1nNLn0-0008Uz-9o; Thu, 24 Feb 2022 22:36:42 +0100 Date: Thu, 24 Feb 2022 20:36:02 -0100 From: Melissa Wen To: "Dhillon, Jasdeep" Cc: "amd-gfx@lists.freedesktop.org" , "dri-devel@lists.freedesktop.org" , "Wentland, Harry" , "Li, Sun peng (Leo)" , "Siqueira, Rodrigo" , "Deucher, Alexander" , "Koenig, Christian" , "Pan, Xinhui" , "airlied@linux.ie" , "daniel@ffwll.ch" , "Laktyushkin, Dmytro" , "Zhuo, Qingqing (Lillian)" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH v2] drm/amd/display: move FPU-related code from dcn20 to dml folder Message-ID: <20220224213602.dph2uu7mzhq7g32w@mail.igalia.com> References: <20220223230506.2656258-1-mwen@igalia.com> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="az5z2ftu5cpeyhmk" Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=1.2 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_SBL_CSS,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Level: * X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --az5z2ftu5cpeyhmk Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 02/23, Dhillon, Jasdeep wrote: > [AMD Official Use Only] >=20 > Hi Melissa, >=20 > Thanks for the new update and added your changes. Hi Jasdeep, I thank you for taking a look at this. Let me know if you find something I should improve in this patch. Best Regards, Melissa >=20 > Regards, > Jasdeep > ________________________________ > From: Melissa Wen > Sent: February 23, 2022 6:05 PM > To: amd-gfx@lists.freedesktop.org ; dri-de= vel@lists.freedesktop.org ; Wentland, Harr= y ; Li, Sun peng (Leo) ; Siquei= ra, Rodrigo ; Deucher, Alexander ; Koenig, Christian ; Pan, Xinhui ; airlied@linux.ie ; daniel@ffwll.ch > Cc: Laktyushkin, Dmytro ; Dhillon, Jasdeep ; Zhuo, Qingqing (Lillian) ; = Melissa Wen ; linux-kernel@vger.kernel.org > Subject: [PATCH v2] drm/amd/display: move FPU-related code from dcn20 to = dml folder >=20 > Move parts of dcn20 code that uses FPU to dml folder. It aims to isolate > FPU operations as described by series: >=20 > drm/amd/display: Introduce FPU directory inside DC > https://nam11.safelinks.protection.outlook.com/?url=3Dhttps%3A%2F%2Fpatch= work.freedesktop.org%2Fseries%2F93042%2F&data=3D04%7C01%7CJasdeep.Dhill= on%40amd.com%7Cdf9863db496249b48b6f08d9f7210654%7C3dd8961fe4884e608e11a82d9= 94e183d%7C0%7C0%7C637812543530752899%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLj= AwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=3DMcy= 8dEkii67NyQoxaP0%2F8AjFYDhCZgqlm%2BGxAPo1OQ4%3D&reserved=3D0 >=20 > This patch moves the following functions from dcn20_resource to > dml/dcn20_fpu and calls of public functions in dcn20_resource are > wrapped by DC_FP_START/END(): >=20 > - void dcn20_populate_dml_writeback_from_context > - static bool is_dtbclk_required() > - static enum dcn_zstate_support_state() > - void dcn20_calculate_dlg_params() > - static void swizzle_to_dml_params() > - int dcn20_populate_dml_pipes_from_context() > - void dcn20_calculate_wm() > - void dcn20_cap_soc_clocks() > - void dcn20_update_bounding_box() > - void dcn20_patch_bounding_box() > - bool dcn20_validate_bandwidth_fp() >=20 > This movement also affects dcn21/30/31, as dcn20_calculate_dlg_params() > is used by them. For this reason, I included dcn20_fpu headers in > dcn20_resource headers to make dcn20_calculate_dlg_params() visible to > dcn21/30/31. >=20 > Three new functions are created to isolate well-delimited FPU > operations: >=20 > - void dcn20_fpu_set_wb_arb_params(): set cli_watermark, > pstate_watermark and time_per_pixel from wb_arb_params (struct > mcif_arb_params), since those uses FPU operations on double types: > WritebackUrgentWatermark, WritebackDRAMClockChangeWatermark, '16.0'. > - void dcn20_fpu_set_wm_ranges(): set min_fill_clk_mhz and > max_fill_clk_mhz involves FPU calcs on dram_speed_mts (double type); > - void dcn20_fpu_adjust_dppclk(): adjust operation on RequiredDPPCLK > that is a double. >=20 > -- >=20 > v2: > - besides dcn20_resource, dcn20_calculate_dlg_params() is used in > dcn[21/30/31]_resource and therefore it needs to be wrapped by > DC_FP_START/END wherever it is called, as verified by > dc_assert_fp_enabled. >=20 > Signed-off-by: Melissa Wen > --- > drivers/gpu/drm/amd/display/dc/dcn20/Makefile | 25 - > .../drm/amd/display/dc/dcn20/dcn20_resource.c | 1370 +--------------- > .../drm/amd/display/dc/dcn20/dcn20_resource.h | 30 +- > .../drm/amd/display/dc/dcn21/dcn21_resource.c | 2 + > .../drm/amd/display/dc/dcn30/dcn30_resource.c | 2 + > .../drm/amd/display/dc/dcn31/dcn31_resource.c | 2 + > .../drm/amd/display/dc/dml/dcn20/dcn20_fpu.c | 1385 +++++++++++++++++ > .../drm/amd/display/dc/dml/dcn20/dcn20_fpu.h | 42 +- > 8 files changed, 1457 insertions(+), 1401 deletions(-) >=20 > diff --git a/drivers/gpu/drm/amd/display/dc/dcn20/Makefile b/drivers/gpu/= drm/amd/display/dc/dcn20/Makefile > index 5fcaf78334ff..abaed2121feb 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn20/Makefile > +++ b/drivers/gpu/drm/amd/display/dc/dcn20/Makefile > @@ -9,31 +9,6 @@ DCN20 =3D dcn20_resource.o dcn20_init.o dcn20_hwseq.o dc= n20_dpp.o dcn20_dpp_cm.o d >=20 > DCN20 +=3D dcn20_dsc.o >=20 > -ifdef CONFIG_X86 > -CFLAGS_$(AMDDALPATH)/dc/dcn20/dcn20_resource.o :=3D -mhard-float -msse > -endif > - > -ifdef CONFIG_PPC64 > -CFLAGS_$(AMDDALPATH)/dc/dcn20/dcn20_resource.o :=3D -mhard-float -maltiv= ec > -endif > - > -ifdef CONFIG_CC_IS_GCC > -ifeq ($(call cc-ifversion, -lt, 0701, y), y) > -IS_OLD_GCC =3D 1 > -endif > -endif > - > -ifdef CONFIG_X86 > -ifdef IS_OLD_GCC > -# Stack alignment mismatch, proceed with caution. > -# GCC < 7.1 cannot compile code using `double` and -mpreferred-stack-bou= ndary=3D3 > -# (8B stack alignment). > -CFLAGS_$(AMDDALPATH)/dc/dcn20/dcn20_resource.o +=3D -mpreferred-stack-bo= undary=3D4 > -else > -CFLAGS_$(AMDDALPATH)/dc/dcn20/dcn20_resource.o +=3D -msse2 > -endif > -endif > - > AMD_DAL_DCN20 =3D $(addprefix $(AMDDALPATH)/dc/dcn20/,$(DCN20)) >=20 > AMD_DISPLAY_FILES +=3D $(AMD_DAL_DCN20) > diff --git a/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.c b/driv= ers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.c > index dfe2e1c25a26..63c50bee0144 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.c > +++ b/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.c > @@ -63,7 +63,6 @@ > #include "dcn20_dccg.h" > #include "dcn20_vmid.h" > #include "dc_link_ddc.h" > -#include "dc_link_dp.h" > #include "dce/dce_panel_cntl.h" >=20 > #include "navi10_ip_offset.h" > @@ -93,367 +92,6 @@ >=20 > #define DC_LOGGER_INIT(logger) >=20 > -struct _vcs_dpi_ip_params_st dcn2_0_ip =3D { > - .odm_capable =3D 1, > - .gpuvm_enable =3D 0, > - .hostvm_enable =3D 0, > - .gpuvm_max_page_table_levels =3D 4, > - .hostvm_max_page_table_levels =3D 4, > - .hostvm_cached_page_table_levels =3D 0, > - .pte_group_size_bytes =3D 2048, > - .num_dsc =3D 6, > - .rob_buffer_size_kbytes =3D 168, > - .det_buffer_size_kbytes =3D 164, > - .dpte_buffer_size_in_pte_reqs_luma =3D 84, > - .pde_proc_buffer_size_64k_reqs =3D 48, > - .dpp_output_buffer_pixels =3D 2560, > - .opp_output_buffer_lines =3D 1, > - .pixel_chunk_size_kbytes =3D 8, > - .pte_chunk_size_kbytes =3D 2, > - .meta_chunk_size_kbytes =3D 2, > - .writeback_chunk_size_kbytes =3D 2, > - .line_buffer_size_bits =3D 789504, > - .is_line_buffer_bpp_fixed =3D 0, > - .line_buffer_fixed_bpp =3D 0, > - .dcc_supported =3D true, > - .max_line_buffer_lines =3D 12, > - .writeback_luma_buffer_size_kbytes =3D 12, > - .writeback_chroma_buffer_size_kbytes =3D 8, > - .writeback_chroma_line_buffer_width_pixels =3D 4, > - .writeback_max_hscl_ratio =3D 1, > - .writeback_max_vscl_ratio =3D 1, > - .writeback_min_hscl_ratio =3D 1, > - .writeback_min_vscl_ratio =3D 1, > - .writeback_max_hscl_taps =3D 12, > - .writeback_max_vscl_taps =3D 12, > - .writeback_line_buffer_luma_buffer_size =3D 0, > - .writeback_line_buffer_chroma_buffer_size =3D 14643, > - .cursor_buffer_size =3D 8, > - .cursor_chunk_size =3D 2, > - .max_num_otg =3D 6, > - .max_num_dpp =3D 6, > - .max_num_wb =3D 1, > - .max_dchub_pscl_bw_pix_per_clk =3D 4, > - .max_pscl_lb_bw_pix_per_clk =3D 2, > - .max_lb_vscl_bw_pix_per_clk =3D 4, > - .max_vscl_hscl_bw_pix_per_clk =3D 4, > - .max_hscl_ratio =3D 8, > - .max_vscl_ratio =3D 8, > - .hscl_mults =3D 4, > - .vscl_mults =3D 4, > - .max_hscl_taps =3D 8, > - .max_vscl_taps =3D 8, > - .dispclk_ramp_margin_percent =3D 1, > - .underscan_factor =3D 1.10, > - .min_vblank_lines =3D 32, // > - .dppclk_delay_subtotal =3D 77, // > - .dppclk_delay_scl_lb_only =3D 16, > - .dppclk_delay_scl =3D 50, > - .dppclk_delay_cnvc_formatter =3D 8, > - .dppclk_delay_cnvc_cursor =3D 6, > - .dispclk_delay_subtotal =3D 87, // > - .dcfclk_cstate_latency =3D 10, // SRExitTime > - .max_inter_dcn_tile_repeaters =3D 8, > - .xfc_supported =3D true, > - .xfc_fill_bw_overhead_percent =3D 10.0, > - .xfc_fill_constant_bytes =3D 0, > - .number_of_cursors =3D 1, > -}; > - > -static struct _vcs_dpi_ip_params_st dcn2_0_nv14_ip =3D { > - .odm_capable =3D 1, > - .gpuvm_enable =3D 0, > - .hostvm_enable =3D 0, > - .gpuvm_max_page_table_levels =3D 4, > - .hostvm_max_page_table_levels =3D 4, > - .hostvm_cached_page_table_levels =3D 0, > - .num_dsc =3D 5, > - .rob_buffer_size_kbytes =3D 168, > - .det_buffer_size_kbytes =3D 164, > - .dpte_buffer_size_in_pte_reqs_luma =3D 84, > - .dpte_buffer_size_in_pte_reqs_chroma =3D 42,//todo > - .dpp_output_buffer_pixels =3D 2560, > - .opp_output_buffer_lines =3D 1, > - .pixel_chunk_size_kbytes =3D 8, > - .pte_enable =3D 1, > - .max_page_table_levels =3D 4, > - .pte_chunk_size_kbytes =3D 2, > - .meta_chunk_size_kbytes =3D 2, > - .writeback_chunk_size_kbytes =3D 2, > - .line_buffer_size_bits =3D 789504, > - .is_line_buffer_bpp_fixed =3D 0, > - .line_buffer_fixed_bpp =3D 0, > - .dcc_supported =3D true, > - .max_line_buffer_lines =3D 12, > - .writeback_luma_buffer_size_kbytes =3D 12, > - .writeback_chroma_buffer_size_kbytes =3D 8, > - .writeback_chroma_line_buffer_width_pixels =3D 4, > - .writeback_max_hscl_ratio =3D 1, > - .writeback_max_vscl_ratio =3D 1, > - .writeback_min_hscl_ratio =3D 1, > - .writeback_min_vscl_ratio =3D 1, > - .writeback_max_hscl_taps =3D 12, > - .writeback_max_vscl_taps =3D 12, > - .writeback_line_buffer_luma_buffer_size =3D 0, > - .writeback_line_buffer_chroma_buffer_size =3D 14643, > - .cursor_buffer_size =3D 8, > - .cursor_chunk_size =3D 2, > - .max_num_otg =3D 5, > - .max_num_dpp =3D 5, > - .max_num_wb =3D 1, > - .max_dchub_pscl_bw_pix_per_clk =3D 4, > - .max_pscl_lb_bw_pix_per_clk =3D 2, > - .max_lb_vscl_bw_pix_per_clk =3D 4, > - .max_vscl_hscl_bw_pix_per_clk =3D 4, > - .max_hscl_ratio =3D 8, > - .max_vscl_ratio =3D 8, > - .hscl_mults =3D 4, > - .vscl_mults =3D 4, > - .max_hscl_taps =3D 8, > - .max_vscl_taps =3D 8, > - .dispclk_ramp_margin_percent =3D 1, > - .underscan_factor =3D 1.10, > - .min_vblank_lines =3D 32, // > - .dppclk_delay_subtotal =3D 77, // > - .dppclk_delay_scl_lb_only =3D 16, > - .dppclk_delay_scl =3D 50, > - .dppclk_delay_cnvc_formatter =3D 8, > - .dppclk_delay_cnvc_cursor =3D 6, > - .dispclk_delay_subtotal =3D 87, // > - .dcfclk_cstate_latency =3D 10, // SRExitTime > - .max_inter_dcn_tile_repeaters =3D 8, > - .xfc_supported =3D true, > - .xfc_fill_bw_overhead_percent =3D 10.0, > - .xfc_fill_constant_bytes =3D 0, > - .ptoi_supported =3D 0, > - .number_of_cursors =3D 1, > -}; > - > -static struct _vcs_dpi_soc_bounding_box_st dcn2_0_soc =3D { > - /* Defaults that get patched on driver load from firmware. */ > - .clock_limits =3D { > - { > - .state =3D 0, > - .dcfclk_mhz =3D 560.0, > - .fabricclk_mhz =3D 560.0, > - .dispclk_mhz =3D 513.0, > - .dppclk_mhz =3D 513.0, > - .phyclk_mhz =3D 540.0, > - .socclk_mhz =3D 560.0, > - .dscclk_mhz =3D 171.0, > - .dram_speed_mts =3D 8960.0, > - }, > - { > - .state =3D 1, > - .dcfclk_mhz =3D 694.0, > - .fabricclk_mhz =3D 694.0, > - .dispclk_mhz =3D 642.0, > - .dppclk_mhz =3D 642.0, > - .phyclk_mhz =3D 600.0, > - .socclk_mhz =3D 694.0, > - .dscclk_mhz =3D 214.0, > - .dram_speed_mts =3D 11104.0, > - }, > - { > - .state =3D 2, > - .dcfclk_mhz =3D 875.0, > - .fabricclk_mhz =3D 875.0, > - .dispclk_mhz =3D 734.0, > - .dppclk_mhz =3D 734.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 875.0, > - .dscclk_mhz =3D 245.0, > - .dram_speed_mts =3D 14000.0, > - }, > - { > - .state =3D 3, > - .dcfclk_mhz =3D 1000.0, > - .fabricclk_mhz =3D 1000.0, > - .dispclk_mhz =3D 1100.0, > - .dppclk_mhz =3D 1100.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1000.0, > - .dscclk_mhz =3D 367.0, > - .dram_speed_mts =3D 16000.0, > - }, > - { > - .state =3D 4, > - .dcfclk_mhz =3D 1200.0, > - .fabricclk_mhz =3D 1200.0, > - .dispclk_mhz =3D 1284.0, > - .dppclk_mhz =3D 1284.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1200.0, > - .dscclk_mhz =3D 428.0, > - .dram_speed_mts =3D 16000.0, > - }, > - /*Extra state, no dispclk ramping*/ > - { > - .state =3D 5, > - .dcfclk_mhz =3D 1200.0, > - .fabricclk_mhz =3D 1200.0, > - .dispclk_mhz =3D 1284.0, > - .dppclk_mhz =3D 1284.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1200.0, > - .dscclk_mhz =3D 428.0, > - .dram_speed_mts =3D 16000.0, > - }, > - }, > - .num_states =3D 5, > - .sr_exit_time_us =3D 8.6, > - .sr_enter_plus_exit_time_us =3D 10.9, > - .urgent_latency_us =3D 4.0, > - .urgent_latency_pixel_data_only_us =3D 4.0, > - .urgent_latency_pixel_mixed_with_vm_data_us =3D 4.0, > - .urgent_latency_vm_data_only_us =3D 4.0, > - .urgent_out_of_order_return_per_channel_pixel_only_bytes =3D 4096, > - .urgent_out_of_order_return_per_channel_pixel_and_vm_bytes =3D 40= 96, > - .urgent_out_of_order_return_per_channel_vm_only_bytes =3D 4096, > - .pct_ideal_dram_sdp_bw_after_urgent_pixel_only =3D 40.0, > - .pct_ideal_dram_sdp_bw_after_urgent_pixel_and_vm =3D 40.0, > - .pct_ideal_dram_sdp_bw_after_urgent_vm_only =3D 40.0, > - .max_avg_sdp_bw_use_normal_percent =3D 40.0, > - .max_avg_dram_bw_use_normal_percent =3D 40.0, > - .writeback_latency_us =3D 12.0, > - .ideal_dram_bw_after_urgent_percent =3D 40.0, > - .max_request_size_bytes =3D 256, > - .dram_channel_width_bytes =3D 2, > - .fabric_datapath_to_dcn_data_return_bytes =3D 64, > - .dcn_downspread_percent =3D 0.5, > - .downspread_percent =3D 0.38, > - .dram_page_open_time_ns =3D 50.0, > - .dram_rw_turnaround_time_ns =3D 17.5, > - .dram_return_buffer_per_channel_bytes =3D 8192, > - .round_trip_ping_latency_dcfclk_cycles =3D 131, > - .urgent_out_of_order_return_per_channel_bytes =3D 256, > - .channel_interleave_bytes =3D 256, > - .num_banks =3D 8, > - .num_chans =3D 16, > - .vmm_page_size_bytes =3D 4096, > - .dram_clock_change_latency_us =3D 404.0, > - .dummy_pstate_latency_us =3D 5.0, > - .writeback_dram_clock_change_latency_us =3D 23.0, > - .return_bus_width_bytes =3D 64, > - .dispclk_dppclk_vco_speed_mhz =3D 3850, > - .xfc_bus_transport_time_us =3D 20, > - .xfc_xbuf_latency_tolerance_us =3D 4, > - .use_urgent_burst_bw =3D 0 > -}; > - > -static struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv14_soc =3D { > - .clock_limits =3D { > - { > - .state =3D 0, > - .dcfclk_mhz =3D 560.0, > - .fabricclk_mhz =3D 560.0, > - .dispclk_mhz =3D 513.0, > - .dppclk_mhz =3D 513.0, > - .phyclk_mhz =3D 540.0, > - .socclk_mhz =3D 560.0, > - .dscclk_mhz =3D 171.0, > - .dram_speed_mts =3D 8960.0, > - }, > - { > - .state =3D 1, > - .dcfclk_mhz =3D 694.0, > - .fabricclk_mhz =3D 694.0, > - .dispclk_mhz =3D 642.0, > - .dppclk_mhz =3D 642.0, > - .phyclk_mhz =3D 600.0, > - .socclk_mhz =3D 694.0, > - .dscclk_mhz =3D 214.0, > - .dram_speed_mts =3D 11104.0, > - }, > - { > - .state =3D 2, > - .dcfclk_mhz =3D 875.0, > - .fabricclk_mhz =3D 875.0, > - .dispclk_mhz =3D 734.0, > - .dppclk_mhz =3D 734.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 875.0, > - .dscclk_mhz =3D 245.0, > - .dram_speed_mts =3D 14000.0, > - }, > - { > - .state =3D 3, > - .dcfclk_mhz =3D 1000.0, > - .fabricclk_mhz =3D 1000.0, > - .dispclk_mhz =3D 1100.0, > - .dppclk_mhz =3D 1100.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1000.0, > - .dscclk_mhz =3D 367.0, > - .dram_speed_mts =3D 16000.0, > - }, > - { > - .state =3D 4, > - .dcfclk_mhz =3D 1200.0, > - .fabricclk_mhz =3D 1200.0, > - .dispclk_mhz =3D 1284.0, > - .dppclk_mhz =3D 1284.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1200.0, > - .dscclk_mhz =3D 428.0, > - .dram_speed_mts =3D 16000.0, > - }, > - /*Extra state, no dispclk ramping*/ > - { > - .state =3D 5, > - .dcfclk_mhz =3D 1200.0, > - .fabricclk_mhz =3D 1200.0, > - .dispclk_mhz =3D 1284.0, > - .dppclk_mhz =3D 1284.0, > - .phyclk_mhz =3D 810.0, > - .socclk_mhz =3D 1200.0, > - .dscclk_mhz =3D 428.0, > - .dram_speed_mts =3D 16000.0, > - }, > - }, > - .num_states =3D 5, > - .sr_exit_time_us =3D 11.6, > - .sr_enter_plus_exit_time_us =3D 13.9, > - .urgent_latency_us =3D 4.0, > - .urgent_latency_pixel_data_only_us =3D 4.0, > - .urgent_latency_pixel_mixed_with_vm_data_us =3D 4.0, > - .urgent_latency_vm_data_only_us =3D 4.0, > - .urgent_out_of_order_return_per_channel_pixel_only_bytes =3D 4096, > - .urgent_out_of_order_return_per_channel_pixel_and_vm_bytes =3D 40= 96, > - .urgent_out_of_order_return_per_channel_vm_only_bytes =3D 4096, > - .pct_ideal_dram_sdp_bw_after_urgent_pixel_only =3D 40.0, > - .pct_ideal_dram_sdp_bw_after_urgent_pixel_and_vm =3D 40.0, > - .pct_ideal_dram_sdp_bw_after_urgent_vm_only =3D 40.0, > - .max_avg_sdp_bw_use_normal_percent =3D 40.0, > - .max_avg_dram_bw_use_normal_percent =3D 40.0, > - .writeback_latency_us =3D 12.0, > - .ideal_dram_bw_after_urgent_percent =3D 40.0, > - .max_request_size_bytes =3D 256, > - .dram_channel_width_bytes =3D 2, > - .fabric_datapath_to_dcn_data_return_bytes =3D 64, > - .dcn_downspread_percent =3D 0.5, > - .downspread_percent =3D 0.38, > - .dram_page_open_time_ns =3D 50.0, > - .dram_rw_turnaround_time_ns =3D 17.5, > - .dram_return_buffer_per_channel_bytes =3D 8192, > - .round_trip_ping_latency_dcfclk_cycles =3D 131, > - .urgent_out_of_order_return_per_channel_bytes =3D 256, > - .channel_interleave_bytes =3D 256, > - .num_banks =3D 8, > - .num_chans =3D 8, > - .vmm_page_size_bytes =3D 4096, > - .dram_clock_change_latency_us =3D 404.0, > - .dummy_pstate_latency_us =3D 5.0, > - .writeback_dram_clock_change_latency_us =3D 23.0, > - .return_bus_width_bytes =3D 64, > - .dispclk_dppclk_vco_speed_mhz =3D 3850, > - .xfc_bus_transport_time_us =3D 20, > - .xfc_xbuf_latency_tolerance_us =3D 4, > - .use_urgent_burst_bw =3D 0 > -}; > - > -static struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv12_soc =3D { 0 }; > - > #ifndef mmDP0_DP_DPHY_INTERNAL_CTRL > #define mmDP0_DP_DPHY_INTERNAL_CTRL 0x210f > #define mmDP0_DP_DPHY_INTERNAL_CTRL_BASE_IDX 2 > @@ -1810,69 +1448,6 @@ enum dc_status dcn20_remove_stream_from_ctx(struct= dc *dc, struct dc_state *new_ > return result; > } >=20 > - > -static void swizzle_to_dml_params( > - enum swizzle_mode_values swizzle, > - unsigned int *sw_mode) > -{ > - switch (swizzle) { > - case DC_SW_LINEAR: > - *sw_mode =3D dm_sw_linear; > - break; > - case DC_SW_4KB_S: > - *sw_mode =3D dm_sw_4kb_s; > - break; > - case DC_SW_4KB_S_X: > - *sw_mode =3D dm_sw_4kb_s_x; > - break; > - case DC_SW_4KB_D: > - *sw_mode =3D dm_sw_4kb_d; > - break; > - case DC_SW_4KB_D_X: > - *sw_mode =3D dm_sw_4kb_d_x; > - break; > - case DC_SW_64KB_S: > - *sw_mode =3D dm_sw_64kb_s; > - break; > - case DC_SW_64KB_S_X: > - *sw_mode =3D dm_sw_64kb_s_x; > - break; > - case DC_SW_64KB_S_T: > - *sw_mode =3D dm_sw_64kb_s_t; > - break; > - case DC_SW_64KB_D: > - *sw_mode =3D dm_sw_64kb_d; > - break; > - case DC_SW_64KB_D_X: > - *sw_mode =3D dm_sw_64kb_d_x; > - break; > - case DC_SW_64KB_D_T: > - *sw_mode =3D dm_sw_64kb_d_t; > - break; > - case DC_SW_64KB_R_X: > - *sw_mode =3D dm_sw_64kb_r_x; > - break; > - case DC_SW_VAR_S: > - *sw_mode =3D dm_sw_var_s; > - break; > - case DC_SW_VAR_S_X: > - *sw_mode =3D dm_sw_var_s_x; > - break; > - case DC_SW_VAR_D: > - *sw_mode =3D dm_sw_var_d; > - break; > - case DC_SW_VAR_D_X: > - *sw_mode =3D dm_sw_var_d_x; > - break; > - case DC_SW_VAR_R_X: > - *sw_mode =3D dm_sw_var_r_x; > - break; > - default: > - ASSERT(0); /* Not supported */ > - break; > - } > -} > - > bool dcn20_split_stream_for_odm( > const struct dc *dc, > struct resource_context *res_ctx, > @@ -1988,394 +1563,6 @@ void dcn20_split_stream_for_mpc( > ASSERT(primary_pipe->plane_state); > } >=20 > -int dcn20_populate_dml_pipes_from_context( > - struct dc *dc, > - struct dc_state *context, > - display_e2e_pipe_params_st *pipes, > - bool fast_validate) > -{ > - int pipe_cnt, i; > - bool synchronized_vblank =3D true; > - struct resource_context *res_ctx =3D &context->res_ctx; > - > - for (i =3D 0, pipe_cnt =3D -1; i < dc->res_pool->pipe_count; i++)= { > - if (!res_ctx->pipe_ctx[i].stream) > - continue; > - > - if (pipe_cnt < 0) { > - pipe_cnt =3D i; > - continue; > - } > - > - if (res_ctx->pipe_ctx[pipe_cnt].stream =3D=3D res_ctx->pi= pe_ctx[i].stream) > - continue; > - > - if (dc->debug.disable_timing_sync || > - (!resource_are_streams_timing_synchronizable( > - res_ctx->pipe_ctx[pipe_cnt].stream, > - res_ctx->pipe_ctx[i].stream) && > - !resource_are_vblanks_synchronizable( > - res_ctx->pipe_ctx[pipe_cnt].stream, > - res_ctx->pipe_ctx[i].stream))) { > - synchronized_vblank =3D false; > - break; > - } > - } > - > - for (i =3D 0, pipe_cnt =3D 0; i < dc->res_pool->pipe_count; i++) { > - struct dc_crtc_timing *timing =3D &res_ctx->pipe_ctx[i].s= tream->timing; > - unsigned int v_total; > - unsigned int front_porch; > - int output_bpc; > - struct audio_check aud_check =3D {0}; > - > - if (!res_ctx->pipe_ctx[i].stream) > - continue; > - > - v_total =3D timing->v_total; > - front_porch =3D timing->v_front_porch; > - > - /* todo: > - pipes[pipe_cnt].pipe.src.dynamic_metadata_enable =3D 0; > - pipes[pipe_cnt].pipe.src.dcc =3D 0; > - pipes[pipe_cnt].pipe.src.vm =3D 0;*/ > - > - pipes[pipe_cnt].clks_cfg.refclk_mhz =3D dc->res_pool->ref= _clocks.dchub_ref_clock_inKhz / 1000.0; > - > - pipes[pipe_cnt].dout.dsc_enable =3D res_ctx->pipe_ctx[i].= stream->timing.flags.DSC; > - /* todo: rotation?*/ > - pipes[pipe_cnt].dout.dsc_slices =3D res_ctx->pipe_ctx[i].= stream->timing.dsc_cfg.num_slices_h; > - if (res_ctx->pipe_ctx[i].stream->use_dynamic_meta) { > - pipes[pipe_cnt].pipe.src.dynamic_metadata_enable = =3D true; > - /* 1/2 vblank */ > - pipes[pipe_cnt].pipe.src.dynamic_metadata_lines_b= efore_active =3D > - (v_total - timing->v_addressable > - - timing->v_border_top - timing->= v_border_bottom) / 2; > - /* 36 bytes dp, 32 hdmi */ > - pipes[pipe_cnt].pipe.src.dynamic_metadata_xmit_by= tes =3D > - dc_is_dp_signal(res_ctx->pipe_ctx[i].stre= am->signal) ? 36 : 32; > - } > - pipes[pipe_cnt].pipe.src.dcc =3D false; > - pipes[pipe_cnt].pipe.src.dcc_rate =3D 1; > - pipes[pipe_cnt].pipe.dest.synchronized_vblank_all_planes = =3D synchronized_vblank; > - pipes[pipe_cnt].pipe.dest.hblank_start =3D timing->h_tota= l - timing->h_front_porch; > - pipes[pipe_cnt].pipe.dest.hblank_end =3D pipes[pipe_cnt].= pipe.dest.hblank_start > - - timing->h_addressable > - - timing->h_border_left > - - timing->h_border_right; > - pipes[pipe_cnt].pipe.dest.vblank_start =3D v_total - fron= t_porch; > - pipes[pipe_cnt].pipe.dest.vblank_end =3D pipes[pipe_cnt].= pipe.dest.vblank_start > - - timing->v_addressable > - - timing->v_border_top > - - timing->v_border_bottom; > - pipes[pipe_cnt].pipe.dest.htotal =3D timing->h_total; > - pipes[pipe_cnt].pipe.dest.vtotal =3D v_total; > - pipes[pipe_cnt].pipe.dest.hactive =3D > - timing->h_addressable + timing->h_border_left + t= iming->h_border_right; > - pipes[pipe_cnt].pipe.dest.vactive =3D > - timing->v_addressable + timing->v_border_top + ti= ming->v_border_bottom; > - pipes[pipe_cnt].pipe.dest.interlaced =3D timing->flags.IN= TERLACE; > - pipes[pipe_cnt].pipe.dest.pixel_rate_mhz =3D timing->pix_= clk_100hz/10000.0; > - if (timing->timing_3d_format =3D=3D TIMING_3D_FORMAT_HW_F= RAME_PACKING) > - pipes[pipe_cnt].pipe.dest.pixel_rate_mhz *=3D 2; > - pipes[pipe_cnt].pipe.dest.otg_inst =3D res_ctx->pipe_ctx[= i].stream_res.tg->inst; > - pipes[pipe_cnt].dout.dp_lanes =3D 4; > - pipes[pipe_cnt].dout.is_virtual =3D 0; > - pipes[pipe_cnt].pipe.dest.vtotal_min =3D res_ctx->pipe_ct= x[i].stream->adjust.v_total_min; > - pipes[pipe_cnt].pipe.dest.vtotal_max =3D res_ctx->pipe_ct= x[i].stream->adjust.v_total_max; > - switch (get_num_odm_splits(&res_ctx->pipe_ctx[i])) { > - case 1: > - pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_2to1; > - break; > - case 3: > - pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_4to1; > - break; > - default: > - pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_disabled; > - } > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D res_ctx->pipe_ctx= [i].pipe_idx; > - if (res_ctx->pipe_ctx[i].top_pipe && res_ctx->pipe_ctx[i]= =2Etop_pipe->plane_state > - =3D=3D res_ctx->pipe_ctx[i].plane_state) { > - struct pipe_ctx *first_pipe =3D res_ctx->pipe_ctx= [i].top_pipe; > - int split_idx =3D 0; > - > - while (first_pipe->top_pipe && first_pipe->top_pi= pe->plane_state > - =3D=3D res_ctx->pipe_ctx[i].plane= _state) { > - first_pipe =3D first_pipe->top_pipe; > - split_idx++; > - } > - /* Treat 4to1 mpc combine as an mpo of 2 2-to-1 c= ombines */ > - if (split_idx =3D=3D 0) > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D f= irst_pipe->pipe_idx; > - else if (split_idx =3D=3D 1) > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].pipe_idx; > - else if (split_idx =3D=3D 2) > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].top_pipe->pipe_idx; > - } else if (res_ctx->pipe_ctx[i].prev_odm_pipe) { > - struct pipe_ctx *first_pipe =3D res_ctx->pipe_ctx= [i].prev_odm_pipe; > - > - while (first_pipe->prev_odm_pipe) > - first_pipe =3D first_pipe->prev_odm_pipe; > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D first_pip= e->pipe_idx; > - } > - > - switch (res_ctx->pipe_ctx[i].stream->signal) { > - case SIGNAL_TYPE_DISPLAY_PORT_MST: > - case SIGNAL_TYPE_DISPLAY_PORT: > - pipes[pipe_cnt].dout.output_type =3D dm_dp; > - break; > - case SIGNAL_TYPE_EDP: > - pipes[pipe_cnt].dout.output_type =3D dm_edp; > - break; > - case SIGNAL_TYPE_HDMI_TYPE_A: > - case SIGNAL_TYPE_DVI_SINGLE_LINK: > - case SIGNAL_TYPE_DVI_DUAL_LINK: > - pipes[pipe_cnt].dout.output_type =3D dm_hdmi; > - break; > - default: > - /* In case there is no signal, set dp with 4 lane= s to allow max config */ > - pipes[pipe_cnt].dout.is_virtual =3D 1; > - pipes[pipe_cnt].dout.output_type =3D dm_dp; > - pipes[pipe_cnt].dout.dp_lanes =3D 4; > - } > - > - switch (res_ctx->pipe_ctx[i].stream->timing.display_color= _depth) { > - case COLOR_DEPTH_666: > - output_bpc =3D 6; > - break; > - case COLOR_DEPTH_888: > - output_bpc =3D 8; > - break; > - case COLOR_DEPTH_101010: > - output_bpc =3D 10; > - break; > - case COLOR_DEPTH_121212: > - output_bpc =3D 12; > - break; > - case COLOR_DEPTH_141414: > - output_bpc =3D 14; > - break; > - case COLOR_DEPTH_161616: > - output_bpc =3D 16; > - break; > - case COLOR_DEPTH_999: > - output_bpc =3D 9; > - break; > - case COLOR_DEPTH_111111: > - output_bpc =3D 11; > - break; > - default: > - output_bpc =3D 8; > - break; > - } > - > - switch (res_ctx->pipe_ctx[i].stream->timing.pixel_encodin= g) { > - case PIXEL_ENCODING_RGB: > - case PIXEL_ENCODING_YCBCR444: > - pipes[pipe_cnt].dout.output_format =3D dm_444; > - pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 3; > - break; > - case PIXEL_ENCODING_YCBCR420: > - pipes[pipe_cnt].dout.output_format =3D dm_420; > - pipes[pipe_cnt].dout.output_bpp =3D (output_bpc *= 3.0) / 2; > - break; > - case PIXEL_ENCODING_YCBCR422: > - if (res_ctx->pipe_ctx[i].stream->timing.flags.DSC= && > - !res_ctx->pipe_ctx[i].stream->timing.dsc_cfg.= ycbcr422_simple) > - pipes[pipe_cnt].dout.output_format =3D dm= _n422; > - else > - pipes[pipe_cnt].dout.output_format =3D dm= _s422; > - pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 2; > - break; > - default: > - pipes[pipe_cnt].dout.output_format =3D dm_444; > - pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 3; > - } > - > - if (res_ctx->pipe_ctx[i].stream->timing.flags.DSC) > - pipes[pipe_cnt].dout.output_bpp =3D res_ctx->pipe= _ctx[i].stream->timing.dsc_cfg.bits_per_pixel / 16.0; > - > - /* todo: default max for now, until there is logic reflec= ting this in dc*/ > - pipes[pipe_cnt].dout.dsc_input_bpc =3D 12; > - /*fill up the audio sample rate (unit in kHz)*/ > - get_audio_check(&res_ctx->pipe_ctx[i].stream->audio_info,= &aud_check); > - pipes[pipe_cnt].dout.max_audio_sample_rate =3D aud_check.= max_audiosample_rate / 1000; > - /* > - * For graphic plane, cursor number is 1, nv12 is 0 > - * bw calculations due to cursor on/off > - */ > - if (res_ctx->pipe_ctx[i].plane_state && > - res_ctx->pipe_ctx[i].plane_state->address= =2Etype =3D=3D PLN_ADDR_TYPE_VIDEO_PROGRESSIVE) > - pipes[pipe_cnt].pipe.src.num_cursors =3D 0; > - else > - pipes[pipe_cnt].pipe.src.num_cursors =3D dc->dml.= ip.number_of_cursors; > - > - pipes[pipe_cnt].pipe.src.cur0_src_width =3D 256; > - pipes[pipe_cnt].pipe.src.cur0_bpp =3D dm_cur_32bit; > - > - if (!res_ctx->pipe_ctx[i].plane_state) { > - pipes[pipe_cnt].pipe.src.is_hsplit =3D pipes[pipe= _cnt].pipe.dest.odm_combine !=3D dm_odm_combine_mode_disabled; > - pipes[pipe_cnt].pipe.src.source_scan =3D dm_horz; > - pipes[pipe_cnt].pipe.src.sw_mode =3D dm_sw_4kb_s; > - pipes[pipe_cnt].pipe.src.macro_tile_size =3D dm_6= 4k_tile; > - pipes[pipe_cnt].pipe.src.viewport_width =3D timin= g->h_addressable; > - if (pipes[pipe_cnt].pipe.src.viewport_width > 192= 0) > - pipes[pipe_cnt].pipe.src.viewport_width = =3D 1920; > - pipes[pipe_cnt].pipe.src.viewport_height =3D timi= ng->v_addressable; > - if (pipes[pipe_cnt].pipe.src.viewport_height > 10= 80) > - pipes[pipe_cnt].pipe.src.viewport_height = =3D 1080; > - pipes[pipe_cnt].pipe.src.surface_height_y =3D pip= es[pipe_cnt].pipe.src.viewport_height; > - pipes[pipe_cnt].pipe.src.surface_width_y =3D pipe= s[pipe_cnt].pipe.src.viewport_width; > - pipes[pipe_cnt].pipe.src.surface_height_c =3D pip= es[pipe_cnt].pipe.src.viewport_height; > - pipes[pipe_cnt].pipe.src.surface_width_c =3D pipe= s[pipe_cnt].pipe.src.viewport_width; > - pipes[pipe_cnt].pipe.src.data_pitch =3D ((pipes[p= ipe_cnt].pipe.src.viewport_width + 255) / 256) * 256; > - pipes[pipe_cnt].pipe.src.source_format =3D dm_444= _32; > - pipes[pipe_cnt].pipe.dest.recout_width =3D pipes[= pipe_cnt].pipe.src.viewport_width; /*vp_width/hratio*/ > - pipes[pipe_cnt].pipe.dest.recout_height =3D pipes= [pipe_cnt].pipe.src.viewport_height; /*vp_height/vratio*/ > - pipes[pipe_cnt].pipe.dest.full_recout_width =3D p= ipes[pipe_cnt].pipe.dest.recout_width; /*when is_hsplit !=3D 1*/ > - pipes[pipe_cnt].pipe.dest.full_recout_height =3D = pipes[pipe_cnt].pipe.dest.recout_height; /*when is_hsplit !=3D 1*/ > - pipes[pipe_cnt].pipe.scale_ratio_depth.lb_depth = =3D dm_lb_16; > - pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= =3D 1.0; > - pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= =3D 1.0; > - pipes[pipe_cnt].pipe.scale_ratio_depth.scl_enable= =3D 0; /*Lb only or Full scl*/ > - pipes[pipe_cnt].pipe.scale_taps.htaps =3D 1; > - pipes[pipe_cnt].pipe.scale_taps.vtaps =3D 1; > - pipes[pipe_cnt].pipe.dest.vtotal_min =3D v_total; > - pipes[pipe_cnt].pipe.dest.vtotal_max =3D v_total; > - > - if (pipes[pipe_cnt].pipe.dest.odm_combine =3D=3D = dm_odm_combine_mode_2to1) { > - pipes[pipe_cnt].pipe.src.viewport_width /= =3D 2; > - pipes[pipe_cnt].pipe.dest.recout_width /= =3D 2; > - } else if (pipes[pipe_cnt].pipe.dest.odm_combine = =3D=3D dm_odm_combine_mode_4to1) { > - pipes[pipe_cnt].pipe.src.viewport_width /= =3D 4; > - pipes[pipe_cnt].pipe.dest.recout_width /= =3D 4; > - } > - } else { > - struct dc_plane_state *pln =3D res_ctx->pipe_ctx[= i].plane_state; > - struct scaler_data *scl =3D &res_ctx->pipe_ctx[i]= =2Eplane_res.scl_data; > - > - pipes[pipe_cnt].pipe.src.immediate_flip =3D pln->= flip_immediate; > - pipes[pipe_cnt].pipe.src.is_hsplit =3D (res_ctx->= pipe_ctx[i].bottom_pipe && res_ctx->pipe_ctx[i].bottom_pipe->plane_state = =3D=3D pln) > - || (res_ctx->pipe_ctx[i].top_pipe= && res_ctx->pipe_ctx[i].top_pipe->plane_state =3D=3D pln) > - || pipes[pipe_cnt].pipe.dest.odm_= combine !=3D dm_odm_combine_mode_disabled; > - > - /* stereo is not split */ > - if (pln->stereo_format =3D=3D PLANE_STEREO_FORMAT= _SIDE_BY_SIDE || > - pln->stereo_format =3D=3D PLANE_STEREO_FORMAT= _TOP_AND_BOTTOM) { > - pipes[pipe_cnt].pipe.src.is_hsplit =3D fa= lse; > - pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].pipe_idx; > - } > - > - pipes[pipe_cnt].pipe.src.source_scan =3D pln->rot= ation =3D=3D ROTATION_ANGLE_90 > - || pln->rotation =3D=3D ROTATION_= ANGLE_270 ? dm_vert : dm_horz; > - pipes[pipe_cnt].pipe.src.viewport_y_y =3D scl->vi= ewport.y; > - pipes[pipe_cnt].pipe.src.viewport_y_c =3D scl->vi= ewport_c.y; > - pipes[pipe_cnt].pipe.src.viewport_width =3D scl->= viewport.width; > - pipes[pipe_cnt].pipe.src.viewport_width_c =3D scl= ->viewport_c.width; > - pipes[pipe_cnt].pipe.src.viewport_height =3D scl-= >viewport.height; > - pipes[pipe_cnt].pipe.src.viewport_height_c =3D sc= l->viewport_c.height; > - pipes[pipe_cnt].pipe.src.viewport_width_max =3D p= ln->src_rect.width; > - pipes[pipe_cnt].pipe.src.viewport_height_max =3D = pln->src_rect.height; > - pipes[pipe_cnt].pipe.src.surface_width_y =3D pln-= >plane_size.surface_size.width; > - pipes[pipe_cnt].pipe.src.surface_height_y =3D pln= ->plane_size.surface_size.height; > - pipes[pipe_cnt].pipe.src.surface_width_c =3D pln-= >plane_size.chroma_size.width; > - pipes[pipe_cnt].pipe.src.surface_height_c =3D pln= ->plane_size.chroma_size.height; > - if (pln->format =3D=3D SURFACE_PIXEL_FORMAT_GRPH_= RGBE_ALPHA > - || pln->format >=3D SURFACE_PIXEL= _FORMAT_VIDEO_BEGIN) { > - pipes[pipe_cnt].pipe.src.data_pitch =3D p= ln->plane_size.surface_pitch; > - pipes[pipe_cnt].pipe.src.data_pitch_c =3D= pln->plane_size.chroma_pitch; > - pipes[pipe_cnt].pipe.src.meta_pitch =3D p= ln->dcc.meta_pitch; > - pipes[pipe_cnt].pipe.src.meta_pitch_c =3D= pln->dcc.meta_pitch_c; > - } else { > - pipes[pipe_cnt].pipe.src.data_pitch =3D p= ln->plane_size.surface_pitch; > - pipes[pipe_cnt].pipe.src.meta_pitch =3D p= ln->dcc.meta_pitch; > - } > - pipes[pipe_cnt].pipe.src.dcc =3D pln->dcc.enable; > - pipes[pipe_cnt].pipe.dest.recout_width =3D scl->r= ecout.width; > - pipes[pipe_cnt].pipe.dest.recout_height =3D scl->= recout.height; > - pipes[pipe_cnt].pipe.dest.full_recout_height =3D = scl->recout.height; > - pipes[pipe_cnt].pipe.dest.full_recout_width =3D s= cl->recout.width; > - if (pipes[pipe_cnt].pipe.dest.odm_combine =3D=3D = dm_odm_combine_mode_2to1) > - pipes[pipe_cnt].pipe.dest.full_recout_wid= th *=3D 2; > - else if (pipes[pipe_cnt].pipe.dest.odm_combine = =3D=3D dm_odm_combine_mode_4to1) > - pipes[pipe_cnt].pipe.dest.full_recout_wid= th *=3D 4; > - else { > - struct pipe_ctx *split_pipe =3D res_ctx->= pipe_ctx[i].bottom_pipe; > - > - while (split_pipe && split_pipe->plane_st= ate =3D=3D pln) { > - pipes[pipe_cnt].pipe.dest.full_re= cout_width +=3D split_pipe->plane_res.scl_data.recout.width; > - split_pipe =3D split_pipe->bottom= _pipe; > - } > - split_pipe =3D res_ctx->pipe_ctx[i].top_p= ipe; > - while (split_pipe && split_pipe->plane_st= ate =3D=3D pln) { > - pipes[pipe_cnt].pipe.dest.full_re= cout_width +=3D split_pipe->plane_res.scl_data.recout.width; > - split_pipe =3D split_pipe->top_pi= pe; > - } > - } > - > - pipes[pipe_cnt].pipe.scale_ratio_depth.lb_depth = =3D dm_lb_16; > - pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= =3D (double) scl->ratios.horz.value / (1ULL<<32); > - pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= _c =3D (double) scl->ratios.horz_c.value / (1ULL<<32); > - pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= =3D (double) scl->ratios.vert.value / (1ULL<<32); > - pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= _c =3D (double) scl->ratios.vert_c.value / (1ULL<<32); > - pipes[pipe_cnt].pipe.scale_ratio_depth.scl_enable= =3D > - scl->ratios.vert.value !=3D dc_fi= xpt_one.value > - || scl->ratios.horz.value !=3D dc= _fixpt_one.value > - || scl->ratios.vert_c.value !=3D = dc_fixpt_one.value > - || scl->ratios.horz_c.value !=3D = dc_fixpt_one.value /*Lb only or Full scl*/ > - || dc->debug.always_scale; /*supp= ort always scale*/ > - pipes[pipe_cnt].pipe.scale_taps.htaps =3D scl->ta= ps.h_taps; > - pipes[pipe_cnt].pipe.scale_taps.htaps_c =3D scl->= taps.h_taps_c; > - pipes[pipe_cnt].pipe.scale_taps.vtaps =3D scl->ta= ps.v_taps; > - pipes[pipe_cnt].pipe.scale_taps.vtaps_c =3D scl->= taps.v_taps_c; > - > - pipes[pipe_cnt].pipe.src.macro_tile_size =3D > - swizzle_mode_to_macro_tile_size(p= ln->tiling_info.gfx9.swizzle); > - swizzle_to_dml_params(pln->tiling_info.gfx9.swizz= le, > - &pipes[pipe_cnt].pipe.src.sw_mode= ); > - > - switch (pln->format) { > - case SURFACE_PIXEL_FORMAT_VIDEO_420_YCbCr: > - case SURFACE_PIXEL_FORMAT_VIDEO_420_YCrCb: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_420_8; > - break; > - case SURFACE_PIXEL_FORMAT_VIDEO_420_10bpc_YCbCr: > - case SURFACE_PIXEL_FORMAT_VIDEO_420_10bpc_YCrCb: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_420_10; > - break; > - case SURFACE_PIXEL_FORMAT_GRPH_ARGB16161616: > - case SURFACE_PIXEL_FORMAT_GRPH_ABGR16161616: > - case SURFACE_PIXEL_FORMAT_GRPH_ARGB16161616F: > - case SURFACE_PIXEL_FORMAT_GRPH_ABGR16161616F: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_64; > - break; > - case SURFACE_PIXEL_FORMAT_GRPH_ARGB1555: > - case SURFACE_PIXEL_FORMAT_GRPH_RGB565: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_16; > - break; > - case SURFACE_PIXEL_FORMAT_GRPH_PALETA_256_COLORS: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_8; > - break; > - case SURFACE_PIXEL_FORMAT_GRPH_RGBE_ALPHA: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_rgbe_alpha; > - break; > - default: > - pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_32; > - break; > - } > - } > - > - pipe_cnt++; > - } > - > - /* populate writeback information */ > - DC_FP_START(); > - dc->res_pool->funcs->populate_dml_writeback_from_context(dc, res_= ctx, pipes); > - DC_FP_END(); > - > - return pipe_cnt; > -} > - > unsigned int dcn20_calc_max_scaled_time( > unsigned int time_per_pixel, > enum mmhubbub_wbif_mode mode, > @@ -2413,7 +1600,7 @@ void dcn20_set_mcif_arb_params( > { > enum mmhubbub_wbif_mode wbif_mode; > struct mcif_arb_params *wb_arb_params; > - int i, j, k, dwb_pipe; > + int i, j, dwb_pipe; >=20 > /* Writeback MCIF_WB arbitration parameters */ > dwb_pipe =3D 0; > @@ -2437,11 +1624,10 @@ void dcn20_set_mcif_arb_params( > } else > wbif_mode =3D PACKED_444; >=20 > - for (k =3D 0; k < sizeof(wb_arb_params->cli_water= mark)/sizeof(wb_arb_params->cli_watermark[0]); k++) { > - wb_arb_params->cli_watermark[k] =3D get_w= m_writeback_urgent(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - wb_arb_params->pstate_watermark[k] =3D ge= t_wm_writeback_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 1= 000; > - } > - wb_arb_params->time_per_pixel =3D 16.0 * 1000 / (= context->res_ctx.pipe_ctx[i].stream->phy_pix_clk / 1000); /* 4 bit fraction= , ms */ > + DC_FP_START(); > + dcn20_fpu_set_wb_arb_params(wb_arb_params, contex= t, pipes, pipe_cnt, i); > + DC_FP_END(); > + > wb_arb_params->slice_lines =3D 32; > wb_arb_params->arbitration_slice =3D 2; > wb_arb_params->max_scaled_time =3D dcn20_calc_ma= x_scaled_time(wb_arb_params->time_per_pixel, > @@ -2808,8 +1994,11 @@ int dcn20_validate_apply_pipe_split_flags( > } >=20 > /* Adjust dppclk when split is forced, do not bother wit= h dispclk */ > - if (split[i] !=3D 0 && v->NoOfDPP[vlevel][max_mpc_comb][p= ipe_idx] =3D=3D 1) > - v->RequiredDPPCLK[vlevel][max_mpc_comb][pipe_idx]= /=3D 2; > + if (split[i] !=3D 0 && v->NoOfDPP[vlevel][max_mpc_comb][p= ipe_idx] =3D=3D 1) { > + DC_FP_START(); > + dcn20_fpu_adjust_dppclk(v, vlevel, max_mpc_comb, = pipe_idx, false); > + DC_FP_END(); > + } > pipe_idx++; > } >=20 > @@ -2835,7 +2024,9 @@ bool dcn20_fast_validate_bw( >=20 > dcn20_merge_pipes_for_validate(dc, context); >=20 > + DC_FP_START(); > pipe_cnt =3D dc->res_pool->funcs->populate_dml_pipes(dc, context= , pipes, fast_validate); > + DC_FP_END(); >=20 > *pipe_cnt_out =3D pipe_cnt; >=20 > @@ -2892,7 +2083,9 @@ bool dcn20_fast_validate_bw( > hsplit_pipe =3D dcn20_find_secondary_pip= e(dc, &context->res_ctx, dc->res_pool, pipe); > ASSERT(hsplit_pipe); > if (!hsplit_pipe) { > - context->bw_ctx.dml.vba.RequiredD= PPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb][pipe_idx] *=3D 2; > + DC_FP_START(); > + dcn20_fpu_adjust_dppclk(&context-= >bw_ctx.dml.vba, vlevel, context->bw_ctx.dml.vba.maxMpcComb, pipe_idx, true= ); > + DC_FP_END(); > continue; > } > if (context->bw_ctx.dml.vba.ODMCombineEn= abled[pipe_idx]) { > @@ -2934,369 +2127,6 @@ bool dcn20_fast_validate_bw( > return out; > } >=20 > -static void dcn20_calculate_wm( > - struct dc *dc, struct dc_state *context, > - display_e2e_pipe_params_st *pipes, > - int *out_pipe_cnt, > - int *pipe_split_from, > - int vlevel, > - bool fast_validate) > -{ > - int pipe_cnt, i, pipe_idx; > - > - for (i =3D 0, pipe_idx =3D 0, pipe_cnt =3D 0; i < dc->res_pool->p= ipe_count; i++) { > - if (!context->res_ctx.pipe_ctx[i].stream) > - continue; > - > - pipes[pipe_cnt].clks_cfg.refclk_mhz =3D dc->res_pool->ref= _clocks.dchub_ref_clock_inKhz / 1000.0; > - pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D context->bw_ctx.= dml.vba.RequiredDISPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb]; > - > - if (pipe_split_from[i] < 0) { > - pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D > - context->bw_ctx.dml.vba.RequiredD= PPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb][pipe_idx]; > - if (context->bw_ctx.dml.vba.BlendingAndTiming[pip= e_idx] =3D=3D pipe_idx) > - pipes[pipe_cnt].pipe.dest.odm_combine =3D > - context->bw_ctx.dml.vba.O= DMCombineEnabled[pipe_idx]; > - else > - pipes[pipe_cnt].pipe.dest.odm_combine =3D= 0; > - pipe_idx++; > - } else { > - pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D > - context->bw_ctx.dml.vba.RequiredD= PPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb][pipe_split_from[i]]; > - if (context->bw_ctx.dml.vba.BlendingAndTiming[pip= e_split_from[i]] =3D=3D pipe_split_from[i]) > - pipes[pipe_cnt].pipe.dest.odm_combine =3D > - context->bw_ctx.dml.vba.O= DMCombineEnabled[pipe_split_from[i]]; > - else > - pipes[pipe_cnt].pipe.dest.odm_combine =3D= 0; > - } > - > - if (dc->config.forced_clocks) { > - pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D context-= >bw_ctx.dml.soc.clock_limits[0].dispclk_mhz; > - pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D context->= bw_ctx.dml.soc.clock_limits[0].dppclk_mhz; > - } > - if (dc->debug.min_disp_clk_khz > pipes[pipe_cnt].clks_cfg= =2Edispclk_mhz * 1000) > - pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D dc->debu= g.min_disp_clk_khz / 1000.0; > - if (dc->debug.min_dpp_clk_khz > pipes[pipe_cnt].clks_cfg.= dppclk_mhz * 1000) > - pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D dc->debug= =2Emin_dpp_clk_khz / 1000.0; > - > - pipe_cnt++; > - } > - > - if (pipe_cnt !=3D pipe_idx) { > - if (dc->res_pool->funcs->populate_dml_pipes) > - pipe_cnt =3D dc->res_pool->funcs->populate_dml_pi= pes(dc, > - context, pipes, fast_validate); > - else > - pipe_cnt =3D dcn20_populate_dml_pipes_from_contex= t(dc, > - context, pipes, fast_validate); > - } > - > - *out_pipe_cnt =3D pipe_cnt; > - > - pipes[0].clks_cfg.voltage =3D vlevel; > - pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].dcfclk_mhz; > - pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].socclk_mhz; > - > - /* only pipe 0 is read for voltage and dcf/soc clocks */ > - if (vlevel < 1) { > - pipes[0].clks_cfg.voltage =3D 1; > - pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[1].dcfclk_mhz; > - pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[1].socclk_mhz; > - } > - context->bw_ctx.bw.dcn.watermarks.b.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > - context->bw_ctx.bw.dcn.watermarks.b.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.b.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.b.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > - context->bw_ctx.bw.dcn.watermarks.b.urgent_latency_ns =3D get_urg= ent_latency(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - > - if (vlevel < 2) { > - pipes[0].clks_cfg.voltage =3D 2; > - pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].dcfclk_mhz; > - pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].socclk_mhz; > - } > - context->bw_ctx.bw.dcn.watermarks.c.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > - context->bw_ctx.bw.dcn.watermarks.c.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.c.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.c.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > - > - if (vlevel < 3) { > - pipes[0].clks_cfg.voltage =3D 3; > - pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].dcfclk_mhz; > - pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].socclk_mhz; > - } > - context->bw_ctx.bw.dcn.watermarks.d.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > - context->bw_ctx.bw.dcn.watermarks.d.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.d.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.d.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > - > - pipes[0].clks_cfg.voltage =3D vlevel; > - pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].dcfclk_mhz; > - pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].socclk_mhz; > - context->bw_ctx.bw.dcn.watermarks.a.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > - context->bw_ctx.bw.dcn.watermarks.a.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.a.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > - context->bw_ctx.bw.dcn.watermarks.a.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > -} > - > -static bool is_dtbclk_required(struct dc *dc, struct dc_state *context) > -{ > - int i; > - for (i =3D 0; i < dc->res_pool->pipe_count; i++) { > - if (!context->res_ctx.pipe_ctx[i].stream) > - continue; > - if (is_dp_128b_132b_signal(&context->res_ctx.pipe_ctx[i])) > - return true; > - } > - return false; > -} > - > -static enum dcn_zstate_support_state decide_zstate_support(struct dc *d= c, struct dc_state *context) > -{ > - int plane_count; > - int i; > - > - plane_count =3D 0; > - for (i =3D 0; i < dc->res_pool->pipe_count; i++) { > - if (context->res_ctx.pipe_ctx[i].plane_state) > - plane_count++; > - } > - > - /* > - * Zstate is allowed in following scenarios: > - * 1. Single eDP with PSR enabled > - * 2. 0 planes (No memory requests) > - * 3. Single eDP without PSR but > 5ms stutter period > - */ > - if (plane_count =3D=3D 0) > - return DCN_ZSTATE_SUPPORT_ALLOW; > - else if (context->stream_count =3D=3D 1 && context->streams[0]->= signal =3D=3D SIGNAL_TYPE_EDP) { > - struct dc_link *link =3D context->streams[0]->sink->link; > - > - /* zstate only supported on PWRSEQ0 */ > - if (link->link_index !=3D 0) > - return DCN_ZSTATE_SUPPORT_DISALLOW; > - > - if (context->bw_ctx.dml.vba.StutterPeriod > 5000.0) > - return DCN_ZSTATE_SUPPORT_ALLOW; > - else if (link->psr_settings.psr_version =3D=3D DC_PSR_VER= SION_1 && !dc->debug.disable_psr) > - return DCN_ZSTATE_SUPPORT_ALLOW_Z10_ONLY; > - else > - return DCN_ZSTATE_SUPPORT_DISALLOW; > - } else > - return DCN_ZSTATE_SUPPORT_DISALLOW; > -} > - > -void dcn20_calculate_dlg_params( > - struct dc *dc, struct dc_state *context, > - display_e2e_pipe_params_st *pipes, > - int pipe_cnt, > - int vlevel) > -{ > - int i, pipe_idx; > - > - /* Writeback MCIF_WB arbitration parameters */ > - dc->res_pool->funcs->set_mcif_arb_params(dc, context, pipes, pipe= _cnt); > - > - context->bw_ctx.bw.dcn.clk.dispclk_khz =3D context->bw_ctx.dml.vb= a.DISPCLK * 1000; > - context->bw_ctx.bw.dcn.clk.dcfclk_khz =3D context->bw_ctx.dml.vba= =2EDCFCLK * 1000; > - context->bw_ctx.bw.dcn.clk.socclk_khz =3D context->bw_ctx.dml.vba= =2ESOCCLK * 1000; > - context->bw_ctx.bw.dcn.clk.dramclk_khz =3D context->bw_ctx.dml.vb= a.DRAMSpeed * 1000 / 16; > - > - if (dc->debug.min_dram_clk_khz > context->bw_ctx.bw.dcn.clk.dramc= lk_khz) > - context->bw_ctx.bw.dcn.clk.dramclk_khz =3D dc->debug.min_= dram_clk_khz; > - > - context->bw_ctx.bw.dcn.clk.dcfclk_deep_sleep_khz =3D context->bw_= ctx.dml.vba.DCFCLKDeepSleep * 1000; > - context->bw_ctx.bw.dcn.clk.fclk_khz =3D context->bw_ctx.dml.vba.F= abricClock * 1000; > - context->bw_ctx.bw.dcn.clk.p_state_change_support =3D > - context->bw_ctx.dml.vba.DRAMClockChangeSupport[vlevel][co= ntext->bw_ctx.dml.vba.maxMpcComb] > - !=3D dm_dram_cloc= k_change_unsupported; > - context->bw_ctx.bw.dcn.clk.dppclk_khz =3D 0; > - > - context->bw_ctx.bw.dcn.clk.zstate_support =3D decide_zstate_suppo= rt(dc, context); > - > - context->bw_ctx.bw.dcn.clk.dtbclk_en =3D is_dtbclk_required(dc, c= ontext); > - > - if (context->bw_ctx.bw.dcn.clk.dispclk_khz < dc->debug.min_disp_c= lk_khz) > - context->bw_ctx.bw.dcn.clk.dispclk_khz =3D dc->debug.min_= disp_clk_khz; > - > - for (i =3D 0, pipe_idx =3D 0; i < dc->res_pool->pipe_count; i++) { > - if (!context->res_ctx.pipe_ctx[i].stream) > - continue; > - pipes[pipe_idx].pipe.dest.vstartup_start =3D get_vstartup= (&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > - pipes[pipe_idx].pipe.dest.vupdate_offset =3D get_vupdate_= offset(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > - pipes[pipe_idx].pipe.dest.vupdate_width =3D get_vupdate_w= idth(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > - pipes[pipe_idx].pipe.dest.vready_offset =3D get_vready_of= fset(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > - context->res_ctx.pipe_ctx[i].det_buffer_size_kb =3D conte= xt->bw_ctx.dml.ip.det_buffer_size_kbytes; > - context->res_ctx.pipe_ctx[i].unbounded_req =3D pipes[pipe= _idx].pipe.src.unbounded_req_mode; > - > - if (context->bw_ctx.bw.dcn.clk.dppclk_khz < pipes[pipe_id= x].clks_cfg.dppclk_mhz * 1000) > - context->bw_ctx.bw.dcn.clk.dppclk_khz =3D pipes[p= ipe_idx].clks_cfg.dppclk_mhz * 1000; > - context->res_ctx.pipe_ctx[i].plane_res.bw.dppclk_khz =3D > - pipes[pipe_idx].clks_cfg.= dppclk_mhz * 1000; > - context->res_ctx.pipe_ctx[i].pipe_dlg_param =3D pipes[pip= e_idx].pipe.dest; > - pipe_idx++; > - } > - /*save a original dppclock copy*/ > - context->bw_ctx.bw.dcn.clk.bw_dppclk_khz =3D context->bw_ctx.bw.d= cn.clk.dppclk_khz; > - context->bw_ctx.bw.dcn.clk.bw_dispclk_khz =3D context->bw_ctx.bw.= dcn.clk.dispclk_khz; > - context->bw_ctx.bw.dcn.clk.max_supported_dppclk_khz =3D context->= bw_ctx.dml.soc.clock_limits[vlevel].dppclk_mhz * 1000; > - context->bw_ctx.bw.dcn.clk.max_supported_dispclk_khz =3D context-= >bw_ctx.dml.soc.clock_limits[vlevel].dispclk_mhz * 1000; > - > - context->bw_ctx.bw.dcn.compbuf_size_kb =3D context->bw_ctx.dml.ip= =2Econfig_return_buffer_size_in_kbytes > - - context->bw_ctx.dml.ip.= det_buffer_size_kbytes * pipe_idx; > - > - for (i =3D 0, pipe_idx =3D 0; i < dc->res_pool->pipe_count; i++) { > - bool cstate_en =3D context->bw_ctx.dml.vba.PrefetchMode[v= level][context->bw_ctx.dml.vba.maxMpcComb] !=3D 2; > - > - if (!context->res_ctx.pipe_ctx[i].stream) > - continue; > - > - if (dc->ctx->dce_version =3D=3D DCN_VERSION_2_01) > - cstate_en =3D false; > - > - context->bw_ctx.dml.funcs.rq_dlg_get_dlg_reg(&context->bw= _ctx.dml, > - &context->res_ctx.pipe_ctx[i].dlg_regs, > - &context->res_ctx.pipe_ctx[i].ttu_regs, > - pipes, > - pipe_cnt, > - pipe_idx, > - cstate_en, > - context->bw_ctx.bw.dcn.clk.p_state_change= _support, > - false, false, true); > - > - context->bw_ctx.dml.funcs.rq_dlg_get_rq_reg(&context->bw_= ctx.dml, > - &context->res_ctx.pipe_ctx[i].rq_regs, > - &pipes[pipe_idx].pipe); > - pipe_idx++; > - } > -} > - > -static bool dcn20_validate_bandwidth_internal(struct dc *dc, struct dc_s= tate *context, > - bool fast_validate) > -{ > - bool out =3D false; > - > - BW_VAL_TRACE_SETUP(); > - > - int vlevel =3D 0; > - int pipe_split_from[MAX_PIPES]; > - int pipe_cnt =3D 0; > - display_e2e_pipe_params_st *pipes =3D kzalloc(dc->res_pool->pipe_= count * sizeof(display_e2e_pipe_params_st), GFP_ATOMIC); > - DC_LOGGER_INIT(dc->ctx->logger); > - > - BW_VAL_TRACE_COUNT(); > - > - out =3D dcn20_fast_validate_bw(dc, context, pipes, &pipe_cnt, pip= e_split_from, &vlevel, fast_validate); > - > - if (pipe_cnt =3D=3D 0) > - goto validate_out; > - > - if (!out) > - goto validate_fail; > - > - BW_VAL_TRACE_END_VOLTAGE_LEVEL(); > - > - if (fast_validate) { > - BW_VAL_TRACE_SKIP(fast); > - goto validate_out; > - } > - > - dcn20_calculate_wm(dc, context, pipes, &pipe_cnt, pipe_split_from= , vlevel, fast_validate); > - dcn20_calculate_dlg_params(dc, context, pipes, pipe_cnt, vlevel); > - > - BW_VAL_TRACE_END_WATERMARKS(); > - > - goto validate_out; > - > -validate_fail: > - DC_LOG_WARNING("Mode Validation Warning: %s failed validation.\n", > - dml_get_status_message(context->bw_ctx.dml.vba.Validation= Status[context->bw_ctx.dml.vba.soc.num_states])); > - > - BW_VAL_TRACE_SKIP(fail); > - out =3D false; > - > -validate_out: > - kfree(pipes); > - > - BW_VAL_TRACE_FINISH(); > - > - return out; > -} > - > -/* > - * This must be noinline to ensure anything that deals with FP registers > - * is contained within this call; previously our compiling with hard-flo= at > - * would result in fp instructions being emitted outside of the boundari= es > - * of the DC_FP_START/END macros, which makes sense as the compiler has = no > - * idea about what is wrapped and what is not > - * > - * This is largely just a workaround to avoid breakage introduced with 5= =2E6, > - * ideally all fp-using code should be moved into its own file, only that > - * should be compiled with hard-float, and all code exported from there > - * should be strictly wrapped with DC_FP_START/END > - */ > -static noinline bool dcn20_validate_bandwidth_fp(struct dc *dc, > - struct dc_state *context, bool fast_validate) > -{ > - bool voltage_supported =3D false; > - bool full_pstate_supported =3D false; > - bool dummy_pstate_supported =3D false; > - double p_state_latency_us; > - > - p_state_latency_us =3D context->bw_ctx.dml.soc.dram_clock_change_= latency_us; > - context->bw_ctx.dml.soc.disable_dram_clock_change_vactive_support= =3D > - dc->debug.disable_dram_clock_change_vactive_support; > - context->bw_ctx.dml.soc.allow_dram_clock_one_display_vactive =3D > - dc->debug.enable_dram_clock_change_one_display_vactive; > - > - /*Unsafe due to current pipe merge and split logic*/ > - ASSERT(context !=3D dc->current_state); > - > - if (fast_validate) { > - return dcn20_validate_bandwidth_internal(dc, context, tru= e); > - } > - > - // Best case, we support full UCLK switch latency > - voltage_supported =3D dcn20_validate_bandwidth_internal(dc, conte= xt, false); > - full_pstate_supported =3D context->bw_ctx.bw.dcn.clk.p_state_chan= ge_support; > - > - if (context->bw_ctx.dml.soc.dummy_pstate_latency_us =3D=3D 0 || > - (voltage_supported && full_pstate_supported)) { > - context->bw_ctx.bw.dcn.clk.p_state_change_support =3D ful= l_pstate_supported; > - goto restore_dml_state; > - } > - > - // Fallback: Try to only support G6 temperature read latency > - context->bw_ctx.dml.soc.dram_clock_change_latency_us =3D context-= >bw_ctx.dml.soc.dummy_pstate_latency_us; > - > - voltage_supported =3D dcn20_validate_bandwidth_internal(dc, conte= xt, false); > - dummy_pstate_supported =3D context->bw_ctx.bw.dcn.clk.p_state_cha= nge_support; > - > - if (voltage_supported && (dummy_pstate_supported || !(context->st= ream_count))) { > - context->bw_ctx.bw.dcn.clk.p_state_change_support =3D fal= se; > - goto restore_dml_state; > - } > - > - // ERROR: fallback is supposed to always work. > - ASSERT(false); > - > -restore_dml_state: > - context->bw_ctx.dml.soc.dram_clock_change_latency_us =3D p_state_= latency_us; > - return voltage_supported; > -} > - > bool dcn20_validate_bandwidth(struct dc *dc, struct dc_state *context, > bool fast_validate) > { > @@ -3464,170 +2294,6 @@ static void dcn20_pp_smu_destroy(struct pp_smu_fu= ncs **pp_smu) > } > } >=20 > -void dcn20_cap_soc_clocks( > - struct _vcs_dpi_soc_bounding_box_st *bb, > - struct pp_smu_nv_clock_table max_clocks) > -{ > - int i; > - > - // First pass - cap all clocks higher than the reported max > - for (i =3D 0; i < bb->num_states; i++) { > - if ((bb->clock_limits[i].dcfclk_mhz > (max_clocks.dcfCloc= kInKhz / 1000)) > - && max_clocks.dcfClockInKhz !=3D 0) > - bb->clock_limits[i].dcfclk_mhz =3D (max_clocks.dc= fClockInKhz / 1000); > - > - if ((bb->clock_limits[i].dram_speed_mts > (max_clocks.uCl= ockInKhz / 1000) * 16) > - && max_clocks.uClockInKhz= !=3D 0) > - bb->clock_limits[i].dram_speed_mts =3D (max_clock= s.uClockInKhz / 1000) * 16; > - > - if ((bb->clock_limits[i].fabricclk_mhz > (max_clocks.fabr= icClockInKhz / 1000)) > - && max_clocks.fabricClock= InKhz !=3D 0) > - bb->clock_limits[i].fabricclk_mhz =3D (max_clocks= =2EfabricClockInKhz / 1000); > - > - if ((bb->clock_limits[i].dispclk_mhz > (max_clocks.displa= yClockInKhz / 1000)) > - && max_clocks.displayCloc= kInKhz !=3D 0) > - bb->clock_limits[i].dispclk_mhz =3D (max_clocks.d= isplayClockInKhz / 1000); > - > - if ((bb->clock_limits[i].dppclk_mhz > (max_clocks.dppCloc= kInKhz / 1000)) > - && max_clocks.dppClockInK= hz !=3D 0) > - bb->clock_limits[i].dppclk_mhz =3D (max_clocks.dp= pClockInKhz / 1000); > - > - if ((bb->clock_limits[i].phyclk_mhz > (max_clocks.phyCloc= kInKhz / 1000)) > - && max_clocks.phyClockInK= hz !=3D 0) > - bb->clock_limits[i].phyclk_mhz =3D (max_clocks.ph= yClockInKhz / 1000); > - > - if ((bb->clock_limits[i].socclk_mhz > (max_clocks.socCloc= kInKhz / 1000)) > - && max_clocks.socClockInK= hz !=3D 0) > - bb->clock_limits[i].socclk_mhz =3D (max_clocks.so= cClockInKhz / 1000); > - > - if ((bb->clock_limits[i].dscclk_mhz > (max_clocks.dscCloc= kInKhz / 1000)) > - && max_clocks.dscClockInK= hz !=3D 0) > - bb->clock_limits[i].dscclk_mhz =3D (max_clocks.ds= cClockInKhz / 1000); > - } > - > - // Second pass - remove all duplicate clock states > - for (i =3D bb->num_states - 1; i > 1; i--) { > - bool duplicate =3D true; > - > - if (bb->clock_limits[i-1].dcfclk_mhz !=3D bb->clock_limit= s[i].dcfclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].dispclk_mhz !=3D bb->clock_limi= ts[i].dispclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].dppclk_mhz !=3D bb->clock_limit= s[i].dppclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].dram_speed_mts !=3D bb->clock_l= imits[i].dram_speed_mts) > - duplicate =3D false; > - if (bb->clock_limits[i-1].dscclk_mhz !=3D bb->clock_limit= s[i].dscclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].fabricclk_mhz !=3D bb->clock_li= mits[i].fabricclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].phyclk_mhz !=3D bb->clock_limit= s[i].phyclk_mhz) > - duplicate =3D false; > - if (bb->clock_limits[i-1].socclk_mhz !=3D bb->clock_limit= s[i].socclk_mhz) > - duplicate =3D false; > - > - if (duplicate) > - bb->num_states--; > - } > -} > - > -void dcn20_update_bounding_box(struct dc *dc, struct _vcs_dpi_soc_boundi= ng_box_st *bb, > - struct pp_smu_nv_clock_table *max_clocks, unsigned int *u= clk_states, unsigned int num_states) > -{ > - struct _vcs_dpi_voltage_scaling_st calculated_states[DC__VOLTAGE_= STATES]; > - int i; > - int num_calculated_states =3D 0; > - int min_dcfclk =3D 0; > - > - if (num_states =3D=3D 0) > - return; > - > - memset(calculated_states, 0, sizeof(calculated_states)); > - > - if (dc->bb_overrides.min_dcfclk_mhz > 0) > - min_dcfclk =3D dc->bb_overrides.min_dcfclk_mhz; > - else { > - if (ASICREV_IS_NAVI12_P(dc->ctx->asic_id.hw_internal_rev)) > - min_dcfclk =3D 310; > - else > - // Accounting for SOC/DCF relationship, we can go= as high as > - // 506Mhz in Vmin. > - min_dcfclk =3D 506; > - } > - > - for (i =3D 0; i < num_states; i++) { > - int min_fclk_required_by_uclk; > - calculated_states[i].state =3D i; > - calculated_states[i].dram_speed_mts =3D uclk_states[i] * = 16 / 1000; > - > - // FCLK:UCLK ratio is 1.08 > - min_fclk_required_by_uclk =3D div_u64(((unsigned long lon= g)uclk_states[i]) * 1080, > - 1000000); > - > - calculated_states[i].fabricclk_mhz =3D (min_fclk_required= _by_uclk < min_dcfclk) ? > - min_dcfclk : min_fclk_required_by_uclk; > - > - calculated_states[i].socclk_mhz =3D (calculated_states[i]= =2Efabricclk_mhz > max_clocks->socClockInKhz / 1000) ? > - max_clocks->socClockInKhz / 1000 : calcul= ated_states[i].fabricclk_mhz; > - > - calculated_states[i].dcfclk_mhz =3D (calculated_states[i]= =2Efabricclk_mhz > max_clocks->dcfClockInKhz / 1000) ? > - max_clocks->dcfClockInKhz / 1000 : calcul= ated_states[i].fabricclk_mhz; > - > - calculated_states[i].dispclk_mhz =3D max_clocks->displayC= lockInKhz / 1000; > - calculated_states[i].dppclk_mhz =3D max_clocks->displayCl= ockInKhz / 1000; > - calculated_states[i].dscclk_mhz =3D max_clocks->displayCl= ockInKhz / (1000 * 3); > - > - calculated_states[i].phyclk_mhz =3D max_clocks->phyClockI= nKhz / 1000; > - > - num_calculated_states++; > - } > - > - calculated_states[num_calculated_states - 1].socclk_mhz =3D max_c= locks->socClockInKhz / 1000; > - calculated_states[num_calculated_states - 1].fabricclk_mhz =3D ma= x_clocks->socClockInKhz / 1000; > - calculated_states[num_calculated_states - 1].dcfclk_mhz =3D max_c= locks->dcfClockInKhz / 1000; > - > - memcpy(bb->clock_limits, calculated_states, sizeof(bb->clock_limi= ts)); > - bb->num_states =3D num_calculated_states; > - > - // Duplicate the last state, DML always an extra state identical = to max state to work > - memcpy(&bb->clock_limits[num_calculated_states], &bb->clock_limit= s[num_calculated_states - 1], sizeof(struct _vcs_dpi_voltage_scaling_st)); > - bb->clock_limits[num_calculated_states].state =3D bb->num_states; > -} > - > -void dcn20_patch_bounding_box(struct dc *dc, struct _vcs_dpi_soc_boundin= g_box_st *bb) > -{ > - if ((int)(bb->sr_exit_time_us * 1000) !=3D dc->bb_overrides.sr_ex= it_time_ns > - && dc->bb_overrides.sr_exit_time_ns) { > - bb->sr_exit_time_us =3D dc->bb_overrides.sr_exit_time_ns = / 1000.0; > - } > - > - if ((int)(bb->sr_enter_plus_exit_time_us * 1000) > - !=3D dc->bb_overrides.sr_enter_plus_exit_= time_ns > - && dc->bb_overrides.sr_enter_plus_exit_time_ns) { > - bb->sr_enter_plus_exit_time_us =3D > - dc->bb_overrides.sr_enter_plus_exit_time_= ns / 1000.0; > - } > - > - if ((int)(bb->urgent_latency_us * 1000) !=3D dc->bb_overrides.urg= ent_latency_ns > - && dc->bb_overrides.urgent_latency_ns) { > - bb->urgent_latency_us =3D dc->bb_overrides.urgent_latency= _ns / 1000.0; > - } > - > - if ((int)(bb->dram_clock_change_latency_us * 1000) > - !=3D dc->bb_overrides.dram_clock_change_l= atency_ns > - && dc->bb_overrides.dram_clock_change_latency_ns)= { > - bb->dram_clock_change_latency_us =3D > - dc->bb_overrides.dram_clock_change_latenc= y_ns / 1000.0; > - } > - > - if ((int)(bb->dummy_pstate_latency_us * 1000) > - !=3D dc->bb_overrides.dummy_clock_change_= latency_ns > - && dc->bb_overrides.dummy_clock_change_latency_ns= ) { > - bb->dummy_pstate_latency_us =3D > - dc->bb_overrides.dummy_clock_change_laten= cy_ns / 1000.0; > - } > -} > - > static struct _vcs_dpi_soc_bounding_box_st *get_asic_rev_soc_bb( > uint32_t hw_internal_rev) > { > @@ -3910,9 +2576,9 @@ static bool dcn20_resource_construct( > ranges.reader_wm_sets[i].wm_inst =3D i; > ranges.reader_wm_sets[i].min_drain_clk_m= hz =3D PP_SMU_WM_SET_RANGE_CLK_UNCONSTRAINED_MIN; > ranges.reader_wm_sets[i].max_drain_clk_m= hz =3D PP_SMU_WM_SET_RANGE_CLK_UNCONSTRAINED_MAX; > - ranges.reader_wm_sets[i].min_fill_clk_mhz= =3D (i > 0) ? (loaded_bb->clock_limits[i - 1].dram_speed_mts / 16) + 1 : 0; > - ranges.reader_wm_sets[i].max_fill_clk_mhz= =3D loaded_bb->clock_limits[i].dram_speed_mts / 16; > - > + DC_FP_START(); > + dcn20_fpu_set_wm_ranges(i, &ranges, loade= d_bb); > + DC_FP_END(); > ranges.num_reader_wm_sets =3D i + 1; > } >=20 > diff --git a/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.h b/driv= ers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.h > index 6ec8ff45f0f7..961923c56ea0 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.h > +++ b/drivers/gpu/drm/amd/display/dc/dcn20/dcn20_resource.h > @@ -27,6 +27,7 @@ > #define __DC_RESOURCE_DCN20_H__ >=20 > #include "core_types.h" > +#include "dml/dcn20/dcn20_fpu.h" >=20 > #define TO_DCN20_RES_POOL(pool)\ > container_of(pool, struct dcn20_resource_pool, base) > @@ -35,6 +36,12 @@ struct dc; > struct resource_pool; > struct _vcs_dpi_display_pipe_params_st; >=20 > +extern struct _vcs_dpi_ip_params_st dcn2_0_ip; > +extern struct _vcs_dpi_ip_params_st dcn2_0_nv14_ip; > +extern struct _vcs_dpi_soc_bounding_box_st dcn2_0_soc; > +extern struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv14_soc; > +extern struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv12_soc; > + > struct dcn20_resource_pool { > struct resource_pool base; > }; > @@ -49,11 +56,6 @@ unsigned int dcn20_calc_max_scaled_time( > unsigned int time_per_pixel, > enum mmhubbub_wbif_mode mode, > unsigned int urgent_watermark); > -int dcn20_populate_dml_pipes_from_context( > - struct dc *dc, > - struct dc_state *context, > - display_e2e_pipe_params_st *pipes, > - bool fast_validate); > struct pipe_ctx *dcn20_acquire_idle_pipe_for_layer( > struct dc_state *state, > const struct resource_pool *pool, > @@ -79,7 +81,6 @@ struct dpp *dcn20_dpp_create( > struct input_pixel_processor *dcn20_ipp_create( > struct dc_context *ctx, uint32_t inst); >=20 > - > struct output_pixel_processor *dcn20_opp_create( > struct dc_context *ctx, uint32_t inst); >=20 > @@ -96,11 +97,6 @@ struct display_stream_compressor *dcn20_dsc_create( > struct dc_context *ctx, uint32_t inst); > void dcn20_dsc_destroy(struct display_stream_compressor **dsc); >=20 > -void dcn20_cap_soc_clocks( > - struct _vcs_dpi_soc_bounding_box_st *bb, > - struct pp_smu_nv_clock_table max_clocks); > -void dcn20_update_bounding_box(struct dc *dc, struct _vcs_dpi_soc_boundi= ng_box_st *bb, > - struct pp_smu_nv_clock_table *max_clocks, unsigned int *u= clk_states, unsigned int num_states); > struct hubp *dcn20_hubp_create( > struct dc_context *ctx, > uint32_t inst); > @@ -158,11 +154,6 @@ bool dcn20_fast_validate_bw( > int *pipe_split_from, > int *vlevel_out, > bool fast_validate); > -void dcn20_calculate_dlg_params( > - struct dc *dc, struct dc_state *context, > - display_e2e_pipe_params_st *pipes, > - int pipe_cnt, > - int vlevel); >=20 > enum dc_status dcn20_build_mapped_resource(const struct dc *dc, struct d= c_state *context, struct dc_stream_state *stream); > enum dc_status dcn20_add_stream_to_ctx(struct dc *dc, struct dc_state *n= ew_ctx, struct dc_stream_state *dc_stream); > @@ -170,12 +161,5 @@ enum dc_status dcn20_add_dsc_to_stream_resource(stru= ct dc *dc, struct dc_state * > enum dc_status dcn20_remove_stream_from_ctx(struct dc *dc, struct dc_sta= te *new_ctx, struct dc_stream_state *dc_stream); > enum dc_status dcn20_patch_unknown_plane_state(struct dc_plane_state *pl= ane_state); >=20 > -void dcn20_patch_bounding_box( > - struct dc *dc, > - struct _vcs_dpi_soc_bounding_box_st *bb); > -void dcn20_cap_soc_clocks( > - struct _vcs_dpi_soc_bounding_box_st *bb, > - struct pp_smu_nv_clock_table max_clocks); > - > #endif /* __DC_RESOURCE_DCN20_H__ */ >=20 > diff --git a/drivers/gpu/drm/amd/display/dc/dcn21/dcn21_resource.c b/driv= ers/gpu/drm/amd/display/dc/dcn21/dcn21_resource.c > index e5cc6bf45743..c1cd1a8ff1d7 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn21/dcn21_resource.c > +++ b/drivers/gpu/drm/amd/display/dc/dcn21/dcn21_resource.c > @@ -1363,7 +1363,9 @@ static noinline bool dcn21_validate_bandwidth_fp(st= ruct dc *dc, > } >=20 > dcn21_calculate_wm(dc, context, pipes, &pipe_cnt, pipe_split_fro= m, vlevel, fast_validate); > + DC_FP_START(); > dcn20_calculate_dlg_params(dc, context, pipes, pipe_cnt, vlevel); > + DC_FP_END(); >=20 > BW_VAL_TRACE_END_WATERMARKS(); >=20 > diff --git a/drivers/gpu/drm/amd/display/dc/dcn30/dcn30_resource.c b/driv= ers/gpu/drm/amd/display/dc/dcn30/dcn30_resource.c > index f10f7a0ca02a..8fa08f026c89 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn30/dcn30_resource.c > +++ b/drivers/gpu/drm/amd/display/dc/dcn30/dcn30_resource.c > @@ -2261,7 +2261,9 @@ static noinline void dcn30_calculate_wm_and_dlg_fp( > pipe_idx++; > } >=20 > + DC_FP_START(); > dcn20_calculate_dlg_params(dc, context, pipes, pipe_cnt, vlevel); > + DC_FP_END(); >=20 > if (!pstate_en) > /* Restore full p-state latency */ > diff --git a/drivers/gpu/drm/amd/display/dc/dcn31/dcn31_resource.c b/driv= ers/gpu/drm/amd/display/dc/dcn31/dcn31_resource.c > index 660e96a7fe7f..4a9c80482636 100644 > --- a/drivers/gpu/drm/amd/display/dc/dcn31/dcn31_resource.c > +++ b/drivers/gpu/drm/amd/display/dc/dcn31/dcn31_resource.c > @@ -1998,7 +1998,9 @@ static void dcn31_calculate_wm_and_dlg_fp( > pipe_idx++; > } >=20 > + DC_FP_START(); > dcn20_calculate_dlg_params(dc, context, pipes, pipe_cnt, vlevel); > + DC_FP_END(); > } >=20 > void dcn31_calculate_wm_and_dlg( > diff --git a/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.c b/drive= rs/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.c > index d590dc917363..b7adc9b6a543 100644 > --- a/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.c > +++ b/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.c > @@ -25,6 +25,9 @@ > */ >=20 > #include "resource.h" > +#include "clk_mgr.h" > +#include "dc_link_dp.h" > +#include "dcn20/dcn20_resource.h" >=20 > #include "dcn20_fpu.h" >=20 > @@ -61,6 +64,370 @@ > * warning. > */ >=20 > +struct _vcs_dpi_ip_params_st dcn2_0_ip =3D { > + .odm_capable =3D 1, > + .gpuvm_enable =3D 0, > + .hostvm_enable =3D 0, > + .gpuvm_max_page_table_levels =3D 4, > + .hostvm_max_page_table_levels =3D 4, > + .hostvm_cached_page_table_levels =3D 0, > + .pte_group_size_bytes =3D 2048, > + .num_dsc =3D 6, > + .rob_buffer_size_kbytes =3D 168, > + .det_buffer_size_kbytes =3D 164, > + .dpte_buffer_size_in_pte_reqs_luma =3D 84, > + .pde_proc_buffer_size_64k_reqs =3D 48, > + .dpp_output_buffer_pixels =3D 2560, > + .opp_output_buffer_lines =3D 1, > + .pixel_chunk_size_kbytes =3D 8, > + .pte_chunk_size_kbytes =3D 2, > + .meta_chunk_size_kbytes =3D 2, > + .writeback_chunk_size_kbytes =3D 2, > + .line_buffer_size_bits =3D 789504, > + .is_line_buffer_bpp_fixed =3D 0, > + .line_buffer_fixed_bpp =3D 0, > + .dcc_supported =3D true, > + .max_line_buffer_lines =3D 12, > + .writeback_luma_buffer_size_kbytes =3D 12, > + .writeback_chroma_buffer_size_kbytes =3D 8, > + .writeback_chroma_line_buffer_width_pixels =3D 4, > + .writeback_max_hscl_ratio =3D 1, > + .writeback_max_vscl_ratio =3D 1, > + .writeback_min_hscl_ratio =3D 1, > + .writeback_min_vscl_ratio =3D 1, > + .writeback_max_hscl_taps =3D 12, > + .writeback_max_vscl_taps =3D 12, > + .writeback_line_buffer_luma_buffer_size =3D 0, > + .writeback_line_buffer_chroma_buffer_size =3D 14643, > + .cursor_buffer_size =3D 8, > + .cursor_chunk_size =3D 2, > + .max_num_otg =3D 6, > + .max_num_dpp =3D 6, > + .max_num_wb =3D 1, > + .max_dchub_pscl_bw_pix_per_clk =3D 4, > + .max_pscl_lb_bw_pix_per_clk =3D 2, > + .max_lb_vscl_bw_pix_per_clk =3D 4, > + .max_vscl_hscl_bw_pix_per_clk =3D 4, > + .max_hscl_ratio =3D 8, > + .max_vscl_ratio =3D 8, > + .hscl_mults =3D 4, > + .vscl_mults =3D 4, > + .max_hscl_taps =3D 8, > + .max_vscl_taps =3D 8, > + .dispclk_ramp_margin_percent =3D 1, > + .underscan_factor =3D 1.10, > + .min_vblank_lines =3D 32, // > + .dppclk_delay_subtotal =3D 77, // > + .dppclk_delay_scl_lb_only =3D 16, > + .dppclk_delay_scl =3D 50, > + .dppclk_delay_cnvc_formatter =3D 8, > + .dppclk_delay_cnvc_cursor =3D 6, > + .dispclk_delay_subtotal =3D 87, // > + .dcfclk_cstate_latency =3D 10, // SRExitTime > + .max_inter_dcn_tile_repeaters =3D 8, > + .xfc_supported =3D true, > + .xfc_fill_bw_overhead_percent =3D 10.0, > + .xfc_fill_constant_bytes =3D 0, > + .number_of_cursors =3D 1, > +}; > + > +struct _vcs_dpi_ip_params_st dcn2_0_nv14_ip =3D { > + .odm_capable =3D 1, > + .gpuvm_enable =3D 0, > + .hostvm_enable =3D 0, > + .gpuvm_max_page_table_levels =3D 4, > + .hostvm_max_page_table_levels =3D 4, > + .hostvm_cached_page_table_levels =3D 0, > + .num_dsc =3D 5, > + .rob_buffer_size_kbytes =3D 168, > + .det_buffer_size_kbytes =3D 164, > + .dpte_buffer_size_in_pte_reqs_luma =3D 84, > + .dpte_buffer_size_in_pte_reqs_chroma =3D 42,//todo > + .dpp_output_buffer_pixels =3D 2560, > + .opp_output_buffer_lines =3D 1, > + .pixel_chunk_size_kbytes =3D 8, > + .pte_enable =3D 1, > + .max_page_table_levels =3D 4, > + .pte_chunk_size_kbytes =3D 2, > + .meta_chunk_size_kbytes =3D 2, > + .writeback_chunk_size_kbytes =3D 2, > + .line_buffer_size_bits =3D 789504, > + .is_line_buffer_bpp_fixed =3D 0, > + .line_buffer_fixed_bpp =3D 0, > + .dcc_supported =3D true, > + .max_line_buffer_lines =3D 12, > + .writeback_luma_buffer_size_kbytes =3D 12, > + .writeback_chroma_buffer_size_kbytes =3D 8, > + .writeback_chroma_line_buffer_width_pixels =3D 4, > + .writeback_max_hscl_ratio =3D 1, > + .writeback_max_vscl_ratio =3D 1, > + .writeback_min_hscl_ratio =3D 1, > + .writeback_min_vscl_ratio =3D 1, > + .writeback_max_hscl_taps =3D 12, > + .writeback_max_vscl_taps =3D 12, > + .writeback_line_buffer_luma_buffer_size =3D 0, > + .writeback_line_buffer_chroma_buffer_size =3D 14643, > + .cursor_buffer_size =3D 8, > + .cursor_chunk_size =3D 2, > + .max_num_otg =3D 5, > + .max_num_dpp =3D 5, > + .max_num_wb =3D 1, > + .max_dchub_pscl_bw_pix_per_clk =3D 4, > + .max_pscl_lb_bw_pix_per_clk =3D 2, > + .max_lb_vscl_bw_pix_per_clk =3D 4, > + .max_vscl_hscl_bw_pix_per_clk =3D 4, > + .max_hscl_ratio =3D 8, > + .max_vscl_ratio =3D 8, > + .hscl_mults =3D 4, > + .vscl_mults =3D 4, > + .max_hscl_taps =3D 8, > + .max_vscl_taps =3D 8, > + .dispclk_ramp_margin_percent =3D 1, > + .underscan_factor =3D 1.10, > + .min_vblank_lines =3D 32, // > + .dppclk_delay_subtotal =3D 77, // > + .dppclk_delay_scl_lb_only =3D 16, > + .dppclk_delay_scl =3D 50, > + .dppclk_delay_cnvc_formatter =3D 8, > + .dppclk_delay_cnvc_cursor =3D 6, > + .dispclk_delay_subtotal =3D 87, // > + .dcfclk_cstate_latency =3D 10, // SRExitTime > + .max_inter_dcn_tile_repeaters =3D 8, > + .xfc_supported =3D true, > + .xfc_fill_bw_overhead_percent =3D 10.0, > + .xfc_fill_constant_bytes =3D 0, > + .ptoi_supported =3D 0, > + .number_of_cursors =3D 1, > +}; > + > + > +struct _vcs_dpi_soc_bounding_box_st dcn2_0_soc =3D { > + /* Defaults that get patched on driver load from firmware. */ > + .clock_limits =3D { > + { > + .state =3D 0, > + .dcfclk_mhz =3D 560.0, > + .fabricclk_mhz =3D 560.0, > + .dispclk_mhz =3D 513.0, > + .dppclk_mhz =3D 513.0, > + .phyclk_mhz =3D 540.0, > + .socclk_mhz =3D 560.0, > + .dscclk_mhz =3D 171.0, > + .dram_speed_mts =3D 8960.0, > + }, > + { > + .state =3D 1, > + .dcfclk_mhz =3D 694.0, > + .fabricclk_mhz =3D 694.0, > + .dispclk_mhz =3D 642.0, > + .dppclk_mhz =3D 642.0, > + .phyclk_mhz =3D 600.0, > + .socclk_mhz =3D 694.0, > + .dscclk_mhz =3D 214.0, > + .dram_speed_mts =3D 11104.0, > + }, > + { > + .state =3D 2, > + .dcfclk_mhz =3D 875.0, > + .fabricclk_mhz =3D 875.0, > + .dispclk_mhz =3D 734.0, > + .dppclk_mhz =3D 734.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 875.0, > + .dscclk_mhz =3D 245.0, > + .dram_speed_mts =3D 14000.0, > + }, > + { > + .state =3D 3, > + .dcfclk_mhz =3D 1000.0, > + .fabricclk_mhz =3D 1000.0, > + .dispclk_mhz =3D 1100.0, > + .dppclk_mhz =3D 1100.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1000.0, > + .dscclk_mhz =3D 367.0, > + .dram_speed_mts =3D 16000.0, > + }, > + { > + .state =3D 4, > + .dcfclk_mhz =3D 1200.0, > + .fabricclk_mhz =3D 1200.0, > + .dispclk_mhz =3D 1284.0, > + .dppclk_mhz =3D 1284.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1200.0, > + .dscclk_mhz =3D 428.0, > + .dram_speed_mts =3D 16000.0, > + }, > + /*Extra state, no dispclk ramping*/ > + { > + .state =3D 5, > + .dcfclk_mhz =3D 1200.0, > + .fabricclk_mhz =3D 1200.0, > + .dispclk_mhz =3D 1284.0, > + .dppclk_mhz =3D 1284.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1200.0, > + .dscclk_mhz =3D 428.0, > + .dram_speed_mts =3D 16000.0, > + }, > + }, > + .num_states =3D 5, > + .sr_exit_time_us =3D 8.6, > + .sr_enter_plus_exit_time_us =3D 10.9, > + .urgent_latency_us =3D 4.0, > + .urgent_latency_pixel_data_only_us =3D 4.0, > + .urgent_latency_pixel_mixed_with_vm_data_us =3D 4.0, > + .urgent_latency_vm_data_only_us =3D 4.0, > + .urgent_out_of_order_return_per_channel_pixel_only_bytes =3D 4096, > + .urgent_out_of_order_return_per_channel_pixel_and_vm_bytes =3D 40= 96, > + .urgent_out_of_order_return_per_channel_vm_only_bytes =3D 4096, > + .pct_ideal_dram_sdp_bw_after_urgent_pixel_only =3D 40.0, > + .pct_ideal_dram_sdp_bw_after_urgent_pixel_and_vm =3D 40.0, > + .pct_ideal_dram_sdp_bw_after_urgent_vm_only =3D 40.0, > + .max_avg_sdp_bw_use_normal_percent =3D 40.0, > + .max_avg_dram_bw_use_normal_percent =3D 40.0, > + .writeback_latency_us =3D 12.0, > + .ideal_dram_bw_after_urgent_percent =3D 40.0, > + .max_request_size_bytes =3D 256, > + .dram_channel_width_bytes =3D 2, > + .fabric_datapath_to_dcn_data_return_bytes =3D 64, > + .dcn_downspread_percent =3D 0.5, > + .downspread_percent =3D 0.38, > + .dram_page_open_time_ns =3D 50.0, > + .dram_rw_turnaround_time_ns =3D 17.5, > + .dram_return_buffer_per_channel_bytes =3D 8192, > + .round_trip_ping_latency_dcfclk_cycles =3D 131, > + .urgent_out_of_order_return_per_channel_bytes =3D 256, > + .channel_interleave_bytes =3D 256, > + .num_banks =3D 8, > + .num_chans =3D 16, > + .vmm_page_size_bytes =3D 4096, > + .dram_clock_change_latency_us =3D 404.0, > + .dummy_pstate_latency_us =3D 5.0, > + .writeback_dram_clock_change_latency_us =3D 23.0, > + .return_bus_width_bytes =3D 64, > + .dispclk_dppclk_vco_speed_mhz =3D 3850, > + .xfc_bus_transport_time_us =3D 20, > + .xfc_xbuf_latency_tolerance_us =3D 4, > + .use_urgent_burst_bw =3D 0 > +}; > + > +struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv14_soc =3D { > + .clock_limits =3D { > + { > + .state =3D 0, > + .dcfclk_mhz =3D 560.0, > + .fabricclk_mhz =3D 560.0, > + .dispclk_mhz =3D 513.0, > + .dppclk_mhz =3D 513.0, > + .phyclk_mhz =3D 540.0, > + .socclk_mhz =3D 560.0, > + .dscclk_mhz =3D 171.0, > + .dram_speed_mts =3D 8960.0, > + }, > + { > + .state =3D 1, > + .dcfclk_mhz =3D 694.0, > + .fabricclk_mhz =3D 694.0, > + .dispclk_mhz =3D 642.0, > + .dppclk_mhz =3D 642.0, > + .phyclk_mhz =3D 600.0, > + .socclk_mhz =3D 694.0, > + .dscclk_mhz =3D 214.0, > + .dram_speed_mts =3D 11104.0, > + }, > + { > + .state =3D 2, > + .dcfclk_mhz =3D 875.0, > + .fabricclk_mhz =3D 875.0, > + .dispclk_mhz =3D 734.0, > + .dppclk_mhz =3D 734.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 875.0, > + .dscclk_mhz =3D 245.0, > + .dram_speed_mts =3D 14000.0, > + }, > + { > + .state =3D 3, > + .dcfclk_mhz =3D 1000.0, > + .fabricclk_mhz =3D 1000.0, > + .dispclk_mhz =3D 1100.0, > + .dppclk_mhz =3D 1100.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1000.0, > + .dscclk_mhz =3D 367.0, > + .dram_speed_mts =3D 16000.0, > + }, > + { > + .state =3D 4, > + .dcfclk_mhz =3D 1200.0, > + .fabricclk_mhz =3D 1200.0, > + .dispclk_mhz =3D 1284.0, > + .dppclk_mhz =3D 1284.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1200.0, > + .dscclk_mhz =3D 428.0, > + .dram_speed_mts =3D 16000.0, > + }, > + /*Extra state, no dispclk ramping*/ > + { > + .state =3D 5, > + .dcfclk_mhz =3D 1200.0, > + .fabricclk_mhz =3D 1200.0, > + .dispclk_mhz =3D 1284.0, > + .dppclk_mhz =3D 1284.0, > + .phyclk_mhz =3D 810.0, > + .socclk_mhz =3D 1200.0, > + .dscclk_mhz =3D 428.0, > + .dram_speed_mts =3D 16000.0, > + }, > + }, > + .num_states =3D 5, > + .sr_exit_time_us =3D 11.6, > + .sr_enter_plus_exit_time_us =3D 13.9, > + .urgent_latency_us =3D 4.0, > + .urgent_latency_pixel_data_only_us =3D 4.0, > + .urgent_latency_pixel_mixed_with_vm_data_us =3D 4.0, > + .urgent_latency_vm_data_only_us =3D 4.0, > + .urgent_out_of_order_return_per_channel_pixel_only_bytes =3D 4096, > + .urgent_out_of_order_return_per_channel_pixel_and_vm_bytes =3D 40= 96, > + .urgent_out_of_order_return_per_channel_vm_only_bytes =3D 4096, > + .pct_ideal_dram_sdp_bw_after_urgent_pixel_only =3D 40.0, > + .pct_ideal_dram_sdp_bw_after_urgent_pixel_and_vm =3D 40.0, > + .pct_ideal_dram_sdp_bw_after_urgent_vm_only =3D 40.0, > + .max_avg_sdp_bw_use_normal_percent =3D 40.0, > + .max_avg_dram_bw_use_normal_percent =3D 40.0, > + .writeback_latency_us =3D 12.0, > + .ideal_dram_bw_after_urgent_percent =3D 40.0, > + .max_request_size_bytes =3D 256, > + .dram_channel_width_bytes =3D 2, > + .fabric_datapath_to_dcn_data_return_bytes =3D 64, > + .dcn_downspread_percent =3D 0.5, > + .downspread_percent =3D 0.38, > + .dram_page_open_time_ns =3D 50.0, > + .dram_rw_turnaround_time_ns =3D 17.5, > + .dram_return_buffer_per_channel_bytes =3D 8192, > + .round_trip_ping_latency_dcfclk_cycles =3D 131, > + .urgent_out_of_order_return_per_channel_bytes =3D 256, > + .channel_interleave_bytes =3D 256, > + .num_banks =3D 8, > + .num_chans =3D 8, > + .vmm_page_size_bytes =3D 4096, > + .dram_clock_change_latency_us =3D 404.0, > + .dummy_pstate_latency_us =3D 5.0, > + .writeback_dram_clock_change_latency_us =3D 23.0, > + .return_bus_width_bytes =3D 64, > + .dispclk_dppclk_vco_speed_mhz =3D 3850, > + .xfc_bus_transport_time_us =3D 20, > + .xfc_xbuf_latency_tolerance_us =3D 4, > + .use_urgent_burst_bw =3D 0 > +}; > + > +struct _vcs_dpi_soc_bounding_box_st dcn2_0_nv12_soc =3D { 0 }; > + > +#define DC_LOGGER_INIT(logger) > + > void dcn20_populate_dml_writeback_from_context(struct dc *dc, > struct resource_context *= res_ctx, > display_e2e_pipe_params_s= t *pipes) > @@ -100,3 +467,1021 @@ void dcn20_populate_dml_writeback_from_context(str= uct dc *dc, > pipe_cnt++; > } > } > + > +void dcn20_fpu_set_wb_arb_params(struct mcif_arb_params *wb_arb_params, > + struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int pipe_cnt, int i) > +{ > + int k; > + > + dc_assert_fp_enabled(); > + > + for (k =3D 0; k < sizeof(wb_arb_params->cli_watermark)/sizeof(wb_= arb_params->cli_watermark[0]); k++) { > + wb_arb_params->cli_watermark[k] =3D get_wm_writeback_urge= nt(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + wb_arb_params->pstate_watermark[k] =3D get_wm_writeback_d= ram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + } > + wb_arb_params->time_per_pixel =3D 16.0 * 1000 / (context->res_ctx= =2Epipe_ctx[i].stream->phy_pix_clk / 1000); /* 4 bit fraction, ms */ > +} > + > +static bool is_dtbclk_required(struct dc *dc, struct dc_state *context) > +{ > + int i; > + > + for (i =3D 0; i < dc->res_pool->pipe_count; i++) { > + if (!context->res_ctx.pipe_ctx[i].stream) > + continue; > + if (is_dp_128b_132b_signal(&context->res_ctx.pipe_ctx[i])) > + return true; > + } > + return false; > +} > + > +static enum dcn_zstate_support_state decide_zstate_support(struct dc *dc= ,struct dc_state *context) > +{ > + int plane_count; > + int i; > + > + plane_count =3D 0; > + for (i =3D 0; i < dc->res_pool->pipe_count; i++) { > + if (context->res_ctx.pipe_ctx[i].plane_state) > + plane_count++; > + } > + > + /* > + * Zstate is allowed in following scenarios: > + * 1. Single eDP with PSR enabled > + * 2. 0 planes (No memory requests) > + * 3. Single eDP without PSR but > 5ms stutter period > + */ > + if (plane_count =3D=3D 0) > + return DCN_ZSTATE_SUPPORT_ALLOW; > + else if (context->stream_count =3D=3D 1 && context->streams[0]->= signal =3D=3D SIGNAL_TYPE_EDP) { > + struct dc_link *link =3D context->streams[0]->sink->link; > + > + /* zstate only supported on PWRSEQ0 */ > + if (link->link_index !=3D 0) > + return DCN_ZSTATE_SUPPORT_DISALLOW; > + > + if (context->bw_ctx.dml.vba.StutterPeriod > 5000.0) > + return DCN_ZSTATE_SUPPORT_ALLOW; > + else if (link->psr_settings.psr_version =3D=3D DC_PSR_VER= SION_1 && !dc->debug.disable_psr) > + return DCN_ZSTATE_SUPPORT_ALLOW_Z10_ONLY; > + else > + return DCN_ZSTATE_SUPPORT_DISALLOW; > + } else > + return DCN_ZSTATE_SUPPORT_DISALLOW; > +} > + > +void dcn20_calculate_dlg_params(struct dc *dc, > + struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int pipe_cnt, > + int vlevel) > +{ > + int i, pipe_idx; > + > + dc_assert_fp_enabled(); > + > + /* Writeback MCIF_WB arbitration parameters */ > + dc->res_pool->funcs->set_mcif_arb_params(dc, context, pipes, pipe= _cnt); > + > + context->bw_ctx.bw.dcn.clk.dispclk_khz =3D context->bw_ctx.dml.vb= a.DISPCLK * 1000; > + context->bw_ctx.bw.dcn.clk.dcfclk_khz =3D context->bw_ctx.dml.vba= =2EDCFCLK * 1000; > + context->bw_ctx.bw.dcn.clk.socclk_khz =3D context->bw_ctx.dml.vba= =2ESOCCLK * 1000; > + context->bw_ctx.bw.dcn.clk.dramclk_khz =3D context->bw_ctx.dml.vb= a.DRAMSpeed * 1000 / 16; > + > + if (dc->debug.min_dram_clk_khz > context->bw_ctx.bw.dcn.clk.dramc= lk_khz) > + context->bw_ctx.bw.dcn.clk.dramclk_khz =3D dc->debug.min_= dram_clk_khz; > + > + context->bw_ctx.bw.dcn.clk.dcfclk_deep_sleep_khz =3D context->bw_= ctx.dml.vba.DCFCLKDeepSleep * 1000; > + context->bw_ctx.bw.dcn.clk.fclk_khz =3D context->bw_ctx.dml.vba.F= abricClock * 1000; > + context->bw_ctx.bw.dcn.clk.p_state_change_support =3D > + context->bw_ctx.dml.vba.DRAMClockChangeSupport[vlevel][co= ntext->bw_ctx.dml.vba.maxMpcComb] > + !=3D dm_dram_cloc= k_change_unsupported; > + context->bw_ctx.bw.dcn.clk.dppclk_khz =3D 0; > + > + context->bw_ctx.bw.dcn.clk.zstate_support =3D decide_zstate_suppo= rt(dc, context); > + > + context->bw_ctx.bw.dcn.clk.dtbclk_en =3D is_dtbclk_required(dc, c= ontext); > + > + if (context->bw_ctx.bw.dcn.clk.dispclk_khz < dc->debug.min_disp_c= lk_khz) > + context->bw_ctx.bw.dcn.clk.dispclk_khz =3D dc->debug.min_= disp_clk_khz; > + > + for (i =3D 0, pipe_idx =3D 0; i < dc->res_pool->pipe_count; i++) { > + if (!context->res_ctx.pipe_ctx[i].stream) > + continue; > + pipes[pipe_idx].pipe.dest.vstartup_start =3D get_vstartup= (&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > + pipes[pipe_idx].pipe.dest.vupdate_offset =3D get_vupdate_= offset(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > + pipes[pipe_idx].pipe.dest.vupdate_width =3D get_vupdate_w= idth(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > + pipes[pipe_idx].pipe.dest.vready_offset =3D get_vready_of= fset(&context->bw_ctx.dml, pipes, pipe_cnt, pipe_idx); > + context->res_ctx.pipe_ctx[i].det_buffer_size_kb =3D conte= xt->bw_ctx.dml.ip.det_buffer_size_kbytes; > + context->res_ctx.pipe_ctx[i].unbounded_req =3D pipes[pipe= _idx].pipe.src.unbounded_req_mode; > + > + if (context->bw_ctx.bw.dcn.clk.dppclk_khz < pipes[pipe_id= x].clks_cfg.dppclk_mhz * 1000) > + context->bw_ctx.bw.dcn.clk.dppclk_khz =3D pipes[p= ipe_idx].clks_cfg.dppclk_mhz * 1000; > + context->res_ctx.pipe_ctx[i].plane_res.bw.dppclk_khz =3D > + pipes[pipe_idx].clks_cfg.= dppclk_mhz * 1000; > + context->res_ctx.pipe_ctx[i].pipe_dlg_param =3D pipes[pip= e_idx].pipe.dest; > + pipe_idx++; > + } > + /*save a original dppclock copy*/ > + context->bw_ctx.bw.dcn.clk.bw_dppclk_khz =3D context->bw_ctx.bw.d= cn.clk.dppclk_khz; > + context->bw_ctx.bw.dcn.clk.bw_dispclk_khz =3D context->bw_ctx.bw.= dcn.clk.dispclk_khz; > + context->bw_ctx.bw.dcn.clk.max_supported_dppclk_khz =3D context->= bw_ctx.dml.soc.clock_limits[vlevel].dppclk_mhz * 1000; > + context->bw_ctx.bw.dcn.clk.max_supported_dispclk_khz =3D context-= >bw_ctx.dml.soc.clock_limits[vlevel].dispclk_mhz * 1000; > + > + context->bw_ctx.bw.dcn.compbuf_size_kb =3D context->bw_ctx.dml.ip= =2Econfig_return_buffer_size_in_kbytes > + - context->bw_ctx.dml.ip.= det_buffer_size_kbytes * pipe_idx; > + > + for (i =3D 0, pipe_idx =3D 0; i < dc->res_pool->pipe_count; i++) { > + bool cstate_en =3D context->bw_ctx.dml.vba.PrefetchMode[v= level][context->bw_ctx.dml.vba.maxMpcComb] !=3D 2; > + > + if (!context->res_ctx.pipe_ctx[i].stream) > + continue; > + > + if (dc->ctx->dce_version =3D=3D DCN_VERSION_2_01) > + cstate_en =3D false; > + > + context->bw_ctx.dml.funcs.rq_dlg_get_dlg_reg(&context->bw= _ctx.dml, > + &context->res_ctx.pipe_ctx[i].dlg_regs, > + &context->res_ctx.pipe_ctx[i].ttu_regs, > + pipes, > + pipe_cnt, > + pipe_idx, > + cstate_en, > + context->bw_ctx.bw.dcn.clk.p_state_change= _support, > + false, false, true); > + > + context->bw_ctx.dml.funcs.rq_dlg_get_rq_reg(&context->bw_= ctx.dml, > + &context->res_ctx.pipe_ctx[i].rq_regs, > + &pipes[pipe_idx].pipe); > + pipe_idx++; > + } > +} > + > +static void swizzle_to_dml_params(enum swizzle_mode_values swizzle, > + unsigned int *sw_mode) > +{ > + switch (swizzle) { > + case DC_SW_LINEAR: > + *sw_mode =3D dm_sw_linear; > + break; > + case DC_SW_4KB_S: > + *sw_mode =3D dm_sw_4kb_s; > + break; > + case DC_SW_4KB_S_X: > + *sw_mode =3D dm_sw_4kb_s_x; > + break; > + case DC_SW_4KB_D: > + *sw_mode =3D dm_sw_4kb_d; > + break; > + case DC_SW_4KB_D_X: > + *sw_mode =3D dm_sw_4kb_d_x; > + break; > + case DC_SW_64KB_S: > + *sw_mode =3D dm_sw_64kb_s; > + break; > + case DC_SW_64KB_S_X: > + *sw_mode =3D dm_sw_64kb_s_x; > + break; > + case DC_SW_64KB_S_T: > + *sw_mode =3D dm_sw_64kb_s_t; > + break; > + case DC_SW_64KB_D: > + *sw_mode =3D dm_sw_64kb_d; > + break; > + case DC_SW_64KB_D_X: > + *sw_mode =3D dm_sw_64kb_d_x; > + break; > + case DC_SW_64KB_D_T: > + *sw_mode =3D dm_sw_64kb_d_t; > + break; > + case DC_SW_64KB_R_X: > + *sw_mode =3D dm_sw_64kb_r_x; > + break; > + case DC_SW_VAR_S: > + *sw_mode =3D dm_sw_var_s; > + break; > + case DC_SW_VAR_S_X: > + *sw_mode =3D dm_sw_var_s_x; > + break; > + case DC_SW_VAR_D: > + *sw_mode =3D dm_sw_var_d; > + break; > + case DC_SW_VAR_D_X: > + *sw_mode =3D dm_sw_var_d_x; > + break; > + case DC_SW_VAR_R_X: > + *sw_mode =3D dm_sw_var_r_x; > + break; > + default: > + ASSERT(0); /* Not supported */ > + break; > + } > +} > + > +int dcn20_populate_dml_pipes_from_context(struct dc *dc, > + struct dc_state *context, > + display_e2e_pipe_params_st *pip= es, > + bool fast_validate) > +{ > + int pipe_cnt, i; > + bool synchronized_vblank =3D true; > + struct resource_context *res_ctx =3D &context->res_ctx; > + > + dc_assert_fp_enabled(); > + > + for (i =3D 0, pipe_cnt =3D -1; i < dc->res_pool->pipe_count; i++)= { > + if (!res_ctx->pipe_ctx[i].stream) > + continue; > + > + if (pipe_cnt < 0) { > + pipe_cnt =3D i; > + continue; > + } > + > + if (res_ctx->pipe_ctx[pipe_cnt].stream =3D=3D res_ctx->pi= pe_ctx[i].stream) > + continue; > + > + if (dc->debug.disable_timing_sync || > + (!resource_are_streams_timing_synchronizable( > + res_ctx->pipe_ctx[pipe_cnt].stream, > + res_ctx->pipe_ctx[i].stream) && > + !resource_are_vblanks_synchronizable( > + res_ctx->pipe_ctx[pipe_cnt].stream, > + res_ctx->pipe_ctx[i].stream))) { > + synchronized_vblank =3D false; > + break; > + } > + } > + > + for (i =3D 0, pipe_cnt =3D 0; i < dc->res_pool->pipe_count; i++) { > + struct dc_crtc_timing *timing =3D &res_ctx->pipe_ctx[i].s= tream->timing; > + unsigned int v_total; > + unsigned int front_porch; > + int output_bpc; > + struct audio_check aud_check =3D {0}; > + > + if (!res_ctx->pipe_ctx[i].stream) > + continue; > + > + v_total =3D timing->v_total; > + front_porch =3D timing->v_front_porch; > + > + /* todo: > + pipes[pipe_cnt].pipe.src.dynamic_metadata_enable =3D 0; > + pipes[pipe_cnt].pipe.src.dcc =3D 0; > + pipes[pipe_cnt].pipe.src.vm =3D 0;*/ > + > + pipes[pipe_cnt].clks_cfg.refclk_mhz =3D dc->res_pool->ref= _clocks.dchub_ref_clock_inKhz / 1000.0; > + > + pipes[pipe_cnt].dout.dsc_enable =3D res_ctx->pipe_ctx[i].= stream->timing.flags.DSC; > + /* todo: rotation?*/ > + pipes[pipe_cnt].dout.dsc_slices =3D res_ctx->pipe_ctx[i].= stream->timing.dsc_cfg.num_slices_h; > + if (res_ctx->pipe_ctx[i].stream->use_dynamic_meta) { > + pipes[pipe_cnt].pipe.src.dynamic_metadata_enable = =3D true; > + /* 1/2 vblank */ > + pipes[pipe_cnt].pipe.src.dynamic_metadata_lines_b= efore_active =3D > + (v_total - timing->v_addressable > + - timing->v_border_top - timing->= v_border_bottom) / 2; > + /* 36 bytes dp, 32 hdmi */ > + pipes[pipe_cnt].pipe.src.dynamic_metadata_xmit_by= tes =3D > + dc_is_dp_signal(res_ctx->pipe_ctx[i].stre= am->signal) ? 36 : 32; > + } > + pipes[pipe_cnt].pipe.src.dcc =3D false; > + pipes[pipe_cnt].pipe.src.dcc_rate =3D 1; > + pipes[pipe_cnt].pipe.dest.synchronized_vblank_all_planes = =3D synchronized_vblank; > + pipes[pipe_cnt].pipe.dest.hblank_start =3D timing->h_tota= l - timing->h_front_porch; > + pipes[pipe_cnt].pipe.dest.hblank_end =3D pipes[pipe_cnt].= pipe.dest.hblank_start > + - timing->h_addressable > + - timing->h_border_left > + - timing->h_border_right; > + pipes[pipe_cnt].pipe.dest.vblank_start =3D v_total - fron= t_porch; > + pipes[pipe_cnt].pipe.dest.vblank_end =3D pipes[pipe_cnt].= pipe.dest.vblank_start > + - timing->v_addressable > + - timing->v_border_top > + - timing->v_border_bottom; > + pipes[pipe_cnt].pipe.dest.htotal =3D timing->h_total; > + pipes[pipe_cnt].pipe.dest.vtotal =3D v_total; > + pipes[pipe_cnt].pipe.dest.hactive =3D > + timing->h_addressable + timing->h_border_left + t= iming->h_border_right; > + pipes[pipe_cnt].pipe.dest.vactive =3D > + timing->v_addressable + timing->v_border_top + ti= ming->v_border_bottom; > + pipes[pipe_cnt].pipe.dest.interlaced =3D timing->flags.IN= TERLACE; > + pipes[pipe_cnt].pipe.dest.pixel_rate_mhz =3D timing->pix_= clk_100hz/10000.0; > + if (timing->timing_3d_format =3D=3D TIMING_3D_FORMAT_HW_F= RAME_PACKING) > + pipes[pipe_cnt].pipe.dest.pixel_rate_mhz *=3D 2; > + pipes[pipe_cnt].pipe.dest.otg_inst =3D res_ctx->pipe_ctx[= i].stream_res.tg->inst; > + pipes[pipe_cnt].dout.dp_lanes =3D 4; > + pipes[pipe_cnt].dout.is_virtual =3D 0; > + pipes[pipe_cnt].pipe.dest.vtotal_min =3D res_ctx->pipe_ct= x[i].stream->adjust.v_total_min; > + pipes[pipe_cnt].pipe.dest.vtotal_max =3D res_ctx->pipe_ct= x[i].stream->adjust.v_total_max; > + switch (get_num_odm_splits(&res_ctx->pipe_ctx[i])) { > + case 1: > + pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_2to1; > + break; > + case 3: > + pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_4to1; > + break; > + default: > + pipes[pipe_cnt].pipe.dest.odm_combine =3D dm_odm_= combine_mode_disabled; > + } > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D res_ctx->pipe_ctx= [i].pipe_idx; > + if (res_ctx->pipe_ctx[i].top_pipe && res_ctx->pipe_ctx[i]= =2Etop_pipe->plane_state > + =3D=3D res_ctx->pipe_ctx[i].plane_state) { > + struct pipe_ctx *first_pipe =3D res_ctx->pipe_ctx= [i].top_pipe; > + int split_idx =3D 0; > + > + while (first_pipe->top_pipe && first_pipe->top_pi= pe->plane_state > + =3D=3D res_ctx->pipe_ctx[i].plane= _state) { > + first_pipe =3D first_pipe->top_pipe; > + split_idx++; > + } > + /* Treat 4to1 mpc combine as an mpo of 2 2-to-1 c= ombines */ > + if (split_idx =3D=3D 0) > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D f= irst_pipe->pipe_idx; > + else if (split_idx =3D=3D 1) > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].pipe_idx; > + else if (split_idx =3D=3D 2) > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].top_pipe->pipe_idx; > + } else if (res_ctx->pipe_ctx[i].prev_odm_pipe) { > + struct pipe_ctx *first_pipe =3D res_ctx->pipe_ctx= [i].prev_odm_pipe; > + > + while (first_pipe->prev_odm_pipe) > + first_pipe =3D first_pipe->prev_odm_pipe; > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D first_pip= e->pipe_idx; > + } > + > + switch (res_ctx->pipe_ctx[i].stream->signal) { > + case SIGNAL_TYPE_DISPLAY_PORT_MST: > + case SIGNAL_TYPE_DISPLAY_PORT: > + pipes[pipe_cnt].dout.output_type =3D dm_dp; > + break; > + case SIGNAL_TYPE_EDP: > + pipes[pipe_cnt].dout.output_type =3D dm_edp; > + break; > + case SIGNAL_TYPE_HDMI_TYPE_A: > + case SIGNAL_TYPE_DVI_SINGLE_LINK: > + case SIGNAL_TYPE_DVI_DUAL_LINK: > + pipes[pipe_cnt].dout.output_type =3D dm_hdmi; > + break; > + default: > + /* In case there is no signal, set dp with 4 lane= s to allow max config */ > + pipes[pipe_cnt].dout.is_virtual =3D 1; > + pipes[pipe_cnt].dout.output_type =3D dm_dp; > + pipes[pipe_cnt].dout.dp_lanes =3D 4; > + } > + > + switch (res_ctx->pipe_ctx[i].stream->timing.display_color= _depth) { > + case COLOR_DEPTH_666: > + output_bpc =3D 6; > + break; > + case COLOR_DEPTH_888: > + output_bpc =3D 8; > + break; > + case COLOR_DEPTH_101010: > + output_bpc =3D 10; > + break; > + case COLOR_DEPTH_121212: > + output_bpc =3D 12; > + break; > + case COLOR_DEPTH_141414: > + output_bpc =3D 14; > + break; > + case COLOR_DEPTH_161616: > + output_bpc =3D 16; > + break; > + case COLOR_DEPTH_999: > + output_bpc =3D 9; > + break; > + case COLOR_DEPTH_111111: > + output_bpc =3D 11; > + break; > + default: > + output_bpc =3D 8; > + break; > + } > + > + switch (res_ctx->pipe_ctx[i].stream->timing.pixel_encodin= g) { > + case PIXEL_ENCODING_RGB: > + case PIXEL_ENCODING_YCBCR444: > + pipes[pipe_cnt].dout.output_format =3D dm_444; > + pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 3; > + break; > + case PIXEL_ENCODING_YCBCR420: > + pipes[pipe_cnt].dout.output_format =3D dm_420; > + pipes[pipe_cnt].dout.output_bpp =3D (output_bpc *= 3.0) / 2; > + break; > + case PIXEL_ENCODING_YCBCR422: > + if (res_ctx->pipe_ctx[i].stream->timing.flags.DSC= && > + !res_ctx->pipe_ctx[i].stream->timing.dsc_cfg.= ycbcr422_simple) > + pipes[pipe_cnt].dout.output_format =3D dm= _n422; > + else > + pipes[pipe_cnt].dout.output_format =3D dm= _s422; > + pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 2; > + break; > + default: > + pipes[pipe_cnt].dout.output_format =3D dm_444; > + pipes[pipe_cnt].dout.output_bpp =3D output_bpc * = 3; > + } > + > + if (res_ctx->pipe_ctx[i].stream->timing.flags.DSC) > + pipes[pipe_cnt].dout.output_bpp =3D res_ctx->pipe= _ctx[i].stream->timing.dsc_cfg.bits_per_pixel / 16.0; > + > + /* todo: default max for now, until there is logic reflec= ting this in dc*/ > + pipes[pipe_cnt].dout.dsc_input_bpc =3D 12; > + /*fill up the audio sample rate (unit in kHz)*/ > + get_audio_check(&res_ctx->pipe_ctx[i].stream->audio_info,= &aud_check); > + pipes[pipe_cnt].dout.max_audio_sample_rate =3D aud_check.= max_audiosample_rate / 1000; > + /* > + * For graphic plane, cursor number is 1, nv12 is 0 > + * bw calculations due to cursor on/off > + */ > + if (res_ctx->pipe_ctx[i].plane_state && > + res_ctx->pipe_ctx[i].plane_state->address= =2Etype =3D=3D PLN_ADDR_TYPE_VIDEO_PROGRESSIVE) > + pipes[pipe_cnt].pipe.src.num_cursors =3D 0; > + else > + pipes[pipe_cnt].pipe.src.num_cursors =3D dc->dml.= ip.number_of_cursors; > + > + pipes[pipe_cnt].pipe.src.cur0_src_width =3D 256; > + pipes[pipe_cnt].pipe.src.cur0_bpp =3D dm_cur_32bit; > + > + if (!res_ctx->pipe_ctx[i].plane_state) { > + pipes[pipe_cnt].pipe.src.is_hsplit =3D pipes[pipe= _cnt].pipe.dest.odm_combine !=3D dm_odm_combine_mode_disabled; > + pipes[pipe_cnt].pipe.src.source_scan =3D dm_horz; > + pipes[pipe_cnt].pipe.src.sw_mode =3D dm_sw_4kb_s; > + pipes[pipe_cnt].pipe.src.macro_tile_size =3D dm_6= 4k_tile; > + pipes[pipe_cnt].pipe.src.viewport_width =3D timin= g->h_addressable; > + if (pipes[pipe_cnt].pipe.src.viewport_width > 192= 0) > + pipes[pipe_cnt].pipe.src.viewport_width = =3D 1920; > + pipes[pipe_cnt].pipe.src.viewport_height =3D timi= ng->v_addressable; > + if (pipes[pipe_cnt].pipe.src.viewport_height > 10= 80) > + pipes[pipe_cnt].pipe.src.viewport_height = =3D 1080; > + pipes[pipe_cnt].pipe.src.surface_height_y =3D pip= es[pipe_cnt].pipe.src.viewport_height; > + pipes[pipe_cnt].pipe.src.surface_width_y =3D pipe= s[pipe_cnt].pipe.src.viewport_width; > + pipes[pipe_cnt].pipe.src.surface_height_c =3D pip= es[pipe_cnt].pipe.src.viewport_height; > + pipes[pipe_cnt].pipe.src.surface_width_c =3D pipe= s[pipe_cnt].pipe.src.viewport_width; > + pipes[pipe_cnt].pipe.src.data_pitch =3D ((pipes[p= ipe_cnt].pipe.src.viewport_width + 255) / 256) * 256; > + pipes[pipe_cnt].pipe.src.source_format =3D dm_444= _32; > + pipes[pipe_cnt].pipe.dest.recout_width =3D pipes[= pipe_cnt].pipe.src.viewport_width; /*vp_width/hratio*/ > + pipes[pipe_cnt].pipe.dest.recout_height =3D pipes= [pipe_cnt].pipe.src.viewport_height; /*vp_height/vratio*/ > + pipes[pipe_cnt].pipe.dest.full_recout_width =3D p= ipes[pipe_cnt].pipe.dest.recout_width; /*when is_hsplit !=3D 1*/ > + pipes[pipe_cnt].pipe.dest.full_recout_height =3D = pipes[pipe_cnt].pipe.dest.recout_height; /*when is_hsplit !=3D 1*/ > + pipes[pipe_cnt].pipe.scale_ratio_depth.lb_depth = =3D dm_lb_16; > + pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= =3D 1.0; > + pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= =3D 1.0; > + pipes[pipe_cnt].pipe.scale_ratio_depth.scl_enable= =3D 0; /*Lb only or Full scl*/ > + pipes[pipe_cnt].pipe.scale_taps.htaps =3D 1; > + pipes[pipe_cnt].pipe.scale_taps.vtaps =3D 1; > + pipes[pipe_cnt].pipe.dest.vtotal_min =3D v_total; > + pipes[pipe_cnt].pipe.dest.vtotal_max =3D v_total; > + > + if (pipes[pipe_cnt].pipe.dest.odm_combine =3D=3D = dm_odm_combine_mode_2to1) { > + pipes[pipe_cnt].pipe.src.viewport_width /= =3D 2; > + pipes[pipe_cnt].pipe.dest.recout_width /= =3D 2; > + } else if (pipes[pipe_cnt].pipe.dest.odm_combine = =3D=3D dm_odm_combine_mode_4to1) { > + pipes[pipe_cnt].pipe.src.viewport_width /= =3D 4; > + pipes[pipe_cnt].pipe.dest.recout_width /= =3D 4; > + } > + } else { > + struct dc_plane_state *pln =3D res_ctx->pipe_ctx[= i].plane_state; > + struct scaler_data *scl =3D &res_ctx->pipe_ctx[i]= =2Eplane_res.scl_data; > + > + pipes[pipe_cnt].pipe.src.immediate_flip =3D pln->= flip_immediate; > + pipes[pipe_cnt].pipe.src.is_hsplit =3D (res_ctx->= pipe_ctx[i].bottom_pipe && res_ctx->pipe_ctx[i].bottom_pipe->plane_state = =3D=3D pln) > + || (res_ctx->pipe_ctx[i].top_pipe= && res_ctx->pipe_ctx[i].top_pipe->plane_state =3D=3D pln) > + || pipes[pipe_cnt].pipe.dest.odm_= combine !=3D dm_odm_combine_mode_disabled; > + > + /* stereo is not split */ > + if (pln->stereo_format =3D=3D PLANE_STEREO_FORMAT= _SIDE_BY_SIDE || > + pln->stereo_format =3D=3D PLANE_STEREO_FORMAT= _TOP_AND_BOTTOM) { > + pipes[pipe_cnt].pipe.src.is_hsplit =3D fa= lse; > + pipes[pipe_cnt].pipe.src.hsplit_grp =3D r= es_ctx->pipe_ctx[i].pipe_idx; > + } > + > + pipes[pipe_cnt].pipe.src.source_scan =3D pln->rot= ation =3D=3D ROTATION_ANGLE_90 > + || pln->rotation =3D=3D ROTATION_= ANGLE_270 ? dm_vert : dm_horz; > + pipes[pipe_cnt].pipe.src.viewport_y_y =3D scl->vi= ewport.y; > + pipes[pipe_cnt].pipe.src.viewport_y_c =3D scl->vi= ewport_c.y; > + pipes[pipe_cnt].pipe.src.viewport_width =3D scl->= viewport.width; > + pipes[pipe_cnt].pipe.src.viewport_width_c =3D scl= ->viewport_c.width; > + pipes[pipe_cnt].pipe.src.viewport_height =3D scl-= >viewport.height; > + pipes[pipe_cnt].pipe.src.viewport_height_c =3D sc= l->viewport_c.height; > + pipes[pipe_cnt].pipe.src.viewport_width_max =3D p= ln->src_rect.width; > + pipes[pipe_cnt].pipe.src.viewport_height_max =3D = pln->src_rect.height; > + pipes[pipe_cnt].pipe.src.surface_width_y =3D pln-= >plane_size.surface_size.width; > + pipes[pipe_cnt].pipe.src.surface_height_y =3D pln= ->plane_size.surface_size.height; > + pipes[pipe_cnt].pipe.src.surface_width_c =3D pln-= >plane_size.chroma_size.width; > + pipes[pipe_cnt].pipe.src.surface_height_c =3D pln= ->plane_size.chroma_size.height; > + if (pln->format =3D=3D SURFACE_PIXEL_FORMAT_GRPH_= RGBE_ALPHA > + || pln->format >=3D SURFACE_PIXEL= _FORMAT_VIDEO_BEGIN) { > + pipes[pipe_cnt].pipe.src.data_pitch =3D p= ln->plane_size.surface_pitch; > + pipes[pipe_cnt].pipe.src.data_pitch_c =3D= pln->plane_size.chroma_pitch; > + pipes[pipe_cnt].pipe.src.meta_pitch =3D p= ln->dcc.meta_pitch; > + pipes[pipe_cnt].pipe.src.meta_pitch_c =3D= pln->dcc.meta_pitch_c; > + } else { > + pipes[pipe_cnt].pipe.src.data_pitch =3D p= ln->plane_size.surface_pitch; > + pipes[pipe_cnt].pipe.src.meta_pitch =3D p= ln->dcc.meta_pitch; > + } > + pipes[pipe_cnt].pipe.src.dcc =3D pln->dcc.enable; > + pipes[pipe_cnt].pipe.dest.recout_width =3D scl->r= ecout.width; > + pipes[pipe_cnt].pipe.dest.recout_height =3D scl->= recout.height; > + pipes[pipe_cnt].pipe.dest.full_recout_height =3D = scl->recout.height; > + pipes[pipe_cnt].pipe.dest.full_recout_width =3D s= cl->recout.width; > + if (pipes[pipe_cnt].pipe.dest.odm_combine =3D=3D = dm_odm_combine_mode_2to1) > + pipes[pipe_cnt].pipe.dest.full_recout_wid= th *=3D 2; > + else if (pipes[pipe_cnt].pipe.dest.odm_combine = =3D=3D dm_odm_combine_mode_4to1) > + pipes[pipe_cnt].pipe.dest.full_recout_wid= th *=3D 4; > + else { > + struct pipe_ctx *split_pipe =3D res_ctx->= pipe_ctx[i].bottom_pipe; > + > + while (split_pipe && split_pipe->plane_st= ate =3D=3D pln) { > + pipes[pipe_cnt].pipe.dest.full_re= cout_width +=3D split_pipe->plane_res.scl_data.recout.width; > + split_pipe =3D split_pipe->bottom= _pipe; > + } > + split_pipe =3D res_ctx->pipe_ctx[i].top_p= ipe; > + while (split_pipe && split_pipe->plane_st= ate =3D=3D pln) { > + pipes[pipe_cnt].pipe.dest.full_re= cout_width +=3D split_pipe->plane_res.scl_data.recout.width; > + split_pipe =3D split_pipe->top_pi= pe; > + } > + } > + > + pipes[pipe_cnt].pipe.scale_ratio_depth.lb_depth = =3D dm_lb_16; > + pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= =3D (double) scl->ratios.horz.value / (1ULL<<32); > + pipes[pipe_cnt].pipe.scale_ratio_depth.hscl_ratio= _c =3D (double) scl->ratios.horz_c.value / (1ULL<<32); > + pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= =3D (double) scl->ratios.vert.value / (1ULL<<32); > + pipes[pipe_cnt].pipe.scale_ratio_depth.vscl_ratio= _c =3D (double) scl->ratios.vert_c.value / (1ULL<<32); > + pipes[pipe_cnt].pipe.scale_ratio_depth.scl_enable= =3D > + scl->ratios.vert.value !=3D dc_fi= xpt_one.value > + || scl->ratios.horz.value !=3D dc= _fixpt_one.value > + || scl->ratios.vert_c.value !=3D = dc_fixpt_one.value > + || scl->ratios.horz_c.value !=3D = dc_fixpt_one.value /*Lb only or Full scl*/ > + || dc->debug.always_scale; /*supp= ort always scale*/ > + pipes[pipe_cnt].pipe.scale_taps.htaps =3D scl->ta= ps.h_taps; > + pipes[pipe_cnt].pipe.scale_taps.htaps_c =3D scl->= taps.h_taps_c; > + pipes[pipe_cnt].pipe.scale_taps.vtaps =3D scl->ta= ps.v_taps; > + pipes[pipe_cnt].pipe.scale_taps.vtaps_c =3D scl->= taps.v_taps_c; > + > + pipes[pipe_cnt].pipe.src.macro_tile_size =3D > + swizzle_mode_to_macro_tile_size(p= ln->tiling_info.gfx9.swizzle); > + swizzle_to_dml_params(pln->tiling_info.gfx9.swizz= le, > + &pipes[pipe_cnt].pipe.src.sw_mode= ); > + > + switch (pln->format) { > + case SURFACE_PIXEL_FORMAT_VIDEO_420_YCbCr: > + case SURFACE_PIXEL_FORMAT_VIDEO_420_YCrCb: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_420_8; > + break; > + case SURFACE_PIXEL_FORMAT_VIDEO_420_10bpc_YCbCr: > + case SURFACE_PIXEL_FORMAT_VIDEO_420_10bpc_YCrCb: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_420_10; > + break; > + case SURFACE_PIXEL_FORMAT_GRPH_ARGB16161616: > + case SURFACE_PIXEL_FORMAT_GRPH_ABGR16161616: > + case SURFACE_PIXEL_FORMAT_GRPH_ARGB16161616F: > + case SURFACE_PIXEL_FORMAT_GRPH_ABGR16161616F: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_64; > + break; > + case SURFACE_PIXEL_FORMAT_GRPH_ARGB1555: > + case SURFACE_PIXEL_FORMAT_GRPH_RGB565: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_16; > + break; > + case SURFACE_PIXEL_FORMAT_GRPH_PALETA_256_COLORS: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_8; > + break; > + case SURFACE_PIXEL_FORMAT_GRPH_RGBE_ALPHA: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_rgbe_alpha; > + break; > + default: > + pipes[pipe_cnt].pipe.src.source_format = =3D dm_444_32; > + break; > + } > + } > + > + pipe_cnt++; > + } > + > + /* populate writeback information */ > + dc->res_pool->funcs->populate_dml_writeback_from_context(dc, res_= ctx, pipes); > + > + return pipe_cnt; > +} > + > +void dcn20_calculate_wm(struct dc *dc, struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int *out_pipe_cnt, > + int *pipe_split_from, > + int vlevel, > + bool fast_validate) > +{ > + int pipe_cnt, i, pipe_idx; > + > + dc_assert_fp_enabled(); > + > + for (i =3D 0, pipe_idx =3D 0, pipe_cnt =3D 0; i < dc->res_pool->p= ipe_count; i++) { > + if (!context->res_ctx.pipe_ctx[i].stream) > + continue; > + > + pipes[pipe_cnt].clks_cfg.refclk_mhz =3D dc->res_pool->ref= _clocks.dchub_ref_clock_inKhz / 1000.0; > + pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D context->bw_ctx.= dml.vba.RequiredDISPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb]; > + > + if (pipe_split_from[i] < 0) { > + pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D > + context->bw_ctx.dml.vba.RequiredD= PPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb][pipe_idx]; > + if (context->bw_ctx.dml.vba.BlendingAndTiming[pip= e_idx] =3D=3D pipe_idx) > + pipes[pipe_cnt].pipe.dest.odm_combine =3D > + context->bw_ctx.dml.vba.O= DMCombineEnabled[pipe_idx]; > + else > + pipes[pipe_cnt].pipe.dest.odm_combine =3D= 0; > + pipe_idx++; > + } else { > + pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D > + context->bw_ctx.dml.vba.RequiredD= PPCLK[vlevel][context->bw_ctx.dml.vba.maxMpcComb][pipe_split_from[i]]; > + if (context->bw_ctx.dml.vba.BlendingAndTiming[pip= e_split_from[i]] =3D=3D pipe_split_from[i]) > + pipes[pipe_cnt].pipe.dest.odm_combine =3D > + context->bw_ctx.dml.vba.O= DMCombineEnabled[pipe_split_from[i]]; > + else > + pipes[pipe_cnt].pipe.dest.odm_combine =3D= 0; > + } > + > + if (dc->config.forced_clocks) { > + pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D context-= >bw_ctx.dml.soc.clock_limits[0].dispclk_mhz; > + pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D context->= bw_ctx.dml.soc.clock_limits[0].dppclk_mhz; > + } > + if (dc->debug.min_disp_clk_khz > pipes[pipe_cnt].clks_cfg= =2Edispclk_mhz * 1000) > + pipes[pipe_cnt].clks_cfg.dispclk_mhz =3D dc->debu= g.min_disp_clk_khz / 1000.0; > + if (dc->debug.min_dpp_clk_khz > pipes[pipe_cnt].clks_cfg.= dppclk_mhz * 1000) > + pipes[pipe_cnt].clks_cfg.dppclk_mhz =3D dc->debug= =2Emin_dpp_clk_khz / 1000.0; > + > + pipe_cnt++; > + } > + > + if (pipe_cnt !=3D pipe_idx) { > + if (dc->res_pool->funcs->populate_dml_pipes) > + pipe_cnt =3D dc->res_pool->funcs->populate_dml_pi= pes(dc, > + context, pipes, fast_validate); > + else > + pipe_cnt =3D dcn20_populate_dml_pipes_from_contex= t(dc, > + context, pipes, fast_validate); > + } > + > + *out_pipe_cnt =3D pipe_cnt; > + > + pipes[0].clks_cfg.voltage =3D vlevel; > + pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].dcfclk_mhz; > + pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].socclk_mhz; > + > + /* only pipe 0 is read for voltage and dcf/soc clocks */ > + if (vlevel < 1) { > + pipes[0].clks_cfg.voltage =3D 1; > + pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[1].dcfclk_mhz; > + pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[1].socclk_mhz; > + } > + context->bw_ctx.bw.dcn.watermarks.b.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.b.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > + context->bw_ctx.bw.dcn.watermarks.b.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.b.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.b.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > + context->bw_ctx.bw.dcn.watermarks.b.urgent_latency_ns =3D get_urg= ent_latency(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + > + if (vlevel < 2) { > + pipes[0].clks_cfg.voltage =3D 2; > + pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].dcfclk_mhz; > + pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].socclk_mhz; > + } > + context->bw_ctx.bw.dcn.watermarks.c.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.c.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > + context->bw_ctx.bw.dcn.watermarks.c.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.c.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.c.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > + > + if (vlevel < 3) { > + pipes[0].clks_cfg.voltage =3D 3; > + pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].dcfclk_mhz; > + pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.= clock_limits[2].socclk_mhz; > + } > + context->bw_ctx.bw.dcn.watermarks.d.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.d.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > + context->bw_ctx.bw.dcn.watermarks.d.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.d.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.d.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > + > + pipes[0].clks_cfg.voltage =3D vlevel; > + pipes[0].clks_cfg.dcfclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].dcfclk_mhz; > + pipes[0].clks_cfg.socclk_mhz =3D context->bw_ctx.dml.soc.clock_li= mits[vlevel].socclk_mhz; > + context->bw_ctx.bw.dcn.watermarks.a.urgent_ns =3D get_wm_urgent(&= context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.cstate_enter_pl= us_exit_ns =3D get_wm_stutter_enter_exit(&context->bw_ctx.dml, pipes, pipe_= cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.cstate_exit_ns = =3D get_wm_stutter_exit(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.a.cstate_pstate.pstate_change_n= s =3D get_wm_dram_clock_change(&context->bw_ctx.dml, pipes, pipe_cnt) * 100= 0; > + context->bw_ctx.bw.dcn.watermarks.a.pte_meta_urgent_ns =3D get_wm= _memory_trip(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.a.frac_urg_bw_nom =3D get_fract= ion_of_urgent_bandwidth(&context->bw_ctx.dml, pipes, pipe_cnt) * 1000; > + context->bw_ctx.bw.dcn.watermarks.a.frac_urg_bw_flip =3D get_frac= tion_of_urgent_bandwidth_imm_flip(&context->bw_ctx.dml, pipes, pipe_cnt) * = 1000; > +} > + > +void dcn20_update_bounding_box(struct dc *dc, > + struct _vcs_dpi_soc_bounding_box_st *bb, > + struct pp_smu_nv_clock_table *max_clocks, > + unsigned int *uclk_states, > + unsigned int num_states) > +{ > + struct _vcs_dpi_voltage_scaling_st calculated_states[DC__VOLTAGE_= STATES]; > + int i; > + int num_calculated_states =3D 0; > + int min_dcfclk =3D 0; > + > + dc_assert_fp_enabled(); > + > + if (num_states =3D=3D 0) > + return; > + > + memset(calculated_states, 0, sizeof(calculated_states)); > + > + if (dc->bb_overrides.min_dcfclk_mhz > 0) > + min_dcfclk =3D dc->bb_overrides.min_dcfclk_mhz; > + else { > + if (ASICREV_IS_NAVI12_P(dc->ctx->asic_id.hw_internal_rev)) > + min_dcfclk =3D 310; > + else > + // Accounting for SOC/DCF relationship, we can go= as high as > + // 506Mhz in Vmin. > + min_dcfclk =3D 506; > + } > + > + for (i =3D 0; i < num_states; i++) { > + int min_fclk_required_by_uclk; > + calculated_states[i].state =3D i; > + calculated_states[i].dram_speed_mts =3D uclk_states[i] * = 16 / 1000; > + > + // FCLK:UCLK ratio is 1.08 > + min_fclk_required_by_uclk =3D div_u64(((unsigned long lon= g)uclk_states[i]) * 1080, > + 1000000); > + > + calculated_states[i].fabricclk_mhz =3D (min_fclk_required= _by_uclk < min_dcfclk) ? > + min_dcfclk : min_fclk_required_by_uclk; > + > + calculated_states[i].socclk_mhz =3D (calculated_states[i]= =2Efabricclk_mhz > max_clocks->socClockInKhz / 1000) ? > + max_clocks->socClockInKhz / 1000 : calcul= ated_states[i].fabricclk_mhz; > + > + calculated_states[i].dcfclk_mhz =3D (calculated_states[i]= =2Efabricclk_mhz > max_clocks->dcfClockInKhz / 1000) ? > + max_clocks->dcfClockInKhz / 1000 : calcul= ated_states[i].fabricclk_mhz; > + > + calculated_states[i].dispclk_mhz =3D max_clocks->displayC= lockInKhz / 1000; > + calculated_states[i].dppclk_mhz =3D max_clocks->displayCl= ockInKhz / 1000; > + calculated_states[i].dscclk_mhz =3D max_clocks->displayCl= ockInKhz / (1000 * 3); > + > + calculated_states[i].phyclk_mhz =3D max_clocks->phyClockI= nKhz / 1000; > + > + num_calculated_states++; > + } > + > + calculated_states[num_calculated_states - 1].socclk_mhz =3D max_c= locks->socClockInKhz / 1000; > + calculated_states[num_calculated_states - 1].fabricclk_mhz =3D ma= x_clocks->socClockInKhz / 1000; > + calculated_states[num_calculated_states - 1].dcfclk_mhz =3D max_c= locks->dcfClockInKhz / 1000; > + > + memcpy(bb->clock_limits, calculated_states, sizeof(bb->clock_limi= ts)); > + bb->num_states =3D num_calculated_states; > + > + // Duplicate the last state, DML always an extra state identical = to max state to work > + memcpy(&bb->clock_limits[num_calculated_states], &bb->clock_limit= s[num_calculated_states - 1], sizeof(struct _vcs_dpi_voltage_scaling_st)); > + bb->clock_limits[num_calculated_states].state =3D bb->num_states; > +} > + > +void dcn20_cap_soc_clocks(struct _vcs_dpi_soc_bounding_box_st *bb, > + struct pp_smu_nv_clock_table max_clocks) > +{ > + int i; > + > + dc_assert_fp_enabled(); > + > + // First pass - cap all clocks higher than the reported max > + for (i =3D 0; i < bb->num_states; i++) { > + if ((bb->clock_limits[i].dcfclk_mhz > (max_clocks.dcfCloc= kInKhz / 1000)) > + && max_clocks.dcfClockInKhz !=3D 0) > + bb->clock_limits[i].dcfclk_mhz =3D (max_clocks.dc= fClockInKhz / 1000); > + > + if ((bb->clock_limits[i].dram_speed_mts > (max_clocks.uCl= ockInKhz / 1000) * 16) > + && max_clocks.uClockInKhz= !=3D 0) > + bb->clock_limits[i].dram_speed_mts =3D (max_clock= s.uClockInKhz / 1000) * 16; > + > + if ((bb->clock_limits[i].fabricclk_mhz > (max_clocks.fabr= icClockInKhz / 1000)) > + && max_clocks.fabricClock= InKhz !=3D 0) > + bb->clock_limits[i].fabricclk_mhz =3D (max_clocks= =2EfabricClockInKhz / 1000); > + > + if ((bb->clock_limits[i].dispclk_mhz > (max_clocks.displa= yClockInKhz / 1000)) > + && max_clocks.displayCloc= kInKhz !=3D 0) > + bb->clock_limits[i].dispclk_mhz =3D (max_clocks.d= isplayClockInKhz / 1000); > + > + if ((bb->clock_limits[i].dppclk_mhz > (max_clocks.dppCloc= kInKhz / 1000)) > + && max_clocks.dppClockInK= hz !=3D 0) > + bb->clock_limits[i].dppclk_mhz =3D (max_clocks.dp= pClockInKhz / 1000); > + > + if ((bb->clock_limits[i].phyclk_mhz > (max_clocks.phyCloc= kInKhz / 1000)) > + && max_clocks.phyClockInK= hz !=3D 0) > + bb->clock_limits[i].phyclk_mhz =3D (max_clocks.ph= yClockInKhz / 1000); > + > + if ((bb->clock_limits[i].socclk_mhz > (max_clocks.socCloc= kInKhz / 1000)) > + && max_clocks.socClockInK= hz !=3D 0) > + bb->clock_limits[i].socclk_mhz =3D (max_clocks.so= cClockInKhz / 1000); > + > + if ((bb->clock_limits[i].dscclk_mhz > (max_clocks.dscCloc= kInKhz / 1000)) > + && max_clocks.dscClockInK= hz !=3D 0) > + bb->clock_limits[i].dscclk_mhz =3D (max_clocks.ds= cClockInKhz / 1000); > + } > + > + // Second pass - remove all duplicate clock states > + for (i =3D bb->num_states - 1; i > 1; i--) { > + bool duplicate =3D true; > + > + if (bb->clock_limits[i-1].dcfclk_mhz !=3D bb->clock_limit= s[i].dcfclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].dispclk_mhz !=3D bb->clock_limi= ts[i].dispclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].dppclk_mhz !=3D bb->clock_limit= s[i].dppclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].dram_speed_mts !=3D bb->clock_l= imits[i].dram_speed_mts) > + duplicate =3D false; > + if (bb->clock_limits[i-1].dscclk_mhz !=3D bb->clock_limit= s[i].dscclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].fabricclk_mhz !=3D bb->clock_li= mits[i].fabricclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].phyclk_mhz !=3D bb->clock_limit= s[i].phyclk_mhz) > + duplicate =3D false; > + if (bb->clock_limits[i-1].socclk_mhz !=3D bb->clock_limit= s[i].socclk_mhz) > + duplicate =3D false; > + > + if (duplicate) > + bb->num_states--; > + } > +} > + > +void dcn20_patch_bounding_box(struct dc *dc, struct _vcs_dpi_soc_boundin= g_box_st *bb) > +{ > + dc_assert_fp_enabled(); > + > + if ((int)(bb->sr_exit_time_us * 1000) !=3D dc->bb_overrides.sr_ex= it_time_ns > + && dc->bb_overrides.sr_exit_time_ns) { > + bb->sr_exit_time_us =3D dc->bb_overrides.sr_exit_time_ns = / 1000.0; > + } > + > + if ((int)(bb->sr_enter_plus_exit_time_us * 1000) > + !=3D dc->bb_overrides.sr_enter_plus_exit_= time_ns > + && dc->bb_overrides.sr_enter_plus_exit_time_ns) { > + bb->sr_enter_plus_exit_time_us =3D > + dc->bb_overrides.sr_enter_plus_exit_time_= ns / 1000.0; > + } > + > + if ((int)(bb->urgent_latency_us * 1000) !=3D dc->bb_overrides.urg= ent_latency_ns > + && dc->bb_overrides.urgent_latency_ns) { > + bb->urgent_latency_us =3D dc->bb_overrides.urgent_latency= _ns / 1000.0; > + } > + > + if ((int)(bb->dram_clock_change_latency_us * 1000) > + !=3D dc->bb_overrides.dram_clock_change_l= atency_ns > + && dc->bb_overrides.dram_clock_change_latency_ns)= { > + bb->dram_clock_change_latency_us =3D > + dc->bb_overrides.dram_clock_change_latenc= y_ns / 1000.0; > + } > + > + if ((int)(bb->dummy_pstate_latency_us * 1000) > + !=3D dc->bb_overrides.dummy_clock_change_= latency_ns > + && dc->bb_overrides.dummy_clock_change_latency_ns= ) { > + bb->dummy_pstate_latency_us =3D > + dc->bb_overrides.dummy_clock_change_laten= cy_ns / 1000.0; > + } > +} > + > +static bool dcn20_validate_bandwidth_internal(struct dc *dc, struct dc_s= tate *context, > + bool fast_validate) > +{ > + bool out =3D false; > + > + BW_VAL_TRACE_SETUP(); > + > + int vlevel =3D 0; > + int pipe_split_from[MAX_PIPES]; > + int pipe_cnt =3D 0; > + display_e2e_pipe_params_st *pipes =3D kzalloc(dc->res_pool->pipe_= count * sizeof(display_e2e_pipe_params_st), GFP_ATOMIC); > + DC_LOGGER_INIT(dc->ctx->logger); > + > + BW_VAL_TRACE_COUNT(); > + > + out =3D dcn20_fast_validate_bw(dc, context, pipes, &pipe_cnt, pip= e_split_from, &vlevel, fast_validate); > + > + if (pipe_cnt =3D=3D 0) > + goto validate_out; > + > + if (!out) > + goto validate_fail; > + > + BW_VAL_TRACE_END_VOLTAGE_LEVEL(); > + > + if (fast_validate) { > + BW_VAL_TRACE_SKIP(fast); > + goto validate_out; > + } > + > + dcn20_calculate_wm(dc, context, pipes, &pipe_cnt, pipe_split_from= , vlevel, fast_validate); > + dcn20_calculate_dlg_params(dc, context, pipes, pipe_cnt, vlevel); > + > + BW_VAL_TRACE_END_WATERMARKS(); > + > + goto validate_out; > + > +validate_fail: > + DC_LOG_WARNING("Mode Validation Warning: %s failed validation.\n", > + dml_get_status_message(context->bw_ctx.dml.vba.Validation= Status[context->bw_ctx.dml.vba.soc.num_states])); > + > + BW_VAL_TRACE_SKIP(fail); > + out =3D false; > + > +validate_out: > + kfree(pipes); > + > + BW_VAL_TRACE_FINISH(); > + > + return out; > +} > + > +bool dcn20_validate_bandwidth_fp(struct dc *dc, > + struct dc_state *context, > + bool fast_validate) > +{ > + bool voltage_supported =3D false; > + bool full_pstate_supported =3D false; > + bool dummy_pstate_supported =3D false; > + double p_state_latency_us; > + > + dc_assert_fp_enabled(); > + > + p_state_latency_us =3D context->bw_ctx.dml.soc.dram_clock_change_= latency_us; > + context->bw_ctx.dml.soc.disable_dram_clock_change_vactive_support= =3D > + dc->debug.disable_dram_clock_change_vactive_support; > + context->bw_ctx.dml.soc.allow_dram_clock_one_display_vactive =3D > + dc->debug.enable_dram_clock_change_one_display_vactive; > + > + /*Unsafe due to current pipe merge and split logic*/ > + ASSERT(context !=3D dc->current_state); > + > + if (fast_validate) { > + return dcn20_validate_bandwidth_internal(dc, context, tru= e); > + } > + > + // Best case, we support full UCLK switch latency > + voltage_supported =3D dcn20_validate_bandwidth_internal(dc, conte= xt, false); > + full_pstate_supported =3D context->bw_ctx.bw.dcn.clk.p_state_chan= ge_support; > + > + if (context->bw_ctx.dml.soc.dummy_pstate_latency_us =3D=3D 0 || > + (voltage_supported && full_pstate_supported)) { > + context->bw_ctx.bw.dcn.clk.p_state_change_support =3D ful= l_pstate_supported; > + goto restore_dml_state; > + } > + > + // Fallback: Try to only support G6 temperature read latency > + context->bw_ctx.dml.soc.dram_clock_change_latency_us =3D context-= >bw_ctx.dml.soc.dummy_pstate_latency_us; > + > + voltage_supported =3D dcn20_validate_bandwidth_internal(dc, conte= xt, false); > + dummy_pstate_supported =3D context->bw_ctx.bw.dcn.clk.p_state_cha= nge_support; > + > + if (voltage_supported && (dummy_pstate_supported || !(context->st= ream_count))) { > + context->bw_ctx.bw.dcn.clk.p_state_change_support =3D fal= se; > + goto restore_dml_state; > + } > + > + // ERROR: fallback is supposed to always work. > + ASSERT(false); > + > +restore_dml_state: > + context->bw_ctx.dml.soc.dram_clock_change_latency_us =3D p_state_= latency_us; > + return voltage_supported; > +} > + > +void dcn20_fpu_set_wm_ranges(int i, > + struct pp_smu_wm_range_sets *ranges, > + struct _vcs_dpi_soc_bounding_box_st *loaded_= bb) > +{ > + dc_assert_fp_enabled(); > + > + ranges->reader_wm_sets[i].min_fill_clk_mhz =3D (i > 0) ? (loaded_= bb->clock_limits[i - 1].dram_speed_mts / 16) + 1 : 0; > + ranges->reader_wm_sets[i].max_fill_clk_mhz =3D loaded_bb->clock_l= imits[i].dram_speed_mts / 16; > +} > + > +void dcn20_fpu_adjust_dppclk(struct vba_vars_st *v, > + int vlevel, > + int max_mpc_comb, > + int pipe_idx, > + bool is_validating_bw) > +{ > + dc_assert_fp_enabled(); > + > + if (is_validating_bw) > + v->RequiredDPPCLK[vlevel][max_mpc_comb][pipe_idx] *=3D 2; > + else > + v->RequiredDPPCLK[vlevel][max_mpc_comb][pipe_idx] /=3D 2; > +} > diff --git a/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.h b/drive= rs/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.h > index 36f26126d574..6b1f4126bc88 100644 > --- a/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.h > +++ b/drivers/gpu/drm/amd/display/dc/dml/dcn20/dcn20_fpu.h > @@ -23,6 +23,7 @@ > * Authors: AMD > * > */ > +#include "core_types.h" >=20 > #ifndef __DCN20_FPU_H__ > #define __DCN20_FPU_H__ > @@ -30,5 +31,44 @@ > void dcn20_populate_dml_writeback_from_context(struct dc *dc, > struct resource_context *= res_ctx, > display_e2e_pipe_params_s= t *pipes); > - > +void dcn20_fpu_set_wb_arb_params(struct mcif_arb_params *wb_arb_params, > + struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int pipe_cnt, int i); > +void dcn20_calculate_dlg_params(struct dc *dc, > + struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int pipe_cnt, > + int vlevel); > +int dcn20_populate_dml_pipes_from_context(struct dc *dc, > + struct dc_state *context, > + display_e2e_pipe_params_st *pip= es, > + bool fast_validate); > +void dcn20_calculate_wm(struct dc *dc, > + struct dc_state *context, > + display_e2e_pipe_params_st *pipes, > + int *out_pipe_cnt, > + int *pipe_split_from, > + int vlevel, > + bool fast_validate); > +void dcn20_cap_soc_clocks(struct _vcs_dpi_soc_bounding_box_st *bb, > + struct pp_smu_nv_clock_table max_clocks); > +void dcn20_update_bounding_box(struct dc *dc, > + struct _vcs_dpi_soc_bounding_box_st *bb, > + struct pp_smu_nv_clock_table *max_clocks, > + unsigned int *uclk_states, > + unsigned int num_states); > +void dcn20_patch_bounding_box(struct dc *dc, > + struct _vcs_dpi_soc_bounding_box_st *bb); > +bool dcn20_validate_bandwidth_fp(struct dc *dc, > + struct dc_state *context, > + bool fast_validate); > +void dcn20_fpu_set_wm_ranges(int i, > + struct pp_smu_wm_range_sets *ranges, > + struct _vcs_dpi_soc_bounding_box_st *loaded_= bb); > +void dcn20_fpu_adjust_dppclk(struct vba_vars_st *v, > + int vlevel, > + int max_mpc_comb, > + int pipe_idx, > + bool is_validating_bw); > #endif /* __DCN20_FPU_H__ */ > -- > 2.34.1 >=20 --az5z2ftu5cpeyhmk Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAABCgAdFiEEd8WOo/JViG+Tu+XIwqF3j0dLehwFAmIX+jgACgkQwqF3j0dL ehy/7Q/+Jjxk0405OsAepgfKMNOdaaC8czwHTYfcuMEM2+Hvq5p6ZY8iBcBszYTQ mQfpRwPIB9r2bdnC5GjkMt161b+bn+W44DMAQ4dd3s9GwZcNEPMGjt4FXgVfkBx5 d1YJ4GzKMv2avKl7lqSKhG++ZhdFLt/LRde+0MMizSypTWYhZ0MDdPmzkrnszo8z 4be/bLEKZbRVI1wUShShplBlEHTUvQ2PYso3xEIQ3NIqKonIT35HYmWDj83IsxFE iOl6tFlT1RavWTi/heGW5GdQmMxzKH0ZTY2C6dwZRRl5ivVE2/uj3EccbxY7PWoQ sTRPguEh/sQzacCLkQ6t6T1Tw7AtZ8lHdzszWtMevu/SF9JLzCWRxV6BIQU8YOox VIwLsCdofFebzmm5hF7sG3s9N+P4HFxJKHj2fUlcyHG/0xU789Rb1Jm4b1MeOHCW 7+rLz9YJQS09z2KiG1IW4WxnmUfbeY3992nHtWEYCVI/zVMDOJzn36xaIKdmgasc lN7Xy6X4/B7HmPWVXKRVjBc5EFgLm8gP36A7b3n6GOt96/Ac+VmIlliOLVwjHDbW kAMd9jZNDN8p2YOlVPnPjiusBMIaKdB5mDReC8THUWB6Gpq/woAJeTnCJebCZzCD Koo0iiNGqTkp+P/p/MK0C4OaSzDWSHlh925EnElM/w2A+64yfyw= =2V5o -----END PGP SIGNATURE----- --az5z2ftu5cpeyhmk--