Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1919606imu; Thu, 24 Jan 2019 04:21:34 -0800 (PST) X-Google-Smtp-Source: ALg8bN7zXvrcjC28cjWBjJYAamyiS4a/x4z24AgnuTX2b+yGfs0YDp0U+mSV046yKss5tbLM9y6C X-Received: by 2002:a17:902:9a02:: with SMTP id v2mr6482517plp.180.1548332494908; Thu, 24 Jan 2019 04:21:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548332494; cv=none; d=google.com; s=arc-20160816; b=m/bONTVdtHDnrbevDvJxYx1hkjKfXVwXaq1TM1BNaWoL6Pm9FxBbd13BM9lLmGrb2N 5Sib/w3z9LP7VEbUfEOMR+CcRg/+1Q08TCF4TxkF3yv+Ea7FM9aZBIpYrPogmoAFlCuk lnmfUtJ0M5Mch+CPHx01MGtZXG7LhaQTEa//86Ds0XAU16V2vltufRvMneh60owCKViz KuNPzpas49u1xQiat66jYygWM2I6aLnGU+3nTDAdE0u8jnEKMwupl3NtR1e9vaXaw3Rl CER5P04D4hTLOWA2rx4dZbXOzEt68WosEahUzR96n0jOz2mN08s8AQjVPtI84KOBVtMf llwg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version; bh=R7XE3wruuZVYisN0ifiRIVVlzId03n+qDA1LyaLu6eA=; b=W7l22Yvq1YjXP1vd+01YdpbE+MvPaXR3IHUtuKsH7+17LIzDasLG/mGzBfRxLNga1G iFbdQ7uLh7yo6mshFxehMq2x8eirirp+dNM3ju+/dGmVmPpSxeQq3Kw2QrultLZR4rai /DSc8TOWu69xce1VGjGq9ZXutRYIyUwLVtahGHJmhSQ3DDWlqGsQhsC0ZEjj9KTDpuoO 03d3/u3S5IdSZADBxAmpF2W1S5lCNlIDnXlpnLUOBLLUp50S611q0deBCSrN/JaFKyh2 DsCbolE/jQJd6Eh5287pgLNi/SojeFz/UGXgLrjUAay/BeqsiLlrK8BQdHk/UpARv0nj H3eQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r12si21621193pgf.22.2019.01.24.04.21.19; Thu, 24 Jan 2019 04:21:34 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727797AbfAXMTV convert rfc822-to-8bit (ORCPT + 99 others); Thu, 24 Jan 2019 07:19:21 -0500 Received: from kozue.soulik.info ([108.61.200.231]:56494 "EHLO kozue.soulik.info" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727455AbfAXMTV (ORCPT ); Thu, 24 Jan 2019 07:19:21 -0500 Received: from [192.168.0.49] (unknown [192.168.0.49]) by kozue.soulik.info (Postfix) with ESMTPSA id 597C5100D3B; Thu, 24 Jan 2019 21:20:20 +0900 (JST) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (1.0) Subject: Re: [linux-sunxi] [PATCH v2 1/2] media: v4l: Add definitions for the HEVC slice format and controls From: Ayaka X-Mailer: iPad Mail (16A404) In-Reply-To: <7c9080caa52b7201636999dcb3c5cdbe3d80d82b.camel@bootlin.com> Date: Thu, 24 Jan 2019 20:19:12 +0800 Cc: Randy Li , =?utf-8?Q?Jernej_=C5=A0krabec?= , linux-media@vger.kernel.org, linux-kernel@vger.kernel.org, devel@driverdev.osuosl.org, linux-arm-kernel@lists.infradead.org, Mauro Carvalho Chehab , Maxime Ripard , Hans Verkuil , Ezequiel Garcia , Tomasz Figa , Alexandre Courbot , Thomas Petazzoni , linux-rockchip@lists.infradead.org Content-Transfer-Encoding: 8BIT Message-Id: <5FA6762E-A369-4754-9BA6-01CF7016AA7D@soulik.info> References: <20181123130209.11696-1-paul.kocialkowski@bootlin.com> <20181123130209.11696-2-paul.kocialkowski@bootlin.com> <5515174.7lFZcYkk85@jernej-laptop> <776e63c9-d4a5-342a-e0f7-200ef144ffc4@rock-chips.com> <64c793e08d61181b78125b3956ec38623fa5d261.camel@bootlin.com> <82FA0C3F-BC54-4D89-AECB-90D81B89B1CE@soulik.info> <7c9080caa52b7201636999dcb3c5cdbe3d80d82b.camel@bootlin.com> To: Paul Kocialkowski Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Sent from my iPad > On Jan 24, 2019, at 6:36 PM, Paul Kocialkowski wrote: > > Hi, > >> On Tue, 2019-01-08 at 18:00 +0800, Ayaka wrote: >> >> Sent from my iPad >> >>> On Jan 8, 2019, at 4:38 PM, Paul Kocialkowski wrote: >>> >>> Hi, >>> >>>> On Tue, 2019-01-08 at 09:16 +0800, Ayaka wrote: >>>> >>>> Sent from my iPad >>>> >>>>> On Jan 7, 2019, at 5:57 PM, Paul Kocialkowski wrote: >>>>> >>>>> Hi, >>>>> >>>>>>> On Mon, 2019-01-07 at 11:49 +0800, Randy Li wrote: >>>>>>> On 12/12/18 8:51 PM, Paul Kocialkowski wrote: >>>>>>> Hi, >>>>>>> >>>>>>> On Wed, 2018-12-05 at 21:59 +0100, Jernej Škrabec wrote: >>>>>>> >>>>>>>>> + >>>>>>>>> +#define V4L2_HEVC_DPB_ENTRY_RPS_ST_CURR_BEFORE 0x01 >>>>>>>>> +#define V4L2_HEVC_DPB_ENTRY_RPS_ST_CURR_AFTER 0x02 >>>>>>>>> +#define V4L2_HEVC_DPB_ENTRY_RPS_LT_CURR 0x03 >>>>>>>>> + >>>>>>>>> +#define V4L2_HEVC_DPB_ENTRIES_NUM_MAX 16 >>>>>>>>> + >>>>>>>>> +struct v4l2_hevc_dpb_entry { >>>>>>>>> + __u32 buffer_tag; >>>>>>>>> + __u8 rps; >>>>>>>>> + __u8 field_pic; >>>>>>>>> + __u16 pic_order_cnt[2]; >>>>>>>>> +}; >>>>>> >>>>>> Please add a property for reference index, if that rps is not used for >>>>>> this, some device would request that(not the rockchip one). And >>>>>> Rockchip's VDPU1 and VDPU2 for AVC would request a similar property. >>>>> >>>>> What exactly is that reference index? Is it a bitstream element or >>>>> something deduced from the bitstream? >>>>> >>>> picture order count(POC) for HEVC and frame_num in AVC. I think it is >>>> the number used in list0(P slice and B slice) and list1(B slice). >>> >>> The picture order count is already the last field of the DPB entry >>> structure. There is one for each field picture. >> As we are not sure whether there is a field coded slice or CTU, I >> would hold this part and else about the field. > > I'm not sure what you meant here, sorry. As we talked in IRC, I am not sure the field coded picture is supported in HEVC. And I don’t why there would be two pic order cnt, a picture can only be used a short term or a long term reference at one picture decoding > >>>>>> Adding another buffer_tag for referring the memory of the motion vectors >>>>>> for each frames. Or a better method is add a meta data to echo picture >>>>>> buffer, since the picture output is just the same as the original, >>>>>> display won't care whether the motion vectors are written the button of >>>>>> picture or somewhere else. >>>>> >>>>> The motion vectors are passed as part of the raw bitstream data, in the >>>>> slices. Is there a case where the motion vectors are coded differently? >>>> No, it is an additional cache for decoder, even FFmpeg having such >>>> data, I think allwinner must output it into somewhere. >>> >>> Ah yes I see what you mean! This is handled internally by our driver >>> and not exposed to userspace. I don't think it would be a good idea to >>> expose this cache or request that userspace allocates it like a video >>> buffer. >>> >> No, usually the driver should allocate, as the user space have no >> idea on size of each devices. >> But for advantage user, application can fix a broken picture with a >> proper data or analysis a object motion from that. >> So I would suggest attaching this information to a picture buffer as >> a meta data. > > Right, the driver will allocate chunks of memory for the decoding > metadata used by the hardware decoder. > > Well, I don't think V4L2 has any mechanism to expose this data for now > and since it's very specific to the hardware implementation, I guess > the interest in having that is generally pretty low. > > That's maybe something that could be added later if someone wants to > work on it, but I think we are better off keeping this metadata hidden > by the driver for now. I am writing a V4l2 driver for rockchip based on the previous vendor driver I sent to mail list. I think I would offer a better way to describe the meta after that. But it need both work in derives and userspace, it would cost some times. > >>>>>>>>> + >>>>>>>>> +struct v4l2_hevc_pred_weight_table { >>>>>>>>> + __u8 luma_log2_weight_denom; >>>>>>>>> + __s8 delta_chroma_log2_weight_denom; >>>>>>>>> + >>>>>>>>> + __s8 delta_luma_weight_l0[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __s8 luma_offset_l0[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __s8 delta_chroma_weight_l0[V4L2_HEVC_DPB_ENTRIES_NUM_MAX][2]; >>>>>>>>> + __s8 chroma_offset_l0[V4L2_HEVC_DPB_ENTRIES_NUM_MAX][2]; >>>>>>>>> + >>>>>>>>> + __s8 delta_luma_weight_l1[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __s8 luma_offset_l1[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __s8 delta_chroma_weight_l1[V4L2_HEVC_DPB_ENTRIES_NUM_MAX][2]; >>>>>>>>> + __s8 chroma_offset_l1[V4L2_HEVC_DPB_ENTRIES_NUM_MAX][2]; >>>>>>>>> +}; >>>>>>>>> + >>>>>> Those properties I think are not necessary are applying for the >>>>>> Rockchip's device, may not work for the others. >>>>> >>>>> Yes, it's possible that some of the elements are not necessary for some >>>>> decoders. What we want is to cover all the elements that might be >>>>> required for a decoder. >>>> I wonder whether allwinner need that, those sao flag usually ignored >>>> by decoder in design. But more is better than less, it is hard to >>>> extend a v4l2 structure in the future, maybe a new HEVC profile >>>> would bring a new property, it is still too early for HEVC. >>> >>> Yes this is used by our decoder. The idea is to have all the basic >>> bitstream elements in the structures (even if some decoders don't use >>> them all) and add others for extension as separate controls later. >>> >>>>>>>>> +struct v4l2_ctrl_hevc_slice_params { >>>>>>>>> + __u32 bit_size; >>>>>>>>> + __u32 data_bit_offset; >>>>>>>>> + >>>>>>>>> + /* ISO/IEC 23008-2, ITU-T Rec. H.265: NAL unit header */ >>>>>>>>> + __u8 nal_unit_type; >>>>>>>>> + __u8 nuh_temporal_id_plus1; >>>>>>>>> + >>>>>>>>> + /* ISO/IEC 23008-2, ITU-T Rec. H.265: General slice segment header */ >>>>>>>>> + __u8 slice_type; >>>>>>>>> + __u8 colour_plane_id; >>>>>> ---------------------------------------------------------------------------- >>>>>>>>> + __u16 slice_pic_order_cnt; >>>>>>>>> + __u8 slice_sao_luma_flag; >>>>>>>>> + __u8 slice_sao_chroma_flag; >>>>>>>>> + __u8 slice_temporal_mvp_enabled_flag; >>>>>>>>> + __u8 num_ref_idx_l0_active_minus1; >>>>>>>>> + __u8 num_ref_idx_l1_active_minus1; >>>>>> Rockchip's decoder doesn't use this part. >>>>>>>>> + __u8 mvd_l1_zero_flag; >>>>>>>>> + __u8 cabac_init_flag; >>>>>>>>> + __u8 collocated_from_l0_flag; >>>>>>>>> + __u8 collocated_ref_idx; >>>>>>>>> + __u8 five_minus_max_num_merge_cand; >>>>>>>>> + __u8 use_integer_mv_flag; >>>>>>>>> + __s8 slice_qp_delta; >>>>>>>>> + __s8 slice_cb_qp_offset; >>>>>>>>> + __s8 slice_cr_qp_offset; >>>>>>>>> + __s8 slice_act_y_qp_offset; >>>>>>>>> + __s8 slice_act_cb_qp_offset; >>>>>>>>> + __s8 slice_act_cr_qp_offset; >>>>>>>>> + __u8 slice_deblocking_filter_disabled_flag; >>>>>>>>> + __s8 slice_beta_offset_div2; >>>>>>>>> + __s8 slice_tc_offset_div2; >>>>>>>>> + __u8 slice_loop_filter_across_slices_enabled_flag; >>>>>>>>> + >>>>>>>>> + /* ISO/IEC 23008-2, ITU-T Rec. H.265: Picture timing SEI message */ >>>>>>>>> + __u8 pic_struct; >>>>>> I think the decoder doesn't care about this, it is used for display. >>>>> >>>>> The purpose of this field is to indicate whether the current picture is >>>>> a progressive frame or an interlaced field picture, which is useful for >>>>> decoding. >>>>> >>>>> At least our decoder has a register field to indicate frame/top >>>>> field/bottom field, so we certainly need to keep the info around. >>>>> Looking at the spec and the ffmpeg implementation, it looks like this >>>>> flag of the bitstream is the usual way to report field coding. >>>> It depends whether the decoder cares about scan type or more, I >>>> wonder prefer general_interlaced_source_flag for just scan type, it >>>> would be better than reading another SEL. >>> >>> Well we still need a way to indicate if the current data is top or >>> bottom field for interlaced. I don't think that knowing that the whole >>> video is interlaced would be precise enough. >>> >>> Cheers, >>> >>> Paul >>> >>>>>>>>> + >>>>>>>>> + /* ISO/IEC 23008-2, ITU-T Rec. H.265: General slice segment header */ >>>>>>>>> + struct v4l2_hevc_dpb_entry dpb[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __u8 num_active_dpb_entries; >>>>>>>>> + __u8 ref_idx_l0[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + __u8 ref_idx_l1[V4L2_HEVC_DPB_ENTRIES_NUM_MAX]; >>>>>>>>> + >>>>>>>>> + __u8 num_rps_poc_st_curr_before; >>>>>>>>> + __u8 num_rps_poc_st_curr_after; >>>>>>>>> + __u8 num_rps_poc_lt_curr; >>>>>>>>> + >>>>>>>>> + /* ISO/IEC 23008-2, ITU-T Rec. H.265: Weighted prediction parameter */ >>>>>>>>> + struct v4l2_hevc_pred_weight_table pred_weight_table; >>>>>>>>> +}; >>>>>>>>> + >>>>>>>>> #endif >>>>> -- >>>>> Paul Kocialkowski, Bootlin (formerly Free Electrons) >>>>> Embedded Linux and kernel engineering >>>>> https://bootlin.com >>>>> >>> -- >>> Paul Kocialkowski, Bootlin (formerly Free Electrons) >>> Embedded Linux and kernel engineering >>> https://bootlin.com >>> > -- > Paul Kocialkowski, Bootlin > Embedded Linux and kernel engineering > https://bootlin.com >