Received: by 2002:ac0:b08d:0:0:0:0:0 with SMTP id l13csp1663963imc; Fri, 22 Feb 2019 09:00:41 -0800 (PST) X-Google-Smtp-Source: AHgI3IZwdeZT9yR/vwhXPEjgE3eCzT0cjyC2MKplwjMI6g3IqRRUwrk7rGQTpGPwMLIlsf91IgRO X-Received: by 2002:a63:e03:: with SMTP id d3mr1435128pgl.245.1550854841656; Fri, 22 Feb 2019 09:00:41 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550854841; cv=none; d=google.com; s=arc-20160816; b=fdlmhXHkXHNDjgmu4DH+wI3T5zheShrQ6z3ukK+lSt5faeUNz3xM5XvnYsmRt0kyt0 wQnhiq9bBMR9Rv7VZSjmaGTBSDEvw3n6Id4BK5uFI/G8d7HwmURTidbhPy1LbITuxmhn H0oW1/CumRYM/6RJSa3eP+4LrMaRB56KV6/tv0WcVdYpd2l/LtHZsmpgm9FMusAnna1l 6KgdqGjq69uh+8HtNoOaBBD2aoAA8RiastOkPl2MNyn/w3VWjvPUWnF9oWmd0yDE6LhH Eg0JdoQmcAoeCdzaQ6cR/1OU7N/5hS+w8swtcPWMkKgmHFZ1IlnlwqLjefkwba1wrebY Ag4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:organization:references:in-reply-to:date:cc:to:from :subject:message-id; bh=1YmXJcMm8+KyYDQRfJIWWNTJws6qKiTKikC4XzdPoTQ=; b=j5Y3ZX/EWqPoLZHn46Jdwg60uajgwuxKE+uxuRPIoQ1KnPbLfPyWVmAukhQ/vZ2b11 kB9U30mD79NTpHNVs/4/F2+FWI6oXAhuvyTGnmOn6HHNwtiYXC52/3vn5pef8+Qkdv/8 dXdwgET9zelkpDA9GEw71EWncDlknB8LdXh5e4fQopZD4RWErUyQEwytfVSYsk6G9HFB JR5mu4mkzrg56fiVO9VnTNnFY+pkUk8B3k0+3SeS+CFQ/X1SqU/LxCw8C3ChU9JP/P2x XyyUJEDYsrZkj8h29ovkvSBiPPFw5zpuDv7CMC5EVgg5bjS6+7MZImBiihIRLIl0aUlC UWPw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g1si1674617plp.406.2019.02.22.09.00.26; Fri, 22 Feb 2019 09:00:41 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727031AbfBVQ7w (ORCPT + 99 others); Fri, 22 Feb 2019 11:59:52 -0500 Received: from bhuna.collabora.co.uk ([46.235.227.227]:57642 "EHLO bhuna.collabora.co.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725892AbfBVQ7v (ORCPT ); Fri, 22 Feb 2019 11:59:51 -0500 Received: from [127.0.0.1] (localhost [127.0.0.1]) (Authenticated sender: ezequiel) with ESMTPSA id 85898282123 Message-ID: <5d95629590e6ca2abbec898e8e9ee478f7a3a6cc.camel@collabora.com> Subject: Re: [PATCH v4 1/2] media: uapi: Add H264 low-level decoder API compound controls. From: Ezequiel Garcia To: Tomasz Figa , Maxime Ripard Cc: Hans Verkuil , Alexandre Courbot , Sakari Ailus , Laurent Pinchart , Pawel Osciak , Paul Kocialkowski , Chen-Yu Tsai , Linux Kernel Mailing List , "list@263.net:IOMMU DRIVERS , Joerg " "Roedel ," , Linux Media Mailing List , Nicolas Dufresne , jenskuske@gmail.com, Jernej Skrabec , Jonas Karlman , linux-sunxi@googlegroups.com, Thomas Petazzoni , Guenter Roeck Date: Fri, 22 Feb 2019 13:59:33 -0300 In-Reply-To: References: <9817c9875638ed2484d61e6e128e2551cf3bda4c.1550672228.git-series.maxime.ripard@bootlin.com> Organization: Collabora Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.30.5-1 MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2019-02-22 at 16:46 +0900, Tomasz Figa wrote: > Hi Maxime, > > On Wed, Feb 20, 2019 at 11:17 PM Maxime Ripard > wrote: > > From: Pawel Osciak > > > > Stateless video codecs will require both the H264 metadata and slices in > > order to be able to decode frames. > > > > This introduces the definitions for a new pixel format for H264 slices that > > have been parsed, as well as the structures used to pass the metadata from > > the userspace to the kernel. > > > > Co-Developped-by: Maxime Ripard > > Signed-off-by: Pawel Osciak > > Signed-off-by: Guenter Roeck > > Signed-off-by: Maxime Ripard > > Thanks for the patch. Some comments inline. > > [snip] > > +``V4L2_CID_MPEG_VIDEO_H264_SLICE_PARAMS (struct)`` > > + Specifies the slice parameters (as extracted from the bitstream) > > + for the associated H264 slice data. This includes the necessary > > + parameters for configuring a stateless hardware decoding pipeline > > + for H264. The bitstream parameters are defined according to > > + :ref:`h264`. Unless there's a specific comment, refer to the > > + specification for the documentation of these fields, section 7.4.3 > > + "Slice Header Semantics". > > Note that this is expected to be an array, with entries for all the > slices included in the bitstream buffer. > > > + > > + .. note:: > > + > > + This compound control is not yet part of the public kernel API and > > + it is expected to change. > > + > > +.. c:type:: v4l2_ctrl_h264_slice_param > > + > > +.. cssclass:: longtable > > + > > +.. flat-table:: struct v4l2_ctrl_h264_slice_param > > + :header-rows: 0 > > + :stub-columns: 0 > > + :widths: 1 1 2 > > + > > + * - __u32 > > + - ``size`` > > + - > > + * - __u32 > > + - ``header_bit_size`` > > + - > > + * - __u16 > > + - ``first_mb_in_slice`` > > + - > > + * - __u8 > > + - ``slice_type`` > > + - > > + * - __u8 > > + - ``pic_parameter_set_id`` > > + - > > + * - __u8 > > + - ``colour_plane_id`` > > + - > > + * - __u8 > > + - ``redundant_pic_cnt`` > > + - > > + * - __u16 > > + - ``frame_num`` > > + - > > + * - __u16 > > + - ``idr_pic_id`` > > + - > > + * - __u16 > > + - ``pic_order_cnt_lsb`` > > + - > > + * - __s32 > > + - ``delta_pic_order_cnt_bottom`` > > + - > > + * - __s32 > > + - ``delta_pic_order_cnt0`` > > + - > > + * - __s32 > > + - ``delta_pic_order_cnt1`` > > + - > > + * - struct :c:type:`v4l2_h264_pred_weight_table` > > + - ``pred_weight_table`` > > + - > > + * - __u32 > > + - ``dec_ref_pic_marking_bit_size`` > > + - > > + * - __u32 > > + - ``pic_order_cnt_bit_size`` > > + - > > + * - __u8 > > + - ``cabac_init_idc`` > > + - > > + * - __s8 > > + - ``slice_qp_delta`` > > + - > > + * - __s8 > > + - ``slice_qs_delta`` > > + - > > + * - __u8 > > + - ``disable_deblocking_filter_idc`` > > + - > > + * - __s8 > > + - ``slice_alpha_c0_offset_div2`` > > + - > > + * - __s8 > > + - ``slice_beta_offset_div2`` > > + - > > + * - __u8 > > + - ``num_ref_idx_l0_active_minus1`` > > + - > > + * - __u8 > > + - ``num_ref_idx_l1_active_minus1`` > > + - > > + * - __u32 > > + - ``slice_group_change_cycle`` > > + - > > + * - __u8 > > + - ``ref_pic_list0[32]`` > > + - > > + * - __u8 > > + - ``ref_pic_list1[32]`` > > + - > > Should we explicitly document that these are the lists after applying > the per-slice modifications, as opposed to the original order from > v4l2_ctrl_h264_decode_param? > > [snip] > > + * .. _V4L2-PIX-FMT-H264-SLICE: > > + > > + - ``V4L2_PIX_FMT_H264_SLICE`` > > + - 'S264' > > + - H264 parsed slice data, as extracted from the H264 bitstream. > > + This format is adapted for stateless video decoders that > > + implement an H264 pipeline (using the :ref:`codec` and > > + :ref:`media-request-api`). Metadata associated with the frame > > + to decode are required to be passed through the > > + ``V4L2_CID_MPEG_VIDEO_H264_SPS``, > > + ``V4L2_CID_MPEG_VIDEO_H264_PPS``, > > + ``V4L2_CID_MPEG_VIDEO_H264_SLICE_PARAMS`` and > > + ``V4L2_CID_MPEG_VIDEO_H264_DECODE_PARAMS`` controls and > > + scaling matrices can optionally be specified through the > > + ``V4L2_CID_MPEG_VIDEO_H264_SCALING_MATRIX`` control. See the > > + :ref:`associated Codec Control IDs `. > > + Exactly one output and one capture buffer must be provided for > > + use with this pixel format. The output buffer must contain the > > + appropriate number of macroblocks to decode a full > > + corresponding frame to the matching capture buffer. > > What does it mean that a control can be optionally specified? A > control always has a value, so how do we decide that it was specified > or not? Should we have another control (or flag) that selects whether > to use the control? How is it better than just having the control > initialized with the default scaling matrix and always using it? > > [snip] > > > diff --git a/include/uapi/linux/videodev2.h b/include/uapi/linux/videodev2.h > > index 9a920f071ff9..6443ae53597f 100644 > > --- a/include/uapi/linux/videodev2.h > > +++ b/include/uapi/linux/videodev2.h > > @@ -653,6 +653,7 @@ struct v4l2_pix_format { > > #define V4L2_PIX_FMT_H264 v4l2_fourcc('H', '2', '6', '4') /* H264 with start codes */ > > #define V4L2_PIX_FMT_H264_NO_SC v4l2_fourcc('A', 'V', 'C', '1') /* H264 without start codes */ > > #define V4L2_PIX_FMT_H264_MVC v4l2_fourcc('M', '2', '6', '4') /* H264 MVC */ > > +#define V4L2_PIX_FMT_H264_SLICE v4l2_fourcc('S', '2', '6', '4') /* H264 parsed slices */ > > Are we okay with adding here already, without going through staging first? > > Also regarding the pixel formats. I still think we should have two pixel formats: V4L2_PIX_FMT_H264_SLICE_RAW and V4L2_PIX_FMT_H264_SLICE_ANNEX_B, to properly represent "raw" NALUs and "annex B" formatted NALUs. Thanks, Eze