Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp2602421yba; Mon, 8 Apr 2019 00:08:22 -0700 (PDT) X-Google-Smtp-Source: APXvYqxpYc12c2tDvsqZVHQGo6xwgsfS5PLQcD3iMI5X97ePFAhDZDP1A1+QQHOAinAGXySFvM0a X-Received: by 2002:a63:b64:: with SMTP id a36mr27087002pgl.58.1554707302247; Mon, 08 Apr 2019 00:08:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1554707302; cv=none; d=google.com; s=arc-20160816; b=yFvJOYlUFwO1+HLe7LVvjNbDp52X3G431sbajE3IWvdWAt1+q8Uqx+COs1dKyLSeAU aDGCNQQFPi0fcYHtFcQX2RXBsIZeaH77mWT5MzOMmoG+dyx4HwQlasb2D1ohaPdznRCO 3wzunfNQ7QfuO71HYcRc8jvYDxB+7UvakoqrA+E/Ih48z+Oo5nkX9rnS/WNoBmG1ZoTn RkS5lBoWKvayMTWhNG4y7tPgzUzXvXIrIaZFDl2Gw6c9GAZgzrEFCAdV5runt5ex1sdw 0tB89pleGDNtmpfkKpl4z0BJzV9ZG4vOj0ee6fiEBzskpC010R7wU2iGUyBs90I1UgJV c0Hg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=q2jFnUgUyUJFMS4RUgPLLoRMs2uP+VcNyEG1eG9Z6kc=; b=nFW6cQikhflUKYHOYLMNbhy/AbWBYvWJVhgD8fmmAEKO1+QA4iUWyBKXEgNa4ov2nE UQX8WPcEiVSgWoycs/gK57iusoZTlwHNuEnQxHoMokCPCQwOfD99vTskIDK7sKPoCbUJ XKttPtSIeQTWHHReLDEi6fMU/JtkzbIKBeGx9JzRhR1YD6IIHl+sXcj4lVP/h2ggz7ND QugRAprxOq7S3b/pk6YaDQMO9eSI6aHfitihYKs6/iypA2VcsZGm56cdjlxSCeBjZbZb 7cJA+IxrATUEjXF4eL5Yqvd+ybkNRDLTyN2qECVBcr5aj84LSfP7StMCIqxrPAOwmliL KWsA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=ZwhDwTFT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m18si20969583pgv.209.2019.04.08.00.08.06; Mon, 08 Apr 2019 00:08:22 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=ZwhDwTFT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726228AbfDHHHY (ORCPT + 99 others); Mon, 8 Apr 2019 03:07:24 -0400 Received: from mail-oi1-f196.google.com ([209.85.167.196]:40314 "EHLO mail-oi1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725854AbfDHHHY (ORCPT ); Mon, 8 Apr 2019 03:07:24 -0400 Received: by mail-oi1-f196.google.com with SMTP id 3so9648538oir.7 for ; Mon, 08 Apr 2019 00:07:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=q2jFnUgUyUJFMS4RUgPLLoRMs2uP+VcNyEG1eG9Z6kc=; b=ZwhDwTFTmhbL/tKhhlbKGla1RgrwmdTUMNhKKGGCy0K+/RiwuyLQfx+5Iv5MqupI9o +fAsEoCTKSlEO0FcKG9/gfV3EFJj6ehaONwwqjs/yZLniCOldC/7ynKDiJXA3EiaFle9 RloOREKfk0vheMJppRbOg0Zq3WeqDDW8M/JR8= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=q2jFnUgUyUJFMS4RUgPLLoRMs2uP+VcNyEG1eG9Z6kc=; b=SjSqyjM75s+d7EsPYdzYJg+9vZ93p2Ag/q4kjhqrxxyDMMJ35iPECHLJgb8kIfOFVm M00F/djY/6T4gH3s14paK7h/cWYf7bPbhsj91kHoB5JQfxAAF5R9HB1zcN/dlhVju8yb l0wm5FDSrPW+o4HiuDPa9W/uGbLcJYPpDIOS855O2rTkBn5S7KB0baA088lQAj9fHdgj IPViBsHeK5yo9W6h1jesh79AZbpXs+BafesmJU6VymefVCCmlwTNATG9v6DS4zOkNz94 7FPkMQdIeTzQss+qNuuwT2Jci2A79ElFWNNye5elQ9WQ8AzWd34g+ltrGsm/cDmIXmit RhwA== X-Gm-Message-State: APjAAAXbwEqMOfvEZnY3HhR7GTr3rj5bU1Fiw9o5oyx8WbwNVFh1KdcL YqhRDjvWbEZpyoQaNYe6pI2HOn/D/IM= X-Received: by 2002:aca:b604:: with SMTP id g4mr14309944oif.155.1554707242835; Mon, 08 Apr 2019 00:07:22 -0700 (PDT) Received: from mail-ot1-f54.google.com (mail-ot1-f54.google.com. [209.85.210.54]) by smtp.gmail.com with ESMTPSA id w2sm11468110otk.73.2019.04.08.00.07.22 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 08 Apr 2019 00:07:22 -0700 (PDT) Received: by mail-ot1-f54.google.com with SMTP id d24so11006586otl.11 for ; Mon, 08 Apr 2019 00:07:22 -0700 (PDT) X-Received: by 2002:a9d:32f5:: with SMTP id u108mr18425644otb.153.1554706801612; Mon, 08 Apr 2019 00:00:01 -0700 (PDT) MIME-Version: 1.0 References: <20190124100419.26492-1-tfiga@chromium.org> <20190124100419.26492-3-tfiga@chromium.org> <7dc32e83-dec3-37d6-9bfe-e162c495dcf3@xs4all.nl> In-Reply-To: <7dc32e83-dec3-37d6-9bfe-e162c495dcf3@xs4all.nl> From: Tomasz Figa Date: Mon, 8 Apr 2019 15:59:50 +0900 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v3 2/2] media: docs-rst: Document memory-to-memory video encoder interface To: Hans Verkuil Cc: Linux Media Mailing List , Linux Kernel Mailing List , Mauro Carvalho Chehab , Pawel Osciak , Alexandre Courbot , Kamil Debski , Andrzej Hajda , Kyungmin Park , Jeongtae Park , Philipp Zabel , =?UTF-8?B?VGlmZmFueSBMaW4gKOael+aFp+ePiik=?= , =?UTF-8?B?QW5kcmV3LUNUIENoZW4gKOmZs+aZuui/qik=?= , Stanimir Varbanov , Todor Tomov , Nicolas Dufresne , Paul Kocialkowski , Laurent Pinchart , dave.stevenson@raspberrypi.org, Ezequiel Garcia , Maxime Jourdan Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Mar 21, 2019 at 7:11 PM Hans Verkuil wrote: > > Hi Tomasz, > > A few more comments: > > On 1/24/19 11:04 AM, Tomasz Figa wrote: > > Due to complexity of the video encoding process, the V4L2 drivers of > > stateful encoder hardware require specific sequences of V4L2 API calls > > to be followed. These include capability enumeration, initialization, > > encoding, encode parameters change, drain and reset. > > > > Specifics of the above have been discussed during Media Workshops at > > LinuxCon Europe 2012 in Barcelona and then later Embedded Linux > > Conference Europe 2014 in D=C3=BCsseldorf. The de facto Codec API that > > originated at those events was later implemented by the drivers we alre= ady > > have merged in mainline, such as s5p-mfc or coda. > > > > The only thing missing was the real specification included as a part of > > Linux Media documentation. Fix it now and document the encoder part of > > the Codec API. > > > > Signed-off-by: Tomasz Figa > > --- > > Documentation/media/uapi/v4l/dev-encoder.rst | 586 ++++++++++++++++++ > > Documentation/media/uapi/v4l/dev-mem2mem.rst | 1 + > > Documentation/media/uapi/v4l/pixfmt-v4l2.rst | 5 + > > Documentation/media/uapi/v4l/v4l2.rst | 2 + > > .../media/uapi/v4l/vidioc-encoder-cmd.rst | 38 +- > > 5 files changed, 617 insertions(+), 15 deletions(-) > > create mode 100644 Documentation/media/uapi/v4l/dev-encoder.rst > > > > diff --git a/Documentation/media/uapi/v4l/dev-encoder.rst b/Documentati= on/media/uapi/v4l/dev-encoder.rst > > new file mode 100644 > > index 000000000000..fb8b05a132ee > > --- /dev/null > > +++ b/Documentation/media/uapi/v4l/dev-encoder.rst > > @@ -0,0 +1,586 @@ > > +.. -*- coding: utf-8; mode: rst -*- > > + > > +.. _encoder: > > + > > +************************************************* > > +Memory-to-memory Stateful Video Encoder Interface > > +************************************************* > > + > > +A stateful video encoder takes raw video frames in display order and e= ncodes > > +them into a bitstream. It generates complete chunks of the bitstream, = including > > +all metadata, headers, etc. The resulting bitstream does not require a= ny > > +further post-processing by the client. > > + > > +Performing software stream processing, header generation etc. in the d= river > > +in order to support this interface is strongly discouraged. In case su= ch > > +operations are needed, use of the Stateless Video Encoder Interface (i= n > > +development) is strongly advised. > > + > > +Conventions and notation used in this document > > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +1. The general V4L2 API rules apply if not specified in this document > > + otherwise. > > + > > +2. The meaning of words "must", "may", "should", etc. is as per `RFC > > + 2119 `_. > > + > > +3. All steps not marked "optional" are required. > > + > > +4. :c:func:`VIDIOC_G_EXT_CTRLS` and :c:func:`VIDIOC_S_EXT_CTRLS` may b= e used > > + interchangeably with :c:func:`VIDIOC_G_CTRL` and :c:func:`VIDIOC_S_= CTRL`, > > + unless specified otherwise. > > + > > +5. Single-planar API (see :ref:`planar-apis`) and applicable structure= s may be > > + used interchangeably with multi-planar API, unless specified otherw= ise, > > + depending on decoder capabilities and following the general V4L2 gu= idelines. > > decoder -> encoder > Ack. > > + > > +6. i =3D [a..b]: sequence of integers from a to b, inclusive, i.e. i = =3D > > + [0..2]: i =3D 0, 1, 2. > > + > > +7. Given an ``OUTPUT`` buffer A, then A=E2=80=99 represents a buffer o= n the ``CAPTURE`` > > + queue containing data that resulted from processing buffer A. > > + > > +Glossary > > +=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +Refer to :ref:`decoder-glossary`. > > + > > +State machine > > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +.. kernel-render:: DOT > > + :alt: DOT digraph of encoder state machine > > + :caption: Encoder state machine > > + > > + digraph encoder_state_machine { > > + node [shape =3D doublecircle, label=3D"Encoding"] Encoding; > > + > > + node [shape =3D circle, label=3D"Initialization"] Initializatio= n; > > + node [shape =3D circle, label=3D"Stopped"] Stopped; > > + node [shape =3D circle, label=3D"Drain"] Drain; > > + node [shape =3D circle, label=3D"Reset"] Reset; > > + > > + node [shape =3D point]; qi > > + qi -> Initialization [ label =3D "open()" ]; > > + > > + Initialization -> Encoding [ label =3D "Both queues streaming" = ]; > > + > > + Encoding -> Drain [ label =3D "V4L2_DEC_CMD_STOP" ]; > > + Encoding -> Reset [ label =3D "VIDIOC_STREAMOFF(CAPTURE)" ]; > > + Encoding -> Stopped [ label =3D "VIDIOC_STREAMOFF(OUTPUT)" ]; > > + Encoding -> Encoding; > > + > > + Drain -> Stopped [ label =3D "All CAPTURE\nbuffers dequeued\nor= \nVIDIOC_STREAMOFF(CAPTURE)" ]; > > + Drain -> Reset [ label =3D "VIDIOC_STREAMOFF(CAPTURE)" ]; > > + > > + Reset -> Encoding [ label =3D "VIDIOC_STREAMON(CAPTURE)" ]; > > + Reset -> Initialization [ label =3D "VIDIOC_REQBUFS(OUTPUT, 0)"= ]; > > + > > + Stopped -> Encoding [ label =3D "V4L2_DEC_CMD_START\nor\nVIDIOC= _STREAMON(OUTPUT)" ]; > > + Stopped -> Reset [ label =3D "VIDIOC_STREAMOFF(CAPTURE)" ]; > > + } > > + > > +Querying capabilities > > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +1. To enumerate the set of coded formats supported by the encoder, the > > + client may call :c:func:`VIDIOC_ENUM_FMT` on ``CAPTURE``. > > + > > + * The full set of supported formats will be returned, regardless of= the > > + format set on ``OUTPUT``. > > + > > +2. To enumerate the set of supported raw formats, the client may call > > + :c:func:`VIDIOC_ENUM_FMT` on ``OUTPUT``. > > + > > + * Only the formats supported for the format currently active on ``C= APTURE`` > > + will be returned. > > + > > + * In order to enumerate raw formats supported by a given coded form= at, > > + the client must first set that coded format on ``CAPTURE`` and th= en > > + enumerate the formats on ``OUTPUT``. > > + > > +3. The client may use :c:func:`VIDIOC_ENUM_FRAMESIZES` to detect suppo= rted > > + resolutions for a given format, passing desired pixel format in > > + :c:type:`v4l2_frmsizeenum` ``pixel_format``. > > + > > + * Values returned by :c:func:`VIDIOC_ENUM_FRAMESIZES` for a coded p= ixel > > + format will include all possible coded resolutions supported by t= he > > + encoder for given coded pixel format. > > + > > + * Values returned by :c:func:`VIDIOC_ENUM_FRAMESIZES` for a raw pix= el format > > + will include all possible frame buffer resolutions supported by t= he > > + encoder for given raw pixel format and coded format currently set= on > > + ``CAPTURE``. > > + > > +4. Supported profiles and levels for the coded format currently set on > > + ``CAPTURE``, if applicable, may be queried using their respective c= ontrols > > + via :c:func:`VIDIOC_QUERYCTRL`. > > + > > +5. Any additional encoder capabilities may be discovered by querying > > + their respective controls. > > + > > +Initialization > > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +1. Set the coded format on the ``CAPTURE`` queue via :c:func:`VIDIOC_S= _FMT` > > + > > + * **Required fields:** > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``CAPTURE`` > > + > > + ``pixelformat`` > > + the coded format to be produced > > + > > + ``sizeimage`` > > + desired size of ``CAPTURE`` buffers; the encoder may adjust i= t to > > + match hardware requirements > > + > > + ``width``, ``height`` > > + ignored (always zero) > > + > > + other fields > > + follow standard semantics > > + > > + * **Return fields:** > > + > > + ``sizeimage`` > > + adjusted size of ``CAPTURE`` buffers > > + > > + .. important:: > > + > > + Changing the ``CAPTURE`` format may change the currently set ``O= UTPUT`` > > + format. The encoder will derive a new ``OUTPUT`` format from the > > + ``CAPTURE`` format being set, including resolution, colorimetry > > + parameters, etc. If the client needs a specific ``OUTPUT`` forma= t, it > > + must adjust it afterwards. > > + > > +2. **Optional.** Enumerate supported ``OUTPUT`` formats (raw formats f= or > > + source) for the selected coded format via :c:func:`VIDIOC_ENUM_FMT`= . > > + > > + * **Required fields:** > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``OUTPUT`` > > + > > + other fields > > + follow standard semantics > > + > > + * **Return fields:** > > + > > + ``pixelformat`` > > + raw format supported for the coded format currently selected = on > > + the ``CAPTURE`` queue. > > + > > + other fields > > + follow standard semantics > > + > > +3. Set the raw source format on the ``OUTPUT`` queue via > > + :c:func:`VIDIOC_S_FMT`. > > + > > + * **Required fields:** > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``OUTPUT`` > > + > > + ``pixelformat`` > > + raw format of the source > > + > > + ``width``, ``height`` > > + source resolution > > + > > + other fields > > + follow standard semantics > > + > > + * **Return fields:** > > + > > + ``width``, ``height`` > > + may be adjusted by encoder to match alignment requirements, a= s > > + required by the currently selected formats > > + > > + other fields > > + follow standard semantics > > + > > + * Setting the source resolution will reset the selection rectangles= to their > > + default values, based on the new resolution, as described in the = step 5 > > + below. > > + > > +4. **Optional.** Set the visible resolution for the stream metadata vi= a > > + :c:func:`VIDIOC_S_SELECTION` on the ``OUTPUT`` queue. > > + > > + * **Required fields:** > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``OUTPUT`` > > + > > + ``target`` > > + set to ``V4L2_SEL_TGT_CROP`` > > + > > + ``r.left``, ``r.top``, ``r.width``, ``r.height`` > > + visible rectangle; this must fit within the `V4L2_SEL_TGT_CRO= P_BOUNDS` > > + rectangle and may be subject to adjustment to match codec and > > + hardware constraints > > + > > + * **Return fields:** > > + > > + ``r.left``, ``r.top``, ``r.width``, ``r.height`` > > + visible rectangle adjusted by the encoder > > + > > + * The following selection targets are supported on ``OUTPUT``: > > + > > + ``V4L2_SEL_TGT_CROP_BOUNDS`` > > + equal to the full source frame, matching the active ``OUTPUT`= ` > > + format > > + > > + ``V4L2_SEL_TGT_CROP_DEFAULT`` > > + equal to ``V4L2_SEL_TGT_CROP_BOUNDS`` > > + > > + ``V4L2_SEL_TGT_CROP`` > > + rectangle within the source buffer to be encoded into the > > + ``CAPTURE`` stream; defaults to ``V4L2_SEL_TGT_CROP_DEFAULT`` > > + > > + .. note:: > > + > > + A common use case for this selection target is encoding a = source > > + video with a resolution that is not a multiple of a macrob= lock, > > + e.g. the common 1920x1080 resolution may require the sour= ce > > + buffers to be aligned to 1920x1088 for codecs with 16x16 m= acroblock > > + size. To avoid encoding the padding, the client needs to e= xplicitly > > + configure this selection target to 1920x1080. > > + > > + ``V4L2_SEL_TGT_COMPOSE_BOUNDS`` > > + maximum rectangle within the coded resolution, which the crop= ped > > + source frame can be composed into; if the hardware does not s= upport > > + composition or scaling, then this is always equal to the rect= angle of > > + width and height matching ``V4L2_SEL_TGT_CROP`` and located a= t (0, 0) > > + > > + ``V4L2_SEL_TGT_COMPOSE_DEFAULT`` > > + equal to a rectangle of width and height matching > > + ``V4L2_SEL_TGT_CROP`` and located at (0, 0) > > + > > + ``V4L2_SEL_TGT_COMPOSE`` > > + rectangle within the coded frame, which the cropped source fr= ame > > + is to be composed into; defaults to > > + ``V4L2_SEL_TGT_COMPOSE_DEFAULT``; read-only on hardware witho= ut > > + additional compose/scaling capabilities; resulting stream wil= l > > + have this rectangle encoded as the visible rectangle in its > > + metadata > > I would only support the COMPOSE targets if the hardware can actually do > scaling and/or composing. That is conform standard V4L2 behavior where > cropping/composing is only implemented if the hardware can actually do > this. > Please see my other reply to your earlier similar comment in this thread. > > + > > + .. warning:: > > + > > + The encoder may adjust the crop/compose rectangles to the neares= t > > + supported ones to meet codec and hardware requirements. The clie= nt needs > > + to check the adjusted rectangle returned by :c:func:`VIDIOC_S_SE= LECTION`. > > + > > +5. Allocate buffers for both ``OUTPUT`` and ``CAPTURE`` via > > + :c:func:`VIDIOC_REQBUFS`. This may be performed in any order. > > + > > + * **Required fields:** > > + > > + ``count`` > > + requested number of buffers to allocate; greater than zero > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``OUTPUT`` or > > + ``CAPTURE`` > > + > > + other fields > > + follow standard semantics > > + > > + * **Return fields:** > > + > > + ``count`` > > + actual number of buffers allocated > > + > > + .. warning:: > > + > > + The actual number of allocated buffers may differ from the ``cou= nt`` > > + given. The client must check the updated value of ``count`` afte= r the > > + call returns. > > + > > + .. note:: > > + > > + To allocate more than the minimum number of OUTPUT buffers (for = pipeline > > + depth), the client may query the ``V4L2_CID_MIN_BUFFERS_FOR_OUTP= UT`` > > + control to get the minimum number of buffers required, and pass = the > > + obtained value plus the number of additional buffers needed in t= he > > + ``count`` field to :c:func:`VIDIOC_REQBUFS`. > > + > > + Alternatively, :c:func:`VIDIOC_CREATE_BUFS` can be used to have mor= e > > + control over buffer allocation. > > + > > + * **Required fields:** > > + > > + ``count`` > > + requested number of buffers to allocate; greater than zero > > + > > + ``type`` > > + a ``V4L2_BUF_TYPE_*`` enum appropriate for ``OUTPUT`` > > + > > + other fields > > + follow standard semantics > > + > > + * **Return fields:** > > + > > + ``count`` > > + adjusted to the number of allocated buffers > > + > > +6. Begin streaming on both ``OUTPUT`` and ``CAPTURE`` queues via > > + :c:func:`VIDIOC_STREAMON`. This may be performed in any order. The = actual > > + encoding process starts when both queues start streaming. > > + > > +.. note:: > > + > > + If the client stops the ``CAPTURE`` queue during the encode process= and then > > + restarts it again, the encoder will begin generating a stream indep= endent > > + from the stream generated before the stop. The exact constraints de= pend > > + on the coded format, but may include the following implications: > > + > > + * encoded frames produced after the restart must not reference any > > + frames produced before the stop, e.g. no long term references for > > + H.264, > > + > > + * any headers that must be included in a standalone stream must be > > + produced again, e.g. SPS and PPS for H.264. > > + > > +Encoding > > +=3D=3D=3D=3D=3D=3D=3D=3D > > + > > +This state is reached after the `Initialization` sequence finishes > > +successfully. In this state, the client queues and dequeues buffers t= o both > > +queues via :c:func:`VIDIOC_QBUF` and :c:func:`VIDIOC_DQBUF`, following= the > > +standard semantics. > > + > > +The contents of encoded ``CAPTURE`` buffers depend on the active coded= pixel > > +format and may be affected by codec-specific extended controls, as sta= ted > > +in the documentation of each format. > > + > > +Both queues operate independently, following standard behavior of V4L2= buffer > > +queues and memory-to-memory devices. In addition, the order of encoded= frames > > +dequeued from the ``CAPTURE`` queue may differ from the order of queui= ng raw > > +frames to the ``OUTPUT`` queue, due to properties of the selected code= d format, > > +e.g. frame reordering. > > + > > +The client must not assume any direct relationship between ``CAPTURE``= and > > +``OUTPUT`` buffers and any specific timing of buffers becoming > > +available to dequeue. Specifically: > > + > > +* a buffer queued to ``OUTPUT`` may result in more than 1 buffer produ= ced on > > + ``CAPTURE`` (if returning an encoded frame allowed the encoder to re= turn a > > + frame that preceded it in display, but succeeded it in the decode or= der), > > + > > +* a buffer queued to ``OUTPUT`` may result in a buffer being produced = on > > + ``CAPTURE`` later into encode process, and/or after processing furth= er > > + ``OUTPUT`` buffers, or be returned out of order, e.g. if display > > + reordering is used, > > + > > +* buffers may become available on the ``CAPTURE`` queue without additi= onal > > + buffers queued to ``OUTPUT`` (e.g. during drain or ``EOS``), because= of the > > + ``OUTPUT`` buffers queued in the past whose decoding results are onl= y > > + available at later time, due to specifics of the decoding process, > > + > > +* buffers queued to ``OUTPUT`` may not become available to dequeue ins= tantly > > + after being encoded into a corresponding ``CATPURE`` buffer, e.g. if= the > > + encoder needs to use the frame as a reference for encoding further f= rames. > > + > > +.. note:: > > + > > + To allow matching encoded ``CAPTURE`` buffers with ``OUTPUT`` buffe= rs they > > + originated from, the client can set the ``timestamp`` field of the > > + :c:type:`v4l2_buffer` struct when queuing an ``OUTPUT`` buffer. The > > + ``CAPTURE`` buffer(s), which resulted from encoding that ``OUTPUT``= buffer > > + will have their ``timestamp`` field set to the same value when dequ= eued. > > + > > + In addition to the straightforward case of one ``OUTPUT`` buffer pr= oducing > > + one ``CAPTURE`` buffer, the following cases are defined: > > + > > + * one ``OUTPUT`` buffer generates multiple ``CAPTURE`` buffers: the= same > > + ``OUTPUT`` timestamp will be copied to multiple ``CAPTURE`` buffe= rs, > > + > > + * the encoding order differs from the presentation order (i.e. the > > + ``CAPTURE`` buffers are out-of-order compared to the ``OUTPUT`` b= uffers): > > + ``CAPTURE`` timestamps will not retain the order of ``OUTPUT`` ti= mestamps > > + and thus monotonicity of the timestamps cannot be guaranteed. > > + > > +.. note:: > > + > > + To let the client distinguish between frame types (keyframes, inter= mediate > > + frames; the exact list of types depends on the coded format), the > > + ``CAPTURE`` buffers will have corresponding flag bits set in their > > + :c:type:`v4l2_buffer` struct when dequeued. See the documentation o= f > > + :c:type:`v4l2_buffer` and each coded pixel format for exact list of= flags > > + and their meanings. > > I don't think we can require this since a capture buffer may contain mult= iple > encoded frames. > I thought we required that only one encoded frame was in one CAPTURE buffer. Real time use cases rely heavily on this frame type information, so I can't imagine not requiring this. > It would actually make more sense to return it in the output buffer, but = I don't > know if a hardware encoder can actually provide that information. > I believe all the already existing drivers provide the information about the encoded frame type, but I don't think they provide the information about what source frame it came from. > Another use of these flags for an output buffer is to force a keyframe if= for > example a scene change was detected. > > My feeling is that we should drop this note. Forcing a keyframe by settin= g that > flag for the output buffer might actually be a useful thing to do for a s= tateful > encoder. > However, to force keyframe, one sets it in the OUTPUT buffer. Then, to actually get the right CAPTURE buffer, one has to look for one with this flag set. > > + > > +Encoding parameter changes > > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D > > + > > +The client is allowed to use :c:func:`VIDIOC_S_CTRL` to change encoder > > +parameters at any time. The availability of parameters is encoder-spec= ific > > +and the client must query the encoder to find the set of available con= trols. > > + > > +The ability to change each parameter during encoding is encoder-specif= ic, as > > +per the standard semantics of the V4L2 control interface. The client m= ay > > +attempt to set a control during encoding and if the operation fails wi= th the > > +-EBUSY error code, the ``CAPTURE`` queue needs to be stopped for the > > +configuration change to be allowed. To do this, it may follow the `Dra= in` > > +sequence to avoid losing the already queued/encoded frames. > > + > > +The timing of parameter updates is encoder-specific, as per the standa= rd > > +semantics of the V4L2 control interface. If the client needs to apply = the > > +parameters exactly at specific frame, using the Request API > > +(:ref:`media-request-api`) should be considered, if supported by the e= ncoder. > > + > > +Drain > > +=3D=3D=3D=3D=3D > > + > > +To ensure that all the queued ``OUTPUT`` buffers have been processed a= nd the > > +related ``CAPTURE`` buffers are given to the client, the client must f= ollow the > > +drain sequence described below. After the drain sequence ends, the cli= ent has > > +received all encoded frames for all ``OUTPUT`` buffers queued before t= he > > +sequence was started. > > + > > +1. Begin the drain sequence by issuing :c:func:`VIDIOC_ENCODER_CMD`. > > + > > + * **Required fields:** > > + > > + ``cmd`` > > + set to ``V4L2_ENC_CMD_STOP`` > > + > > + ``flags`` > > + set to 0 > > + > > + ``pts`` > > + set to 0 > > + > > + .. warning:: > > + > > + The sequence can be only initiated if both ``OUTPUT`` and ``CAPT= URE`` > > + queues are streaming. For compatibility reasons, the call to > > + :c:func:`VIDIOC_ENCODER_CMD` will not fail even if any of the qu= eues is > > + not streaming, but at the same time it will not initiate the `Dr= ain` > > + sequence and so the steps described below would not be applicabl= e. > > + > > +2. Any ``OUTPUT`` buffers queued by the client before the > > + :c:func:`VIDIOC_ENCODER_CMD` was issued will be processed and encod= ed as > > + normal. The client must continue to handle both queues independentl= y, > > + similarly to normal encode operation. This includes: > > + > > + * queuing and dequeuing ``CAPTURE`` buffers, until a buffer marked = with the > > + ``V4L2_BUF_FLAG_LAST`` flag is dequeued, > > + > > + .. warning:: > > + > > + The last buffer may be empty (with :c:type:`v4l2_buffer` > > + ``bytesused`` =3D 0) and in that case it must be ignored by th= e client, > > + as it does not contain an encoded frame. > > + > > + .. note:: > > + > > + Any attempt to dequeue more buffers beyond the buffer marked w= ith > > + ``V4L2_BUF_FLAG_LAST`` will result in a -EPIPE error from > > + :c:func:`VIDIOC_DQBUF`. > > + > > + * dequeuing processed ``OUTPUT`` buffers, until all the buffers que= ued > > + before the ``V4L2_ENC_CMD_STOP`` command are dequeued, > > + > > + * dequeuing the ``V4L2_EVENT_EOS`` event, if the client subscribes = to it. > > + > > + .. note:: > > + > > + For backwards compatibility, the encoder will signal a ``V4L2_EV= ENT_EOS`` > > + event when the last frame has been decoded and all frames are re= ady to be > > decoded -> encoded > Ack. Best regards, Tomasz