Received: by 2002:a25:31c3:0:0:0:0:0 with SMTP id x186csp909651ybx; Thu, 31 Oct 2019 02:51:56 -0700 (PDT) X-Google-Smtp-Source: APXvYqzgVZWPlea0/cIk44nu78yHNNg9vz9MGsf1EjlwyDB4P4lfxlA0XL9HSkkxqnfTHilJYJfR X-Received: by 2002:a17:906:2d68:: with SMTP id f8mr2867575eji.71.1572515516146; Thu, 31 Oct 2019 02:51:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1572515516; cv=none; d=google.com; s=arc-20160816; b=CeAICkKFfbWr3BdShdQOG2PlBF2G7AXbPwE8NXwROeQgTIMkoTs9cwop7fVZ4FriT8 j2tWEk918DbQ3ej2C7vITiCNnkCyO4n9QG6G7Srdc5FVuxEGwNQvxi7DFCoLuDujdLOU C8lxkRgY/IT7JQyDB0iyyZB1ikNwMeg6xn8ctNChssc7F9uhb0aI8gzKSuamuvP6bLeu 8kai3ImYOOerlVfEKqItkYuf92TU/AGwBd0C3Br8KFys1lnvasHZ2LPG3A91I+5EE2Q1 W9ELP+nHQqVCAjYyYs8lYXnckoEJjo7o3OzgApwZxVi3fMyWTH1VsNeW20VzYR0vNMUz qPuw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date; bh=+C/mrP7oCaOMjnu+hdybduL2l1d77L5HqsaLZQ74KvE=; b=yG+KQph13LS7/8nW7+efpz2JrmIwgHIK67an61+gkQDKHNGqRfMwm4exZUO5guzpkg yeMkF6IpDG1AqQbnhbGyXMmq+RWgFAbc3XkDOrA9I524gkb23Z1M+FRB+5MthJsONfYL keKOYQHD+KgboXVYXglEuE/+AjzMfLGss1ov+y7PdTzaNN2xGQBHOf/yNo9AE/jTOCIJ s8Z7+7GlwQQRBiM1X5PPn2LcwKj5uJWsos133i+5akYRwE/4biXCBDVngvuAxkny6RQb uKB82SeRQMVMTyXn/Oy5zW5eyxmDY4BO2IusRsnfXoQ6lgtkMC0dp0h0BQxLOKUgkuFz NI3w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c49si3776161edc.361.2019.10.31.02.51.32; Thu, 31 Oct 2019 02:51:56 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726971AbfJaJub (ORCPT + 99 others); Thu, 31 Oct 2019 05:50:31 -0400 Received: from bhuna.collabora.co.uk ([46.235.227.227]:45634 "EHLO bhuna.collabora.co.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726897AbfJaJub (ORCPT ); Thu, 31 Oct 2019 05:50:31 -0400 Received: from localhost (unknown [IPv6:2a01:e0a:2c:6930:5cf4:84a1:2763:fe0d]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: bbrezillon) by bhuna.collabora.co.uk (Postfix) with ESMTPSA id 8FAF828B418; Thu, 31 Oct 2019 09:50:28 +0000 (GMT) Date: Thu, 31 Oct 2019 10:50:25 +0100 From: Boris Brezillon To: Jonas Karlman Cc: Mauro Carvalho Chehab , Ezequiel Garcia , Hans Verkuil , Tomasz Figa , "linux-media@vger.kernel.org" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH v2 05/10] media: hantro: Reduce H264 extra space for motion vectors Message-ID: <20191031105025.5a32113c@collabora.com> In-Reply-To: References: <20191029012430.24566-1-jonas@kwiboo.se> Organization: Collabora X-Mailer: Claws Mail 3.17.4 (GTK+ 2.24.32; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 29 Oct 2019 01:24:50 +0000 Jonas Karlman wrote: > A decoded 8-bit 4:2:0 frame need memory for up to 448 bytes per > macroblock with additional 32 bytes on multi-core variants. > > Memory layout is as follow: > > +---------------------------+ > | Y-plane 256 bytes x MBs | > +---------------------------+ > | UV-plane 128 bytes x MBs | > +---------------------------+ > | MV buffer 64 bytes x MBs | > +---------------------------+ > | MC sync 32 bytes | > +---------------------------+ As for patch 4, can we point this diagram as a comment in the code too? > > Reduce the extra space allocated now that motion vector buffer offset no > longer is based on the extra space. > > Only allocate extra space for 64 bytes x MBs of motion vector buffer > and 32 bytes for multi-core sync. > > Fixes: a9471e25629b ("media: hantro: Add core bits to support H264 decoding") > Signed-off-by: Jonas Karlman Reviewed-by: Boris Brezillon > --- > Changes in v2: > - updated commit message > --- > drivers/staging/media/hantro/hantro_v4l2.c | 6 +++--- > 1 file changed, 3 insertions(+), 3 deletions(-) > > diff --git a/drivers/staging/media/hantro/hantro_v4l2.c b/drivers/staging/media/hantro/hantro_v4l2.c > index 3dae52abb96c..b3a4368b37de 100644 > --- a/drivers/staging/media/hantro/hantro_v4l2.c > +++ b/drivers/staging/media/hantro/hantro_v4l2.c > @@ -242,12 +242,12 @@ static int vidioc_try_fmt(struct file *file, void *priv, struct v4l2_format *f, > /* > * The H264 decoder needs extra space on the output buffers > * to store motion vectors. This is needed for reference > - * frames. > + * frames. 32 extra bytes is used for multi-core sync. > */ > if (ctx->vpu_src_fmt->fourcc == V4L2_PIX_FMT_H264_SLICE) > pix_mp->plane_fmt[0].sizeimage += > - 128 * DIV_ROUND_UP(pix_mp->width, 16) * > - DIV_ROUND_UP(pix_mp->height, 16); > + 64 * MB_WIDTH(pix_mp->width) * > + MB_WIDTH(pix_mp->height) + 32; > } else if (!pix_mp->plane_fmt[0].sizeimage) { > /* > * For coded formats the application can specify