Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp4238979rwn; Sun, 11 Sep 2022 07:53:20 -0700 (PDT) X-Google-Smtp-Source: AA6agR6xNpgWEuzPDRyDEaYG8rOTF+JAnX7GpJ+z0WLJvlpqKiaGf79tZoHVDLnPMVnBe/HWli0F X-Received: by 2002:a17:906:8a5b:b0:776:8578:e241 with SMTP id gx27-20020a1709068a5b00b007768578e241mr11449830ejc.365.1662908000329; Sun, 11 Sep 2022 07:53:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662908000; cv=none; d=google.com; s=arc-20160816; b=kAMNbqQQ6GJHRKE+YI/ZbHOB5e80bgLw7GocVIalIkHEH92iI0oqOLI+hjFddEuKKO J5XmKvIBAYr+35WaGMCEBUcJcXxm3CF64NyLUDFPgHha8JlZeb4Mg2R9M1Gq0gItq9bF yU5LCOysgqpaETU8J4kisprOhGliHRFNoRpQU7jE+z371sZ36bIwJUSidnvgpE9taOCD VACkWKH2IdfKwDt6G8K4aLngeIkx8p+d3o0MCr/ZJpahcOIPM5GuCRmla/YOPcz6ScEC lqzCtXPNjShJjrtHEEIhrpLotJ7BIzCBbJlqU1IDCZEmbQsRouHJNO/1xBZ+K4KeZru5 gSqg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=+QbBBQ6vXh/jN8AiRMD0EN8OQm6rohb3gtaPePWgvvs=; b=z1ObMYKlmSbGRH2zhCsZGchzVQdzNiYPdxpogA7DbHbaZUv3KiVnWXUwrobWycDEax 4Y0Zy/gq47+LBrMKwZJMTGJ1e5nb6ZlhbR4R6LPSsucH76gNJauFZncaMOHQYFWVF7Ar 4CAbETxUzmpqbcNrGv1eMKXuH1WcsNwr5RnWlDsQJTYLLZgtd5lK519jG9sg16B7DQHx upVzlKZEzkL2SFp2U24rr1YFqaQOy585W4tE2+04EWgKpvT75j/NP4QxVg2YAEbrkWhk 69sSCpx3IaoWAFAI6Pcxyrv0Z8pTlgTOI2X1uWxtequIgIPqDvmtaJi3EKYIDgsFGOrh UKkA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=cY4w5okY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id gk6-20020a17090790c600b0077404c1e776si3948766ejb.935.2022.09.11.07.52.53; Sun, 11 Sep 2022 07:53:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=cY4w5okY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229446AbiIKOom (ORCPT + 99 others); Sun, 11 Sep 2022 10:44:42 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45034 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229510AbiIKOoi (ORCPT ); Sun, 11 Sep 2022 10:44:38 -0400 Received: from mail-oa1-x2e.google.com (mail-oa1-x2e.google.com [IPv6:2001:4860:4864:20::2e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C3980DF09 for ; Sun, 11 Sep 2022 07:44:36 -0700 (PDT) Received: by mail-oa1-x2e.google.com with SMTP id 586e51a60fabf-127dca21a7dso16902518fac.12 for ; Sun, 11 Sep 2022 07:44:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date; bh=+QbBBQ6vXh/jN8AiRMD0EN8OQm6rohb3gtaPePWgvvs=; b=cY4w5okYPvGlxF8rGm7JyVdKOoARQdMMV6L/qGxKWG8ToKq5UhWWVrtuJMvQJdcPia IVCGnQndFQntsAJ3qRmfH78IHe47YKRuukKe71yNUwiYo+SYbwiwEI5lWskjjNEEremj msHqz3MS1Kp+i6dBJ5O0j7dU04G7cBOmqyJW4qJ8C/sC4vD9MTb1KkbyBzqEKFPWr3p8 m2sUsqW/to+JMo7ZK9z2v04jdDPcfNBV5NrJp/gjmTDzAPs7KUKW5ZOJvm0/Z9q4v7+a isFAJf3c9Zn0jAuSHIb7bB+e9DEzcYcErKiX9PhF61RkB2dKyRdWNgfvIXsgPFK6myr8 MiLg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date; bh=+QbBBQ6vXh/jN8AiRMD0EN8OQm6rohb3gtaPePWgvvs=; b=rGOQqPM0fk+LUsresBYrRjts+gFXdohQQj6TtaAHjlbpNBl8y+/JpZsSE5EGcNtcLl t2mKsHclhJGd68EOnAiBtQtqNPKFn2/KJZjvHiVE0VrlgVxlhdt2Rt0DBtdGs8c5rU5r uxB5ZcVwamKi/OiGT3VOwneFs6Dz2PT/YLInx8Ia5ACRAxMCk8e/a0/ZKo8mIMrbA0Qe 2rSIoRveKTTSG6H48JW+/diHnRUklSARtcE41JU+HDv/clg7YWmtAU7NFwgkG7A1XqZi sACC/nEAmlAfTPzzVXeBj+whZwzvoi7qiSrg7zNVHJ7E2dPPOzb+QsJX2URrz2Kwdc6/ vVzQ== X-Gm-Message-State: ACgBeo06AhtiW5urVBJPmXcg1o44S/XpmBdfjtbQ2IES8fhWw3A+TcGH 0gVW94Pk0ReGbeua08QwJL5ltcmojWc= X-Received: by 2002:a05:6870:c18a:b0:101:fe5b:bd4e with SMTP id h10-20020a056870c18a00b00101fe5bbd4emr9348351oad.275.1662907475589; Sun, 11 Sep 2022 07:44:35 -0700 (PDT) Received: from ?IPV6:2804:431:c7f5:f684:ee06:25a5:3122:5cd3? ([2804:431:c7f5:f684:ee06:25a5:3122:5cd3]) by smtp.gmail.com with ESMTPSA id c8-20020a056870b28800b001275f056133sm3927439oao.51.2022.09.11.07.44.33 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sun, 11 Sep 2022 07:44:35 -0700 (PDT) Message-ID: <17957593-33fb-a63d-181e-daee2e4689fc@gmail.com> Date: Sun, 11 Sep 2022 11:44:32 -0300 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.13.0 Subject: Re: [PATCH] drm/vkms: fix 32bit compilation error by replacing macros Content-Language: en-US To: Melissa Wen Cc: rodrigosiqueiramelo@gmail.com, melissa.srw@gmail.com, airlied@linux.ie, daniel@ffwll.ch, Sudip Mukherjee , dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org References: <20220909114133.267689-1-mwen@igalia.com> <29f87796-b288-7cdc-86dd-050cf7f0b5dd@gmail.com> <20220910191035.ukxhlhbc3mscbqis@mail.igalia.com> From: Igor Matheus Andrade Torrente In-Reply-To: <20220910191035.ukxhlhbc3mscbqis@mail.igalia.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-6.2 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,NICE_REPLY_A, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 9/10/22 16:10, Melissa Wen wrote: > On 09/09, Igor Matheus Andrade Torrente wrote: >> Hi Mellisa, >> >> Thanks for the patch fixing my mistakes. >> >> On 9/9/22 08:41, Melissa Wen wrote: >>> Replace vkms_formats macros for fixed-point operations with functions >>> from drm/drm_fixed.h to do the same job and fix 32-bit compilation >>> errors. >>> >>> Fixes: a19c2ac9858 ("drm: vkms: Add support to the RGB565 format") >>> Tested-by: Sudip Mukherjee >>> Reported-by: Sudip Mukherjee >>> Reported-by: kernel test robot >>> Signed-off-by: Melissa Wen >>> --- >>> drivers/gpu/drm/vkms/vkms_formats.c | 53 +++++++++++------------------ >>> 1 file changed, 19 insertions(+), 34 deletions(-) >>> >>> diff --git a/drivers/gpu/drm/vkms/vkms_formats.c b/drivers/gpu/drm/vkms/vkms_formats.c >>> index 300abb4d1dfe..ddcd3cfeeaac 100644 >>> --- a/drivers/gpu/drm/vkms/vkms_formats.c >>> +++ b/drivers/gpu/drm/vkms/vkms_formats.c >>> @@ -1,27 +1,12 @@ >>> // SPDX-License-Identifier: GPL-2.0+ >>> -#include >>> +#include >>> #include >>> +#include >>> +#include >>> #include "vkms_formats.h" >>> -/* The following macros help doing fixed point arithmetic. */ >>> -/* >>> - * With Fixed-Point scale 15 we have 17 and 15 bits of integer and fractional >>> - * parts respectively. >>> - * | 0000 0000 0000 0000 0.000 0000 0000 0000 | >>> - * 31 0 >>> - */ >>> -#define SHIFT 15 >>> - >>> -#define INT_TO_FIXED(a) ((a) << SHIFT) >>> -#define FIXED_MUL(a, b) ((s32)(((s64)(a) * (b)) >> SHIFT)) >>> -#define FIXED_DIV(a, b) ((s32)(((s64)(a) << SHIFT) / (b))) >>> -/* This macro converts a fixed point number to int, and round half up it */ >>> -#define FIXED_TO_INT_ROUND(a) (((a) + (1 << (SHIFT - 1))) >> SHIFT) >>> -#define INT_TO_FIXED_DIV(a, b) (FIXED_DIV(INT_TO_FIXED(a), INT_TO_FIXED(b))) >>> -#define INT_TO_FIXED_DIV(a, b) (FIXED_DIV(INT_TO_FIXED(a), INT_TO_FIXED(b))) >>> - >>> static size_t pixel_offset(const struct vkms_frame_info *frame_info, int x, int y) >>> { >>> return frame_info->offset + (y * frame_info->pitch) >>> @@ -137,19 +122,19 @@ static void RGB565_to_argb_u16(struct line_buffer *stage_buffer, >>> int x_limit = min_t(size_t, drm_rect_width(&frame_info->dst), >>> stage_buffer->n_pixels); >>> - s32 fp_rb_ratio = INT_TO_FIXED_DIV(65535, 31); >>> - s32 fp_g_ratio = INT_TO_FIXED_DIV(65535, 63); >>> + s32 fp_rb_ratio = drm_fixp_div(drm_int2fixp(65535), 31); >>> + s32 fp_g_ratio = drm_fixp_div(drm_int2fixp(65535), 63); >> >> I think you need to add `drm_int2fixp` to 31 and 63. >> >>> for (size_t x = 0; x < x_limit; x++, src_pixels++) { >>> u16 rgb_565 = le16_to_cpu(*src_pixels); >>> - s32 fp_r = INT_TO_FIXED((rgb_565 >> 11) & 0x1f); >>> - s32 fp_g = INT_TO_FIXED((rgb_565 >> 5) & 0x3f); >>> - s32 fp_b = INT_TO_FIXED(rgb_565 & 0x1f); >>> + s32 fp_r = drm_int2fixp((rgb_565 >> 11) & 0x1f); >>> + s32 fp_g = drm_int2fixp((rgb_565 >> 5) & 0x3f); >>> + s32 fp_b = drm_int2fixp(rgb_565 & 0x1f); >> >> And we are cast implicitly from 64 bits int to 32 bits which is >> implementation-defined AFAIK. So, probably we should be using `s64` for all >> of these variables. >> >> I tested the patch. And I'm seeing some differences in the intermediate >> results. From my testing, these changes solve those differences. > > Hi Igor, > > Thanks for checking the calc results and all inputs provided. I just > sent a second version, can you take a look? I replicated your > suggestions for RGB565_to_argb_u16() in argb_u16_to_RGB565() and > double-checked for i386 and arm. Let me know what yexiuou think. I tested it here and everything seem to be working :). > >> >> Another thing that may have an impact on the final output is the lack of >> rounding in drm_fixed.h. This can potentially produce the wrong result. > > Yeah, I see... I can include a comment about the rounding issue for > further improvements, or do you plan to work on it? A comment would be good. Maybe add to the VKMS TODO list? I'm busy with other stuffs, and can't work on this any time soon. But It's pretty simple thing to implement. > > Thanks, > > Melissa >> >> Thanks, >> --- >> Igor Torrente >> >>> out_pixels[x].a = (u16)0xffff; >>> - out_pixels[x].r = FIXED_TO_INT_ROUND(FIXED_MUL(fp_r, fp_rb_ratio)); >>> - out_pixels[x].g = FIXED_TO_INT_ROUND(FIXED_MUL(fp_g, fp_g_ratio)); >>> - out_pixels[x].b = FIXED_TO_INT_ROUND(FIXED_MUL(fp_b, fp_rb_ratio)); >>> + out_pixels[x].r = drm_fixp2int(drm_fixp_mul(fp_r, fp_rb_ratio)); >>> + out_pixels[x].g = drm_fixp2int(drm_fixp_mul(fp_g, fp_g_ratio)); >>> + out_pixels[x].b = drm_fixp2int(drm_fixp_mul(fp_b, fp_rb_ratio)); >>> } >>> } >>> @@ -248,17 +233,17 @@ static void argb_u16_to_RGB565(struct vkms_frame_info *frame_info, >>> int x_limit = min_t(size_t, drm_rect_width(&frame_info->dst), >>> src_buffer->n_pixels); >>> - s32 fp_rb_ratio = INT_TO_FIXED_DIV(65535, 31); >>> - s32 fp_g_ratio = INT_TO_FIXED_DIV(65535, 63); >>> + s32 fp_rb_ratio = drm_fixp_div(drm_int2fixp(65535), 31); >>> + s32 fp_g_ratio = drm_fixp_div(drm_int2fixp(65535), 63); >>> for (size_t x = 0; x < x_limit; x++, dst_pixels++) { >>> - s32 fp_r = INT_TO_FIXED(in_pixels[x].r); >>> - s32 fp_g = INT_TO_FIXED(in_pixels[x].g); >>> - s32 fp_b = INT_TO_FIXED(in_pixels[x].b); >>> + s32 fp_r = drm_int2fixp(in_pixels[x].r); >>> + s32 fp_g = drm_int2fixp(in_pixels[x].g); >>> + s32 fp_b = drm_int2fixp(in_pixels[x].b); >>> - u16 r = FIXED_TO_INT_ROUND(FIXED_DIV(fp_r, fp_rb_ratio)); >>> - u16 g = FIXED_TO_INT_ROUND(FIXED_DIV(fp_g, fp_g_ratio)); >>> - u16 b = FIXED_TO_INT_ROUND(FIXED_DIV(fp_b, fp_rb_ratio)); >>> + u16 r = drm_fixp2int(drm_fixp_div(fp_r, fp_rb_ratio)); >>> + u16 g = drm_fixp2int(drm_fixp_div(fp_g, fp_g_ratio)); >>> + u16 b = drm_fixp2int(drm_fixp_div(fp_b, fp_rb_ratio)); >>> *dst_pixels = cpu_to_le16(r << 11 | g << 5 | b); >>> } >>