Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp7092386rwl; Wed, 22 Mar 2023 22:04:32 -0700 (PDT) X-Google-Smtp-Source: AK7set++M/luvyeBvdCHWGTRh5rywcTcwkTbtCsaQfo0LkK44lvLuOo4bQXSLhCnRFfIFKA7LW2C X-Received: by 2002:a17:906:8a62:b0:886:221b:44e5 with SMTP id hy2-20020a1709068a6200b00886221b44e5mr9620399ejc.62.1679547872393; Wed, 22 Mar 2023 22:04:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1679547872; cv=none; d=google.com; s=arc-20160816; b=meOCJEs2IfkCP1yn388LawAL9QgYqVpc6+KkZP53CjCEONOhdBFyLbamkTU+LwH+MP 6YrVA5q+xbvvyoKRn3RmkSgycR77Z4WdhE0BTkhChPgi+LNFlk9TPcfaJkOTimFz++g0 GUbwWVX8FWHzkPzpgxskAgY3bfs6SGS18wAoGGcjQrIQtEnZ5v2Lw9yXcB3WzcZmk88k NFDUrq1yfo1nrzFYzN0MB1htewCBk/8cwn4JqV5v67gxNsMq0Orc1xCrl1FpcEgqLn6C WLqC9LUF661BCrgQ9xy0CTeN7adfEZUm4oBqMuX9rZF86YDMuS0GvfW/xSTBjFLLc2Lw Mjvg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=ldk1oEdnREHz6mvQo1fX9bVoiaivC6z1A1MBonCqWKQ=; b=ABWU1CK+cUxXLkeax76FMzY8vt2VFfi20Fisskc10A6a6wXCotlrGPtGTMg8Gyiv1K tJ0SIKE4HlLy+ql7RyYe12v6sNq4S7qMWpawR1szsq9eh5DrYVs/XrynqOjZjy4yR14c GyXMFQEv8luXZBBhPKCDX7zQK037WHam1URrBv5tDl02MWl3mpMRBC8CRHn4iPj0kPUM EB/x+vHHBN5nNN4GjjRBg06retCwS1Fr8qKrXl/EOrR+O2Io5NNRnuBtAJDl6yDzI+z9 QUeFqkYBLRGbU7AaMvoLjvyN/gyGRTghyVGpN2GFulgHBvSnRhOCuxb2uQ72O0AsfAva KoZg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=YLtDnTQe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ga42-20020a1709070c2a00b0093c121f879fsi1507061ejc.45.2023.03.22.22.04.07; Wed, 22 Mar 2023 22:04:32 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=YLtDnTQe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229729AbjCWE6f (ORCPT + 99 others); Thu, 23 Mar 2023 00:58:35 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34054 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229951AbjCWE6M (ORCPT ); Thu, 23 Mar 2023 00:58:12 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2004B2139 for ; Wed, 22 Mar 2023 21:58:08 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id CE688B81DD0 for ; Thu, 23 Mar 2023 04:58:06 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 32B92C433EF; Thu, 23 Mar 2023 04:58:05 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1679547485; bh=bW3yke4cHvXoTJKbE4z1vm267YxnsHyDh3dtgS9IfMI=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=YLtDnTQewUVOPtVQzCyHqJ6ASgxMB//VBjnxh53ePlw9DbcgQG3JV2MTy38noEK3C RUiunu26HYYNk/kwb9HL72FcG9Zt7b2klx0vY+APqOXgoJ0qNTCfXzu1Pyzil9+BSg WAiPFjERIkOo8p14uyeRZr7bPAfcHMjYCgve5Hd4= Date: Thu, 23 Mar 2023 05:58:02 +0100 From: Greg KH To: Julia Lawall Cc: Alex Elder , Menna Mahmoud , outreachy@lists.linux.dev, johan@kernel.org, elder@kernel.org, linux-kernel@vger.kernel.org, linux-staging@lists.linux.dev Subject: Re: [PATCH v2] staging: greybus: use inline function for macros Message-ID: References: <20230321183456.10385-1-eng.mennamahmoud.mm@gmail.com> <2e869677-2693-6419-ea25-f0cc2efcf3dd@ieee.org> <5efa6e6d-8573-31de-639a-d15b2e9deca0@ieee.org> <48674d8f-9753-780c-f37c-f83ea2855ae6@ieee.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.5 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Mar 22, 2023 at 11:00:41AM +0100, Julia Lawall wrote: > Greg raised the question of whether the inline function is really as > efficient as a macro. > > I tried the following definitions: > > #define to_gbphy_dev(d) container_of(d, struct gbphy_device, dev) > > static inline struct gbphy_device *extra_to_gbphy_dev(const struct device *_dev) > { > return container_of(_dev, struct gbphy_device, dev); > } > > And the following uses: > > ssize_t macro_protocol_id_show(struct device *dev, > struct device_attribute *attr, char *buf) > { > struct gbphy_device *gbphy_dev = to_gbphy_dev(dev); > > return sprintf(buf, "%c macro 0x%02x\n", *buf, gbphy_dev->cport_desc->protocol_id); > } > ssize_t extra_protocol_id_show(struct device *dev, > struct device_attribute *attr, char *buf) > { > struct gbphy_device *gbphy_dev = extra_to_gbphy_dev(dev); > > return sprintf(buf, "extra 0x%02x %c\n", gbphy_dev->cport_desc->protocol_id, *buf); > } > > They are a little bit different to avoid too much compiler optimization. > > After doing make drivers/staging/greybus/gbphy.s, I get similar looking > code in both cases: > > Macro version: > > .type macro_protocol_id_show, @function > macro_protocol_id_show: > endbr64 > 1: call __fentry__ > .section __mcount_loc, "a",@progbits > .quad 1b > .previous > pushq %rbp # > movq %rdx, %rbp # tmp96, buf > pushq %rbx # > # drivers/staging/greybus/gbphy.c:40: { > movq %rdi, %rbx # tmp95, dev > # drivers/staging/greybus/gbphy.c:43: return sprintf(buf, "%c macro 0x%02x\n", *buf, gbphy_dev->cport_desc->protocol_id); > call __sanitizer_cov_trace_pc # > # drivers/staging/greybus/gbphy.c:43: return sprintf(buf, "%c macro 0x%02x\n", *buf, gbphy_dev->cport_desc->protocol_id); > movq -32(%rbx), %rax # MEM[(struct gbphy_device *)dev_7(D) + -40B].cport_desc, MEM[(struct gbphy_device *)dev_7(D) + -40B].cport_desc > # drivers/staging/greybus/gbphy.c:43: return sprintf(buf, "%c macro 0x%02x\n", *buf, gbphy_dev->cport_desc->protocol_id); > movzbl 0(%rbp), %edx # *buf_9(D), *buf_9(D) > movq %rbp, %rdi # buf, > movq $.LC18, %rsi #, > movzbl 3(%rax), %ecx # _1->protocol_id, _1->protocol_id > call sprintf # > # drivers/staging/greybus/gbphy.c:44: } > movl $13, %eax #, > popq %rbx # > popq %rbp # > jmp __x86_return_thunk > .size macro_protocol_id_show, .-macro_protocol_id_show > > Function version: > > .type extra_protocol_id_show, @function > extra_protocol_id_show: > endbr64 > 1: call __fentry__ > .section __mcount_loc, "a",@progbits > .quad 1b > .previous > pushq %rbp # > movq %rdx, %rbp # tmp96, buf > pushq %rbx # > # drivers/staging/greybus/gbphy.c:47: { > movq %rdi, %rbx # tmp95, dev > # drivers/staging/greybus/gbphy.c:50: return sprintf(buf, "extra 0x%02x %c\n", gbphy_dev->cport_desc->protocol_id, *buf); > call __sanitizer_cov_trace_pc # > # drivers/staging/greybus/gbphy.c:50: return sprintf(buf, "extra 0x%02x %c\n", gbphy_dev->cport_desc->protocol_id, *buf); > movq -32(%rbx), %rax # MEM[(struct gbphy_device *)dev_8(D) + -40B].cport_desc, MEM[(struct gbphy_device *)dev_8(D) + -40B].cport_desc > # drivers/staging/greybus/gbphy.c:50: return sprintf(buf, "extra 0x%02x %c\n", gbphy_dev->cport_desc->protocol_id, *buf); > movzbl 0(%rbp), %ecx # *buf_9(D), *buf_9(D) > movq %rbp, %rdi # buf, > movq $.LC19, %rsi #, > movzbl 3(%rax), %edx # _3->protocol_id, _3->protocol_id > call sprintf # > # drivers/staging/greybus/gbphy.c:51: } > movl $13, %eax #, > popq %rbx # > popq %rbp # > jmp __x86_return_thunk > .size extra_protocol_id_show, .-extra_protocol_id_show > > Both seem to access the memory directly. Maybe the example is too simple, > and the compiler is more likely to optimize aggressively? Nice, that shows that it is the same both ways as the compiler version you are using is smart enough Which compiler and version is this? Does it work the same for all of the supported versions we have to support (i.e. really old gcc?) For the most part, sysfs files are not on any sort of "fast path" so a function call is fine, but as I mentioned before, sometimes we are forced to move calls to container_of() to container_of_const() and that can not be an inline function, but must remain a macro :( thanks, greg k-h