Received: by 2002:a19:f614:0:0:0:0:0 with SMTP id x20csp53606lfe; Fri, 15 Apr 2022 19:07:54 -0700 (PDT) X-Google-Smtp-Source: ABdhPJydSp7RCIqJHVdqO4ItrDvdPeouuEiWK+lWnlU+R9cfWOAsrJFUUAcups+/Buziu7aNlaZQ X-Received: by 2002:a63:5710:0:b0:399:365e:5dde with SMTP id l16-20020a635710000000b00399365e5ddemr1421424pgb.192.1650074874085; Fri, 15 Apr 2022 19:07:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650074874; cv=none; d=google.com; s=arc-20160816; b=mGJEabMhKIFqQquXDFminEXTpQGvqArtMsGNdlzuD0hn7hgiISHvRHhCCsfrN44Xp0 hs9oBKn6VNnOHExNzPRTasIfGSAco4vTduJxjE6nipZHk4lnPgB3iLol3smNXmeVr4Yn EScxCWgCN7UXUdCuV5zko5vnpJgT6OSLv0AR0BThWV01xUR2zx3hLpjHH3jLjR856ZQY U2ZsaALq5JLO/u+xj+21s1fu74V9ro2ZEuVxXnBa1P7rCAPpOWiHkhlis/W8DvKwzFte GW21Xl7FEChm8tXRB9GO+FwMy7toBqtaikCi0aTT6Az2lLEc4yV5DmnYp+drXEwBR+7j IaDA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=pW2avzf49LzqFPMRKUtc+7DmzIcKaIS1b7B3kGCgB7w=; b=FSa+aOQlsf3UH+OH83hemjkdusu3bq2vtHIN0skYagTY9TxSfZ2Dx/ckaTHOrRrJ/R i0ow/wMe8GHcaSkaWeQcKL1HuDrtL4bmqhY0wdqcAEqfVjaFNb4/qwWE1LnOLNDLwokI qm8rIbJQvEX596hmTbACWsHmNtzW3fwVs0dcCNKoSDJujCYjOJ+4JLJfSuSJGQk/jy/t GVI6K+umzOgmU4LvsZ71JRkJX3zbYBc7IhZMd3IOQ7oGVoI4th0gusoez8Ql4wNWZXfD Wq+1cw9yhxzsZVako7PxtRNeOhjbPOOhVABJg4Ul5lRhdYfwXwPdsj0PJuepEpMOP34s ZmSA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=l++9rF2+; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id z15-20020a170903018f00b001561bfa1339si3016870plg.547.2022.04.15.19.07.53 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Apr 2022 19:07:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=l++9rF2+; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 0DEA911BD81; Fri, 15 Apr 2022 18:25:50 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1351603AbiDOIpr (ORCPT + 99 others); Fri, 15 Apr 2022 04:45:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53604 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1351612AbiDOIp0 (ORCPT ); Fri, 15 Apr 2022 04:45:26 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 91B39BA31B; Fri, 15 Apr 2022 01:42:44 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 1B9EFB82C42; Fri, 15 Apr 2022 08:42:43 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 2186EC385A8; Fri, 15 Apr 2022 08:42:40 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1650012161; bh=cu5bN2T1sput01IUK44V65OIVMTltnqIip044jyH98U=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=l++9rF2+wdQSpuGnK7HUIuDBk8geTdWFjbMMtmn6B9BNWav6dGtUmI1+Dd/HD9XUE h7MYVu34TY2NTDX3fO4CW9D6l2AHZ30xM7CvpyQVwbscZnrEonz3ACU5R9xK0CmSfl BaUpjXjK6j40X6TOzAYIFW3fT3oLEfKY3WxVnnTs= Date: Fri, 15 Apr 2022 10:42:38 +0200 From: Greg KH To: Jiri Slaby Cc: linux-serial@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH 1/3] tty: serial: introduce uart_port_tx{,_limit}() helpers Message-ID: References: <20220411105405.9519-1-jslaby@suse.cz> <20220411105405.9519-2-jslaby@suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 15, 2022 at 09:47:26AM +0200, Jiri Slaby wrote: > On 14. 04. 22, 18:32, Greg KH wrote: > > On Mon, Apr 11, 2022 at 12:54:03PM +0200, Jiri Slaby wrote: > > > Many serial drivers do the same thing: > > > * send x_char if set > > > * keep sending from the xmit circular buffer until either > > > - the loop reaches the end of the xmit buffer > > > - TX is stopped > > > - HW fifo is full > > > * check for pending characters and: > > > - wake up tty writers to fill for more data into xmit buffer > > > - stop TX if there is nothing in the xmit buffer > > > > > > The only differences are: > > > * how to write the character to the HW fifo > > > * the check of the end condition: > > > - is the HW fifo full? > > > - is limit of the written characters reached? > > > > > > So unify the above into two helpers: > > > * uart_port_tx_limit() -- the generic one, it performs the above taking > > > into account the written characters limit > > > * uart_port_tx() -- calls the above with ~0 as the limit. So it only > > > checks the HW fullness. > > > > > > We need three more hooks in struct uart_ops for all this to work: > > > * tx_ready() -- returns true if HW can accept more data. > > > * put_char() -- write a character to the device. > > > * tx_done() -- when the write loop is done, perform arbitrary action > > > before potential invocation of ops->stop_tx() happens. > > > > > > NOTE1: Maybe the three hooks in uart_ops above are overkill. We can > > > instead pass pointers to the three functions directly to the new helpers > > > as they are not used elsewhere. Similar to uart_console_write() and its > > > putchar(). > > > > > > NOTE2: These two new helper functions call the hooks per every character > > > processed. I was unable to measure any difference, provided most time is > > > spent by readb (or alike) in the hooks themselves. First, LTO might > > > help to eliminate these explicit calls (we might need NOTE1 to be > > > implemented for this to be true). Second, if this turns out to be a > > > problem, we can introduce a macro to build the helper in the driver's > > > code instead of serial_core. That is, similar to wait_event(). > > > > > > Signed-off-by: Jiri Slaby > > > --- > > > Documentation/driver-api/serial/driver.rst | 28 ++++++++++++ > > > drivers/tty/serial/serial_core.c | 53 ++++++++++++++++++++++ > > > include/linux/serial_core.h | 9 ++++ > > > 3 files changed, 90 insertions(+) > > > > > > diff --git a/Documentation/driver-api/serial/driver.rst b/Documentation/driver-api/serial/driver.rst > > > index 06ec04ba086f..7dc3791addeb 100644 > > > --- a/Documentation/driver-api/serial/driver.rst > > > +++ b/Documentation/driver-api/serial/driver.rst > > > @@ -80,6 +80,34 @@ hardware. > > > This call must not sleep > > > + tx_ready(port) > > > + The driver returns true if the HW can accept more data to be sent. > > > + > > > + Locking: port->lock taken. > > > + > > > + Interrupts: locally disabled. > > > + > > > + This call must not sleep. > > > + > > > + put_char(port, ch) > > > + The driver is asked to write ch to the device. > > > + > > > + Locking: port->lock taken. > > > + > > > + Interrupts: locally disabled. > > > + > > > + This call must not sleep. > > > + > > > + tx_done(port) > > > + When the write loop is done, the driver can perform arbitrary action > > > + here before potential invocation of ops->stop_tx() happens. > > > + > > > + Locking: port->lock taken. > > > + > > > + Interrupts: locally disabled. > > > + > > > + This call must not sleep. > > > + > > > set_mctrl(port, mctrl) > > > This function sets the modem control lines for port described > > > by 'port' to the state described by mctrl. The relevant bits > > > diff --git a/drivers/tty/serial/serial_core.c b/drivers/tty/serial/serial_core.c > > > index 6a8963caf954..1be14e90066c 100644 > > > --- a/drivers/tty/serial/serial_core.c > > > +++ b/drivers/tty/serial/serial_core.c > > > @@ -107,6 +107,59 @@ void uart_write_wakeup(struct uart_port *port) > > > } > > > EXPORT_SYMBOL(uart_write_wakeup); > > > +static bool uart_port_tx_always_ready(struct uart_port *port) > > > +{ > > > + return true; > > > +} > > > + > > > +/** > > > + * uart_port_tx_limit -- transmit helper for uart_port > > > + * @port: from which port to transmit > > > + * @count: limit count > > > + * > > > + * uart_port_tx_limit() transmits characters from the xmit buffer to the > > > + * hardware using @uart_port::ops::put_char(). It does so until @count > > > + * characters are sent and while @uart_port::ops::tx_ready() still returns > > > + * non-zero (if non-NULL). > > > + * > > > + * Return: number of characters in the xmit buffer when done. > > > + */ > > > +unsigned int uart_port_tx_limit(struct uart_port *port, unsigned int count) > > > +{ > > > + struct circ_buf *xmit = &port->state->xmit; > > > + bool (*tx_ready)(struct uart_port *) = port->ops->tx_ready ? : > > > + uart_port_tx_always_ready; > > > + unsigned int pending; > > > + > > > + for (; count && tx_ready(port); count--, port->icount.tx++) { > > > + if (port->x_char) { > > > + port->ops->put_char(port, port->x_char); > > > + port->x_char = 0; > > > + continue; > > > + } > > > + > > > + if (uart_circ_empty(xmit) || uart_tx_stopped(port)) > > > + break; > > > + > > > + port->ops->put_char(port, xmit->buf[xmit->tail]); > > > > That's a lot of redirection and function pointer mess per each character > > sent now. With the spectre overhead here (and only getting worse), this > > feels like a step backwards. > > > > I doubt throughput matters here given cpu speeds now, _but_ the cpu load > > should go up. > > > > Although on smaller cpus with slower Mhz and faster line rates, this > > feels like a lot of extra work happening for no real good reason. > > I know… Did you miss NOTE2 in the commit log? Any idea on that? I did see it, but I LTO can not handle function pointer redirection. I was wondering if you ran any benchmarks to see if this is noticeable. I am all for making the drivers smaller, but not at the increased overhead of every character being sent :( thanks, greg k-h