Received: by 2002:ab2:7903:0:b0:1fb:b500:807b with SMTP id a3csp1138623lqj; Mon, 3 Jun 2024 11:13:20 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCVoN4y68TU5H1VI1QuZ5AJInPhLqV7MA9CCSOK/LoxLQZtN8eXrKZXP3KBK/wEONuIXKO8cso/rSYi4z2lsxNUyoMVYyvhd8XJCAdlxSA== X-Google-Smtp-Source: AGHT+IEZG83dw3b0ZHvtxRl7josVGrbCQzB5hm9qwWhWG6CYz9ykL4aItCieIxGAzwaxukYHQQhj X-Received: by 2002:a17:902:ce85:b0:1f6:70d6:d49a with SMTP id d9443c01a7336-1f670d6decdmr56462545ad.11.1717438399968; Mon, 03 Jun 2024 11:13:19 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1717438399; cv=pass; d=google.com; s=arc-20160816; b=fleKmjsGdoLmU2CtP4uHvhfY4TuI6jvrNHxNOwMH0MeaCGUOd8J9xNYTNmf+Ly0c4j yG8QX+CHAQsVUsU51KSVZ0j/W3E9kwjyrBOBbPZZY/oosv+JVwViot/uEuY+jzi/yYgU /nQz/9DkunB8aSHaf2KEw9HpRmFOqXjlVjTBTSmkFaUOTjl6ak7G67N2H9UR1vDpNHPi 3ktl4UlPMQ0IJeUoHS+poX/9KhZ9hkULuljTTLFWvl1VrMQsuQfGjQF1a7O7tCRT8+Pt z1RsByrBKxJnpydmKVd3V9EDtFtwA8rerwUcL+xonk7ZL6l3Cp2+ImyOXI68PH+ixDn0 Nyjg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:mail-followup-to :message-id:subject:cc:to:from:date:dkim-signature; bh=pBBXkcJxxPJLd6H7n8JPjoD/597hQTKlLQ3epFR5740=; fh=/OQ4phdel1oSqvc4BGVNZcquq7ZtnNtyaqQV1F9srfs=; b=A1u+NaICvXAUT/E3plIptAx758xJmBRO4DGoutW31mAF3wFpp2C7STzhxZrbZA59Kd HRin4Ch53wmcExbHfXbpydN+xkKo4aXdzlR6xRsY+2GXHmOfWgYLZSvNWPfXVSnjNqLG oYEaLBueBmwYwdHRQclaDGvRZoYRSiq+FsfZ/hsWKfPwMO/r7R96nQB1rxd6eFy1hLHK MrGBV6rkY3Pznp/ykuwVdsTd9Rqqsan2qUZqeLedfrQ9t4Pm9n2pzjRSlw8kXJipavbl cisY3tjl+zVqhNxLFA88GG54hVAt+4qteAlYzBue3K8jP2Kwd74XGUrB9J1pxODUJRo9 7jHw==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@fastly.com header.s=google header.b="nt/Iv8ab"; arc=pass (i=1 spf=pass spfdomain=fastly.com dkim=pass dkdomain=fastly.com dmarc=pass fromdomain=fastly.com); spf=pass (google.com: domain of linux-kernel+bounces-199553-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-199553-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=fastly.com Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [139.178.88.99]) by mx.google.com with ESMTPS id d9443c01a7336-1f632425496si66755525ad.640.2024.06.03.11.13.19 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 03 Jun 2024 11:13:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-199553-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) client-ip=139.178.88.99; Authentication-Results: mx.google.com; dkim=pass header.i=@fastly.com header.s=google header.b="nt/Iv8ab"; arc=pass (i=1 spf=pass spfdomain=fastly.com dkim=pass dkdomain=fastly.com dmarc=pass fromdomain=fastly.com); spf=pass (google.com: domain of linux-kernel+bounces-199553-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-199553-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=fastly.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id BEB57285F5B for ; Mon, 3 Jun 2024 18:11:40 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 17C5813848A; Mon, 3 Jun 2024 18:11:30 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=fastly.com header.i=@fastly.com header.b="nt/Iv8ab" Received: from mail-pf1-f172.google.com (mail-pf1-f172.google.com [209.85.210.172]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 2AF1B137C2D for ; Mon, 3 Jun 2024 18:11:25 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.210.172 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717438288; cv=none; b=VhExsQobsofPyM0P2kBglfemsID7SlaxOQfrBBrJT2CLJD5t2C5S7fLR8uSeynZIUeBpJkkWhZhntghM2kt1ZliBDHtlxULIP3F89F8BA5RhE2ySKJmidk9ff7WG+RRmm3CACTsv0RNZie0hxh37VMPQgu7ovJVxwaAaIKLb1Zg= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717438288; c=relaxed/simple; bh=jY7dhI3/9yxnM9yEiaYkEdE5FHTB+pX1BMMxN2viUCU=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=MUaZvhIuQrVi34R3tfOzac0B0xJixJgwMoMhH2Wwd9y+ZLTrn/aLnBytLA2sllkdTv7lITjwDLwjfc//6DPlx/MyiWk58BiHMZLiG9bUF/XsTahDfGdo+TBDljwDPo9JbZ3PQP9i5w0q0k4/2IwEagh30Yb8qfx9R14KK77ZdvM= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=fastly.com; spf=pass smtp.mailfrom=fastly.com; dkim=pass (1024-bit key) header.d=fastly.com header.i=@fastly.com header.b=nt/Iv8ab; arc=none smtp.client-ip=209.85.210.172 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=fastly.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=fastly.com Received: by mail-pf1-f172.google.com with SMTP id d2e1a72fcca58-702447766fdso3657294b3a.1 for ; Mon, 03 Jun 2024 11:11:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fastly.com; s=google; t=1717438285; x=1718043085; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references :mail-followup-to:message-id:subject:cc:to:from:date:from:to:cc :subject:date:message-id:reply-to; bh=pBBXkcJxxPJLd6H7n8JPjoD/597hQTKlLQ3epFR5740=; b=nt/Iv8abmFF4tkxO7HvqzGsoLCwWze5XbuOFxwuVR3vkgs/okG7zYP5xKSYQfcFZtF idKTyqZS34GI48mRzWWD5tkDzUOQ0M6klS+j6gKpoAffs4JkI7+KoeYZ+Ppx3Alzb1S5 QJBWfy/FKt3fips3q1ezi2bR+j9dmNHCtc0OY= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1717438285; x=1718043085; h=in-reply-to:content-disposition:mime-version:references :mail-followup-to:message-id:subject:cc:to:from:date :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=pBBXkcJxxPJLd6H7n8JPjoD/597hQTKlLQ3epFR5740=; b=VeyxNeUJz01z68y32STvD1uHAWMbHI9N42EksiC6QPeBZ0HcwIA2nBudDW03EcfQuh 7BTe81n8PYtvrDKgXzxhDXLTKxglNjkkiFEJjPbhzrVBUj5Qh9YwwYPE5ZqIc6rPBM79 14HxhYRctw+bDjFmrcwnzFFlS/wrkAX+Fj3dyk+kgCgTQdgtX5+VUKstchIdUOdJNrpJ HaTq210Wq7t//GeuwkPELUaOKm0xOBzdUxWFtQ63unYpG/Evzyx219J0X95TuKF2FDTT 2QfecRaz74zWCQlOkCMlkkg16vklPJ3AlP28B7Q7wqSup991b0fMSYlCSEOJro9pKXy1 D5MA== X-Gm-Message-State: AOJu0YxQR+AuzsOhgJPmqitbz9XytSknsYwA0xG9lBvXQFz+0hFxSX9+ LEtGo8SNUyH/pWgQjUrTP5jCFCHuViJjDqGArKr2MtlDR//QEW8Cv+L52OlVRBA= X-Received: by 2002:a05:6a21:2786:b0:1af:d4b1:889e with SMTP id adf61e73a8af0-1b26f30d7d0mr10331630637.53.1717438285299; Mon, 03 Jun 2024 11:11:25 -0700 (PDT) Received: from LQ3V64L9R2 (c-24-6-151-244.hsd1.ca.comcast.net. [24.6.151.244]) by smtp.gmail.com with ESMTPSA id d2e1a72fcca58-70242b03ffasm5976779b3a.141.2024.06.03.11.11.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 03 Jun 2024 11:11:24 -0700 (PDT) Date: Mon, 3 Jun 2024 11:11:21 -0700 From: Joe Damato To: Tariq Toukan Cc: linux-kernel@vger.kernel.org, netdev@vger.kernel.org, nalramli@fastly.com, Saeed Mahameed , Leon Romanovsky , "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , Richard Cochran , "open list:MELLANOX MLX5 core VPI driver" , Tariq Toukan Subject: Re: [RFC net-next v3 2/2] net/mlx5e: Add per queue netdev-genl stats Message-ID: Mail-Followup-To: Joe Damato , Tariq Toukan , linux-kernel@vger.kernel.org, netdev@vger.kernel.org, nalramli@fastly.com, Saeed Mahameed , Leon Romanovsky , "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , Richard Cochran , "open list:MELLANOX MLX5 core VPI driver" , Tariq Toukan References: <20240529031628.324117-1-jdamato@fastly.com> <20240529031628.324117-3-jdamato@fastly.com> <5b3a0f6a-5a03-45d7-ab10-1f1ba25504d3@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Mon, Jun 03, 2024 at 02:11:14PM +0300, Tariq Toukan wrote: > > > On 02/06/2024 22:22, Joe Damato wrote: > > On Sun, Jun 02, 2024 at 12:14:21PM +0300, Tariq Toukan wrote: > > > > > > > > > On 29/05/2024 6:16, Joe Damato wrote: > > > > Add functions to support the netdev-genl per queue stats API. > > > > > > > > ./cli.py --spec netlink/specs/netdev.yaml \ > > > > --dump qstats-get --json '{"scope": "queue"}' > > > > > > > > ...snip > > > > > > > > {'ifindex': 7, > > > > 'queue-id': 62, > > > > 'queue-type': 'rx', > > > > 'rx-alloc-fail': 0, > > > > 'rx-bytes': 105965251, > > > > 'rx-packets': 179790}, > > > > {'ifindex': 7, > > > > 'queue-id': 0, > > > > 'queue-type': 'tx', > > > > 'tx-bytes': 9402665, > > > > 'tx-packets': 17551}, > > > > > > > > ...snip > > > > > > > > Also tested with the script tools/testing/selftests/drivers/net/stats.py > > > > in several scenarios to ensure stats tallying was correct: > > > > > > > > - on boot (default queue counts) > > > > - adjusting queue count up or down (ethtool -L eth0 combined ...) > > > > - adding mqprio TCs > > > > > > Please test also with interface down. > > > > OK. I'll test with the interface down. > > > > Is there some publicly available Mellanox script I can run to test > > all the different cases? That would make this much easier. Maybe > > this is something to include in mlnx-tools on github? > > > > You're testing some new functionality. We don't have something for it. > > > > The mlnx-tools scripts that includes some python scripts for setting > > up QoS doesn't seem to work on my system, and outputs vague error > > messages. I have no idea if I'm missing some kernel option, if the > > device doesn't support it, or if I need some other dependency > > installed. > > > > Can you share the command you use, and the output? Sure: jdamato@test:~/mlnx-tools/python$ python --version Python 3.12.3 jdamato@test:~/mlnx-tools/python$ ./mlnx_qos -i vlan401 Priority trust state is not supported on your system Buffers commands are not supported on your system Rate limit is not supported on your system! ETS features are not supported on your system jdamato@test:~/mlnx-tools/python$ echo $? 1 This is mlnx-tools at SHA 641718b13f71 ("Merge pull request #87 from ahlabenadam/no_remove_folder") You can feel free to follow up with me about this off-list if you like. > > I have been testing these patches on a: > > > > Mellanox Technologies MT28800 Family [ConnectX-5 Ex] > > firmware-version: 16.29.2002 (MT_0000000013) > > > > > > > > > > Signed-off-by: Joe Damato > > > > --- > > > > .../net/ethernet/mellanox/mlx5/core/en_main.c | 132 ++++++++++++++++++ > > > > 1 file changed, 132 insertions(+) > > > > > > > > diff --git a/drivers/net/ethernet/mellanox/mlx5/core/en_main.c b/drivers/net/ethernet/mellanox/mlx5/core/en_main.c > > > > index ce15805ad55a..515c16a88a6c 100644 > > > > --- a/drivers/net/ethernet/mellanox/mlx5/core/en_main.c > > > > +++ b/drivers/net/ethernet/mellanox/mlx5/core/en_main.c > > > > @@ -39,6 +39,7 @@ > > > > #include > > > > #include > > > > #include > > > > +#include > > > > #include > > > > #include > > > > #include > > > > @@ -5293,6 +5294,136 @@ static bool mlx5e_tunnel_any_tx_proto_supported(struct mlx5_core_dev *mdev) > > > > return (mlx5_vxlan_allowed(mdev->vxlan) || mlx5_geneve_tx_allowed(mdev)); > > > > } > > > > +static void mlx5e_get_queue_stats_rx(struct net_device *dev, int i, > > > > + struct netdev_queue_stats_rx *stats) > > > > +{ > > > > + struct mlx5e_priv *priv = netdev_priv(dev); > > > > + struct mlx5e_channel_stats *channel_stats; > > > > + struct mlx5e_rq_stats *xskrq_stats; > > > > + struct mlx5e_rq_stats *rq_stats; > > > > + > > > > + if (mlx5e_is_uplink_rep(priv)) > > > > + return; > > > > + > > > > + channel_stats = priv->channel_stats[i]; > > > > + xskrq_stats = &channel_stats->xskrq; > > > > + rq_stats = &channel_stats->rq; > > > > + > > > > + stats->packets = rq_stats->packets + xskrq_stats->packets; > > > > + stats->bytes = rq_stats->bytes + xskrq_stats->bytes; > > > > + stats->alloc_fail = rq_stats->buff_alloc_err + > > > > + xskrq_stats->buff_alloc_err; > > > > +} > > > > + > > > > +static void mlx5e_get_queue_stats_tx(struct net_device *dev, int i, > > > > + struct netdev_queue_stats_tx *stats) > > > > +{ > > > > + struct mlx5e_priv *priv = netdev_priv(dev); > > > > + struct mlx5e_channel_stats *channel_stats; > > > > + struct mlx5e_sq_stats *sq_stats; > > > > + int ch_ix, tc_ix; > > > > + > > > > + mutex_lock(&priv->state_lock); > > > > + txq_ix_to_chtc_ix(&priv->channels.params, i, &ch_ix, &tc_ix); > > > > + mutex_unlock(&priv->state_lock); > > > > + > > > > + channel_stats = priv->channel_stats[ch_ix]; > > > > + sq_stats = &channel_stats->sq[tc_ix]; > > > > + > > > > + stats->packets = sq_stats->packets; > > > > + stats->bytes = sq_stats->bytes; > > > > +} > > > > + > > > > +static void mlx5e_get_base_stats(struct net_device *dev, > > > > + struct netdev_queue_stats_rx *rx, > > > > + struct netdev_queue_stats_tx *tx) > > > > +{ > > > > + struct mlx5e_priv *priv = netdev_priv(dev); > > > > + int i, j; > > > > + > > > > + if (!mlx5e_is_uplink_rep(priv)) { > > > > + rx->packets = 0; > > > > + rx->bytes = 0; > > > > + rx->alloc_fail = 0; > > > > + > > > > + /* compute stats for deactivated RX queues > > > > + * > > > > + * if priv->channels.num == 0 the device is down, so compute > > > > + * stats for every queue. > > > > + * > > > > + * otherwise, compute only the queues which have been deactivated. > > > > + */ > > > > + mutex_lock(&priv->state_lock); > > > > + if (priv->channels.num == 0) > > > > + i = 0; > > > > > > This is not consistent with the above implementation of > > > mlx5e_get_queue_stats_rx(), which always returns the stats even if the > > > channel is down. > > > This way, you'll double count the down channels. > > > > > > I think you should always start from priv->channels.params.num_channels. > > > > OK, I'll do that. > > > > > > + else > > > > + i = priv->channels.params.num_channels; > > > > + mutex_unlock(&priv->state_lock); > > > > > > I understand that you're following the guidelines by taking the lock here, I > > > just don't think this improves anything... If channels can be modified in > > > between calls to mlx5e_get_base_stats / mlx5e_get_queue_stats_rx, then > > > wrapping the priv->channels access with a lock can help protect each single > > > deref, but not necessarily in giving a consistent "screenshot" of the stats. > > > > > > The rtnl_lock should take care of that, as the driver holds it when changing > > > the number of channels and updating the real_numrx/tx_queues. > > > > > > This said, I would carefully say you can drop the mutex once following the > > > requested changes above. > > I still don't really like this design, so I gave some more thought on > this... Thanks, again, for your careful thoughts and review on this. I do appreciate it and this functionality will be extremely useful for me (and I suspect many others). > I think we should come up with a new mapping array under priv, that maps i > (from real_num_tx_queues) to the matching sq_stats struct. > This array would be maintained in the channels open/close functions, > similarly to priv->txq2sq. > > Then, we would not calculate the mapping per call, but just get the proper > pointer from the array. This eases the handling of htb and ptp queues, which > were missed in your txq_ix_to_chtc_ix(). > > This handles mapped SQs. OK, the above makes sense. I'll give that a try. > Now, regarding unmapped ones, they must be handled in the "base" function > call. > We'd still need to access channels->params, to: > 1. read params.num_channels to iterate until priv->stats_nch, and > 2. read mlx5e_get_dcb_num_tc(params) to iterate until priv->max_opened_tc. > > I think we can live with this without holding the mutex, given that this > runs under the rtnl lock. > We can add ASSERT_RTNL() to verify the assumption. OK. I'll try the above and propose an rfc v4. > > > > > OK, that makes sense to me. > > > > So then I assume I can drop the mutex in mlx5e_get_queue_stats_tx > > above, as well, for the same reasons? > > > > Does this mean then that you are in favor of the implementation for > > tx stats provided in this RFC and that I've implemented option 1 as > > you described in the previous thread correctly? > > > > Yes, but I wasn't happy enough with the design. > Thanks for your contribution. Thanks for your review and patience.