Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp1428670pxb; Tue, 26 Oct 2021 08:51:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz/zMJ9TR90bGSo1Bo0M9IAtF1SKmhHopZfUZowQkTCb94IrBxTFn50Og8AnUB++XUYT+6Y X-Received: by 2002:a05:6a00:984:b0:47b:f5b0:7226 with SMTP id u4-20020a056a00098400b0047bf5b07226mr12131974pfg.62.1635263371619; Tue, 26 Oct 2021 08:49:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1635263371; cv=none; d=google.com; s=arc-20160816; b=rDdPAmE6t/w5QUIbm2Df6kKPVn6r/L4GHx/MD6a03I6QRIof4rRlqTnzTU1hNB8zaS J0bGZkrXpwFhDCfimPCdBc1/jYRDWd1KsIdjDlNfQnDoDNA/4thKXidomlHYAM43NK9C MBW2sk3949WNya41beBSyZUUFIxE64rDYkGW/GglM4rriy5dwh7H+k6K1VG9aZ4nHNH+ ZMtpD1jtO7zMQJs3UWJvDzcFSPM2d1gHiQPF6qrE/uGRLhp76nYtqU08w1B8Ddtjy5DF EgSHTeQyO4d/pGHAUF9PELuqUXtRcYiB4oCF2HIF4Vdj0G/hzBqLbiM9Gx+EmtHgyb3n ymQg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id:dkim-signature; bh=SMCiF30faeAukouRfxZ/PlOO14x1f0ahVwRXDVzq6Fw=; b=JxofK+Gb/9d2zLPiufjnel4Se5VyHrkJgy4vpI3lTRbw+8VYHH9+FWoYsFlSNQiMho zdlUnqZ9/01Ms8KePmo3ZF9Ib8tm+qSTQJzA2g8rfwDqt8jGkdGFGSeJIPT0AZszXgoO 9gSljenPZufkTLGRXtOUp9UsN5A4dyk4mA2vItWcANRMHZub88PXOBM8ZiJtpIIjcmxG 9keoMgt20EpXJbLpFWIhvjNpe3D5WOuF4LwwewvW1cuONLteh+ro/DqwT005w3DuMkjm djm8htBgr3uehql4QDa/sJ5MvoNVopVsfm8DASngq3bm11IqVOxL0ZCybB9fuXUVrsMo AE4A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=aQjzhwyK; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p12si20490104plf.55.2021.10.26.08.49.01; Tue, 26 Oct 2021 08:49:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=aQjzhwyK; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235524AbhJZLnS (ORCPT + 99 others); Tue, 26 Oct 2021 07:43:18 -0400 Received: from mail.kernel.org ([198.145.29.99]:53166 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234803AbhJZLnR (ORCPT ); Tue, 26 Oct 2021 07:43:17 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 0DB4360C4B; Tue, 26 Oct 2021 11:40:52 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1635248453; bh=jsir7YjseS8J2Bzgug1/HO3u7BEFwv5cWCMG1pexsXo=; h=Subject:From:To:Cc:Date:In-Reply-To:References:From; b=aQjzhwyK9XAKMRtANSpUEYxz/KCWd7UDFKbnRh+auMdF9/ZX8Gikp/p7F07saulsa /Yc8JmBaAWmMpHBz59+2xq8sR4bmUTA68/+ggtBqfti1Ekfoa++MI2NS8e2jQCR1gg IhTFuoZv+/4AoH8eEkz/6Nv34MJfCrpCX/GHP0ImE4iGy2++DhJAcynIbFot6PmYsm Tt4VoS+m+iIMfU3YsbD5KBQ52qjhs9znvkABWy9aCD9WyyipCA+dkQffxTTn8nUMzj K+aYybAsH2AC4rrkEciTZ/2Q+vupOBz/iTNS06X2M3XQtPa2G1vpCbygsxrkf9lNXq jGexLjJa3Eeww== Message-ID: <604199ed389d9286e3fdab6b5acdf65c421df45d.camel@kernel.org> Subject: Re: [RFC PATCH] ceph: add remote object copy counter to fs client From: Jeff Layton To: Xiubo Li , Patrick Donnelly Cc: =?ISO-8859-1?Q?Lu=EDs?= Henriques , Ilya Dryomov , Ceph Development , linux-kernel@vger.kernel.org Date: Tue, 26 Oct 2021 07:40:51 -0400 In-Reply-To: <785d1435-4a2c-95aa-0573-2de54b4e7b6b@redhat.com> References: <20211020143708.14728-1-lhenriques@suse.de> <34e379f9dec1cbdf09fffd8207f6ef7f4e1a6841.camel@kernel.org> <99209198dd9d8634245f153a90e4091851635a16.camel@kernel.org> <785d1435-4a2c-95aa-0573-2de54b4e7b6b@redhat.com> Content-Type: text/plain; charset="ISO-8859-15" User-Agent: Evolution 3.40.4 (3.40.4-2.fc34) MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2021-10-26 at 11:05 +0800, Xiubo Li wrote: > On 10/22/21 1:30 AM, Patrick Donnelly wrote: > > On Thu, Oct 21, 2021 at 12:35 PM Jeff Layton wrote: > > > On Thu, 2021-10-21 at 12:18 -0400, Patrick Donnelly wrote: > > > > On Thu, Oct 21, 2021 at 11:44 AM Jeff Layton wrote: > > > > > On Thu, 2021-10-21 at 09:52 -0400, Patrick Donnelly wrote: > > > > > > On Wed, Oct 20, 2021 at 12:27 PM Jeff Layton wrote: > > > > > > > On Wed, 2021-10-20 at 15:37 +0100, Lu?s Henriques wrote: > > > > > > > > This counter will keep track of the number of remote object copies done on > > > > > > > > copy_file_range syscalls. This counter will be filesystem per-client, and > > > > > > > > can be accessed from the client debugfs directory. > > > > > > > > > > > > > > > > Cc: Patrick Donnelly > > > > > > > > Signed-off-by: Lu?s Henriques > > > > > > > > --- > > > > > > > > This is an RFC to reply to Patrick's request in [0]. Note that I'm not > > > > > > > > 100% sure about the usefulness of this patch, or if this is the best way > > > > > > > > to provide the functionality Patrick requested. Anyway, this is just to > > > > > > > > get some feedback, hence the RFC. > > > > > > > > > > > > > > > > Cheers, > > > > > > > > -- > > > > > > > > Lu?s > > > > > > > > > > > > > > > > [0] https://github.com/ceph/ceph/pull/42720 > > > > > > > > > > > > > > > I think this would be better integrated into the stats infrastructure. > > > > > > > > > > > > > > Maybe you could add a new set of "copy" stats to struct > > > > > > > ceph_client_metric that tracks the total copy operations done, their > > > > > > > size and latency (similar to read and write ops)? > > > > > > I think it's a good idea to integrate this into "stats" but I think a > > > > > > local debugfs file for some counters is still useful. The "stats" > > > > > > module is immature at this time and I'd rather not build any qa tests > > > > > > (yet) that rely on it. > > > > > > > > > > > > Can we generalize this patch-set to a file named "op_counters" or > > > > > > similar and additionally add other OSD ops performed by the kclient? > > > > > > > > > > > > > > > > Tracking this sort of thing is the main purpose of the stats code. I'm > > > > > really not keen on adding a whole separate set of files for reporting > > > > > this. > > > > Maybe I'm confused. Is there some "file" which is already used for > > > > this type of debugging information? Or do you mean the code for > > > > sending stats to the MDS to support cephfs-top? > > > > > > > > > What's the specific problem with relying on the data in debugfs > > > > > "metrics" file? > > > > Maybe no problem? I wasn't aware of a "metrics" file. > > > > > > > Yes. For instance: > > > > > > # cat /sys/kernel/debug/ceph/*/metrics > > > item total > > > ------------------------------------------ > > > opened files / total inodes 0 / 4 > > > pinned i_caps / total inodes 5 / 4 > > > opened inodes / total inodes 0 / 4 > > > > > > item total avg_lat(us) min_lat(us) max_lat(us) stdev(us) > > > ----------------------------------------------------------------------------------- > > > read 0 0 0 0 0 > > > write 5 914013 824797 1092343 103476 > > > metadata 79 12856 1572 114572 13262 > > > > > > item total avg_sz(bytes) min_sz(bytes) max_sz(bytes) total_sz(bytes) > > > ---------------------------------------------------------------------------------------- > > > read 0 0 0 0 0 > > > write 5 4194304 4194304 4194304 20971520 > > > > > > item total miss hit > > > ------------------------------------------------- > > > d_lease 11 0 29 > > > caps 5 68 10702 > > > > > > > > > I'm proposing that Luis add new lines for "copy" to go along with the > > > "read" and "write" ones. The "total" counter should give you a count of > > > the number of operations. > > Okay that makes more sense! > > > > Side note: I am a bit horrified by how computer-unfriendly that > > table-formatted data is. > > Any suggestion to improve this ? > > How about just make the "metric" file writable like a switch ? And as > default it will show the data as above and if tools want the > computer-friendly format, just write none-zero to it, then show raw data > just like: > > # cat /sys/kernel/debug/ceph/*/metrics > opened_files:0 > pinned_i_caps:5 > opened_inodes:0 > total_inodes:4 > > read_latency:0,0,0,0,0 > write_latency:5,914013,824797,1092343,103476 > metadata_latency:79,12856,1572,114572,13262 > > read_size:0,0,0,0,0 > write_size:5,4194304,4194304,4194304,20971520 > > d_lease:11,0,29 > caps:5,68,10702 > > I'd rather not multiplex the output of this file based on some input. That would also be rather hard to do -- write() and read() are two different syscalls, so you'd need to track a bool (or something) across them somehow. Currently, I doubt there are many scripts in the field that scrape this info and debugfs is specifically excluded from ABI concerns. If we want to make it more machine-readable (which sounds like a good thing), then I suggest we just change the output to something like what you have above and not worry about preserving the "legacy" output. -- Jeff Layton