Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp187647pxb; Mon, 25 Oct 2021 06:25:20 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyV6JR5ef83CFZSBDuaeWvDk1SX9QeMM8fRh43j86VU0B4hKfuw1hRvkknq0NuEbMa50YBf X-Received: by 2002:a17:907:1dcf:: with SMTP id og15mr21897789ejc.553.1635168318571; Mon, 25 Oct 2021 06:25:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1635168318; cv=none; d=google.com; s=arc-20160816; b=BMqHkLzJ4CZSdRzY5nzalnmzn1Vewc/naWNPKa+y9ZAjf/WdyR5q1Fh2z86mxu6W01 7SW1T95LRKbVrjv3Kc6hjCiyh7Q1otw8GpHidrxuar1HMg/u/U+lJHGAzX5HDgOYuyp4 Sprss3am/srTHHF3uQ4PBQ2eg9+yLhYMVN5vBwDBpd7SVgUAgxVPEfEdkGPXMPCEZ03j fauS3oYi+tH3op4s8nhIwM1fQnT91NMXpBKjI5Z587WNMWKHlHvQkmKg3rJDjdSo1kda mX9IeZbzsDgJxnkeFiI0yHeHD5KcZbTmuhizgBL34HD7tDTXGSi/GjAqwJdRxtuIPd0I CfJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:dkim-signature; bh=Mj3yO0Hrr2SEuiBX5CH+xLN/Uvz4h9rWJ4Oor/vFZPs=; b=LFgroayj7fLvE9jA3/RLMqThBp7LKewBAwNOLizoxvHosu1Oxb7t8H+NoVvTk3NwIf 3hwq0bwBHYy+yxpWRePVKM8Z/TWlNIvI+4eWgGEDNKac6u8AodjrVdmRkbzL39LYApDD f9EDrNCwHoKBE6TfLziaXivNPGdzOhlMv1H6kd1V22yBjUYiA2Hg9SfTxbeONcP+e3jW TB7gE7+oA8nPocrJCTGmPB+CuZ96S7xvkESwjBP6vPbAl7EwOZsiSux1T7hzrqxfkxRX FVaMHRW3Efz0S9RpXSoYex/M1bhbcxUQub7IKMRQQgYZHHBTZXWhch6SObfL46K1ZFTv Disw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=N8GXnGzY; dkim=neutral (no key) header.i=@suse.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id x2si13437343ejy.385.2021.10.25.06.24.53; Mon, 25 Oct 2021 06:25:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.de header.s=susede2_rsa header.b=N8GXnGzY; dkim=neutral (no key) header.i=@suse.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=suse.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231448AbhJYKy1 (ORCPT + 99 others); Mon, 25 Oct 2021 06:54:27 -0400 Received: from smtp-out1.suse.de ([195.135.220.28]:38652 "EHLO smtp-out1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229890AbhJYKy0 (ORCPT ); Mon, 25 Oct 2021 06:54:26 -0400 Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id A256B218ED; Mon, 25 Oct 2021 10:52:03 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1635159123; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=Mj3yO0Hrr2SEuiBX5CH+xLN/Uvz4h9rWJ4Oor/vFZPs=; b=N8GXnGzYV/KEyF2/sq39dPRhYVsMaIzFNa710ueUstVvm3s8qDKXfUE+9mXWj+qRRJkFFz 0yxshksi79VZ7RzvB2WFlDokHuMNBICqshtlyzEpIkJqmfkVCD3qUAP06bfD35WIyHCnoU 557RHKZuV72BqECbEtQq86Flc/wZOk4= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1635159123; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=Mj3yO0Hrr2SEuiBX5CH+xLN/Uvz4h9rWJ4Oor/vFZPs=; b=oUtxpZT4fxPMDQQpG/pfohUbFVQiuUbOKWx3+1wW2uxRDCLLYuklefN/oXVbozVYqHCpMi 29PwsVeK8o5ja7Bw== Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id 35B2713BA4; Mon, 25 Oct 2021 10:52:03 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id EWWOB1OMdmEMBwAAMHmgww (envelope-from ); Mon, 25 Oct 2021 10:52:03 +0000 Received: from localhost (brahms [local]) by brahms (OpenSMTPD) with ESMTPA id 8dcf2670; Mon, 25 Oct 2021 10:52:02 +0000 (UTC) Date: Mon, 25 Oct 2021 11:52:02 +0100 From: =?iso-8859-1?Q?Lu=EDs?= Henriques To: Jeff Layton Cc: Patrick Donnelly , Ilya Dryomov , Ceph Development , linux-kernel@vger.kernel.org Subject: Re: [RFC PATCH] ceph: add remote object copy counter to fs client Message-ID: References: <20211020143708.14728-1-lhenriques@suse.de> <34e379f9dec1cbdf09fffd8207f6ef7f4e1a6841.camel@kernel.org> <99209198dd9d8634245f153a90e4091851635a16.camel@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Oct 25, 2021 at 06:20:40AM -0400, Jeff Layton wrote: > On Mon, 2021-10-25 at 11:12 +0100, Lu?s Henriques wrote: > > On Thu, Oct 21, 2021 at 12:35:18PM -0400, Jeff Layton wrote: > > > On Thu, 2021-10-21 at 12:18 -0400, Patrick Donnelly wrote: > > > > On Thu, Oct 21, 2021 at 11:44 AM Jeff Layton wrote: > > > > > > > > > > On Thu, 2021-10-21 at 09:52 -0400, Patrick Donnelly wrote: > > > > > > On Wed, Oct 20, 2021 at 12:27 PM Jeff Layton wrote: > > > > > > > > > > > > > > On Wed, 2021-10-20 at 15:37 +0100, Lu?s Henriques wrote: > > > > > > > > This counter will keep track of the number of remote object copies done on > > > > > > > > copy_file_range syscalls. This counter will be filesystem per-client, and > > > > > > > > can be accessed from the client debugfs directory. > > > > > > > > > > > > > > > > Cc: Patrick Donnelly > > > > > > > > Signed-off-by: Lu?s Henriques > > > > > > > > --- > > > > > > > > This is an RFC to reply to Patrick's request in [0]. Note that I'm not > > > > > > > > 100% sure about the usefulness of this patch, or if this is the best way > > > > > > > > to provide the functionality Patrick requested. Anyway, this is just to > > > > > > > > get some feedback, hence the RFC. > > > > > > > > > > > > > > > > Cheers, > > > > > > > > -- > > > > > > > > Lu?s > > > > > > > > > > > > > > > > [0] https://github.com/ceph/ceph/pull/42720 > > > > > > > > > > > > > > > > > > > > > > I think this would be better integrated into the stats infrastructure. > > > > > > > > > > > > > > Maybe you could add a new set of "copy" stats to struct > > > > > > > ceph_client_metric that tracks the total copy operations done, their > > > > > > > size and latency (similar to read and write ops)? > > > > > > > > > > > > I think it's a good idea to integrate this into "stats" but I think a > > > > > > local debugfs file for some counters is still useful. The "stats" > > > > > > module is immature at this time and I'd rather not build any qa tests > > > > > > (yet) that rely on it. > > > > > > > > > > > > Can we generalize this patch-set to a file named "op_counters" or > > > > > > similar and additionally add other OSD ops performed by the kclient? > > > > > > > > > > > > > > > > > > > > > Tracking this sort of thing is the main purpose of the stats code. I'm > > > > > really not keen on adding a whole separate set of files for reporting > > > > > this. > > > > > > > > Maybe I'm confused. Is there some "file" which is already used for > > > > this type of debugging information? Or do you mean the code for > > > > sending stats to the MDS to support cephfs-top? > > > > > > > > > What's the specific problem with relying on the data in debugfs > > > > > "metrics" file? > > > > > > > > Maybe no problem? I wasn't aware of a "metrics" file. > > > > > > > > > > Yes. For instance: > > > > > > # cat /sys/kernel/debug/ceph/*/metrics > > > item total > > > ------------------------------------------ > > > opened files / total inodes 0 / 4 > > > pinned i_caps / total inodes 5 / 4 > > > opened inodes / total inodes 0 / 4 > > > > > > item total avg_lat(us) min_lat(us) max_lat(us) stdev(us) > > > ----------------------------------------------------------------------------------- > > > read 0 0 0 0 0 > > > write 5 914013 824797 1092343 103476 > > > metadata 79 12856 1572 114572 13262 > > > > > > item total avg_sz(bytes) min_sz(bytes) max_sz(bytes) total_sz(bytes) > > > ---------------------------------------------------------------------------------------- > > > read 0 0 0 0 0 > > > write 5 4194304 4194304 4194304 20971520 > > > > > > item total miss hit > > > ------------------------------------------------- > > > d_lease 11 0 29 > > > caps 5 68 10702 > > > > > > > > > I'm proposing that Luis add new lines for "copy" to go along with the > > > "read" and "write" ones. The "total" counter should give you a count of > > > the number of operations. > > > > The problem with this is that it will require quite some work on the > > MDS-side because, AFAIU, the MDS will need to handle different versions of > > the CEPH_MSG_CLIENT_METRICS message (with and without the new copy-from > > metrics). > > > > Will this extra metric ever be useful on the MDS side? From what I > > understood Patrick's initial request was to have a way to find out, on the > > client, if remote copies are really happening. (*sigh* for not having > > tracepoints.) > > > > Anyway, I can look into adding this to the metrics infrastructure, but > > it'll likely take me some more time to get to it and to figure out (once > > again) how the messages versioning work. > > > > I think it is useful info to report to the MDS, but it's not required to > send these to the MDS to solve the current problem. My suggestion would > be to add what's needed to track these stats in the kclient and report > them via debugfs, but don't send the info to the MDS just yet. > > Later, we could extend the protocol with COPY stats, and add the > necessary infrastructure to the MDS to deal with it. Once that's in > place, we can then extend the kclient to start sending this info along > when it reports the stats. Awesome, that sounds good to me. I'll look into re-writing this patch following your suggestion. Thanks! Cheers, -- Lu?s