Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp1230092imm; Tue, 5 Jun 2018 11:02:46 -0700 (PDT) X-Google-Smtp-Source: ADUXVKK8H5+HFaXPSWuuRsj9xADFeeSMUH7TM1Fwmz6BIhGzadY7RJkt8qcpU1OSnituxPGUh9g8 X-Received: by 2002:a17:902:5959:: with SMTP id e25-v6mr27752520plj.351.1528221766550; Tue, 05 Jun 2018 11:02:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528221766; cv=none; d=google.com; s=arc-20160816; b=sYB+pC2tslEcwg5aKxHWsVF97zv+GlnOw6Y2epnYsmFt8wXDWFdhsbLBEQvNh3na83 9CBwTQ6c3IMKK73kFLdN0h90D7/HPrvo9Am6arvP4cSR0fdE8pzsUIHrLNe4KQpjQ4ez HCrUL6T9efBmKCylT76uFvFmDejGfgRExoHZO8omx588sZ+wSa8j8djOkTUa0/Ay4hjx O/U/A9Xv+Hh2tAJdXHiNVSDKDWn0C0OjdmklZmYKlitfOiMC39u15onbkR/a/pX4utxL Kp+RldxvKrlbOGDqOkZ9lbAiCX3NVvccD7U0T/DO/gy1BnKjvO3wSqsWS42e1XlxNDIK 9Ltw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature:arc-authentication-results; bh=76hZIauPd7bjlMvvjuR/DSb2wi9VCcBm2l0/SL/InYY=; b=uVnpF57MNvr4uKJIzf3wh6S+lbVnMgORg0eZ1/9tTOrEaH0R6u2biGAznBaBXXGA3t aAbiib59cir6iiO7lGGtkVla8xRKBkOtlI7NiPtfn5fGdX6HVsc6FgpUyv4GduH4ZRuj ygCFiyc22yaKjny7VpXHeUvdTHsqzHdEN/3KWdU8JLFoGqt6+mK6KXrvgqq8Z5FHrjTH pOYkkUc1Z7EAhqNKZJWRAmKq5gKn1pBj4UxHKltOPJVk8XBbk7wPCgYhIfDSgdgirkbB GfvgruDa+BSM+KeX85wO+ZSe9WbBSE5Ez00Bu8mhzac/9+Slncnlrw/U77U9cf2wTEEE JlWA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=QoL5YGR3; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m21-v6si26042828pls.217.2018.06.05.11.02.32; Tue, 05 Jun 2018 11:02:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=QoL5YGR3; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752954AbeFESCL (ORCPT + 99 others); Tue, 5 Jun 2018 14:02:11 -0400 Received: from mail-yb0-f195.google.com ([209.85.213.195]:39534 "EHLO mail-yb0-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752880AbeFESBi (ORCPT ); Tue, 5 Jun 2018 14:01:38 -0400 Received: by mail-yb0-f195.google.com with SMTP id m137-v6so1081702ybm.6; Tue, 05 Jun 2018 11:01:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:to:cc:subject:date:message-id:in-reply-to:references; bh=76hZIauPd7bjlMvvjuR/DSb2wi9VCcBm2l0/SL/InYY=; b=QoL5YGR3mBcTLNHhelDnx4u0QQmX40udFsD2P+zb6VOC313LtZXxW6QBY9cFyVtA9K qApusEV2SUsPW9rS5NwJkbKrMdtoKiUxH+T1cbbRzrB01Pzl6tvGkqkEeV6Mwq/vlbd+ +hisXd0zV+WdGj+Z80DKyb/GcHtXo7wzBXieqxHr2dXTA/99iqCWE5to/0cY6BFXmFoH oRQlx35AvDXC6a7VtiL9sYh5iW3hEVPoWpSejr2c2bXHgz5CyD/EUaPXa700K+0pvERO 132m6aml3ih2WdYR7DGuHBI/MAO1zLMaqt/JLh1f0+MmT7i+JMYNxmBQRsAXqe2H3JEY UBEQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:to:cc:subject:date:message-id :in-reply-to:references; bh=76hZIauPd7bjlMvvjuR/DSb2wi9VCcBm2l0/SL/InYY=; b=DUpbl19xtmGx4gz9gZJIGvX/vRWubCSQQJtW/TE8O/W6w5lOMPyC5FFUz0YldtBCfQ bIrw1KHyb4bJlHQyeMko3v7tFVz8oJeWqoNe5lSRDBPN/VWc+SL/buG7/7moeGp5K19j cGIeMoOMUZbgAWeAJuTE+IpmmdlI1RDO+J2LHscmEaCKPOCr/8cfBatmkNhXO2ygEYWM FjpojwwioMeIwjzy9EoNFM6F9Y1kY8ykO84kJhe+C6WZ29xapJhWnQ5lcxpU1suZl7Tx j9mguE5ZYrB4AzuDxVeXM0MdjL7wY/790Gl+Crpu/WmlJC9OmtDyk9TrESYatkSr556Z Clqw== X-Gm-Message-State: APt69E3CwCLwkSc32209HkCDmg5IFan3TK15/Qe8Wij0R1kjwoNoU7xy oKDD24snxBtAsCYVzpvB9kOqk3k6 X-Received: by 2002:a25:7810:: with SMTP id t16-v6mr9545471ybc.435.1528221697801; Tue, 05 Jun 2018 11:01:37 -0700 (PDT) Received: from localhost ([2620:10d:c091:200::1:f697]) by smtp.gmail.com with ESMTPSA id n19-v6sm854395ywh.55.2018.06.05.11.01.36 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 05 Jun 2018 11:01:36 -0700 (PDT) From: Tejun Heo To: axboe@kernel.dk Cc: michaelcallahan@fb.com, newella@fb.com, linux-block@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, Tejun Heo Subject: [PATCH 5/6] block: Track DISCARD statistics and output them in stat and diskstat Date: Tue, 5 Jun 2018 11:01:19 -0700 Message-Id: <20180605180120.2726113-6-tj@kernel.org> X-Mailer: git-send-email 2.9.5 In-Reply-To: <20180605180120.2726113-1-tj@kernel.org> References: <20180605180120.2726113-1-tj@kernel.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Michael Callahan Add tracking of REQ_OP_DISCARD ios to the partition statistics and append them to the various stat files in /sys as well as /proc/diskstats. These are tracked with the same four stats as reads and writes: Number of discard ios completed. Number of discard ios merged Number of discard sectors completed Milliseconds spent on discard requests This is done via adding a new STAT_DISCARD define to genhd.h and then using it to index that stat field for discard requests. tj: Refreshed on top of v4.17 and other previous updates. Signed-off-by: Michael Callahan Signed-off-by: Tejun Heo Cc: Andy Newell --- Documentation/ABI/testing/procfs-diskstats | 10 ++++++++++ Documentation/block/stat.txt | 28 ++++++++++++++++------------ Documentation/iostats.txt | 15 +++++++++++++++ block/genhd.c | 13 ++++++++++--- block/partition-generic.c | 9 +++++++-- include/linux/blk_types.h | 8 ++++++++ include/linux/genhd.h | 3 ++- 7 files changed, 68 insertions(+), 18 deletions(-) diff --git a/Documentation/ABI/testing/procfs-diskstats b/Documentation/ABI/testing/procfs-diskstats index f91a973..abac31d 100644 --- a/Documentation/ABI/testing/procfs-diskstats +++ b/Documentation/ABI/testing/procfs-diskstats @@ -5,6 +5,7 @@ Contact: Jerome Marchand The /proc/diskstats file displays the I/O statistics of block devices. Each line contains the following 14 fields: + 1 - major number 2 - minor mumber 3 - device name @@ -19,4 +20,13 @@ Contact: Jerome Marchand 12 - I/Os currently in progress 13 - time spent doing I/Os (ms) 14 - weighted time spent doing I/Os (ms) + + Kernel 4.18+ appends four more fields for discard + tracking putting the total at 18: + + 15 - discards completed successfully + 16 - discards merged + 17 - sectors discarded + 18 - time spent discarding + For more details refer to Documentation/iostats.txt diff --git a/Documentation/block/stat.txt b/Documentation/block/stat.txt index 0dbc946..0aace9c 100644 --- a/Documentation/block/stat.txt +++ b/Documentation/block/stat.txt @@ -31,28 +31,32 @@ write ticks milliseconds total wait time for write requests in_flight requests number of I/Os currently in flight io_ticks milliseconds total time this block device has been active time_in_queue milliseconds total wait time for all requests +discard I/Os requests number of discard I/Os processed +discard merges requests number of discard I/Os merged with in-queue I/O +discard sectors sectors number of sectors discarded +discard ticks milliseconds total wait time for discard requests -read I/Os, write I/Os -===================== +read I/Os, write I/Os, discard I/0s +=================================== These values increment when an I/O request completes. -read merges, write merges -========================= +read merges, write merges, discard merges +========================================= These values increment when an I/O request is merged with an already-queued I/O request. -read sectors, write sectors -=========================== +read sectors, write sectors, discard_sectors +============================================ -These values count the number of sectors read from or written to this -block device. The "sectors" in question are the standard UNIX 512-byte -sectors, not any device- or filesystem-specific block size. The -counters are incremented when the I/O completes. +These values count the number of sectors read from, written to, or +discarded from this block device. The "sectors" in question are the +standard UNIX 512-byte sectors, not any device- or filesystem-specific +block size. The counters are incremented when the I/O completes. -read ticks, write ticks -======================= +read ticks, write ticks, discard ticks +====================================== These values count the number of milliseconds that I/O requests have waited on this block device. If there are multiple I/O requests waiting, diff --git a/Documentation/iostats.txt b/Documentation/iostats.txt index 04d394a..49df45f 100644 --- a/Documentation/iostats.txt +++ b/Documentation/iostats.txt @@ -31,6 +31,9 @@ and so should not differ. 3 0 hda 446216 784926 9550688 4382310 424847 312726 5922052 19310380 0 3376340 23705160 3 1 hda1 35486 38030 38030 38030 + 4.18+ diskstats: + 3 0 hda 446216 784926 9550688 4382310 424847 312726 5922052 19310380 0 3376340 23705160 0 0 0 0 + On 2.4 you might execute ``grep 'hda ' /proc/partitions``. On 2.6+, you have a choice of ``cat /sys/block/hda/stat`` or ``grep 'hda ' /proc/diskstats``. @@ -101,6 +104,18 @@ Field 11 -- weighted # of milliseconds spent doing I/Os last update of this field. This can provide an easy measure of both I/O completion time and the backlog that may be accumulating. +Field 12 -- # of discards completed + This is the total number of discards completed successfully. + +Field 13 -- # of discards merged + See the description of field 2 + +Field 14 -- # of sectors discarded + This is the total number of sectors discarded successfully. + +Field 15 -- # of milliseconds spent discarding + This is the total number of milliseconds spent by all discards (as + measured from __make_request() to end_that_request_last()). To avoid introducing performance bottlenecks, no locks are held while modifying these counters. This implies that minor inaccuracies may be diff --git a/block/genhd.c b/block/genhd.c index 0711a80..8cc719a3 100644 --- a/block/genhd.c +++ b/block/genhd.c @@ -1333,8 +1333,11 @@ static int diskstats_show(struct seq_file *seqf, void *v) part_round_stats(gp->queue, cpu, hd); part_stat_unlock(); part_in_flight(gp->queue, hd, inflight); - seq_printf(seqf, "%4d %7d %s %lu %lu %lu " - "%u %lu %lu %lu %u %u %u %u\n", + seq_printf(seqf, "%4d %7d %s " + "%lu %lu %lu %u " + "%lu %lu %lu %u " + "%u %u %u " + "%lu %lu %lu %u\n", MAJOR(part_devt(hd)), MINOR(part_devt(hd)), disk_name(gp, hd->partno, buf), part_stat_read(hd, ios[STAT_READ]), @@ -1347,7 +1350,11 @@ static int diskstats_show(struct seq_file *seqf, void *v) jiffies_to_msecs(part_stat_read(hd, ticks[STAT_WRITE])), inflight[0], jiffies_to_msecs(part_stat_read(hd, io_ticks)), - jiffies_to_msecs(part_stat_read(hd, time_in_queue)) + jiffies_to_msecs(part_stat_read(hd, time_in_queue)), + part_stat_read(hd, ios[STAT_DISCARD]), + part_stat_read(hd, merges[STAT_DISCARD]), + part_stat_read(hd, sectors[STAT_DISCARD]), + jiffies_to_msecs(part_stat_read(hd, ticks[STAT_DISCARD])) ); } disk_part_iter_exit(&piter); diff --git a/block/partition-generic.c b/block/partition-generic.c index 0ddb067..5a8975a 100644 --- a/block/partition-generic.c +++ b/block/partition-generic.c @@ -130,7 +130,8 @@ ssize_t part_stat_show(struct device *dev, return sprintf(buf, "%8lu %8lu %8llu %8u " "%8lu %8lu %8llu %8u " - "%8u %8u %8u" + "%8u %8u %8u " + "%8lu %8lu %8llu %8u" "\n", part_stat_read(p, ios[STAT_READ]), part_stat_read(p, merges[STAT_READ]), @@ -142,7 +143,11 @@ ssize_t part_stat_show(struct device *dev, jiffies_to_msecs(part_stat_read(p, ticks[STAT_WRITE])), inflight[0], jiffies_to_msecs(part_stat_read(p, io_ticks)), - jiffies_to_msecs(part_stat_read(p, time_in_queue))); + jiffies_to_msecs(part_stat_read(p, time_in_queue)), + part_stat_read(p, ios[STAT_DISCARD]), + part_stat_read(p, merges[STAT_DISCARD]), + (unsigned long long)part_stat_read(p, sectors[STAT_DISCARD]), + jiffies_to_msecs(part_stat_read(p, ticks[STAT_DISCARD]))); } ssize_t part_inflight_show(struct device *dev, struct device_attribute *attr, diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h index 698890a..6d72eb2 100644 --- a/include/linux/blk_types.h +++ b/include/linux/blk_types.h @@ -361,6 +361,7 @@ enum req_flag_bits { enum stat_group { STAT_READ, STAT_WRITE, + STAT_DISCARD, NR_STAT_GROUPS }; @@ -402,8 +403,15 @@ static inline bool op_is_sync(unsigned int op) (op & (REQ_SYNC | REQ_FUA | REQ_PREFLUSH)); } +static inline bool op_is_discard(unsigned int op) +{ + return (op & REQ_OP_MASK) == REQ_OP_DISCARD; +} + static inline int op_stat_group(unsigned int op) { + if (op_is_discard(op)) + return STAT_DISCARD; return op_is_write(op); } diff --git a/include/linux/genhd.h b/include/linux/genhd.h index a754454..5786442 100644 --- a/include/linux/genhd.h +++ b/include/linux/genhd.h @@ -356,7 +356,8 @@ static inline void free_part_stats(struct hd_struct *part) #define part_stat_read_accum(part, field) \ (part_stat_read(part, field[STAT_READ]) + \ - part_stat_read(part, field[STAT_WRITE])) + part_stat_read(part, field[STAT_WRITE]) + \ + part_stat_read(part, field[STAT_DISCARD])) #define part_stat_add(cpu, part, field, addnd) do { \ __part_stat_add((cpu), (part), field, addnd); \ -- 2.9.5