Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp539861pxk; Wed, 2 Sep 2020 08:17:37 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwQy/mU/DVKNavywG1XH+n25A7x1NOMcfUc0ZwMIyfkbdY585RpxjpA6yL4Ij4/W4K/vq4F X-Received: by 2002:a17:906:f90c:: with SMTP id lc12mr525988ejb.104.1599059857388; Wed, 02 Sep 2020 08:17:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1599059857; cv=none; d=google.com; s=arc-20160816; b=T9tWPXu6DjxSwd8QuzDqePi3y/cTNhscHsAHaYdxlACyv+XmeWfMNxD+IxAX9scFZ1 RRPGkXuXE5WxToCLFOa2lZR18B993G1jIx/HEtjZiQCKE53zy2BzAEBZrlwVQ/OnhLyW E1pJ7/gZ/4wz3EHTOzCtzBLAzf1W6P42aB5GkAdQN/XgBAXconK4rbPGNmG7Sgp5RY2k lYQOgnboZWKZ0TQN4XVCs2TBIkcr9xlUjRw5koi1R4RHfNe3QdyK7k/h/5fVs4Ysarim A9DPCkL7NlVpIWnPxNGuCUYCHhEFQJx5a/rgcxjuaPMv+henoPXp2BDfBJlnnWFiXwTd hEaw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=GM3icW3bE9xKRcg/HU1Tu/hDNtiRleXeCWIGx6nUCo4=; b=EnT6R2QtmpMimZyJH3MF2NutgcNqv1EKTEZjXUj5W56dDi7bVzF4ClzPhZx9GbylpO AdSWr+apIn4JXS108pAo3CR+7FIPkxp5Ye3lnjWUrfzWsqe2mbeqTEj7h7Lb+EJYq3E/ NP+h3DjFl6Ex8KR0gLOSrySMAN/e83e0tquwuaq4GLOyleGrqJPGIMBm2rHR9NRixqtr gtqYivfixgFZxeFuU/Hc1a0wA2G7tOOiyNdKtg4LD1XoJoJJlYew3kWQNin2waLt5yg1 kH2/jkmZNBAqydaQG032NfcSSAxzH1Y5D3OtqY5PyX6mLxXB6zo0j5mD7esDdGLbLxLq h66A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id x17si2552858edi.428.2020.09.02.08.17.13; Wed, 02 Sep 2020 08:17:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726800AbgIBPQW (ORCPT + 99 others); Wed, 2 Sep 2020 11:16:22 -0400 Received: from verein.lst.de ([213.95.11.211]:60870 "EHLO verein.lst.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726567AbgIBPLt (ORCPT ); Wed, 2 Sep 2020 11:11:49 -0400 Received: by verein.lst.de (Postfix, from userid 2407) id 4285B68B05; Wed, 2 Sep 2020 17:11:44 +0200 (CEST) Date: Wed, 2 Sep 2020 17:11:44 +0200 From: Christoph Hellwig To: Mike Snitzer Cc: Christoph Hellwig , Jens Axboe , linux-raid@vger.kernel.org, Hans de Goede , Minchan Kim , Richard Weinberger , linux-kernel@vger.kernel.org, linux-block@vger.kernel.org, Song Liu , dm-devel@redhat.com, linux-mtd@lists.infradead.org, cgroups@vger.kernel.org, drbd-dev@tron.linbit.com, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, martin.petersen@oracle.com Subject: Re: [PATCH 06/14] block: lift setting the readahead size into the block layer Message-ID: <20200902151144.GA1738@lst.de> References: <20200726150333.305527-1-hch@lst.de> <20200726150333.305527-7-hch@lst.de> <20200826220737.GA25613@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200826220737.GA25613@redhat.com> User-Agent: Mutt/1.5.17 (2007-11-01) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Aug 26, 2020 at 06:07:38PM -0400, Mike Snitzer wrote: > On Sun, Jul 26 2020 at 11:03am -0400, > Christoph Hellwig wrote: > > > Drivers shouldn't really mess with the readahead size, as that is a VM > > concept. Instead set it based on the optimal I/O size by lifting the > > algorithm from the md driver when registering the disk. Also set > > bdi->io_pages there as well by applying the same scheme based on > > max_sectors. > > > > Signed-off-by: Christoph Hellwig > > --- > > block/blk-settings.c | 5 ++--- > > block/blk-sysfs.c | 1 - > > block/genhd.c | 13 +++++++++++-- > > drivers/block/aoe/aoeblk.c | 2 -- > > drivers/block/drbd/drbd_nl.c | 12 +----------- > > drivers/md/bcache/super.c | 4 ---- > > drivers/md/dm-table.c | 3 --- > > drivers/md/raid0.c | 16 ---------------- > > drivers/md/raid10.c | 24 +----------------------- > > drivers/md/raid5.c | 13 +------------ > > 10 files changed, 16 insertions(+), 77 deletions(-) > > > In general these changes need a solid audit relative to stacking > drivers. That is, the limits stacking methods (blk_stack_limits) > vs lower level allocation methods (__device_add_disk). > > You optimized for lowlevel __device_add_disk establishing the bdi's > ra_pages and io_pages. That is at the beginning of disk allocation, > well before any build up of stacking driver's queue_io_opt() -- which > was previously done in disk_stack_limits or driver specific methods > (e.g. dm_table_set_restrictions) that are called _after_ all the limits > stacking occurs. > > By inverting the setting of the bdi's ra_pages and io_pages to be done > so early in __device_add_disk it'll break properly setting these values > for at least DM afaict. ra_pages never got inherited by stacking drivers, check it by modifying it on an underlying device and then creating a trivial dm or md one. And I think that is a good thing - in general we shouldn't really mess with this thing from drivers if we can avoid it. I've kept the legacy aoe and md parity raid cases, out of which the first looks pretty weird and the md one at least remotely sensible. ->io_pages is still inherited in disk_stack_limits, just like before so no change either.