Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1030833AbcCQRre (ORCPT ); Thu, 17 Mar 2016 13:47:34 -0400 Received: from mail-io0-f196.google.com ([209.85.223.196]:35295 "EHLO mail-io0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S935977AbcCQRrb (ORCPT ); Thu, 17 Mar 2016 13:47:31 -0400 MIME-Version: 1.0 In-Reply-To: References: <20160313233049.GA30721@dastard> <56E69398.7030508@redhat.com> <20160314144603.GO29218@thunk.org> <20160315201431.GG30721@dastard> <20160315223313.GH30721@dastard> <20160315225224.GD23848@thunk.org> <20160316015139.GC5826@birch.djwong.org> <7674C689-C07E-4D38-85EB-4FD9B55CBB35@dilger.ca> <20160317001502.GF23593@thunk.org> <56E9FB73.6040803@redhat.com> Date: Thu, 17 Mar 2016 10:47:29 -0700 X-Google-Sender-Auth: yp2pHCGwJv0czpkAZ1zSz1_d9Jg Message-ID: Subject: Re: [PATCH 2/2] block: create ioctl to discard-or-zeroout a range of blocks From: Linus Torvalds To: Gregory Farnum Cc: Eric Sandeen , "Theodore Ts'o" , Andreas Dilger , "Darrick J. Wong" , Dave Chinner , Ric Wheeler , Andy Lutomirski , One Thousand Gnomes , Martin Petersen , Christoph Hellwig , Jens Axboe , Andrew Morton , Linux API , Linux Kernel Mailing List , shane.seymour@hpe.com, Bruce Fields , linux-fsdevel , Jeff Layton Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1898 Lines: 45 On Wed, Mar 16, 2016 at 10:18 PM, Gregory Farnum wrote: > > So we've not asked for NO_HIDE_STALE on the mailing lists, but I think > it was one of the problems Sage had using xfs in his BlueStore > implementation and was a big part of why it moved to pure userspace. > FileStore might use NO_HIDE_STALE in some places but it would be > pretty limited. When it came up at Linux FAST we were discussing how > it and similar things had been problems for us in the past and it > would've been nice if they were upstream. Hmm. So to me it really sounds like somebody should cook up a patch, but we shouldn't put it in the upstream kernel until we get numbers and actual "yes, we'd use this" from outside of google. I say "outside of google", because inside of google not only do we not get numbers, but google can maintain their own patch. But maybe Ted could at least post the patch google uses, and somebody in the Ceph community might want to at least try it out... > What *is* a big deal for > FileStore (and would be easy to take advantage of) is the thematically > similar O_NOMTIME flag, which is also about reducing metadata updates > and got blocked on similar stupid-user grounds (although not security > ones): http://thread.gmane.org/gmane.linux.kernel.api/10727. Hmm. I don't hate that patch, because the NOATIME thing really does wonders on many loads. NOMTIME makes sense. It's not like you can't do this with utimes() anyway. That said, I do wonder if people wouldn't just prefer to expand on and improve on the lazytime. Is there some reason you guys didn't use that? > As noted though, we've basically given up and are moving to a > pure-userspace solution as quickly as we can. That argues against worrying about this all in the kernel unless there are other users. Linus