Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751149AbeACC2t (ORCPT + 1 other); Tue, 2 Jan 2018 21:28:49 -0500 Received: from LGEAMRELO11.lge.com ([156.147.23.51]:57524 "EHLO lgeamrelo11.lge.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750927AbeACC2r (ORCPT ); Tue, 2 Jan 2018 21:28:47 -0500 X-Original-SENDERIP: 156.147.1.126 X-Original-MAILFROM: byungchul.park@lge.com X-Original-SENDERIP: 10.177.222.184 X-Original-MAILFROM: byungchul.park@lge.com Subject: Re: About the try to remove cross-release feature entirely by Ingo To: Matthew Wilcox , Theodore Ts'o , Byungchul Park , Thomas Gleixner , Peter Zijlstra , Ingo Molnar , david@fromorbit.com, Linus Torvalds , Amir Goldstein , linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-block@vger.kernel.org, linux-fsdevel@vger.kernel.org, oleg@redhat.com, kernel-team@lge.com, daniel@ffwll.ch References: <20171229014736.GA10341@X58A-UD3R> <20171229035146.GA11757@thunk.org> <20171229072851.GA12235@X58A-UD3R> <20171230061624.GA27959@bombadil.infradead.org> <20171230154041.GB3366@thunk.org> <20171230204417.GF27959@bombadil.infradead.org> <20171230224028.GC3366@thunk.org> <20171230230057.GB12995@thunk.org> <20180101101855.GA23567@bombadil.infradead.org> From: Byungchul Park Message-ID: <06e73e69-9c78-07bc-3d06-a51accb2645b@lge.com> Date: Wed, 3 Jan 2018 11:28:44 +0900 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.2 MIME-Version: 1.0 In-Reply-To: <20180101101855.GA23567@bombadil.infradead.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On 1/1/2018 7:18 PM, Matthew Wilcox wrote: > On Sat, Dec 30, 2017 at 06:00:57PM -0500, Theodore Ts'o wrote: >> On Sat, Dec 30, 2017 at 05:40:28PM -0500, Theodore Ts'o wrote: >>> On Sat, Dec 30, 2017 at 12:44:17PM -0800, Matthew Wilcox wrote: >>>> >>>> I'm not sure I agree with this part. What if we add a new TCP lock class >>>> for connections which are used for filesystems/network block devices/...? >>>> Yes, it'll be up to each user to set the lockdep classification correctly, >>>> but that's a relatively small number of places to add annotations, >>>> and I don't see why it wouldn't work. >>> >>> I was exagerrating a bit for effect, I admit. (but only a bit). > > I feel like there's been rather too much of that recently. Can we stick > to facts as far as possible, please? > >>> It can probably be for all TCP connections that are used by kernel >>> code (as opposed to userspace-only TCP connections). But it would >>> probably have to be each and every device-mapper instance, each and >>> every block device, each and every mounted file system, each and every >>> bdi object, etc. >> >> Clarification: all TCP connections that are used by kernel code would >> need to be in their own separate lock class. All TCP connections used >> only by userspace could be in their own shared lock class. You can't >> use a one lock class for all kernel-used TCP connections, because of >> the Network Block Device mounted on a local file system which is then >> exported via NFS and squirted out yet another TCP connection problem. > > So the false positive you're concerned about is write-comes-in-over-NFS > (with socket lock held), NFS sends a write request to local filesystem, > local filesystem sends write to block device, block device sends a > packet to a socket which takes that socket lock. > > I don't think we need to be as drastic as giving each socket its own lock > class to solve this. All NFS sockets can be in lock class A; all NBD > sockets can be in lock class B; all user sockets can be in lock class > C; etc. > >> Also, what to do with TCP connections which are created in userspace >> (with some authentication exchanges happening in userspace), and then >> passed into kernel space for use in kernel space, is an interesting >> question. > > Yes! I'd love to have a lockdep expert weigh in here. I believe it's > legitimate to change a lock's class after it's been used, essentially > destroying it and reinitialising it. If not, it should be because it's > a reasonable design for an object to need different lock classes for > different phases of its existance. I also think it should be done ultimately. And I think it's very much hard since it requires to change the dependency graph of lockdep but anyway possible. It's up to lockdep maintainer's will though.. -- Thanks, Byungchul