Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753334AbYJ0WNl (ORCPT ); Mon, 27 Oct 2008 18:13:41 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751952AbYJ0WNc (ORCPT ); Mon, 27 Oct 2008 18:13:32 -0400 Received: from mail.crca.org.au ([67.207.131.56]:39537 "EHLO crca.org.au" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751873AbYJ0WNb (ORCPT ); Mon, 27 Oct 2008 18:13:31 -0400 X-Bogosity: Ham, spamicity=0.000000 Subject: Re: [linux-pm] Freezer: Don't count threads waiting for frozen filesystems. From: Nigel Cunningham To: Miklos Szeredi Cc: rjw@sisk.pl, linux-pm@lists.linux-foundation.org, linux-kernel@vger.kernel.org In-Reply-To: References: <1224886068.6478.21.camel@nigel-laptop> <1225106427.26724.5.camel@nigel-laptop> <200810271237.40049.rjw@sisk.pl> <1225107607.26724.9.camel@nigel-laptop> <1225141168.26724.23.camel@nigel-laptop> Content-Type: text/plain Organization: Christian Reformed Churches of Australia Date: Tue, 28 Oct 2008 09:13:27 +1100 Message-Id: <1225145607.26724.54.camel@nigel-laptop> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3981 Lines: 91 Hi. On Mon, 2008-10-27 at 22:09 +0100, Miklos Szeredi wrote: > On Tue, 28 Oct 2008, Nigel Cunningham wrote: > > Hi. > > > > On Mon, 2008-10-27 at 13:38 +0100, Miklos Szeredi wrote: > > > On Mon, 27 Oct 2008, Nigel Cunningham wrote: > > > > Hi. > > > > > > > > On Mon, 2008-10-27 at 12:37 +0100, Rafael J. Wysocki wrote: > > > > > Well, I guess it's better if you post the entire thing so that we can see > > > > > what the role of the $subject patch is in it, even if this patch finally gets > > > > > merged separately. > > > > > > > > Ah.. that makes me see how vfs_check_frozen was getting triggered... > > > > (fs/namei.c, below). > > > > > > Nigel, thanks for the patch, it makes thinks a lot clearer. > > > > > > >From a quick look through the patch it seems to solve a bunch of cases > > > where new fuse requests during the freezing could cause problems. But > > > it doesn't do anything with requests that are already under way when > > > the freezing starts, which would still result in all the same > > > problems. > > > > > > Take this scenario: > > > > > > 1) process A does rename("/mnt/fuse/a", "/mnt/fuse/b") > > > 2) request goes to process B serving the fuse filesystem > > > 3) filesystems are frozen, no new fuse requests > > > 4) processes are frozen, let's say B first, then A > > > 5) freezing A will fail, since it's still waiting for the request to finish > > > > I'll have a look at the code again. I deliberately didn't stop existing > > requests, but perhaps that's the wrong behaviour. > > > > In the above scenario, process B won't see process A's new request until > > post-resume if the filesystem is already frozen, so steps 4 & 5 don't > > happen. > > No, I mean the filesystem is only frozen at 3 not before, so B is > already processing the request by the time the filesystem gets frozen. > > > Process B will also always be frozen before process A if process > > A is userspace (most likely in the above scenario) or was mounted after > > process B. (I've had this patch distributed as is for almost a year, > > with no problems at all, so I believe I'm right here). > > Both A and B are userspace processes. The order of freezing userspace > processes is basically random, there's no way to make sure that > processes doing work on behalf of a fuse filesystem (B) are frozen > after processes accessing the fuse filesystem (A). Sorry. You're right - I was confusing things in what I said, but I do have a (unconfused) solution: The answer is to freeze the fuse filesystems first, stopping new requests (freezing the processes making them) before they are passed on to userspace and allowing existing requests to complete or freeze. Once that is done, the userspace fuse processes will be idle, at least as far as fuse activity is concerned. After fuse activity is stopped, userspace can be frozen without worrying about what processes are fuse and what aren't. This is what my patch implements so far. To deal with requests that are already in progress, I'd suggest three possibilities, in the order I think they should be preferred. 1) Use wait_event_freezeable[_timeout] in fuse code. Probably preferable to #2, but I thought of it later :) 2) Use freezer_do_not_count as part of FUSE_MIGHT_FREEZE (resetting before exiting the callers, of course). If the request doesn't complete during the freezing period, it must be because the userspace thread was already frozen. If the request does complete, we're counted again during the normal freezing of userspace that follows the freezing of filesystems. 3) Adding a means to check whether processes being frozen are fuse requests. The code could then wait for fc->num_waiting - (say) fc->num_frozen == 0. Regards, Nigel -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/