Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754597AbbEOJyj (ORCPT ); Fri, 15 May 2015 05:54:39 -0400 Received: from mail.phunq.net ([184.71.0.62]:46985 "EHLO starbase.phunq.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754096AbbEOJyh (ORCPT ); Fri, 15 May 2015 05:54:37 -0400 Message-ID: <5555C268.4000304@phunq.net> Date: Fri, 15 May 2015 02:54:48 -0700 From: Daniel Phillips User-Agent: Mozilla/5.0 (X11; Linux i686; rv:31.0) Gecko/20100101 Thunderbird/31.6.0 MIME-Version: 1.0 To: Mel Gorman , Rik van Riel CC: linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, tux3@tux3.org, OGAWA Hirofumi , Andrea Arcangeli , Peter Zijlstra Subject: Re: [FYI] tux3: Core changes References: <8f886f13-6550-4322-95be-93244ae61045@phunq.net> <55545C2F.8040207@phunq.net> <55549C2F.6000103@redhat.com> <5555388F.5010909@phunq.net> <555562AE.9020204@redhat.com> <20150515080902.GU2462@suse.de> In-Reply-To: <20150515080902.GU2462@suse.de> Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1936 Lines: 49 On 05/15/2015 01:09 AM, Mel Gorman wrote: > On Thu, May 14, 2015 at 11:06:22PM -0400, Rik van Riel wrote: >> On 05/14/2015 08:06 PM, Daniel Phillips wrote: >>>> The issue is that things like ptrace, AIO, infiniband >>>> RDMA, and other direct memory access subsystems can take >>>> a reference to page A, which Tux3 clones into a new page B >>>> when the process writes it. >>>> >>>> However, while the process now points at page B, ptrace, >>>> AIO, infiniband, etc will still be pointing at page A. >>>> >>>> This causes the process and the other subsystem to each >>>> look at a different page, instead of at shared state, >>>> causing ptrace to do nothing, AIO and RDMA data to be >>>> invisible (or corrupted), etc... >>> >>> Is this a bit like page migration? >> >> Yes. Page migration will fail if there is an "extra" >> reference to the page that is not accounted for by >> the migration code. > > When I said it's not like page migration, I was referring to the fact > that a COW on a pinned page for RDMA is a different problem to page > migration. The COW of a pinned page can lead to lost writes or > corruption depending on the ordering of events. I see the lost writes case, but not the corruption case, Do you mean corruption by changing a page already in writeout? If so, don't all filesystems have that problem? If RDMA to a mmapped file races with write(2) to the same file, maybe it is reasonable and expected to lose some data. > Page migration fails > when there are unexpected problems to avoid this class of issue which is > fine for page migration but may be a critical failure in a filesystem > depending on exactly why the copy is required. Regards, Daniel -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/