Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752927Ab3EQD0G (ORCPT ); Thu, 16 May 2013 23:26:06 -0400 Received: from cn.fujitsu.com ([222.73.24.84]:28042 "EHLO song.cn.fujitsu.com" rhost-flags-OK-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1752063Ab3EQD0F (ORCPT ); Thu, 16 May 2013 23:26:05 -0400 X-IronPort-AV: E=Sophos;i="4.87,689,1363104000"; d="scan'208";a="7291138" Message-ID: <5195A3F4.70803@cn.fujitsu.com> Date: Fri, 17 May 2013 11:28:52 +0800 From: Tang Chen User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: Benjamin LaHaise CC: Mel Gorman , Minchan Kim , Lin Feng , akpm@linux-foundation.org, viro@zeniv.linux.org.uk, khlebnikov@openvz.org, walken@google.com, kamezawa.hiroyu@jp.fujitsu.com, riel@redhat.com, rientjes@google.com, isimatu.yasuaki@jp.fujitsu.com, wency@cn.fujitsu.com, laijs@cn.fujitsu.com, jiang.liu@huawei.com, zab@redhat.com, jmoyer@redhat.com, linux-mm@kvack.org, linux-aio@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Marek Szyprowski Subject: Re: [WiP]: aio support for migrating pages (Re: [PATCH V2 1/2] mm: hotplug: implement non-movable version of get_user_pages() called get_user_pages_non_movable()) References: <1360056113-14294-1-git-send-email-linfeng@cn.fujitsu.com> <1360056113-14294-2-git-send-email-linfeng@cn.fujitsu.com> <20130205120137.GG21389@suse.de> <20130206004234.GD11197@blaptop> <20130206095617.GN21389@suse.de> <5190AE4F.4000103@cn.fujitsu.com> <20130513091902.GP11497@suse.de> <5191B5B3.7080406@cn.fujitsu.com> <20130515132453.GB11497@suse.de> <5194748A.5070700@cn.fujitsu.com> <20130517002349.GI1008@kvack.org> In-Reply-To: <20130517002349.GI1008@kvack.org> X-MIMETrack: Itemize by SMTP Server on mailserver/fnst(Release 8.5.3|September 15, 2011) at 2013/05/17 11:24:44, Serialize by Router on mailserver/fnst(Release 8.5.3|September 15, 2011) at 2013/05/17 11:24:48, Serialize complete at 2013/05/17 11:24:48 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=ISO-8859-1; format=flowed Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2560 Lines: 63 Hi Benjamin, Thank you very much for your idea. :) I have no objection to your idea, but seeing from your patch, this only works for aio subsystem because you changed the way to allocate the aio ring pages, with a file mapping. So far as I know, not only aio, but also other subsystems, such CMA, will also have problem like this. The page cannot be migrated because it is pinned in memory. So I think we should work out a common way to solve how to migrate pinned pages. I'm working in the way Mel has said, migrate_unpin() and migrate_pin() callbacks. But as you saw, I met some problems, like I don't where to put these two callbacks. And discussed with you guys, I want to try this: 1. Add a new member to struct page, used to remember the pin holders of this page, including the pin and unpin callbacks and the necessary data. This is more like a callback chain. (I'm worry about this step, I'm not sure if it is good enough. After all, we need a good place to put the callbacks.) And then, like Mel said, 2. Implement the callbacks in the subsystems, and register them to the new member in struct page. 3. Call these callbacks before and after migration. I think I'll send a RFC patch next week when I finished the outline. I'm just thinking of finding a common way to solve this problem that all the other subsystems will benefit. Thanks. :) On 05/17/2013 08:23 AM, Benjamin LaHaise wrote: > On Thu, May 16, 2013 at 01:54:18PM +0800, Tang Chen wrote: > ... >> OK, I'll try to figure out a proper place to put the callbacks. >> But I think we need to add something new to struct page. I'm just >> not sure if it is OK. Maybe we can discuss more about it when I send >> a RFC patch. > ... > > I ended up working on this a bit today, and managed to cobble together > something that somewhat works -- please see the patch below. It still is > not completely tested, and it has a rather nasty bug owing to the fact > that the file descriptors returned by anon_inode_getfile() all share the > same inode (read: more than one instance of aio does not work), but it > shows the basic idea. Also, bad things probably happen if someone does > an mremap() on the aio ring buffer. I'll polish this off sometime next > week after the long weekend if noone beats me to it. > > -ben -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/