Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751251AbZIOI2E (ORCPT ); Tue, 15 Sep 2009 04:28:04 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750785AbZIOI15 (ORCPT ); Tue, 15 Sep 2009 04:27:57 -0400 Received: from sj-iport-2.cisco.com ([171.71.176.71]:8833 "EHLO sj-iport-2.cisco.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750768AbZIOI1z (ORCPT ); Tue, 15 Sep 2009 04:27:55 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApoEALPsrkqrR7PD/2dsb2JhbADDa4hMAY8xBYQX X-IronPort-AV: E=Sophos;i="4.44,388,1249257600"; d="scan'208";a="204663260" From: Roland Dreier To: KOSAKI Motohiro Cc: Jason Gunthorpe , Brice Goglin , linux-rdma@vger.kernel.org, linux-kernel@vger.kernel.org, general@lists.openfabrics.org, akpm@linux-foundation.org, torvalds@linux-foundation.org Cc: jsquyres@cisco.com Subject: Re: [ofa-general] Re: [GIT PULL] please pull ummunotify References: <20090911064019.GZ4973@obsidianresearch.com> <20090915155231.DB86.A69D9226@jp.fujitsu.com> X-Message-Flag: Warning: May contain useful information Date: Tue, 15 Sep 2009 01:27:57 -0700 In-Reply-To: <20090915155231.DB86.A69D9226@jp.fujitsu.com> (KOSAKI Motohiro's message of "Tue, 15 Sep 2009 16:03:04 +0900 (JST)") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.91 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-OriginalArrivalTime: 15 Sep 2009 08:27:57.0980 (UTC) FILETIME=[6E1BADC0:01CA35DE] Authentication-Results: sj-dkim-3; header.From=rdreier@cisco.com; dkim=pass ( sig from cisco.com/sjdkim3002 verified; ); Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2473 Lines: 57 > - I guess you have your MPI implementaion w/ ummunotify, right? Yes, Jeff Squyres (cc'ed) has an Open MPI prototype (mercurial tree at http://bitbucket.org/jsquyres/ummunot/). > - I guess you have test sevaral pattern, right? > if so, can we see your test result? Open MPI has a pretty extensive automated test fabric -- I don't have a link handy but I believe all the tests that pass with unmodified Open MPI currently still pass with ummunotify. Maybe Jeff has a link. > - I think you can explain your MPI advantage/disadvantage against > current OpenMPI (or mpich et al). The advantage is as Jeff explained in his blog post (http://blogs.cisco.com/ciscotalk/performance/comments/better_linux_memory_tracking/), namely the performance improvement of memory registration caching without the reliability problems caused by previous approaches to caching such as trying to hook malloc etc (which are fragile because the great diversity of MPI-using codes find ways to mess up all previous userspace-only approaches). > - I guess your patch dramatically improve MPI implementaion, but > it's not free. it request some limitation to MPI application, right? Not that I know of, beyond already existing limitations. > - I imagine multi thread and fork. Is there another linmitaion? There are no new limitations on multi-threaded codes or on use of fork that I know of. Of course, buggy code that does something like passing a buffer to MPI in one thread and then freeing that buffer from another thread before MPI is done with it is still buggy; but ummunotify actually increases the ability of the MPI implementation to detect such bugs and give useful diagnostic information. > - In past discuttion, you said ummunotify user should not use > multi threading. you also think user should not fork? I don't recall where I said ummunotify users should not be multithreaded. I don't know of any problem with that. Also code using ummunotify can fork -- ummunotify simply does not fix issues with copy-on-write for buffers that are in use, just as it does not fix multithreaded code that has a race between using a buffer and freeing the same buffer. Hope this clarifies things. - Roland -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/