Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753956AbYK0TMU (ORCPT ); Thu, 27 Nov 2008 14:12:20 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752442AbYK0TMI (ORCPT ); Thu, 27 Nov 2008 14:12:08 -0500 Received: from smtp-out.google.com ([216.239.45.13]:39263 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752473AbYK0TMH (ORCPT ); Thu, 27 Nov 2008 14:12:07 -0500 DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=message-id:date:from:user-agent:mime-version:to:cc:subject: references:in-reply-to:content-type: content-transfer-encoding:x-gmailtapped-by:x-gmailtapped; b=tvqnJOzJzommNM/QaJssiJ9ZPV/yXft94QhbSYcUKjhnI+Tqf2l2qUCRWgPpxVk6k RxUjKNrwDyWaA0WqzEarQ== Message-ID: <492EF0C2.4090108@google.com> Date: Thu, 27 Nov 2008 11:10:58 -0800 From: Mike Waychison User-Agent: Thunderbird 2.0.0.18 (Windows/20081105) MIME-Version: 1.0 To: Peter Zijlstra CC: Nick Piggin , Ying Han , Ingo Molnar , linux-mm@kvack.org, linux-kernel@vger.kernel.org, akpm , David Rientjes , Rohit Seth , Hugh Dickins , "H. Peter Anvin" , edwintorok@gmail.com Subject: Re: [RFC v1][PATCH]page_fault retry with NOPAGE_RETRY References: <604427e00811212247k1fe6b63u9efe8cfe37bddfb5@mail.gmail.com> <20081123091843.GK30453@elte.hu> <604427e00811251042t1eebded6k9916212b7c0c2ea0@mail.gmail.com> <20081126123246.GB23649@wotan.suse.de> <492DAA24.8040100@google.com> <20081127085554.GD28285@wotan.suse.de> <492E6849.6090205@google.com> <1227780007.4454.1344.camel@twins> In-Reply-To: <1227780007.4454.1344.camel@twins> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-GMailtapped-By: 172.25.146.18 X-GMailtapped: mikew Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2010 Lines: 42 Peter Zijlstra wrote: > On Thu, 2008-11-27 at 01:28 -0800, Mike Waychison wrote: > >> Correct. I don't recall the numbers from the pathelogical cases we were >> seeing, but iirc, it was on the order of 10s of seconds, likely >> exascerbated by slower than usual disks. I've been digging through my >> inbox to find numbers without much success -- we've been using a variant >> of this patch since 2.6.11. > >> We generally try to avoid such things, but sometimes it a) can't be >> easily avoided (third party libraries for instance) and b) when it hits >> us, it affects the overall health of the machine/cluster (the monitoring >> daemons get blocked, which isn't very healthy). > > If its only monitoring, there might be another solution. If you can keep > the required data in a separate (approximate) copy so that you don't > need mmap_sem at all to show them. > > If your mmap_sem is so contended your latencies are unacceptable, adding > more users to it - even statistics gathering, just isn't going to cure > the situation. > > Furthermore, /proc code usually isn't written with performance in mind, > so its usually simple and robust code. Adding it to a 'hot'-path like > you're doing doesn't seem advisable. > > Also, releasing and re-acquiring mmap_sem can significantly add to the > cacheline bouncing that thing already has. > This is much less of a worry. We expect to be able to look at these things on the order of 1HZ, so cacheline bouncing becomes negligible. Latency to lock acquire however hurts and is silly considering it's just another reader. Our monitoring software here is acting as a litmus test and the real pain is felt by other threads in the same process who are also blocked trying to acquire the read lock. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/