Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1764051AbXJPEjX (ORCPT ); Tue, 16 Oct 2007 00:39:23 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754443AbXJPEjM (ORCPT ); Tue, 16 Oct 2007 00:39:12 -0400 Received: from ebiederm.dsl.xmission.com ([166.70.28.69]:46078 "EHLO ebiederm.dsl.xmission.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754313AbXJPEjI (ORCPT ); Tue, 16 Oct 2007 00:39:08 -0400 From: ebiederm@xmission.com (Eric W. Biederman) To: Nick Piggin Cc: Rob Landley , Theodore Tso , James Bottomley , Matthew Wilcox , linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org, Jens Axboe , Suparna Bhattacharya , Nick Piggin Subject: Re: OOM killer gripe (was Re: What still uses the block layer?) References: <200710112011.22000.rob@landley.net> <200710152337.45252.nickpiggin@yahoo.com.au> <200710161659.28826.nickpiggin@yahoo.com.au> Date: Mon, 15 Oct 2007 22:38:33 -0600 In-Reply-To: <200710161659.28826.nickpiggin@yahoo.com.au> (Nick Piggin's message of "Tue, 16 Oct 2007 16:59:28 +1000") Message-ID: User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/21.4 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3845 Lines: 101 Nick Piggin writes: > On Tuesday 16 October 2007 13:55, Eric W. Biederman wrote: >> Nick Piggin writes: > >> > How much swap do you have configured? You really shouldn't configure >> > so much unless you do want the kernel to actually use it all, right? >> >> No. >> >> There are three basic swapping scenarios. >> - Pushing unused data out of ram >> - Swapping >> - Thrashing >> >> To effectively swap you need SWAP > RAM because after a little while of >> swapping all of your pages in RAM should be assigned a location in the >> page cache. > > I don't follow your logic. We don't need SWAP > RAM in order to swap > effectively, IMO. The steady state of a system that is heavily and usably swapping but not thrashing is that all of the pages in RAM are in the swap cache, at least that used to be the case. >> I have not heard of many people swapping and not thrashing lately. >> I think part of the problem is that we do random access to the swap >> partition which makes us seek limited. And since the number of >> seeks per unit time has been increasing at a linear or slower rate >> that if we are doing random disk I/O then the amount we can use > > I don't know if there is a causal relationship there. I mean, I > think it's been a long time since thrashing was ever a viable mode > of operation, right? Right. But swapping heavily has been a viable mode of operation and that the vast gap in disk random IO performance seems to have hurt significantly. It be very clear is used to able to run a problem at little below full speed with the disk pegged with swap traffic, and I did this regularly when I started out with linux. > Maybe desktops just have less need for swapping now, so nobody sees > it much until something goes _really_ bad. When I'm using my 256MB > machine, unused stuff goes to swap. There is a bit of truth in the fact that there is less need for swapping now. At the same time however swapping simply does not work well right now, and I'm not at all certain why. >> the disk for is very limited. I wonder if we could figure out >> how to push and pull 1M or bigger chunks into and out of swap? > > Pulling in 1MB pages can really easily end up compounding the > thrashing problem unless you're very sure a significant amount > of it will be used. It's a hard call. The I/O time for 1MB of contiguous disk data is about the I/O time of 512 bytes of contiguous disk data. >> I don't know if swap has actually worked since we vmscan stopped >> going over the virtual addresses. > > I do, and it does ;) Really? Not just the pushing of unused stuff into swap. >> > Because if we're not really conservative about OOM killing, then the >> > user who actually really did want to use all the swap they configured >> > gets angry when we kill their jobs without using it all. >> >> I totally agree. The fact that the OOM killer started is a sign that >> the system was completely overwhelmed and nothing better could happen. >> >> In this case my gut feel says limiting the total number of processes >> would have been much more effective then anything at all to do with >> swap. make -j reminds me of the classic fork bomb. > > Yep. > > >> > Would an oom-kill-someone-now sysrq be of help, I wonder? >> >> Well we have SAQ which should kill everything on your current VT >> which should include X and all of it's children. > > Which is exactly what you don't want to do if you've just forkbombed > yourself. I missed the fact that we now have a manual oom kill... You probably have a point there. Eric - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/