Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934231Ab0BQJLn (ORCPT ); Wed, 17 Feb 2010 04:11:43 -0500 Received: from smtp-out.google.com ([216.239.33.17]:41969 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S934191Ab0BQJLl (ORCPT ); Wed, 17 Feb 2010 04:11:41 -0500 DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=date:from:x-x-sender:to:cc:subject:in-reply-to:message-id: references:user-agent:mime-version:content-type:x-system-of-record; b=mFxv0MuYj0N7/a1YdG2EPOFuWwKoSiiVvXzhZxKJGh9xRkJsJtH4N8l7kNSwHX9bB qxpE6SQpS/PsraVL35b+g== Date: Wed, 17 Feb 2010 01:11:30 -0800 (PST) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: KAMEZAWA Hiroyuki cc: Andrew Morton , Rik van Riel , Nick Piggin , Andrea Arcangeli , Balbir Singh , Lubos Lunak , KOSAKI Motohiro , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [patch -mm 4/9 v2] oom: remove compulsory panic_on_oom mode In-Reply-To: <20100217122106.31e12398.kamezawa.hiroyu@jp.fujitsu.com> Message-ID: References: <20100216092311.86bceb0c.kamezawa.hiroyu@jp.fujitsu.com> <20100217084239.265c65ea.kamezawa.hiroyu@jp.fujitsu.com> <20100217090124.398769d5.kamezawa.hiroyu@jp.fujitsu.com> <20100217094137.a0d26fbb.kamezawa.hiroyu@jp.fujitsu.com> <20100217111319.d342f10e.kamezawa.hiroyu@jp.fujitsu.com> <20100217113430.9528438d.kamezawa.hiroyu@jp.fujitsu.com> <20100217122106.31e12398.kamezawa.hiroyu@jp.fujitsu.com> User-Agent: Alpine 2.00 (DEB 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2726 Lines: 60 On Wed, 17 Feb 2010, KAMEZAWA Hiroyuki wrote: > > The hugetlb case seems to be the only misuse of VM_FAULT_OOM where it > > doesn't mean we simply don't have the memory to handle the page fault, > > i.e. your earlier "memory is exhausted" definition. That was handled well > > before calling out_of_memory() by simply killing current since we know it > > is faulting hugetlb pages and its resource is limited. > > > > We could pass the vma to pagefault_out_of_memory() and simply kill current > > if its killable and is_vm_hugetlb_page(vma). > > > > No. hugepage is not only case. > You may not read but we annoyed i915's driver bug recently and it was clearly > misuse of VM_FAULT_OOM. Then, we got many reports of OOM killer in these months. > (thanks to Kosaki about this.) > That's been fixed, right? > quick glance around core codes... > - HUGEPAGE at el. should return some VM_FAULT_NO_RESOUECE rather than VM_FAULT_OOM. We can detect this with is_vm_hugetlb_page() if we pass the vma into pagefault_out_of_memory() without adding another VM_FAULT flag. > - filemap.c's VM_FAULT_OOM shoudn't call page_fault_oom_kill because it has already > called oom_killer if it can. See below. > - about relayfs, is VM_FAULT_OOM should be BUG_ON()... That looks appropriate at first glance. > - filemap_xip.c return VM_FAULT_OOM....but it doesn't seem to be OOM.. > just like VM_FAULT_NO_VALID_PAGE_FOUND. (But I'm not familiar with this area.) > - fs/buffer.c 's VM_FAULT_OOM is returned oom-killer is called. > - shmem.c's VM_FAULT_OOM is retuned oom-killer is called. > The filemap, shmem, and block_prepare_write() cases will call the oom killer but, depending on the gfp mask, they will retry their allocations after the oom killer is called so we should never return VM_FAULT_OOM because they return -ENOMEM. They fail from either small objsize slab allocations or with orders less than PAGE_ALLOC_COSTLY_ORDER which by default continues to retry even if direct reclaim fails. If we're returning with VM_FAULT_OOM from these handlers, it should only be because of GFP_NOFS | __GFP_NORETRY or current has been oom killed and still can't find memory (so we don't care if the oom killer is called again since it won't kill anything else). So like I said, I don't really see a need where VM_FAULT_NO_RESOURCE would be helpful in any case other than hugetlb which we can already detect by passing the vma into the pagefault oom handler. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/