Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753362AbbKBIzW (ORCPT ); Mon, 2 Nov 2015 03:55:22 -0500 Received: from mga03.intel.com ([134.134.136.65]:32324 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751633AbbKBIzT (ORCPT ); Mon, 2 Nov 2015 03:55:19 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.20,233,1444719600"; d="scan'208";a="809465093" From: "Huang\, Ying" To: Michal Hocko Cc: Mel Gorman , lkp@01.org, LKML , Andrew Morton , Rik van Riel , Vitaly Wool , David Rientjes , Christoph Lameter , Johannes Weiner , Vlastimil Babka , Stephen Rothwell Subject: Re: [lkp] [mm, page_alloc] 43993977ba: +88% OOM possibility References: <87oafjpnb1.fsf@yhuang-dev.intel.com> <20151029142440.GE23598@dhcp22.suse.cz> <87si4szrzf.fsf@yhuang-dev.intel.com> <20151030103830.GJ18429@dhcp22.suse.cz> <87poztfgsa.fsf@yhuang-dev.intel.com> <20151102074542.GC630@dhcp22.suse.cz> Date: Mon, 02 Nov 2015 16:55:15 +0800 In-Reply-To: <20151102074542.GC630@dhcp22.suse.cz> (Michal Hocko's message of "Mon, 2 Nov 2015 08:45:43 +0100") Message-ID: <87twp4232k.fsf@yhuang-dev.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.5 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=ascii Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3726 Lines: 78 Michal Hocko writes: > On Mon 02-11-15 07:20:37, Huang, Ying wrote: >> Michal Hocko writes: >> >> > On Fri 30-10-15 16:21:40, Huang, Ying wrote: >> >> Michal Hocko writes: >> >> >> >> > On Wed 28-10-15 13:36:02, kernel test robot wrote: >> >> >> FYI, we noticed the below changes on >> >> >> >> >> >> https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master >> >> >> commit 43993977baecd838d66ccabc7f682342fc6ff635 ("mm, page_alloc: >> >> >> distinguish between being unable to sleep, unwilling to sleep and >> >> >> avoiding waking kswapd") >> >> >> >> >> >> We found the OOM possibility increased 88% in a virtual machine with 1G memory. >> >> > >> >> > Could you provide dmesg output from this test? >> >> >> >> Sure, Attached. >> > >> > I can only see a single allocation failure warning: >> > kworker/u4:1: page allocation failure: order:0, mode:0x2204000 >> > >> > This is obviously a non sleeping allocation with ___GFP_KSWAPD_RECLAIM >> > set. ___GFP_HIGH (aka access to memory reserves) is not required so a >> > failure of such an allocation is something to be expected. >> > >> > [ 2294.616369] Workqueue: btrfs-submit btrfs_submit_helper >> > [ 2294.616369] 0000000000000000 ffff88000d38f5e0 ffffffff8173f84c 0000000000000000 >> > [ 2294.616369] ffff88000d38f678 ffffffff811abaee 00000000ffffffff 000000010038f618 >> > [ 2294.616369] ffff8800584e4148 00000000ffffffff ffff8800584e2f00 0000000000000001 >> > [ 2294.616369] Call Trace: >> > [ 2294.616369] [] dump_stack+0x4b/0x63 >> > [ 2294.616369] [] warn_alloc_failed+0x125/0x13d >> > [ 2294.616369] [] __alloc_pages_nodemask+0x7c9/0x915 >> > [ 2294.616369] [] kmem_getpages+0x91/0x155 >> > [ 2294.616369] [] fallback_alloc+0x1cc/0x24c >> > [ 2294.616369] [] ____cache_alloc_node+0x151/0x160 >> > [ 2294.616369] [] __kmalloc+0xb0/0x134 >> > [ 2294.616369] [] ? sched_clock+0x9/0xb >> > [ 2294.616369] [] ? virtqueue_add+0x78/0x37f >> > [ 2294.616369] [] virtqueue_add+0x78/0x37f >> > [ 2294.616369] [] ? __lock_acquire+0x751/0xf55 >> > [ 2294.616369] [] virtqueue_add_sgs+0x76/0x85 >> > >> > The patch you are referring shouldn't make any change in this path >> > because alloc_indirect which I expect is the allocation failing here >> > does: >> > gfp &= ~(__GFP_HIGHMEM | __GFP_HIGH) >> > >> > and that came in via b92b1b89a33c ("virtio: force vring descriptors to >> > be allocated from lowmem"). >> > >> > Are there more failed allocations during the test? The subject would >> > suggest so. >> >> We done 24 tests for the commit and 24 tests for its parent. There is >> no OOM in any test for the parent commit, but there are OOM in 21 tests >> for this commit. This is what I want to say in the subject. Sorry for >> confusing. > > It would be interesting to see all the page allocation failure warnings > (if they are different). Maybe other callers have relied on GFP_ATOMIC > and access to memory reserves. The above path is not this case though. I take a look at all dmesgs, and found the backtrace for page allocation failure is same for all. Is it possible that this commit cause more memory were allocated or kept in memory so that more OOM were triggered? Best Regards, Huang, Ying -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/