Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756410Ab3HaTow (ORCPT ); Sat, 31 Aug 2013 15:44:52 -0400 Received: from rydia.net ([69.46.88.68]:34073 "EHLO mail.rydia.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753727Ab3HaTou (ORCPT ); Sat, 31 Aug 2013 15:44:50 -0400 X-Greylist: delayed 408 seconds by postgrey-1.27 at vger.kernel.org; Sat, 31 Aug 2013 15:44:50 EDT Date: Sat, 31 Aug 2013 12:38:01 -0700 (PDT) From: dormando X-X-Sender: dormando@dtop To: Eric Wong cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: ipv4: warnings on sk_wmem_queued In-Reply-To: <20130830195638.GA27393@dcvr.yhbt.net> Message-ID: References: <20130830195638.GA27393@dcvr.yhbt.net> User-Agent: Alpine 2.02 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2322 Lines: 47 > I noticed these warnings on stock 3.10.9 running stress tests on > cmogstored.git (git://bogomips.org/cmogstored.git) doing standard > HTTP server stuff between lo and tmpfs: > [...] > I was going to reboot into 3.10.10 before I looked at dmesg. These > warnings happened after ~8 hours of stress tests, and those stress tests > are still running. I had a kernel panic this morning on a production machine, also running 3.10.9. I only got a small part of the end of the trace, but it matches: > Aug 30 06:03:54 localhost kernel: [] ip_queue_xmit+0x153/0x3c0 > Aug 30 06:03:54 localhost kernel: [] tcp_transmit_skb+0x3c5/0x820 > Aug 30 06:03:54 localhost kernel: [] tcp_write_xmit+0x191/0xaa0 > Aug 30 06:03:54 localhost kernel: [] ? __kmalloc_reserve.isra.49+0x3c/0xa0 > Aug 30 06:03:54 localhost kernel: [] __tcp_push_pending_frames+0x32/0xa0 > Aug 30 06:03:54 localhost kernel: [] tcp_send_fin+0x6f/0x190 > Aug 30 06:03:54 localhost kernel: [] tcp_close+0x378/0x410 > Aug 30 06:03:54 localhost kernel: [] inet_release+0x5a/0xa0 > Aug 30 06:03:54 localhost kernel: [] sock_release+0x28/0x90 > Aug 30 06:03:54 localhost kernel: [] sock_close+0x12/0x20 > Aug 30 06:03:54 localhost kernel: [] __fput+0xaf/0x240 > Aug 30 06:03:54 localhost kernel: [] ____fput+0xe/0x10 > Aug 30 06:03:54 localhost kernel: [] task_work_run+0xa7/0xe0 > Aug 30 06:03:54 localhost kernel: [] do_notify_resume+0x9c/0xb0 > Aug 30 06:03:54 localhost kernel: [] int_signal+0x12/0x17 ... from there to here... Then: RIP [ kmem_cache_alloc+0x5a/0x130 RSP ---[ end trace 6ab931f3db28b31e ]--- Kernel panic - not syncing: Fatal exception in interrupt Machine was running for a few days before panic'ing. I don't see anything in 3.10.10 that would've affected this. Thanks! (also: hi Eric!) -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/