Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1423563AbWKFGHH (ORCPT ); Mon, 6 Nov 2006 01:07:07 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1423564AbWKFGHH (ORCPT ); Mon, 6 Nov 2006 01:07:07 -0500 Received: from nz-out-0102.google.com ([64.233.162.202]:5540 "EHLO nz-out-0102.google.com") by vger.kernel.org with ESMTP id S1423563AbWKFGHD (ORCPT ); Mon, 6 Nov 2006 01:07:03 -0500 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; b=CZU3nJ74cfnHWlkHGe2bM+071avJ1rC09qWkBe7J2FRaQcQ6F8C9qHT7Hxxvq5bmJT5hwJNELU/Bz6AJQirGEi2DRrLvchM9srEqldBYcwoCpWhzgTHrWz4/X4q+SvC0PFb3QFYbRmLxtVXqH8Ig5T97U9wmfgHDaowu7sTwjls= Message-ID: Date: Mon, 6 Nov 2006 14:07:02 +0800 From: "Zhao Xiaoming" To: linux-kernel@vger.kernel.org Subject: ZONE_NORMAL memory exhausted by 4000 TCP sockets MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2545 Lines: 47 Dears, I'm running a linux box with kernel version 2.6.16. The hardware has 2 Woodcrest Xeon CPUs (2 cores each) and 4G RAM. The NIC cards is Intel 82571 on PCI-e bus. The box is acting as ethernet bridge between 2 Gigabit Ethernets. By configuring ebtables and iptables, an application is running as TCP proxy which will intercept all TCP connections requests from the network and setup another TCP connection to the acture server. The TCP proxy then relays all traffics in both directions. The problem is the memory. Since the box must support thousands of concurrent connections, I know the memory size of ZONE_NORMAL would be a bottleneck as TCP packets would need many buffers. After setting upper limit of net.ipv4.tcp_rmem and net.ipv4.tcp_wmem to 32K bytes, our test began. My test scenario employs 2000 concurrent downloading connections to a IIS server's port 80. The throughput is about 500~600 Mbps which is limited by the capability of the client application. Because all traffics are from server to client and the capability of client machine is bottleneck, I believe the receiver side of the sockets connected with server and the sender side of the sockets connected with client should be filled with packets in correspondent windows. Thus, roughly there should be about 32K * 2000+ 32K*2000 = 128M bytes memory occupied by TCP/IP stack for packet buffering. Data from slabtop confermed it. it's about 140M bytes memory cost after I start the traffic. That reasonablly matched with my estimation. However, /proc/meminfo had a different story. The 'LowFree' dropped from about 710M to 80M. In other words, there's addtional 500M memory in ZONE_NORMAL allocated by someone other than the slab. Why? I also made another test that the upper limit of tcp_rmem and tcp_wmem being set to 64K. After 2000 connections transfering a lot of data for several seconds, the linux box showed some error messages such as error allocating memory pages, etc. and became unstable. My questions are: 1. To calculate memory request of TCP sockets, is there any other large amount of memory requested besides send and receive buffer? 2. Is there any logics that emploied by TCP/IP stack that will dynamically allocating memory pages directly instead of from slab? Thanks! Xiaoming. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/