Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1764508AbYBTPBe (ORCPT ); Wed, 20 Feb 2008 10:01:34 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752764AbYBTPBS (ORCPT ); Wed, 20 Feb 2008 10:01:18 -0500 Received: from host64.cybernetics.com ([70.169.137.4]:2739 "EHLO mail.cybernetics.com" rhost-flags-OK-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1752569AbYBTPBR (ORCPT ); Wed, 20 Feb 2008 10:01:17 -0500 Message-ID: <47BC40BE.6080106@cybernetics.com> Date: Wed, 20 Feb 2008 10:01:18 -0500 From: Tony Battersby User-Agent: Thunderbird 2.0.0.9 (X11/20071031) MIME-Version: 1.0 To: Michael Chan Cc: David Miller , herbert@gondor.apana.org.au, netdev , gregkh@suse.de, linux-kernel@vger.kernel.org Subject: Re: TG3 network data corruption regression 2.6.24/2.6.23.4 References: <47BA0984.2070306@cybernetics.com> <1203381120.13495.78.camel@dell> <20080218.163554.74130592.davem@davemloft.net> <1203383046.13495.87.camel@dell> <47BB00EC.3010607@cybernetics.com> <1203448265.13495.95.camel@dell> <47BB54C2.6090501@cybernetics.com> <1203465163.13495.102.camel@dell> In-Reply-To: <1203465163.13495.102.camel@dell> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1718 Lines: 39 Michael Chan wrote: > On Tue, 2008-02-19 at 17:14 -0500, Tony Battersby wrote: > > >> Update: when I revert Herbert's patch in addition to applying your >> patch, the iSCSI performance goes back up to 115 MB/s again in both >> directions. So it looks like turning off SG for TX didn't itself cause >> the performance drop, but rather that the performance drop is just >> another manifestation of whatever bug is causing the data corruption. >> >> I do not regularly use wireshark or look at network packet dumps, so I >> am not really sure what to look for. Given the above information, do >> you still believe that there is value in examining the packet dump? >> >> > > Can you confirm whether you're getting TCP checksum errors on the other > side that is receiving packets from the 5701? You can just check > statistics using netstat -s. I suspect that after we turn off SG, > checksum is no longer offloaded and we are getting lots of TCP checksum > errors instead that are slowing the performance. > > > Confirmed. With a 100 MB read/write test, netstat -s shows 75 bad segments received, and performance in the one direction is about 5 MB/s. When I switch to the SysKonnect NIC, netstat -s shows 0 bad segments received, and performance is 115 MB/s. So that solves that mystery - there is still data corruption, but the software-computed TCP checksum causes the bad packets to be retransmitted rather than being passed on to the application. Tony -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/