Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754969AbZJ1Qce (ORCPT ); Wed, 28 Oct 2009 12:32:34 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754771AbZJ1Qcd (ORCPT ); Wed, 28 Oct 2009 12:32:33 -0400 Received: from mms1.broadcom.com ([216.31.210.17]:4136 "EHLO mms1.broadcom.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753479AbZJ1Qcc (ORCPT ); Wed, 28 Oct 2009 12:32:32 -0400 X-Server-Uuid: 02CED230-5797-4B57-9875-D5D2FEE4708A Date: Wed, 28 Oct 2009 09:32:26 -0700 From: "Matt Carlson" To: "Rik van Riel" cc: "netdev@vger.kernel.org" , "Linux kernel Mailing List" , "Matthew Carlson" , "Michael Chan" , "KVM list" Subject: Re: TG3, kvm, ipv6 & tso data corruption bug? Message-ID: <20091028163226.GA8556@xw6200.broadcom.net> References: <4AE8595F.1080404@redhat.com> MIME-Version: 1.0 In-Reply-To: <4AE8595F.1080404@redhat.com> User-Agent: Mutt/1.5.20 (2009-06-14) X-WSS-ID: 66F6AD905KG34295712-01-01 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1658 Lines: 41 On Wed, Oct 28, 2009 at 07:46:55AM -0700, Rik van Riel wrote: > I have been tracking down what I thought was a KVM related network > issue for a while, however it appears it could be a hardware issue. > > The symptom is that data in network packets gets corrupted, before > the checksum is calculated. This means the remote host can get > corrupted data, with no way to calculate it (except application > level checksums). Luckily ssh has such checksums, so my rsync over > ssh backup script discovered this issue. > > On a very regular basis, I got this message from ssh: > > Corrupted MAC on input. > > I have played around a bit and narrowed it down to the following: > > ipv4 => no problem > ipv6 w/o tso => no problem > ipv6 with tso => occasional data corruption > > Disabling tso with ethtool -K eth0 tso off makes the problem stop. > > I am running Fedora 12's 2.6.31.1-56.fc12.x86_64 kernel, with the > following hardware: > > 05:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5761 > Gigabit Ethernet PCIe (rev 10) > > I do not know enough about the network layer to know whether this is > fixable in software or whether TSO offloading for ipv6 should just > be disabled on this model. This problem sounds familiar. There are chip bugs in this area, but as far as I know, they should have been worked around. Let me see if this is indeed the same bug resurfacing. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/