Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752238AbdLTFUs (ORCPT ); Wed, 20 Dec 2017 00:20:48 -0500 Received: from server.atrad.com.au ([150.101.241.2]:45716 "EHLO server.atrad.com.au" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751738AbdLTFUk (ORCPT ); Wed, 20 Dec 2017 00:20:40 -0500 Date: Wed, 20 Dec 2017 15:50:11 +1030 From: Jonathan Woithe To: Michal Kubecek Cc: Holger =?iso-8859-1?Q?Hoffst=E4tte?= , netdev@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: r8169 regression: UDP packets dropped intermittantly Message-ID: <20171220052011.GN8641@marvin.atrad.com.au> References: <20171218054951.GJ17747@marvin.atrad.com.au> <20171218223224.GA13172@marvin.atrad.com.au> <20171219054532.GA13685@marvin.atrad.com.au> <20171219122523.lhavmoxo3ippftyn@unicorn.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20171219122523.lhavmoxo3ippftyn@unicorn.suse.cz> User-Agent: Mutt/1.6.1 (2016-04-27) X-MIMEDefang-action: accept Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1867 Lines: 41 On Tue, Dec 19, 2017 at 01:25:23PM +0100, Michal Kubecek wrote: > On Tue, Dec 19, 2017 at 04:15:32PM +1030, Jonathan Woithe wrote: > > This clearly indicates that not every card using the r8169 driver is > > vulnerable to the problem. It also explains why Holger was unable to > > reproduce the result on his system: the PCIe cards do not appear to suffer > > from the problem. Most likely the PCI RTL-8169 chip is affected, but newer > > PCIe variations do not. However, obviously more testing will be required > > with a wider variety of cards if this inference is to hold up. > > The r8169 driver supports many slightly different variants of the chip. > To identify your variant more precisely, look for a line like > > r8169 0000:02:00.0 eth0: RTL8168evl/8111evl at 0xffffc90003135000, d4:3d:7e:2a:30:08, XID 0c900800 IRQ 38 > > in kernel log. The PCIe card (the one which works correctly with the current driver) shows this: r8169 0000:02:00.0 eth0: RTL8168e/8111e at 0xf862e000, 80:1f:02:45:25:a4, XID 0c200000 IRQ 30 r8169 0000:02:00.0 eth0: jumbo features [frames: 9200 bytes, tx checksumming: ko] The PCI card (Netgear GA311) which is affected by the problem shows this: r8169 0000:05:01.0 eth1: RTL8110s at 0xf8706800, e0:91:f5:1b:5f:c6, XID 04000000 IRQ 22 r8169 0000:05:01.0 eth1: jumbo features [frames: 7152 bytes, tx checksumming: ok] The system which has shown the regressed behaviour is running a 32-bit kernel; for various reasons we can't move to a 64-bit kernel at present. However, I was able to boot this system using Slackware 14.2 install discs, and therefore test using both 32-bit and 64-bit 4.4.14 kernels. In both cases the fault was observed within 30 minutes of starting the tests when the GA311 card was in use. The fault is therefore not specific to 32-bit environments. Regards jonathan