Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751755AbbBUTF2 (ORCPT ); Sat, 21 Feb 2015 14:05:28 -0500 Received: from louise.pinerecords.com ([213.168.185.253]:52233 "EHLO louise.pinerecords.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751349AbbBUTF0 (ORCPT ); Sat, 21 Feb 2015 14:05:26 -0500 Date: Sat, 21 Feb 2015 20:05:08 +0100 From: Tomas Szepe To: Eric Dumazet Cc: Florian Westphal , Francois Romieu , Hayes Wang , Eric Dumazet , Tom Herbert , "David S. Miller" , Marco Berizzi , linux-kernel@vger.kernel.org, netdev Subject: Re: 1e918876 breaks r8169 (linux-3.18+) Message-ID: <20150221190507.GA25687@louise.pinerecords.com> References: <20150203100816.GA5807@louise.pinerecords.com> <20150203104214.GG24751@breakpoint.cc> <20150210154536.GB16264@breakpoint.cc> <20150221101512.GB17223@louise.pinerecords.com> <20150221103104.GA26574@breakpoint.cc> <1424535746.5565.42.camel@edumazet-glaptop2.roam.corp.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1424535746.5565.42.camel@edumazet-glaptop2.roam.corp.google.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1525 Lines: 48 > > David, please consider reverting > > > > 1e918876853aa85435e0f17fd8b4a92dcfff53d6 > > (r8169: add support for Byte Queue Limits) > > > > and > > > > 0bec3b700d106a8b0a34227b2976d1a582f1aab7 > > (r8169: add support for xmit_more) > > > > I cannot reproduce any hangs (tried for 2days with 40 parallel > > netperfs using both 100mbit and 1gbit receiver). > > > > And I don't see anything wrong with the change either. > > Seems like some revisions of the HW are just dodgy? > > > > I hate giving up, but I have no means to diagnose this any further. > > Even reporter says it doesn't affect all of his r8169 nics. > > > > So I think the change is correct per se, but might be revealing some > > HW/firmware bug. > > Hold on. > > I believe there is one race in the way you access skb->xmit_more _after_ > > txd->opts1 = cpu_to_le32(status); > > After this point, TX might have completed and TX completion already have > freed skb > > Could Tomas try following fix ? > > diff --git a/drivers/net/ethernet/realtek/r8169.c b/drivers/net/ethernet/realtek/r8169.c > index ad0020af2193..f2764366a36c 100644 > ... Sure, just did. Unfortunately, 3.19.0 + 0bec3b70 + this patch results in a driver that retains the problem. Sorry, -- Tomas Szepe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/