Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756913Ab2KHUfx (ORCPT ); Thu, 8 Nov 2012 15:35:53 -0500 Received: from mga14.intel.com ([143.182.124.37]:31674 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756624Ab2KHUfw convert rfc822-to-8bit (ORCPT ); Thu, 8 Nov 2012 15:35:52 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.80,739,1344236400"; d="scan'208";a="215202499" From: "Dave, Tushar N" To: Joe Jin , "e1000-devel@lists.sf.net" CC: "netdev@vger.kernel.org" , "linux-kernel@vger.kernel.org" , Mary Mcgrath Subject: RE: 82571EB: Detected Hardware Unit Hang Thread-Topic: 82571EB: Detected Hardware Unit Hang Thread-Index: AQHNvXnv9zWmKMRSTUSeVhWmoXs8rZfgY6Yg Date: Thu, 8 Nov 2012 20:35:48 +0000 Message-ID: <061C8A8601E8EE4CA8D8FD6990CEA89133487884@ORSMSX102.amr.corp.intel.com> References: <509B5038.8090304@oracle.com> In-Reply-To: <509B5038.8090304@oracle.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.22.254.139] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2478 Lines: 64 >-----Original Message----- >From: netdev-owner@vger.kernel.org [mailto:netdev-owner@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Wednesday, November 07, 2012 10:25 PM >To: e1000-devel@lists.sf.net >Cc: netdev@vger.kernel.org; linux-kernel@vger.kernel.org; Mary Mcgrath >Subject: 82571EB: Detected Hardware Unit Hang > >Hi list, > >IHAC reported "82571EB Detected Hardware Unit Hang" on HP ProLiant DL360 >G6, and have to reboot the server to recover: > >e1000e 0000:06:00.1: eth3: Detected Hardware Unit Hang: > TDH <1a> > TDT <1a> > next_to_use <1a> > next_to_clean <18> >buffer_info[next_to_clean]: > time_stamp <10047a74e> > next_to_watch <18> > jiffies <10047a88c> > next_to_watch.status <1> >MAC Status <80383> >PHY Status <792d> >PHY 1000BASE-T Status <3800> >PHY Extended Status <3000> >PCI Status <10> > >With newer kernel 2.0.0.1 the issue still reproducible. > >Device info: >06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet >Controller (Copper) (rev 06) >06:00.1 0200: 8086:10bc (rev 06) > >I compared lspci output before and after the issue, different as below: > 06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet >Controller (Copper) (rev 06) > Subsystem: Hewlett-Packard Company NC364T PCI Express Quad Port >Gigabit Server Adapter > Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ >Stepping- SERR- FastB2B- DisINTx- >- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- >SERR- + Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- >+SERR- I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when > doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, > just copy a big file (>500M) from another server will hit it at once. All devices in path from root complex to 82571, should have *same* max payload size otherwise it can cause hang. Can you double check this? -Tushar -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/