Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754470AbYK0KIS (ORCPT ); Thu, 27 Nov 2008 05:08:18 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752294AbYK0KIE (ORCPT ); Thu, 27 Nov 2008 05:08:04 -0500 Received: from ns.hanzlici.cz ([212.158.159.56]:59742 "EHLO hanzlici.cz" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752606AbYK0KIA (ORCPT ); Thu, 27 Nov 2008 05:08:00 -0500 Message-ID: <492E7142.3030002@hanzlici.cz> Date: Thu, 27 Nov 2008 11:06:58 +0100 From: Frantisek Hanzlik User-Agent: Mozilla/5.0 (X11; U; Linux i686; rv:1.9.1b2pre) Gecko/20081125 Lightning/1.0pre SeaMonkey/2.0a2pre MIME-Version: 1.0 To: Willy Tarreau CC: Matt Carlson , Roger Heflin , Peter Zijlstra , LKML , netdev Subject: Re: WARNING: at net/sched/sch_generic.c:219 dev_watchdog+0xfe/0x17e() with tg3 network References: <20081120184310.GB27712@xw6200.broadcom.net> <20081120212637.GB23844@1wt.eu> <20081120215318.GB27907@xw6200.broadcom.net> <20081124132744.GB24851@1wt.eu> <20081124215247.GA29696@1wt.eu> <20081125015223.GA9151@xw6200.broadcom.net> <20081125053128.GA32426@1wt.eu> <20081125175413.GA9808@xw6200.broadcom.net> <20081126211220.GA22374@1wt.eu> <20081126225421.GA8906@xw6200.broadcom.net> <20081127051659.GA26057@1wt.eu> In-Reply-To: <20081127051659.GA26057@1wt.eu> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 11499 Lines: 183 Willy Tarreau wrote: > On Wed, Nov 26, 2008 at 02:54:21PM -0800, Matt Carlson wrote: > (...) >>> I've run a new test on a switch I have here at home (another el-cheapo, >>> non-manageable 100 Mbps, netgear this time). Unfortunately I cannot >>> reproduce the problem at all. I have disabled FC on my laptop, it did >>> not have any effect. >> Disabling FC should have a positive effect, not a negative one. It >> might be the case that the switch does not advertise nor support FC. If >> that is true, you might not be able to repro the problem no matter what >> you did (if your problem is what I think it is). Can you check your >> link messages and see if it really is negotiated to off? (I see the >> message above, but I don't think that is with the current switch.) > > yes the switch does advertuse FC : > > willy@wtap:~$ dmesg|grep eth0 > eth0: Tigon3 [partno(BMC5705mA3) rev 3003 PHY(5705)] (PCI:33MHz:32-bit) 10/100/1000Base-T Ethernet 00:0d:9d:91:ef:24 > eth0: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[0] WireSpeed[0] TSOcap[1] > eth0: dma_rwctrl[763f0000] dma_mask[64-bit] > tg3: eth0: Link is up at 100 Mbps, full duplex. > tg3: eth0: Flow control is on for TX and on for RX. > >>> I have disabled auto-neg and manually forced the >>> speed to 100/Full on my laptop, and could not reproduce the problem >>> either (though the speed was much lower due to the switch obviously >>> negociating 100/Half when not seeing my NWay frames). >> Yes. If you force the link, both sides must be forced. The switch >> rightly assumes HD when bringing the link up. > > I know ;-) but not seeing the problem, I started to suspect that the other > switch was a little bit ill and tried to reproduce some problems I might > incidently have been encountering on it. > > Regards, > Willy > I got same issue two days ago on PCI-X Fiber 1000BASE-SX D-Link Adapter DGE-550SX/dl2k driver (I report it yesterday). This card work fine in old DEC Alphaserver 800/Fedora Core 5/kernel 2.6.17. We use it for our internet connections, which is limited to approx. 50 Mb/sec by ISP, then I think there should be no bottleneck in LAN card. We just tried install new i386 based router (Core2Duo E8500/4GB RAM/ 4x Realtek 8111C and this D-Link from old Alphaserver, on Fedora 10, kernel 2.6.27.5). Router is connected to several gigabit backbones over other ois gigabit cards, then I'm not sure when any crapped internal machines could overload this DGE-550SX. Card in new router stop working after several secs/minutes - packed transmitting freeze (what is interesting, in one case after it sent exactly 8192 packets, and other cases are some multiples of 8 too) and stop transmit packets. Only workaround (but for a while) is rmmod + modprobe its dl2k driver. /var/log/messages contains following messages: ... Nov 25 19:04:52 ns kernel: Sundance Technology IPG Triple-Speed Ethernet 0000:09:00.0: PCI INT A -> GSI 20 (level, low) -> IRQ 20 Nov 25 19:04:52 ns kernel: 0000:09:00.0: D-Link NIC ... Nov 25 19:05:12 ns kernel: ------------[ cut here ]------------ Nov 25 19:05:12 ns kernel: WARNING: at net/sched/sch_generic.c:219 dev_watchdog+0xda/0x12d() Nov 25 19:05:12 ns kernel: NETDEV WATCHDOG: eth1 (Sundance Technology IPG Triple-Speed Ethernet): transmit timed out Nov 25 19:05:12 ns kernel: Modules linked in: hwmon_vid hwmon nf_nat_ftp nf_conntrack_ftp xt_comment iptable_nat nf_nat cpufreq_ondemand acpi_cpufreq dm_multipath uinput snd_hda_intel snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device dl2k snd_pcm_oss snd_mixer_oss snd_pcm ipg snd_timer snd_page_alloc snd_hwdep snd i2c_i801 firewire_ohci i2c_core soundcore pcspkr r8169 mii firewire_core crc_itu_t raid456 async_xor async_memcpy async_tx xor raid1 [last unloaded: microcode] Nov 25 19:05:12 ns kernel: Pid: 0, comm: swapper Not tainted 2.6.27.5-117.fc10.i686 #1 Nov 25 19:05:12 ns kernel: [] warn_slowpath+0x4b/0x6c Nov 25 19:05:12 ns kernel: [] ? mempool_resize+0x15c/0x183 Nov 25 19:05:12 ns kernel: [] ? __slab_free+0x63/0x26e Nov 25 19:05:12 ns kernel: [] ? kmem_cache_free+0x71/0xa7 Nov 25 19:05:12 ns kernel: [] ? __slab_free+0x63/0x26e Nov 25 19:05:12 ns kernel: [] ? blk_remove_plug+0x66/0x92 Nov 25 19:05:12 ns kernel: [] ? elv_queue_empty+0x20/0x22 Nov 25 19:05:12 ns kernel: [] ? blk_run_queue+0x28/0x2c Nov 25 19:05:12 ns kernel: [] ? scsi_run_queue+0x250/0x27c Nov 25 19:05:12 ns kernel: [] ? kobject_put+0x37/0x3c Nov 25 19:05:12 ns kernel: [] ? strlcpy+0x17/0x49 Nov 25 19:05:12 ns kernel: [] dev_watchdog+0xda/0x12d Nov 25 19:05:12 ns kernel: [] ? scsi_device_unbusy+0x6b/0x70 Nov 25 19:05:12 ns kernel: [] run_timer_softirq+0x14b/0x1bb Nov 25 19:05:12 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:05:12 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:05:12 ns kernel: [] __do_softirq+0x84/0x109 Nov 25 19:05:12 ns kernel: [] ? __do_softirq+0x0/0x109 Nov 25 19:05:12 ns kernel: [] do_softirq+0x77/0xdb Nov 25 19:05:12 ns kernel: [] ? handle_fasteoi_irq+0x0/0xc0 Nov 25 19:05:12 ns kernel: [] irq_exit+0x44/0x83 Nov 25 19:05:12 ns kernel: [] do_IRQ+0xe7/0xfe Nov 25 19:05:12 ns kernel: [] common_interrupt+0x28/0x30 Nov 25 19:05:12 ns kernel: [] ? acpi_idle_enter_simple+0x162/0x19d Nov 25 19:05:12 ns kernel: [] cpuidle_idle_call+0x67/0x97 Nov 25 19:05:12 ns kernel: [] cpu_idle+0x101/0x134 Nov 25 19:05:12 ns kernel: [] start_secondary+0x197/0x19f Nov 25 19:05:12 ns kernel: ======================= Nov 25 19:05:12 ns kernel: ---[ end trace fd7fa9607e312047 ]--- ===== Nov 25 19:29:33 ns kernel: ------------[ cut here ]------------ Nov 25 19:29:33 ns kernel: WARNING: at net/sched/sch_generic.c:219 dev_watchdog+0xda/0x12d() Nov 25 19:29:33 ns kernel: NETDEV WATCHDOG: eth1 (Sundance Technology IPG Triple-Speed Ethernet): transmit timed out Nov 25 19:29:33 ns kernel: Modules linked in: hwmon_vid hwmon nf_nat_ftp nf_conntrack_ftp xt_comment iptable_nat nf_nat cpufreq_ondemand acpi_cpufreq dm_multipath uinput dl2k i2c_i801 pcspkr r8169 ipg mii i2c_core firewire_ohci firewire_core crc_itu_t raid456 async_xor async_memcpy async_tx xor raid1 [last unloaded: microcode] Nov 25 19:29:33 ns kernel: Pid: 9941, comm: mail Not tainted 2.6.27.5-117.fc10.i686 #1 Nov 25 19:29:33 ns kernel: [] warn_slowpath+0x4b/0x6c Nov 25 19:29:33 ns kernel: [] ? mempool_resize+0x15c/0x183 Nov 25 19:29:33 ns kernel: [] ? bio_free+0x40/0x44 Nov 25 19:29:33 ns kernel: [] ? bio_fs_destructor+0xe/0x11 Nov 25 19:29:33 ns kernel: [] ? bio_put+0x26/0x28 Nov 25 19:29:33 ns kernel: [] ? virt_to_head_page+0x22/0x2e Nov 25 19:29:33 ns kernel: [] ? queue_flag_clear+0x18/0x54 Nov 25 19:29:33 ns kernel: [] ? __freed_request+0x6b/0x72 Nov 25 19:29:33 ns kernel: [] ? __enqueue_entity+0xe3/0xeb Nov 25 19:29:33 ns kernel: [] ? enqueue_entity+0x203/0x20b Nov 25 19:29:33 ns kernel: [] ? strlcpy+0x17/0x49 Nov 25 19:29:33 ns kernel: [] dev_watchdog+0xda/0x12d Nov 25 19:29:33 ns kernel: [] ? do_IRQ+0xe7/0xfe Nov 25 19:29:33 ns kernel: [] run_timer_softirq+0x14b/0x1bb Nov 25 19:29:33 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:29:33 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:29:33 ns kernel: [] __do_softirq+0x84/0x109 Nov 25 19:29:33 ns kernel: [] ? __do_softirq+0x0/0x109 Nov 25 19:29:33 ns kernel: [] do_softirq+0x77/0xdb Nov 25 19:29:33 ns kernel: [] irq_exit+0x44/0x83 Nov 25 19:29:33 ns kernel: [] smp_apic_timer_interrupt+0x6e/0x7c Nov 25 19:29:33 ns kernel: [] apic_timer_interrupt+0x2d/0x34 Nov 25 19:29:33 ns kernel: [] ? speedstep_target+0x23/0x7e Nov 25 19:29:33 ns kernel: [] ? native_flush_tlb_single+0x6/0x8 Nov 25 19:29:33 ns kernel: [] kunmap_atomic+0x67/0xa7 Nov 25 19:29:33 ns kernel: [] follow_page+0x1c5/0x23b Nov 25 19:29:33 ns kernel: [] get_user_pages+0x289/0x2fe Nov 25 19:29:33 ns kernel: [] get_arg_page+0x2d/0x80 Nov 25 19:29:33 ns kernel: [] ? strnlen_user+0x2f/0x4d Nov 25 19:29:33 ns kernel: [] copy_strings+0xc3/0x160 Nov 25 19:29:33 ns kernel: [] do_execve+0x14e/0x215 Nov 25 19:29:33 ns kernel: [] sys_execve+0x29/0x50 Nov 25 19:29:33 ns kernel: [] syscall_call+0x7/0xb Nov 25 19:29:33 ns kernel: [] ? init_intel_cacheinfo+0x0/0x421 Nov 25 19:29:33 ns kernel: ======================= Nov 25 19:29:33 ns kernel: ---[ end trace 45c41aa8555c94fa ]--- ===== Nov 25 19:41:00 ns kernel: ------------[ cut here ]------------ Nov 25 19:41:00 ns kernel: WARNING: at net/sched/sch_generic.c:219 dev_watchdog+0xda/0x12d() Nov 25 19:41:00 ns kernel: NETDEV WATCHDOG: eth1 (Sundance Technology IPG Triple-Speed Ethernet): transmit timed out Nov 25 19:41:00 ns kernel: Modules linked in: hwmon_vid hwmon nf_nat_ftp nf_conntrack_ftp xt_comment iptable_nat nf_nat cpufreq_ondemand acpi_cpufreq dm_multipath uinput dl2k ipg firewire_ohci firewire_core crc_itu_t pcspkr r8169 mii i2c_i801 i2c_core raid456 async_xor async_memcpy async_tx xor raid1 [last unloaded: microcode] Nov 25 19:41:00 ns kernel: Pid: 0, comm: swapper Not tainted 2.6.27.5-117.fc10.i686 #1 Nov 25 19:41:00 ns kernel: [] warn_slowpath+0x4b/0x6c Nov 25 19:41:00 ns kernel: [] ? __enqueue_entity+0xa3/0xeb Nov 25 19:41:00 ns kernel: [] ? enqueue_entity+0x203/0x20b Nov 25 19:41:00 ns kernel: [] ? enqueue_task_fair+0x3b/0x3f Nov 25 19:41:00 ns kernel: [] ? resched_task+0x3a/0x6e Nov 25 19:41:00 ns kernel: [] ? _spin_unlock_irqrestore+0x22/0x38 Nov 25 19:41:00 ns kernel: [] ? try_to_wake_up+0x221/0x22b Nov 25 19:41:00 ns kernel: [] ? _spin_lock_irqsave+0x29/0x30 Nov 25 19:41:00 ns kernel: [] ? strlcpy+0x17/0x49 Nov 25 19:41:00 ns kernel: [] dev_watchdog+0xda/0x12d Nov 25 19:41:00 ns kernel: [] ? __queue_work+0x26/0x2b Nov 25 19:41:00 ns kernel: [] run_timer_softirq+0x14b/0x1bb Nov 25 19:41:00 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:41:00 ns kernel: [] ? dev_watchdog+0x0/0x12d Nov 25 19:41:00 ns kernel: [] __do_softirq+0x84/0x109 Nov 25 19:41:00 ns kernel: [] ? __do_softirq+0x0/0x109 Nov 25 19:41:00 ns kernel: [] do_softirq+0x77/0xdb Nov 25 19:41:00 ns kernel: [] irq_exit+0x44/0x83 Nov 25 19:41:00 ns kernel: [] smp_apic_timer_interrupt+0x6e/0x7c Nov 25 19:41:00 ns kernel: [] apic_timer_interrupt+0x2d/0x34 Nov 25 19:41:00 ns kernel: [] ? acpi_idle_enter_bm+0x277/0x2c6 Nov 25 19:41:00 ns kernel: [] cpuidle_idle_call+0x67/0x97 Nov 25 19:41:00 ns kernel: [] cpu_idle+0x101/0x134 Nov 25 19:41:00 ns kernel: [] rest_init+0x4e/0x50 Nov 25 19:41:00 ns kernel: ======================= Nov 25 19:41:00 ns kernel: ---[ end trace 9ce93cc9159b5214 ]--- It's there any thing for help with it? Thx, Franta Hanzlik -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/