2015-08-21 06:51:24

by Gerhard Wiesinger

[permalink] [raw]
Subject: Kernel 4.1.5 not stable - crashes

Hello,

I'm having big problems with Fedora FC22 kernel 4.1.5 (happened with all
tried kernels 4.1.x from FC22) which is not stable at all. At the
nightly backup jobs (database dumps, rsync via ssh, etc.) maschine
crashes reproduceable at every night with the stack trace below. Message
repeats on different CPUs in around 1~10s with same message.

Kernel 4.0.8 from Fedora FC22 works well with long uptimes, also
previous kernel versions are highly stable. Kernel 4.1.4/4.1.5 had a lot
of RAID fixes so I tried it again but it didn't help. So something
critical must be different from 4.0.8 to 4.1.2 and later.

I'm running 2 RAID5 volumes with each LVM and cryptsetup above. After
the crash RAID does a resync.

Machine:
- Mainboard: ASUS - M3N-H HDMI with latest BIOS
- CPU: AMD Phenom II X4 940 Black Edition, 4x 3.00GHz, boxed (HDZ940XCGIBOX)
- NIC: HP Broadcom Netxtreme Gigabit PCIe Netzwerkkarte 482914-001 (BCM5761)

If you need further information please let me know.

Any ideas?

Thank you.

Ciao,
Gerhard

[63525.726812] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s!
[ping:18283]
[63525.734015] Modules linked in: tun ebtable_filter ebtables bridge stp
llc cfg80211 rfkill ipt_MASQUERADE nf_nat_masquerade_ipv4 ip6t_REJECT
nf_reject_ipv6 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_i
pv4 nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat xt_CHECKSUM xt_conntrack
nf_conntrack iptable_mangle iptable_security ip6table_filter ip6_tables
iptable_raw hwmon_vid snd_hda_codec_hdmi lnbp21 stb6100 stb0899 snd_hd
a_codec_realtek snd_hda_codec_generic kvm_amd kvm snd_hda_intel
snd_hda_controller snd_hda_codec snd_hda_core edac_core edac_mce_amd
mantis snd_hwdep mantis_core snd_seq k10temp snd_seq_device dvb_core
snd_pcm s
nd_timer snd soundcore shpchp i2c_nforce2 asus_atk0110 acpi_cpufreq nfsd
auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc dm_crypt raid1
ata_generic raid456 async_raid6_recov async_memcpy async_pq async_xor xo
r async_tx pata_acpi raid6_pq nouveau i2c_algo_bit drm_kms_helper ttm
mxm_wmi drm tg3 serio_raw ptp pps_core firewire_ohci forcedeth
firewire_core crc_itu_t pata_amd video wmi uas usb_storage
[63525.825481] CPU: 1 PID: 18283 Comm: ping Tainted: G D W L
4.1.5-200.fc22.x86_64 #1
[63525.833809] Hardware name: System manufacturer System Product
Name/M3N-H/HDMI, BIOS ASUS M3N-H/HDMI ACPI BIOS Revision 2603 06/11/2010
[63525.845863] task: ffff88019de5c520 ti: ffff880117f50000 task.ti:
ffff880117f50000
[63525.853325] RIP: 0010:[<ffffffff81121cc2>] [<ffffffff81121cc2>]
smp_call_function_many+0x222/0x280
[63525.862366] RSP: 0018:ffff880117f53c58 EFLAGS: 00000202
[63525.867663] RAX: 0000000000000003 RBX: 0000000000000293 RCX:
0000000000000000
[63525.874781] RDX: ffff88023fc1b8c8 RSI: 0000000000000008 RDI:
ffff880237406bb0
[63525.881897] RBP: ffff880117f53c98 R08: 0000000000000000 R09:
000000000000000d
[63525.889015] R10: ffffffff813ad019 R11: ffffffff813acfa4 R12:
ffff880117f53c28
[63525.896131] R13: ffff880117f53bc8 R14: ffffffff813acfa4 R15:
00000000000082d2
[63525.903249] FS: 00007f4227e48700(0000) GS:ffff88023fc40000(0000)
knlGS:0000000000000000
[63525.911319] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[63525.917051] CR2: 00007fb542e34000 CR3: 0000000014833000 CR4:
00000000000006e0
[63525.924166] Stack:
[63525.926176] 0000000000000001 0100000000000001 0000000000000002
0000000000000000
[63525.933620] ffffffff81069d90 0000000000000000 ffff880117f53db0
0000000000000001
[63525.941067] ffff880117f53cc8 ffffffff81121d81 ffffc90001130000
0000000000000000
[63525.948513] Call Trace:
[63525.950956] [<ffffffff81069d90>] ? unmap_pte_range+0xe0/0xe0
[63525.956688] [<ffffffff81121d81>] on_each_cpu+0x31/0x60
[63525.961901] [<ffffffff8106bcd1>] change_page_attr_set_clr+0x421/0x530
[63525.968412] [<ffffffff8106c8bf>] set_memory_ro+0x2f/0x40
[63525.973797] [<ffffffff81191e99>] bpf_prog_select_runtime+0x29/0x40
[63525.980047] [<ffffffff81699130>] bpf_prepare_filter+0x160/0x180
[63525.986038] [<ffffffff81699462>] sk_attach_filter+0xe2/0x190
[63525.991772] [<ffffffff810dee91>] ? pick_next_task_fair+0x7e1/0x980
[63525.998022] [<ffffffff8166b005>] sock_setsockopt+0x3f5/0x9a0
[63526.003755] [<ffffffff81665966>] SyS_setsockopt+0xd6/0xf0
[63526.009225] [<ffffffff810250d7>] ? syscall_trace_leave+0xc7/0x140
[63526.015391] [<ffffffff817a1e6e>] system_call_fastpath+0x12/0x71
[63526.021382] Code: 05 78 a2 c0 00 89 c1 0f 8d 73 fe ff ff 48 98 49 8b
16 48 03 14 c5 a0 77 d2 81 8b 42 18 a8 01 74 c8 0f 1f 84 00 00 00 00 00
f3 90 <8b> 42 18 a8 01 75 f7 eb b5 0f b6 4d c8 4c 89 ea 4c 89 e6 44
89