2015-02-27 14:54:03

by Justin Piszcz

[permalink] [raw]
Subject: 3.19 kernel: BUG: unable to handle kernel NULL pointer dereference

Hello,

With kernel 3.15, I do not recall having any issues, with 3.19, I am getting
a kernel crash when I copy files over NFS from machine A to B.
Is this a known issue?

I suspect it has to do something with this:
Feb 27 09:31:20 remote-host [ 15.745342] dmar: DRHD: handling fault status
reg 2
Feb 27 09:31:20 remote-host [ 15.745361] dmar: DMAR:[DMA Read] Request
device [04:00.0] fault addr 0 #012[ 15.745361] DMAR:[fault reason 06] PTE
Read access is not set

Logs from netconsole during kernel crash:
Feb 27 09:31:19 atom kernel: [ 2296.410022] ixgbe 0000:01:00.0 eth4: NIC
Link is Up 10 Gbps, Flow Control: RX/TX
Feb 27 09:31:19 remote-host [ 15.138220] ixgbe 0000:86:00.0 eth2: NIC Link
is Up 10 Gbps, Flow Control: RX/TX
Feb 27 09:31:19 remote-host [ 15.138370] IPv6: ADDRCONF(NETDEV_CHANGE):
eth2: link becomes ready
Feb 27 09:31:20 remote-host [ 15.565505] systemd-logind[3863]: Failed to
start user service: Unknown unit: [email protected]
Feb 27 09:31:20 remote-host [ 15.569093] systemd-logind[3863]: New session
c2 of user user.
Feb 27 09:31:20 remote-host [ 15.745342] dmar: DRHD: handling fault status
reg 2
Feb 27 09:31:20 remote-host [ 15.745361] dmar: DMAR:[DMA Read] Request
device [04:00.0] fault addr 0 #012[ 15.745361] DMAR:[fault reason 06] PTE
Read access is not set
Feb 27 09:31:23 remote-host [ 18.719490] xfsettingsd[4040]: segfault at 20
ip 00007f23665916b7 sp 00007fff80d9d4b0 error 4
Feb 27 09:31:23 remote-host in
libxfce4kbd-private-2.so.0.0.0[7f236658c000+9000]
Feb 27 09:31:23 remote-host
Feb 27 09:31:24 remote-host [ 20.241049] systemd-logind[3863]: New session
c3 of user user.
Feb 27 09:31:57 remote-host [ 52.361903] systemd-logind[3863]: New session
c4 of user user.
Feb 27 09:32:10 remote-host [ 65.621540] NFSD: Using
/var/lib/nfs/v4recovery as the NFSv4 state recovery directory
Feb 27 09:32:10 remote-host [ 65.621827] NFSD: starting 90-second grace
period (net ffffffff81b82000)
Feb 27 09:36:01 atom kernel: [ 2578.355223] device eth0 left promiscuous
mode
Feb 27 09:43:29 remote-host [ 745.070043] systemd-logind[3863]: New session
c5 of user user.
Feb 27 09:44:18 remote-host [ 794.622189] systemd-logind[3863]: Removed
session c5.
Feb 27 09:44:21 remote-host [ 797.309290] systemd-logind[3863]: New session
c6 of user user.
Feb 27 09:45:21 remote-host [ 857.360193] systemd-logind[3863]: New session
c7 of user user.
Feb 27 09:45:32 remote-host [ 868.472759] systemd-logind[3863]: Removed
session c7.
Feb 27 09:46:09 atom rsyslogd: -- MARK --
Feb 27 09:49:17 remote-host [ 1093.754960] BUG: unable to handle kernel
Feb 27 09:49:17 NULL pointer dereference
Feb 27 09:49:17 remote-host at 0000000000000010
Feb 27 09:49:17 remote-host [ 1093.758337] IP:
Feb 27 09:49:17 remote-host [<ffffffff814b7160>] intel_unmap_sg+0x0/0x10
Feb 27 09:49:17 remote-host [ 1093.760661] PGD 0
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.761481] Oops: 0000 [#1]
Feb 27 09:49:17 SMP
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.762837] CPU: 6 PID: 0 Comm: swapper/6 Not
tainted 3.19.0 #2
Feb 27 09:49:17 remote-host [ 1093.765189] Hardware name: Supermicro
X8DTH-i/6/iF/6F/X8DTH, BIOS 2.1b 05/04/12
Feb 27 09:49:17 remote-host [ 1093.768377] task: ffff880624600010 ti:
ffff880624630000 task.ti: ffff880624630000
Feb 27 09:49:17 remote-host [ 1093.771354] RIP: 0010:[<ffffffff814b7160>]
Feb 27 09:49:17 remote-host [<ffffffff814b7160>] intel_unmap_sg+0x0/0x10
Feb 27 09:49:17 remote-host [ 1093.774615] RSP: 0018:ffff880c3fc03ea0
EFLAGS: 00010046
Feb 27 09:49:17 remote-host [ 1093.776717] RAX: ffff880623d2b098 RBX:
ffff880c231706d8 RCX: 0000000000000001
Feb 27 09:49:17 remote-host [ 1093.779553] RDX: 0000000000000005 RSI:
0000000000000000 RDI: ffff880623d2b098
Feb 27 09:49:17 remote-host [ 1093.782390] RBP: 0000000000000008 R08:
0000000000000000 R09: ffffffff814b7160
Feb 27 09:49:17 remote-host [ 1093.785227] R10: 0000000000000000 R11:
0000000000000045 R12: ffff880c23174c30
Feb 27 09:49:17 remote-host [ 1093.788063] R13: 000000006ef5de4d R14:
00000000000000e0 R15: 00000000000006e2
Feb 27 09:49:17 remote-host [ 1093.790900] FS: 0000000000000000(0000)
GS:ffff880c3fc00000(0000) knlGS:0000000000000000
Feb 27 09:49:17 remote-host [ 1093.794122] CS: 0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Feb 27 09:49:17 remote-host [ 1093.796399] CR2: 0000000000000010 CR3:
0000000001ad2000 CR4: 00000000000007e0
Feb 27 09:49:17 remote-host [ 1093.799235] Stack:
Feb 27 09:49:17 remote-host [ 1093.800009] ffffffff815e5005
Feb 27 09:49:17 remote-host ffffffff81b2f888
Feb 27 09:49:17 remote-host ffff880c00000040
Feb 27 09:49:17 remote-host ffff880c3fc03eb8
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.803103] ffff880c3fc03eb8
Feb 27 09:49:17 remote-host ffff880c3fc03ec8
Feb 27 09:49:17 remote-host ffff880624372d40
Feb 27 09:49:17 remote-host ffff880624633e18
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.806196] 0000000000000000
Feb 27 09:49:17 remote-host 0000000000000000
Feb 27 09:49:17 remote-host 0000000000000040
Feb 27 09:49:17 remote-host ffff880c23e52e00
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.809290] Call Trace:
Feb 27 09:49:17 remote-host [ 1093.810240] <IRQ>
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.811048]
Feb 27 09:49:17 remote-host [<ffffffff815e5005>] ?
twl_interrupt+0x425/0x720
Feb 27 09:49:17 remote-host [ 1093.813409] [<ffffffff8110baf5>] ?
handle_irq_event_percpu+0x55/0x100
Feb 27 09:49:17 remote-host [ 1093.816002] [<ffffffff8110bbcc>] ?
handle_irq_event+0x2c/0x50
Feb 27 09:49:17 remote-host [ 1093.818316] [<ffffffff8110e7e6>] ?
handle_edge_irq+0x96/0x140
Feb 27 09:49:17 remote-host [ 1093.820631] [<ffffffff8103f195>] ?
handle_irq+0x15/0x30
Feb 27 09:49:17 remote-host [ 1093.822734] [<ffffffff8103efc1>] ?
do_IRQ+0x41/0xd0
Feb 27 09:49:17 remote-host [ 1093.824700] [<ffffffff8186e02a>] ?
common_interrupt+0x6a/0x6a
Feb 27 09:49:17 remote-host [ 1093.827012] <EOI>
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.827820]
Feb 27 09:49:17 remote-host [<ffffffff81719405>] ?
cpuidle_enter_state+0x45/0xc0
Feb 27 09:49:17 remote-host [ 1093.830317] [<ffffffff817193fa>] ?
cpuidle_enter_state+0x3a/0xc0
Feb 27 09:49:17 remote-host [ 1093.832738] [<ffffffff81104817>] ?
cpu_startup_entry+0x277/0x2f0
Feb 27 09:49:17 remote-host [ 1093.835158] [<ffffffff8186cdd5>] ?
_raw_spin_unlock_irqrestore+0x5/0x10
Feb 27 09:49:17 remote-host [ 1093.837820] Code:
Feb 27 09:49:17 e7
Feb 27 09:49:17 5b
Feb 27 09:49:17 5d
Feb 27 09:49:17 41
Feb 27 09:49:17 5d
Feb 27 09:49:17 41
Feb 27 09:49:17 5e
Feb 27 09:49:17 41
Feb 27 09:49:17 5f
Feb 27 09:49:17 e9
Feb 27 09:49:17 db
Feb 27 09:49:17 e5
Feb 27 09:49:17 ff
Feb 27 09:49:17 ff
Feb 27 09:49:17 0f
Feb 27 09:49:17 1f
Feb 27 09:49:17 00
Feb 27 09:49:17 0f
Feb 27 09:49:17 0b
Feb 27 09:49:17 66
Feb 27 09:49:17 0f
Feb 27 09:49:17 1f
Feb 27 09:49:17 44
Feb 27 09:49:17 00
Feb 27 09:49:17 00
Feb 27 09:49:17 e8
Feb 27 09:49:17 1b
Feb 27 09:49:17 e6
Feb 27 09:49:17 ff
Feb 27 09:49:17 ff
Feb 27 09:49:17 e9
Feb 27 09:49:17 c7
Feb 27 09:49:17 fe
Feb 27 09:49:17 ff
Feb 27 09:49:17 ff
Feb 27 09:49:17 66
Feb 27 09:49:17 0f
Feb 27 09:49:17 1f
Feb 27 09:49:17 44
Feb 27 09:49:17 00
Feb 27 09:49:17 00
Feb 27 09:49:17 8b
Feb 27 09:49:17 76
Feb 27 09:49:17 10
Feb 27 09:49:17 e9
Feb 27 09:49:17 07
Feb 27 09:49:17 fe
Feb 27 09:49:17 ff
Feb 27 09:49:17 ff
Feb 27 09:49:17 0f
Feb 27 09:49:17 1f
Feb 27 09:49:17 80
Feb 27 09:49:17 00
Feb 27 09:49:17 00
Feb 27 09:49:17 00
Feb 27 09:49:17 00
Feb 27 09:49:17 e9
Feb 27 09:49:17 fb
Feb 27 09:49:17 fd
Feb 27 09:49:17 ff
Feb 27 09:49:17 ff
Feb 27 09:49:17 remote-host
Feb 27 09:49:17 remote-host [ 1093.847921] RIP
Feb 27 09:49:17 remote-host [<ffffffff814b7160>] intel_unmap_sg+0x0/0x10
Feb 27 09:49:17 remote-host [ 1093.850238] RSP <ffff880c3fc03ea0>
Feb 27 09:49:17 remote-host [ 1093.851607] CR2: 0000000000000010
Feb 27 09:49:18 remote-host [ 1095.111283] ---[ end trace 501d5952594f8825
]---
Feb 27 09:49:18 remote-host [ 1095.111320] Kernel panic - not syncing: Fatal
exception in interrupt
Feb 27 09:49:18 remote-host [ 1095.114369] Kernel Offset: 0x0 from
0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffff9fffffff)
Feb 27 09:49:18 remote-host [ 1095.118435] drm_kms_helper: panic occurred,
switching back to text console
Feb 27 09:49:18 remote-host [ 1095.121166] ---[ end Kernel panic - not
syncing: Fatal exception in interrupt
Feb 27 09:49:18 remote-host [ 1095.121176] ------------[ cut here
]------------
Feb 27 09:49:18 remote-host [ 1095.121183] WARNING: CPU: 6 PID: 0 at
arch/x86/kernel/smp.c:124 update_process_times+0x49/0x60()
Feb 27 09:49:18 remote-host [ 1095.121187] CPU: 6 PID: 0 Comm: swapper/6
Tainted: G D 3.19.0 #2
Feb 27 09:49:18 remote-host [ 1095.121190] Hardware name: Supermicro
X8DTH-i/6/iF/6F/X8DTH, BIOS 2.1b 05/04/12
Feb 27 09:49:18 remote-host [ 1095.121193] 0000000000000000
Feb 27 09:49:18 remote-host ffffffff81a26a3c
Feb 27 09:49:18 remote-host ffffffff81867dad
Feb 27 09:49:18 remote-host 0000000000000000
Feb 27 09:49:18 remote-host
Feb 27 09:49:18 remote-host [ 1095.121206] ffffffff810d3782
Feb 27 09:49:18 remote-host ffff880624600010
Feb 27 09:49:18 remote-host 0000000000000000
Feb 27 09:49:18 remote-host ffff880c3fc03b38
Feb 27 09:49:18 remote-host
Feb 27 09:49:18 remote-host [ 1095.121218] 0000000000000000
Feb 27 09:49:18 remote-host ffff880c3fc0d238
Feb 27 09:49:18 remote-host ffffffff81115fd9
Feb 27 09:49:18 remote-host ffff880c3fc0d200
Feb 27 09:49:18 remote-host
Feb 27 09:49:18 remote-host [ 1095.121229] Call Trace:
Feb 27 09:49:18 remote-host [ 1095.121232] <IRQ>
Feb 27 09:49:18 remote-host [<ffffffff81867dad>] ? dump_stack+0x40/0x50
Feb 27 09:49:18 remote-host [ 1095.121244] [<ffffffff810d3782>] ?
warn_slowpath_common+0x72/0xb0
Feb 27 09:49:18 remote-host [ 1095.121248] [<ffffffff81115fd9>] ?
update_process_times+0x49/0x60
Feb 27 09:49:18 remote-host [ 1095.121254] [<ffffffff81123923>] ?
tick_sched_timer+0x33/0x70
Feb 27 09:49:18 remote-host [ 1095.121258] [<ffffffff811164a0>] ?
__run_hrtimer.isra.34+0x40/0x100
Feb 27 09:49:18 remote-host [ 1095.121263] [<ffffffff81116d1f>] ?
hrtimer_interrupt+0xef/0x240
Feb 27 09:49:18 remote-host [ 1095.121269] [<ffffffff810673f4>] ?
smp_apic_timer_interrupt+0x34/0x50
Feb 27 09:49:18 remote-host [ 1095.121273] [<ffffffff8186e2ea>] ?
apic_timer_interrupt+0x6a/0x70
Feb 27 09:49:18 remote-host [ 1095.121277] [<ffffffff81866fb0>] ?
panic+0x191/0x1c9
Feb 27 09:49:18 remote-host [ 1095.121281] [<ffffffff81866fad>] ?
panic+0x18e/0x1c9
Feb 27 09:49:18 remote-host [ 1095.121285] [<ffffffff81040685>] ?
oops_end+0x85/0xa0
Feb 27 09:49:18 remote-host [ 1095.121290] [<ffffffff8106fcb4>] ?
no_context+0x134/0x380
Feb 27 09:49:18 remote-host [ 1095.121294] [<ffffffff810703de>] ?
__do_page_fault+0x9e/0x4c0
Feb 27 09:49:18 remote-host [ 1095.121299] [<ffffffff814b2be1>] ?
dma_pte_clear_level+0x121/0x1b0
Feb 27 09:49:18 remote-host [ 1095.121304] [<ffffffff8186ea72>] ?
page_fault+0x22/0x30
Feb 27 09:49:18 remote-host [ 1095.121308] [<ffffffff814b7160>] ?
intel_unmap+0x1f0/0x1f0
Feb 27 09:49:18 remote-host [ 1095.121312] [<ffffffff814b7160>] ?
intel_unmap+0x1f0/0x1f0
Feb 27 09:49:18 remote-host [ 1095.121316] [<ffffffff815e5005>] ?
twl_interrupt+0x425/0x720
Feb 27 09:49:18 remote-host [ 1095.121320] [<ffffffff8110baf5>] ?
handle_irq_event_percpu+0x55/0x100
Feb 27 09:49:18 remote-host [ 1095.121324] [<ffffffff8110bbcc>] ?
handle_irq_event+0x2c/0x50
Feb 27 09:49:18 remote-host [ 1095.121328] [<ffffffff8110e7e6>] ?
handle_edge_irq+0x96/0x140
Feb 27 09:49:18 remote-host [ 1095.121332] [<ffffffff8103f195>] ?
handle_irq+0x15/0x30
Feb 27 09:49:18 remote-host [ 1095.121335] [<ffffffff8103efc1>] ?
do_IRQ+0x41/0xd0
Feb 27 09:49:18 remote-host [ 1095.121339] [<ffffffff8186e02a>] ?
common_interrupt+0x6a/0x6a
Feb 27 09:49:18 remote-host [ 1095.121342] <EOI>
Feb 27 09:49:18 remote-host [<ffffffff81719405>] ?
cpuidle_enter_state+0x45/0xc0
Feb 27 09:49:18 remote-host [ 1095.121349] [<ffffffff817193fa>] ?
cpuidle_enter_state+0x3a/0xc0
Feb 27 09:49:18 remote-host [ 1095.121353] [<ffffffff81104817>] ?
cpu_startup_entry+0x277/0x2f0
Feb 27 09:49:18 remote-host [ 1095.121357] [<ffffffff8186cdd5>] ?
_raw_spin_unlock_irqrestore+0x5/0x10
Feb 27 09:49:18 remote-host [ 1095.121361] ---[ end trace 501d5952594f8826
]---

Justin.


2015-02-27 15:53:44

by Justin Piszcz

[permalink] [raw]
Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer dereference



> -----Original Message-----
> From: Justin Piszcz [mailto:[email protected]]
> Sent: Friday, February 27, 2015 9:54 AM
> To: [email protected]
> Subject: 3.19 kernel: BUG: unable to handle kernel NULL pointer
dereference
>
> Hello,
>
> With kernel 3.15, I do not recall having any issues, with 3.19, I am
getting
> a kernel crash when I copy files over NFS from machine A to B.
> Is this a known issue?
>
> I suspect it has to do something with this:
> Feb 27 09:31:20 remote-host [ 15.745342] dmar: DRHD: handling fault
status
> reg 2
> Feb 27 09:31:20 remote-host [ 15.745361] dmar: DMAR:[DMA Read] Request
> device [04:00.0] fault addr 0 #012[ 15.745361] DMAR:[fault reason 06]
PTE
> Read access is not set
>

[ .. ]

Here is another crash with debugging enabled:

Feb 27 10:44:42 remote-host
Feb 27 10:44:42 remote-host [ 2256.999876] RIP
Feb 27 10:44:42 remote-host [<ffffffff814e4d01>] intel_unmap_sg+0x1/0x10
Feb 27 10:44:42 remote-host [ 2257.002194] RSP <ffff880c3fc03e38>
Feb 27 10:44:42 remote-host [ 2257.003562] CR2: 0000000000000010
Feb 27 10:44:43 remote-host [ 2258.263414] ---[ end trace 8c48fd88f48d9b7a
]---
Feb 27 10:44:43 remote-host [ 2258.263421] Kernel panic - not syncing: Fatal
exception in interrupt
Feb 27 10:44:43 remote-host [ 2258.266476] Kernel Offset: 0x0 from
0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffff9fffffff)
Feb 27 10:44:43 remote-host [ 2258.270538] drm_kms_helper: panic occurred,
switching back to text console
Feb 27 10:44:43 remote-host [ 2258.273269] ---[ end Kernel panic - not
syncing: Fatal exception in interrupt
Feb 27 10:44:43 remote-host [ 2258.273278] ------------[ cut here
]------------
Feb 27 10:44:43 remote-host [ 2258.273284] WARNING: CPU: 6 PID: 0 at
arch/x86/kernel/smp.c:124 native_smp_send_reschedule+0x61/0x70()
Feb 27 10:44:43 remote-host [ 2258.273289] CPU: 6 PID: 0 Comm: swapper/6
Tainted: G D 3.19.0 #3
Feb 27 10:44:43 remote-host [ 2258.273292] Hardware name: Supermicro
X8DTH-i/6/iF/6F/X8DTH, BIOS 2.1b 05/04/12
Feb 27 10:44:43 remote-host [ 2258.273295] ffffffff81c08a23
Feb 27 10:44:43 remote-host ffff880c3fc03820
Feb 27 10:44:43 remote-host ffffffff818b4dc6
Feb 27 10:44:43 remote-host ffffffff81e45698
Feb 27 10:44:43 remote-host
Feb 27 10:44:43 remote-host [ 2258.273306] 0000000000000000
Feb 27 10:44:43 remote-host ffff880c3fc03860
Feb 27 10:44:43 remote-host ffffffff810d8f8b
Feb 27 10:44:43 remote-host 0000000000000001
Feb 27 10:44:43 remote-host
Feb 27 10:44:43 remote-host [ 2258.273318] 0000000000000000
Feb 27 10:44:43 remote-host ffff880627c11a40
Feb 27 10:44:43 remote-host 0000000000000006
Feb 27 10:44:43 remote-host ffff880624632410
Feb 27 10:44:43 remote-host
Feb 27 10:44:43 remote-host [ 2258.273330] Call Trace:
Feb 27 10:44:43 remote-host [ 2258.273333] <IRQ>
Feb 27 10:44:43 remote-host [<ffffffff818b4dc6>] dump_stack+0x45/0x57
Feb 27 10:44:43 remote-host [ 2258.273346] [<ffffffff810d8f8b>]
warn_slowpath_common+0x7b/0xc0
Feb 27 10:44:43 remote-host [ 2258.273350] [<ffffffff810d90a5>]
warn_slowpath_null+0x15/0x20
Feb 27 10:44:43 remote-host [ 2258.273355] [<ffffffff810690d1>]
native_smp_send_reschedule+0x61/0x70
Feb 27 10:44:43 remote-host [ 2258.273363] [<ffffffff81107141>]
trigger_load_balance+0x141/0x1f0
Feb 27 10:44:43 remote-host [ 2258.273367] [<ffffffff810fa320>]
scheduler_tick+0x90/0xd0
Feb 27 10:44:43 remote-host [ 2258.273373] [<ffffffff81123bbc>]
update_process_times+0x4c/0x60
Feb 27 10:44:43 remote-host [ 2258.273380] [<ffffffff81131d90>]
tick_sched_handle.isra.18+0x20/0x50
Feb 27 10:44:43 remote-host [ 2258.273384] [<ffffffff81131dff>]
tick_sched_timer+0x3f/0x80
Feb 27 10:44:43 remote-host [ 2258.273389] [<ffffffff811240cf>]
__run_hrtimer.isra.32+0x4f/0x130
Feb 27 10:44:43 remote-host [ 2258.273394] [<ffffffff81124a2f>]
hrtimer_interrupt+0xef/0x240
Feb 27 10:44:43 remote-host [ 2258.273399] [<ffffffff8106ae17>]
local_apic_timer_interrupt+0x37/0x60
Feb 27 10:44:43 remote-host [ 2258.273403] [<ffffffff8106b3ec>]
smp_apic_timer_interrupt+0x3c/0x50
Feb 27 10:44:43 remote-host [ 2258.273407] [<ffffffff818bcbea>]
apic_timer_interrupt+0x6a/0x70
Feb 27 10:44:43 remote-host [ 2258.273412] [<ffffffff818b3ede>] ?
panic+0x18f/0x1cd
Feb 27 10:44:43 remote-host [ 2258.273416] [<ffffffff818b3eda>] ?
panic+0x18b/0x1cd
Feb 27 10:44:43 remote-host [ 2258.273421] [<ffffffff81042b83>]
oops_end+0x83/0xa0
Feb 27 10:44:43 remote-host [ 2258.273426] [<ffffffff8107440b>]
no_context+0x13b/0x390
Feb 27 10:44:43 remote-host [ 2258.273431] [<ffffffff8107477d>]
__bad_area_nosemaphore+0x11d/0x240
Feb 27 10:44:43 remote-host [ 2258.273434] [<ffffffff810748ae>]
bad_area_nosemaphore+0xe/0x10
Feb 27 10:44:43 remote-host [ 2258.273438] [<ffffffff81074b7e>]
__do_page_fault+0x9e/0x4f0
Feb 27 10:44:43 remote-host [ 2258.273443] [<ffffffff814e062b>] ?
dma_pte_clear_level+0x11b/0x1a0
Feb 27 10:44:43 remote-host [ 2258.273447] [<ffffffff8107500c>]
do_page_fault+0xc/0x10
Feb 27 10:44:43 remote-host [ 2258.273451] [<ffffffff818bd372>]
page_fault+0x22/0x30
Feb 27 10:44:43 remote-host [ 2258.273455] [<ffffffff814e4d00>] ?
intel_unmap+0x1f0/0x1f0
Feb 27 10:44:43 remote-host [ 2258.273460] [<ffffffff814e4d01>] ?
intel_unmap_sg+0x1/0x10
Feb 27 10:44:43 remote-host [ 2258.273468] [<ffffffff81409901>] ?
blk_complete_request+0x11/0x20
Feb 27 10:44:43 remote-host [ 2258.273472] [<ffffffff81611100>] ?
scsi_dma_unmap+0x50/0x70
Feb 27 10:44:43 remote-host [ 2258.273476] [<ffffffff8161bc45>]
twl_interrupt+0x415/0x700
Feb 27 10:44:43 remote-host [ 2258.273480] [<ffffffff81118ac0>]
handle_irq_event_percpu+0x60/0x110
Feb 27 10:44:43 remote-host [ 2258.273484] [<ffffffff81118ba5>]
handle_irq_event+0x35/0x60
Feb 27 10:44:43 remote-host [ 2258.273489] [<ffffffff8111ba16>]
handle_edge_irq+0xa6/0x150
Feb 27 10:44:43 remote-host [ 2258.273492] [<ffffffff8104154d>]
handle_irq+0x1d/0x40
Feb 27 10:44:43 remote-host [ 2258.273496] [<ffffffff8104135a>]
do_IRQ+0x4a/0xe0
Feb 27 10:44:43 remote-host [ 2258.273500] [<ffffffff818bc92a>]
common_interrupt+0x6a/0x6a
Feb 27 10:44:43 remote-host [ 2258.273503] <EOI>
Feb 27 10:44:43 remote-host [<ffffffff81759c48>] ?
cpuidle_enter_state+0x48/0xc0
Feb 27 10:44:43 remote-host [ 2258.273511] [<ffffffff81759c3d>] ?
cpuidle_enter_state+0x3d/0xc0
Feb 27 10:44:43 remote-host [ 2258.273516] [<ffffffff81759d92>]
cpuidle_enter+0x12/0x20
Feb 27 10:44:43 remote-host [ 2258.273520] [<ffffffff8110cce2>]
cpu_startup_entry+0x272/0x2f0
Feb 27 10:44:43 remote-host [ 2258.273524] [<ffffffff81069b5a>]
start_secondary+0x13a/0x150
Feb 27 10:44:43 remote-host [ 2258.273528] ---[ end trace 8c48fd88f48d9b7b
]---
Feb 27 10:46:09 atom rsyslogd: -- MARK --

Justin.

2015-02-28 22:19:12

by Justin Piszcz

[permalink] [raw]
Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer dereference



> -----Original Message-----
> From: Justin Piszcz [mailto:[email protected]]
> Sent: Friday, February 27, 2015 10:54 AM
> To: [email protected]
> Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer
> dereference
>
>
>
> > -----Original Message-----
> > From: Justin Piszcz [mailto:[email protected]]
> > Sent: Friday, February 27, 2015 9:54 AM
> > To: [email protected]
> > Subject: 3.19 kernel: BUG: unable to handle kernel NULL pointer
> dereference
> >
> > Hello,
> >
> > With kernel 3.15, I do not recall having any issues, with 3.19, I am
> getting
> > a kernel crash when I copy files over NFS from machine A to B.
> > Is this a known issue?
> >
> > I suspect it has to do something with this:
> > Feb 27 09:31:20 remote-host [ 15.745342] dmar: DRHD: handling fault
> status
> > reg 2
> > Feb 27 09:31:20 remote-host [ 15.745361] dmar: DMAR:[DMA Read]
> Request
> > device [04:00.0] fault addr 0 #012[ 15.745361] DMAR:[fault reason 06]
> PTE
> > Read access is not set
> >
>
> [ .. ]
>
> Here is another crash with debugging enabled:
>

[ .. ]

https://bbs.archlinux.org/viewtopic.php?id=176398
https://forums.opensuse.org/showthread.php/497436-DMAR-errors-with-VT-d-enab
led

I disabled Virtualization in the kernel and this works around the problem.

Justin.

2015-02-28 22:57:09

by Justin Piszcz

[permalink] [raw]
Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer dereference



> -----Original Message-----
> From: Justin Piszcz [mailto:[email protected]]
> Sent: Saturday, February 28, 2015 5:19 PM
> To: [email protected]
> Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer
> dereference
>
>
>
> > -----Original Message-----
> > From: Justin Piszcz [mailto:[email protected]]
> > Sent: Friday, February 27, 2015 10:54 AM
> > To: [email protected]
> > Subject: RE: 3.19 kernel: BUG: unable to handle kernel NULL pointer
> > dereference
> >
> >
> >
> > > -----Original Message-----
> > > From: Justin Piszcz [mailto:[email protected]]
> > > Sent: Friday, February 27, 2015 9:54 AM
> > > To: [email protected]
> > > Subject: 3.19 kernel: BUG: unable to handle kernel NULL pointer
> > dereference
> > >
> > > Hello,
> > >
> > > With kernel 3.15, I do not recall having any issues, with 3.19, I am
> > getting
> > > a kernel crash when I copy files over NFS from machine A to B.
> > > Is this a known issue?
> > >
> > > I suspect it has to do something with this:
> > > Feb 27 09:31:20 remote-host [ 15.745342] dmar: DRHD: handling fault
> > status
> > > reg 2
> > > Feb 27 09:31:20 remote-host [ 15.745361] dmar: DMAR:[DMA Read]
> > Request
> > > device [04:00.0] fault addr 0 #012[ 15.745361] DMAR:[fault reason
06]
> > PTE
> > > Read access is not set
> > >
> >
> > [ .. ]
> >
> > Here is another crash with debugging enabled:
> >
>
> [ .. ]
>

[ . ]

Sorry for the spam-- still occurs, will remove the NVIDIA card, looks like
it is still an issue for users--
https://bugs.freedesktop.org/show_bug.cgi?id=66696

Justin.