Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759424AbdLRT4Y (ORCPT ); Mon, 18 Dec 2017 14:56:24 -0500 Received: from out3-smtp.messagingengine.com ([66.111.4.27]:46851 "EHLO out3-smtp.messagingengine.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758334AbdLRT4U (ORCPT ); Mon, 18 Dec 2017 14:56:20 -0500 X-ME-Sender: Subject: Re: PROBLEM: NULL pointer dereference in kernel 4.14.6 To: vcaputo@pengaru.com, linux-kernel@vger.kernel.org, tj@kernel.org Cc: cgroups@vger.kernel.org References: <1513512885.3653140.1207725096.395A9CCC@webmail.messagingengine.com> <08995310-d853-ee77-ed1f-26cc336a4a30@incorrekt.com> <20171217232448.yfaxxew2ijaay7iu@shells.gnugeneration.com> From: Bronek Kozicki Message-ID: <95d82ae1-fe4c-4eee-8e94-fa0df3e25532@incorrekt.com> Date: Mon, 18 Dec 2017 19:56:17 +0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <20171217232448.yfaxxew2ijaay7iu@shells.gnugeneration.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-GB Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1310 Lines: 22 On 17/12/2017 23:24, vcaputo@pengaru.com wrote: > On Sun, Dec 17, 2017 at 05:49:44PM +0000, Bronek Kozicki wrote: >> I just upgraded to 4.14.7 and tried to reproduce this error, this time under strace. As you can see this happens when systemctl tries to read a specific entry under /sys/fs . In case this matters, the entry is for a small virtual machine running under qemu/kvm and managed by libvirt. >> >> open("/sys/fs/cgroup/unified/machine.slice", O_RDONLY|O_NONBLOCK|O_DIRECTORY|O_CLOEXEC) = 5 >> fstat(5, {st_mode=S_IFDIR|0755, st_size=0, ...}) = 0 >> getdents(5, /* 12 entries */, 32768) = 464 >> openat(AT_FDCWD, "/sys/fs/cgroup/unified/machine.slice/machine-qemu\\x2d1\\x2dkartuzy\\x2dspice.scope/cgroup.procs", O_RDONLY|O_CLOEXEC) = 8 >> fstat(8, {st_mode=S_IFREG|0644, st_size=0, ...}) = 0 >> read(8, ) = ? >> +++ killed by SIGKILL +++ >> [1] 12078 killed strace -- systemctl status >> >> > > This recently came through lkml, may be related: > https://marc.info/?l=linux-kernel&m=151320108922415&w=2 thank you, it certainly seems related. Is there some debugging option I could enable, or patch I could apply, which would make the point of data corruption easier to find? I'm ok taking untested patches, if that helps finding the location of the bug. B.