Received: by 10.192.165.156 with SMTP id m28csp1711517imm; Tue, 17 Apr 2018 04:20:46 -0700 (PDT) X-Google-Smtp-Source: AIpwx48gINZfXyEa4qJm5Wt365HVPrujXVNdJrRxU4zYr+jdc6w4DpOmNNEMYdAvQYEhAh3wxf6H X-Received: by 10.167.129.10 with SMTP id b10mr1616457pfi.186.1523964046846; Tue, 17 Apr 2018 04:20:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523964046; cv=none; d=google.com; s=arc-20160816; b=QEnYcZyVSx52YkkKE6RLRl/owaxW/mQ+1rrOrhL3bGFSdH31tJ9it8a9cbse3vUsP7 RRmoVhpj7VYEyEWHfFyBZaz56JqEy4Iuj+eaeULPn0R3iVOGxZBf+DgE0U24m1VJ/wg7 afruuVVGMPbp3sEllvrz7Mx3vczUgKwZb1+QqkbX6S24e+ftLJQwvr6MUPbfSM7pP8DG gQX5nx7hhbMLTPTojMPW1V7h4GBdknglHSjpngmPwOZPWXzG5sxjKBpCvhVQ6mQHz7pX Wixn49T3GqzGFc8NhJdhKERRwjjTAtXZhxcae6Pg6ABZyLcjgqsoAD4FZBufjA9jYGIW vIcA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject:dkim-signature :arc-authentication-results; bh=lWRlXMR2AV5IDbTSjYT+mD9HQUpupHNbNWPWWsoSBvQ=; b=QsfFhdbyvON6uFUonBRKGaRwnGTfYq4SiHnBtKcJlFfgq6FFZqS6plxQ0g1Bf3dyLn Ok1IULpAJPWhOBSGWi1mMSyRJHZhwa2YRJqFWQ7L3ngRBylcSpMb3tJ3U3ipcM3jXI3W 3EgoSKlJTM4SYWNTcnKRnEIukhJMTJvR5ulqrrGMg2jcxqEuhr/LgocfcbndoskYZQHx eUyTzMqZ6Sb5M6pdWScYZoBvrkjUeIF0CnNHM842f3Xltq1awRCAeNlmQpmys7+LZ6yX DnxDs+hBl+sc1DkpBinSsL5RH3RCWxoN4OcnUyUXvxouy9AqUHq6ZgRvfkNGVYZ1Zgag thTw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=J61lAQnt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k78si13195255pfb.250.2018.04.17.04.20.32; Tue, 17 Apr 2018 04:20:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=J61lAQnt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752841AbeDQLSw (ORCPT + 99 others); Tue, 17 Apr 2018 07:18:52 -0400 Received: from mail-wr0-f177.google.com ([209.85.128.177]:33232 "EHLO mail-wr0-f177.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752135AbeDQLSu (ORCPT ); Tue, 17 Apr 2018 07:18:50 -0400 Received: by mail-wr0-f177.google.com with SMTP id z73so34413189wrb.0; Tue, 17 Apr 2018 04:18:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:from:to:cc:references:message-id:date:user-agent :mime-version:in-reply-to; bh=lWRlXMR2AV5IDbTSjYT+mD9HQUpupHNbNWPWWsoSBvQ=; b=J61lAQntcN6D7h6+CYHgt3KQwH3kMkrTz8o1gQX9IBVei4kzqtJueL7OAE5968rGUp P+4+UHgixLbist0jbfOl6qLdMNfP9NQCNWVfBvTeKTHg0GRc4W5AI44MFgsk7NeKlrtQ OEPTXaEGGGBQ28X1S5JTIDe/Jo6vu+647wdOneQG47AyYH/Uc2QvpxlgQokpy02UaJ6S Vq2SePyHRzTymdtDsnluxhGlsG1hcjA4W49lhXylxZ4/sJcTmhh2GfYSnL8iUmrcVUli v8qN2gRo26Ul5XDFyL77zdhDbB9+NcrknT4zoDmSUuPcj3TilnbnCOA4ToCn2WwkxwQk AMsA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:from:to:cc:references:message-id:date :user-agent:mime-version:in-reply-to; bh=lWRlXMR2AV5IDbTSjYT+mD9HQUpupHNbNWPWWsoSBvQ=; b=FA4F1mNWcb8Iaff1ldQ5g0Ogyt6tf2wTH3IW67iXukbNeCM43eWT5K5e2xBE2i7jhA jDaKfqtG3ZgB+FmtHUU0UDdGRFj/89LJcjk9fao73q/HZDOxUhorueCein81jsc4GFrg AtOA+fAk4+6MGE1RWiFq9P/PHOwOKBa8bgcTczxSbHAgUelaMGXw1fxcfH6YM4DmGtUf cVt0NloEnNY7Vv7qPPVlN1iOjxpSmwCmqt+Cb/HkrdRHeOQJfM4dzACPgAzTncfkDF/T W+aMKAKwEpSX1KitcgJNeAHCdltR19sGf5BRTSgaAjrGqPWH3UvTuZ129Eqw+Q/BQ2iy Dzig== X-Gm-Message-State: ALQs6tCcWCqC4ociwl5/8SL0wlDAkCiv3f+h3UEGTBbMsBc56l92QLRi zlZ0W93VvZw8WE9xg+NDYnnBc6sD/Lk= X-Received: by 10.28.158.10 with SMTP id h10mr648694wme.105.1523963929159; Tue, 17 Apr 2018 04:18:49 -0700 (PDT) Received: from [192.168.0.103] (807f27e8.ftth.concepts.nl. [128.127.39.232]) by smtp.googlemail.com with ESMTPSA id u37sm8892215wrb.53.2018.04.17.04.18.48 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 17 Apr 2018 04:18:48 -0700 (PDT) Subject: Re: kernel panics with 4.14.X versions From: Pavlos Parissis To: Jan Kara , Guillaume Morin Cc: stable@vger.kernel.org, decui@microsoft.com, jack@suse.com, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, mszeredi@redhat.com References: <20180416132550.d25jtdntdvpy55l3@bender.morinfr.org> <20180416144041.t2mt7ugzwqr56ka3@quack2.suse.cz> <9b11cfba-4bdc-8a3e-cd33-2f7e8d513bdf@gmail.com> Message-ID: <134eb955-fae7-9fd0-946e-787986509d7b@gmail.com> Date: Tue, 17 Apr 2018 13:18:46 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: <9b11cfba-4bdc-8a3e-cd33-2f7e8d513bdf@gmail.com> Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="ai8yaQgESfaoWlnxUhkQyxJQda3vbW3Pe" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --ai8yaQgESfaoWlnxUhkQyxJQda3vbW3Pe Content-Type: multipart/mixed; boundary="9XlytmznuQJvjcKNRANSgao5WQPrhWlad"; protected-headers="v1" From: Pavlos Parissis To: Jan Kara , Guillaume Morin Cc: stable@vger.kernel.org, decui@microsoft.com, jack@suse.com, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, mszeredi@redhat.com Message-ID: <134eb955-fae7-9fd0-946e-787986509d7b@gmail.com> Subject: Re: kernel panics with 4.14.X versions References: <20180416132550.d25jtdntdvpy55l3@bender.morinfr.org> <20180416144041.t2mt7ugzwqr56ka3@quack2.suse.cz> <9b11cfba-4bdc-8a3e-cd33-2f7e8d513bdf@gmail.com> In-Reply-To: <9b11cfba-4bdc-8a3e-cd33-2f7e8d513bdf@gmail.com> --9XlytmznuQJvjcKNRANSgao5WQPrhWlad Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable On 17/04/2018 01:31 =CF=80=CE=BC, Pavlos Parissis wrote: > On 16/04/2018 04:40 =CE=BC=CE=BC, Jan Kara wrote: >> On Mon 16-04-18 15:25:50, Guillaume Morin wrote: >>> Fwiw, there have been already reports of similar soft lockups in >>> fsnotify() on 4.14: https://lkml.org/lkml/2018/3/2/1038 >>> >>> We have also noticed similar softlockups with 4.14.22 here. >> >> Yeah. >> =20 >>> On 16 Apr 13:54, Pavlos Parissis wrote: >>>> >>>> Hi all, >>>> >=20 > [..snip..] >=20 >>>> [373782.361064] watchdog: BUG: soft lockup - CPU#24 stuck for 22s! [= kube-apiserver:24261] >>>> [373782.378225] Modules linked in: binfmt_misc sctp_diag sctp dccp_d= iag dccp tcp_diag udp_diag >>>> inet_diag unix_diag cfg80211 rfkill dell_rbu 8021q garp mrp xfs libc= rc32c loop x86_pkg_temp_thermal >>>> intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul c= rc32_pclmul ghash_clmulni_intel >>>> pcbc aesni_intel vfat fat crypto_simd glue_helper cryptd intel_cstat= e intel_rapl_perf iTCO_wdt ses >>>> iTCO_vendor_support mxm_wmi ipmi_si dcdbas enclosure mei_me pcspkr i= pmi_devintf lpc_ich sg mei >>>> ipmi_msghandler mfd_core shpchp wmi acpi_power_meter netconsole nfsd= auth_rpcgss nfs_acl lockd grace >>>> sunrpc ip_tables ext4 mbcache jbd2 i2c_algo_bit drm_kms_helper sysco= pyarea sysfillrect sysimgblt >>>> fb_sys_fops sd_mod ttm crc32c_intel ahci libahci mlx5_core drm mlxfw= mpt3sas ptp libata raid_class >>>> pps_core scsi_transport_sas >>>> [373782.516807] dm_mirror dm_region_hash dm_log dm_mod dax >>>> [373782.531739] CPU: 24 PID: 24261 Comm: kube-apiserver Not tainted = 4.14.32-1.el7.x86_64 #1 >>>> [373782.549848] Hardware name: Dell Inc. PowerEdge R630/02C2CP, BIOS= 2.4.3 01/17/2017 >>>> [373782.567486] task: ffff882f66d28000 task.stack: ffffc9002120c000 >>>> [373782.583441] RIP: 0010:fsnotify+0x197/0x510 >>>> [373782.597319] RSP: 0018:ffffc9002120fdb8 EFLAGS: 00000286 ORIG_RAX= : ffffffffffffff10 >>>> [373782.615308] RAX: 0000000000000000 RBX: ffff882f9ec65c20 RCX: 000= 0000000000002 >>>> [373782.632950] RDX: 0000000000028700 RSI: 0000000000000002 RDI: fff= fffff8269a4e0 >>>> [373782.650616] RBP: ffffc9002120fe98 R08: 0000000000000000 R09: 000= 0000000000000 >>>> [373782.668287] R10: 0000000000000000 R11: 0000000000000000 R12: 000= 0000000000000 >>>> [373782.685918] R13: 0000000000000000 R14: 0000000000000000 R15: 000= 0000000000000 >>>> [373782.703302] FS: 000000c42009f090(0000) GS:ffff882fbf900000(0000= ) knlGS:0000000000000000 >>>> [373782.721887] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >>>> [373782.737741] CR2: 00007f82b6539244 CR3: 0000002f3de2a005 CR4: 000= 00000003606e0 >>>> [373782.755247] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 000= 0000000000000 >>>> [373782.772722] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 000= 0000000000400 >>>> [373782.790043] Call Trace: >>>> [373782.802041] vfs_write+0x151/0x1b0 >>>> [373782.815081] ? syscall_trace_enter+0x1cd/0x2b0 >>>> [373782.829175] SyS_write+0x55/0xc0 >>>> [373782.841870] do_syscall_64+0x79/0x1b0 >>>> [373782.855073] entry_SYSCALL_64_after_hwframe+0x3d/0xa2 >> >> Can you please run RIP through ./scripts/faddr2line to see where exact= ly >> are we looping? I expect the loop iterating over marks to notify but b= etter >> be sure. >> >=20 > I am very newbie on this and I tried with: > ../repo/Linux/linux/scripts/faddr2line ./vmlinuz-4.14.32-1.el7.x86_64 > 0010:fsnotify+0x197/0x510 > readelf: Error: Not an ELF file - it has the wrong magic bytes at the s= tart > size: ./vmlinuz-4.14.32-1.el7.x86_64: Warning: Ignoring section flag > IMAGE_SCN_MEM_NOT_PAGED in section .bss > nm: ./vmlinuz-4.14.32-1.el7.x86_64: Warning: Ignoring section flag > IMAGE_SCN_MEM_NOT_PAGED in section .bss > nm: ./vmlinuz-4.14.32-1.el7.x86_64: no symbols > size: ./vmlinuz-4.14.32-1.el7.x86_64: Warning: Ignoring section flag > IMAGE_SCN_MEM_NOT_PAGED in section .bss > nm: ./vmlinuz-4.14.32-1.el7.x86_64: Warning: Ignoring section flag > IMAGE_SCN_MEM_NOT_PAGED in section .bss > nm: ./vmlinuz-4.14.32-1.el7.x86_64: no symbols > no match for 0010:fsnotify+0x197/0x510 >=20 > Obviously, I am doing something very wrong. >=20 I produced an uncompressed image(the error above caused by giving a compr= essed image to faddr2line) by compiling 4.14.32 with config which we have in pr= oduction and now faddr2line reports: ../repo/Linux/linux/scripts/faddr2line ./vmlinux 0010:fsnotify+0x197/0x5= 10 no match for 0010:fsnotify+0x197/0x510 ../repo/Linux/linux/scripts/faddr2line ./vmlinux fsnotify+0x197/0x510 skipping fsnotify address at 0xffffffff8129baf7 due to size mismatch (0x5= 10 !=3D 0x520) no match for fsnotify+0x197/0x510 what am I doing wrong? Cheers, Pavlos --9XlytmznuQJvjcKNRANSgao5WQPrhWlad-- --ai8yaQgESfaoWlnxUhkQyxJQda3vbW3Pe Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJa1dgWAAoJEIP8ktofcXa5C48P+gM5HrenPrLO9gMvkHcqmOIa OdJtr6g15A+PPUcZeyInLSLIk+XF6JjkxbV1jXzYZOOBYDTFCCSNee7SZbYjqqGN AFBDW5Rp+xArTh6U++0kmCFY+BLWC0BvEKWS0dNR4R25DD0Y7ewYpKhhtaYGqauP bkktzCfap5CGSUu6oNawzMsb9ZcHLT3G26QyHFA6AY+2xMZeBvTE97g4AvyjPFLC xSe+/YKlkmj+2Zi2waDXUHQETWBv/samMWlwZL7VnBGUoIHTvWmdpRPDRdSAVOK7 sbq90jX5DXgmWrVjTFtZA823E1pe9x8xiObZ8l8T7+nzw+wdgs7qghTlixdmlTg5 b0qSmVishqXw8Y0xzX+VJF83nmag2LvpskjVDCyj+iVt8Relh/v5EKSk2XKD2bNA BC/2HeQkWXhSH1lCxzC1ahMCA7+F/FgIqWHtf8BbocAPiIrtyM47ylsF08gcx4FS R8qMSBSHSftSV2btKwI08dGkx55w9UHu3mPGb4gbPltduI4/beruksl0w65k2S5y AtezEoRnL5IhaqU0jKP28ZOkYSMX5S8iZSigVn1K834dqKcewIpfhfi3PPaDCfqf VFsjcWg1bR1LUDDh+A0RI38faHYoy5xZtpAHeu6yo5RDmjLGelYLBPir6Tut3tSy gon2MetZ7vd9nK/Ip5sB =2FQe -----END PGP SIGNATURE----- --ai8yaQgESfaoWlnxUhkQyxJQda3vbW3Pe--