Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751316AbdH1Ly5 (ORCPT ); Mon, 28 Aug 2017 07:54:57 -0400 Received: from mail-by2nam03on0041.outbound.protection.outlook.com ([104.47.42.41]:12699 "EHLO NAM03-BY2-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1750866AbdH1Ly4 (ORCPT ); Mon, 28 Aug 2017 07:54:56 -0400 Authentication-Results: spf=none (sender IP is ) smtp.mailfrom=Vadim.Lomovtsev@cavium.com; Date: Mon, 28 Aug 2017 04:54:47 -0700 From: Vadim Lomovtsev To: catalin.marinas@arm.com, will.deacon@arm.com, peterz@infradead.org, mingo@redhat.com, msalter@redhat.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Cc: Vadim.Lomovtsev@caviumnetworks.com, Robert.Richter@cavium.com Subject: [arm64 BUG] INFO: rcu_preempt detected stalls on CPUs/tasks Message-ID: <20170828115447.GA20818@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.6.1 (2016-04-27) X-Originating-IP: [50.233.148.156] X-ClientProxiedBy: BN6PR1301CA0031.namprd13.prod.outlook.com (10.174.84.172) To MWHPR07MB3005.namprd07.prod.outlook.com (10.172.96.11) X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: fae0ec70-3dcc-4510-2826-08d4ee0b9885 X-Microsoft-Antispam: UriScan:;BCL:0;PCL:0;RULEID:(300000500095)(300135000095)(300000501095)(300135300095)(22001)(300000502095)(300135100095)(2017030254152)(300000503095)(300135400095)(201703131423075)(201703031133081)(201702281549075)(300000504095)(300135200095)(300000505095)(300135600095)(300000506095)(300135500095);SRVR:MWHPR07MB3005; X-Microsoft-Exchange-Diagnostics: 1;MWHPR07MB3005;3:Fjy/FxN3aiUF7WUoxxLMGU/tO3JmpRNAS1ibzdFdX2Wutbdw9eYml6KHH0HaG/HnHfxd+GAldiOyI6Te9kMs9t4M5salfn1wMHWWRHQuxqqoP8sHr5wO4IHQ0/+e3fR9tv4K0bdgo14ha377L/HWYLIfQLGFzRb1M+BoPHtk0CCLs4f49vFIUW5ZWI+w23S2ji10z+nWACudVwdW+5rAcubUsTobNv8CiIWXBRcXLJtomd+kcRW68aRUCDTx7u3a;25:rCDhAquEhnGNXaLtqBeqWbWa4lvJBJUrCOGv7DjqxYqN5gFTImGF6wsJxR+vT5523aU0EeuIv6nLjBah1vG48/medETgKnY7Ryb/dsbihwFkLlFlXoYFYRAokon2Llt6lky9jPMeNdCvDyYjcmLkXbwpp7gYZXzQDqOALSSIN9SitzAgyCBROftCdPbpLV4FaBbqbgIDn0MMzev5+V24cvfJQYy3lmz4l3VvCCHKtlTYu98z/XpkA3VrBqsLmykYZAjgM3zcqq6au130WTKL083mwYCEyhUTjroeCpxYbKfNlswzIQP8xvmi08xjp0I4Y8pJKvX0avHiq6UEXHtGKA==;31:R80nzx0wHlHAnztHTLrV5aQLV6rZZOCISgj7EbhMp7PV3bQQGyFiEcBHGlrc9NkcCDy9pYc/kyEyfpDIHZBaYWFbwW7KB2xNDJMv4ilHZvMAaVoGO0bFPzPieunf8MWSAdvnZDSqZzSrajgAZAWHbLqAqLadMHCyQgLYCSRmzjNAt1pBC3n36NT6Zb1lQdPPBU1X5Te9yfFqXXjZKFN0etDxULGaAc4q++s6qflsNTg= X-MS-TrafficTypeDiagnostic: MWHPR07MB3005: X-Microsoft-Exchange-Diagnostics: 1;MWHPR07MB3005;20:6PtbHv+sOa5Nlx0E4WfZByakz0AZSRlyeUce5WchbDYPInlENQlsP1lSx60nnpFcTxIOzAXPckLzE5rVOFc52OD76piQzVFeXMytIIejPdJt9cBZuvIAgccsKEdaaqpaDTqSZEZaP8UGABIEgh1sSIkSpW4aTArFKdruTnlOth4cZTzAV7QZ0tqNwHE6VIZfmd9PKwjXK0MLbTbi9/ptaZx0R5juXKo9CnxP60inSevj6pJAz2M/ZofrZqXkoof3oXKc5lviqJhcWt4RwypVx1D5PhjdnZk63/Ujoyo65rpFQJAxP6k1WD3u8zDxwUjdyJAAFIsPclG9lWMHKfKvt97UqePN4hnpgzT5/eFnJOajnxjCkBxuH77D9iErlLJYirXNoXzVxh/26xMuXUYTVdywCrG7ErxSEkSNBFsHlebp0k+cQIM8iNtsdaC+X1vTKn43TprMAsDJ9PlsFI3n7g2Ds2uskvUyzY3KEfS6xt69mNrNprpXXiTUy89A9EV3If1OgUe6Q9A+yPHw/V11I8otlwt+e6QfELAAsRStc/BCnUFmnypZL6bXRADFby83QfKJ4EoJw+B3ShqCoi5D4+LVPp4GxU48bqFIbuw0bwo=;4:9s+3PvZ6QzwUhYm0/8ZbyYgdNcYjkuFt3qFNQySRC5f7u5El2nbN9wBB99R7xEH9zM3YjWBKDxDlwO0778XUwOJkqMz60gXQegD74XWAUEy0Hw1VPRW4GencLL6Oz02+7/l6EIRg5s38DURjqgNuU8EVqG1I/uRpK1gXELtgaohRED8Edz8q+ykfi6b1peYFyW5U2vl8Ex02C6kLPJNVRg3lyVYoiQQujr1yXR+BzATfGhO2v1mZ6fBBictvy/4B X-Exchange-Antispam-Report-Test: UriScan:; X-Microsoft-Antispam-PRVS: X-Exchange-Antispam-Report-CFA-Test: BCL:0;PCL:0;RULEID:(100000700101)(100105000095)(100000701101)(100105300095)(100000702101)(100105100095)(6040450)(601004)(2401047)(8121501046)(5005006)(100000703101)(100105400095)(10201501046)(93006095)(3002001)(6041248)(201703131423075)(201702281528075)(201703061421075)(201703061406153)(20161123564025)(20161123562025)(20161123555025)(20161123560025)(20161123558100)(6072148)(201708071742011)(100000704101)(100105200095)(100000705101)(100105500095);SRVR:MWHPR07MB3005;BCL:0;PCL:0;RULEID:(100000800101)(100110000095)(100000801101)(100110300095)(100000802101)(100110100095)(100000803101)(100110400095)(100000804101)(100110200095)(100000805101)(100110500095);SRVR:MWHPR07MB3005; X-Forefront-PRVS: 0413C9F1ED X-Forefront-Antispam-Report: SFV:NSPM;SFS:(10009020)(4630300001)(7370300001)(6009001)(6069001)(39830400002)(189002)(53754006)(199003)(42882006)(7736002)(61506002)(4743002)(50986999)(9686003)(54356999)(68736007)(6306002)(25786009)(110136004)(107886003)(81156014)(53936002)(47776003)(305945005)(2906002)(66066001)(33656002)(189998001)(4326008)(81166006)(4001350100001)(72206003)(97736004)(5660300001)(6506006)(50466002)(478600001)(6666003)(55016002)(8676002)(42186005)(83506001)(6116002)(106356001)(7350300001)(105586002)(1076002)(101416001)(23726003)(3846002)(966005)(18370500001);DIR:OUT;SFP:1101;SCL:1;SRVR:MWHPR07MB3005;H:localhost.localdomain;FPR:;SPF:None;PTR:InfoNoRecords;MX:1;A:1;LANG:en; X-Microsoft-Exchange-Diagnostics: =?us-ascii?Q?1;MWHPR07MB3005;23:tNWAiuVWmOWm0uvcda/gwzxNGKibLW6R4Sj3aMaEp?= =?us-ascii?Q?8NkGpHLvSHJ9M0MB64+gfoEFNMNYv095P/6WSx2Jx1KT4Wm/Ywcu8c2Eb788?= =?us-ascii?Q?dcso2rrdNGnUd61etbYUZ0HHpuoZ+rvtct/zjeWK6Kw6ht/xmgyJHpZ2yQ4W?= =?us-ascii?Q?ZWD89MjuJGp3U65I7010I0cNgYQF8+CKcBwoDsgg4TJx4EunhFjiktEHIjO3?= =?us-ascii?Q?cCEpuo/zmjCEiIcYpkwWq0/qweDPIRNthW1oD2bREGDNk68ndt0fFKBTiXSA?= =?us-ascii?Q?GzgM4NKVTrmYkSvdOIcbKICVZPHn/onhuKE3LopQgdiIwk+llB62CL8WiNXo?= =?us-ascii?Q?cmNdIrOh6FoRuWgisDaY3b4FWG2z8CRltO8QnU8VjbMfaVJEj1e7yhO6t+N+?= =?us-ascii?Q?BMdSQ1F1J7PqcY5WrVijQjaCHJYzhWnCeDW7tPKmzqAhaJd26fLXH/mVLomo?= =?us-ascii?Q?U30BpblCQftcsAWlDpTk2PAUZjS3fETTOaS/11kSTTumL1Ii6pF3SwITTu1C?= =?us-ascii?Q?5pQgAPZU0JPbkjpEhbQF8KrPq9PrNh1mb9/W2Uy++WY3ZqjAvey7wQeYHtjn?= =?us-ascii?Q?feXZp4V6DPyR/Aiq28EoYD99/x9tqJxuw2dsqEuXH3NZQEN/Zo7hMmlsXBQl?= =?us-ascii?Q?7bXebmSYVBsX/AQdn0zx6V7El7GwR33DEIOQXayXLGfc1r9XJNXoSm2xyL3g?= =?us-ascii?Q?EpSTdXKkNZWVBepEJZjQW3Zye+QbIS1KE24KEmQ/yalG2HwpKIZY1BEeJWK4?= =?us-ascii?Q?qSJC3ODN74Esa0aryuRiPLBM1WO8/nrk+obR72dclpiuM9GBgW4PbsfEZaf3?= =?us-ascii?Q?vDgmSZvbqevx5hlE1XQ9/NPjkR/ymzWWWLYkmjarPN9GgkGSfhmQlfpG4z1r?= =?us-ascii?Q?b8tuAYjZmC8ZTvzM+qJqKut6QF2vGBgMh/xYhoaxmKMuOc1q/3niAWvEMOlN?= =?us-ascii?Q?dGsyCXeIUJ4JAsi0ZKNzYuxsZq1wmT7nbBYaFHV3WVP0X6C6QckP8Rj8kWD6?= =?us-ascii?Q?ChZHEsSBXh/uUUK12e3giLSuhEjLUoavC0moBp9rLSRQDnM9mYC5jknGFx/d?= =?us-ascii?Q?aSp98BOHEGZ7RXAYQPEJO4c6caDtrmRTJFJxyY99bzpedYrVMboD3c0eKcab?= =?us-ascii?Q?4Zb1BUnSOdOEsMW6hYoYWlUIGcuQl3PLqijuxzyNPbfScs+CY8g68hT1bj23?= =?us-ascii?Q?ccEMZ2nDpQaAZ688/eDvgt+jh+nC+8YFpjHtJzP7vgNwUp7AKg1keiDe5ESx?= =?us-ascii?Q?eKpvX7FtOl/5/VXRlA=3D?= X-Microsoft-Exchange-Diagnostics: 1;MWHPR07MB3005;6:SkSX2/PfDQ7zKv/X6Mwx4apt2L2nZNMPSAWTFTv9gUA+8KfjxC5lqn94/SVM01VVdNr9fzH28zywPXc50XNEoWgJ/sy/5x8Xe5X62f8fiVuG1cOf5q0Ft+QvhAnfHi0Cew6djr8I8Y2oZYwJtfmeXr0ll8kooh9mAhf/KgKoeZGkNKdOQuOefLRbxZy65xtloFg9wZl88Q8OVXvKoTo9PNX8Q5KowE6WXtDg2ItDdKKU4qG8kA+XWp+UbV0AR1XlJDVcwXAnDaSOVkNHREjuDDDGABh104AVpjoNc1o2PW/yF7hQ5zItWWumG2jG7OJFGErY87QBuUBzRqZsehrPdw==;5:UAmqvaU+8vcxS9KpnOjzQJMhL3oMQp75a1061gKTxiGYu1K6oI5+ReeEp9T6Sz6oOB9/QJ12oy3w66/JERaQnqNYoELdAO5067WuXHIcMeITTdYhJj/c+sjDYohocBZI3VH7zMaD+LVD5WOTfvq1Yw==;24:+1rRuo74n85uRex0MAxYTjybOhzq4p5Vw2q4IJAqz/v4TiV0F6YvnwgHFttg4qQZo/xpowbKg+Jlt8yJmjAP6FdIk/HfrTJZDrlUulAtT/4=;7:gsg8Z1j3v3btgYLzlW9aiYlAGvb9Jsf/T0swJ4mMjVcQoTqkDILiWfcMrXlzwQ2Z8XQ8QMYnjcFbQrPWubctGmKKUQqBY3wNwoFZ/o5ueEl8zZyJXN/wmfB0JJ+LLnGU0ORq74+kWVKIemW80ih7GlAD6O62oGM2ZC0PynSp99W/xXiTx+SjObKkUMtTDggnbIIwZJK3s9QdEWzVIsbVY0lrfDygBgFMcKkJEHlj/Mo= SpamDiagnosticOutput: 1:99 SpamDiagnosticMetadata: NSPM X-OriginatorOrg: caviumnetworks.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 28 Aug 2017 11:54:52.0434 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-Transport-CrossTenantHeadersStamped: MWHPR07MB3005 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3461 Lines: 86 Hi all, Following issue has been observed while running stress-ng test at arm64-based board. Please find short summary below. Similar issue was reported a while ago: https://lkml.org/lkml/2017/1/20/101 but in my case I have totally "empty" setup having nothing special running except test itself and OS default stuff (using Centos 7.3) so before test run cpu is mostly idle. Problem description: ==================== Observed kernel message "INFO: rcu_preempt detected stalls on CPUs/tasks" while running strss-ng test. Setup: ====== kernel: 4.13.0-rc7 Arch: arm64 Board: Cavium ThunderX (48 CPU) Steps to reproduce: =================== 1. clone & build stress-ng src git clone git://kernel.ubuntu.com/cking/stress-ng.git cd stress-ng make 2. run stress test ./stress-ng --kill 4 -t 300 -v Expected results: ================= - no any warning/error messages in dmesg; - no kernel failures/crashes; Actual results: =============== Following messages observed at default console: [ 354.876479] INFO: rcu_preempt detected stalls on CPUs/tasks: [ 354.882219] 4-...: (1 GPs behind) idle=7da/140000000000000/0 softirq=1914/1914 fqs=2626 [ 354.890490] 5-...: (1 GPs behind) idle=016/140000000000000/0 softirq=2474/2474 fqs=2627 [ 354.898761] 12-...: (1 GPs behind) idle=262/140000000000000/0 softirq=3020/3022 fqs=2627 [ 354.907119] 28-...: (1 GPs behind) idle=432/140000000000000/0 softirq=2800/2802 fqs=2627 [ 354.915474] (detected by 25, t=5261 jiffies, g=4059, c=4058, q=4696) [ 354.921992] Task dump for CPU 4: [ 354.925252] kworker/dying R running task 0 31 2 0x00000002 [ 354.932383] Call trace: [ 354.934858] [] __switch_to+0x70/0x7c [ 354.940053] [] return_address+0x44/0x64 [ 354.945511] [] preempt_count_add+0xfc/0x118 [ 354.951323] [] _raw_write_lock_irq+0x1c/0x48 [ 354.957222] [] do_exit+0x2d0/0x958 [ 354.962244] [] kthread+0xe4/0x12c [ 354.967174] [] ret_from_fork+0x10/0x20 [ 354.972542] Task dump for CPU 5: [ 354.975802] kworker/dying R running task 0 36 2 0x00000002 [ 354.982933] Call trace: [ 354.985403] [] __switch_to+0x70/0x7c [ 354.990597] [] 0xfffffe1f00000000 [ 354.995526] Task dump for CPU 12: [ 354.998874] kworker/dying R running task 0 71 2 0x00000002 [ 355.006004] Call trace: [ 355.008473] [] __switch_to+0x70/0x7c [ 355.013666] [] return_address+0x44/0x64 [ 355.019124] [] preempt_count_add+0xfc/0x118 [ 355.024933] [] _raw_write_lock_irq+0x1c/0x48 [ 355.030830] [] do_exit+0x2d0/0x958 [ 355.035849] [] kthread+0xe4/0x12c [ 355.040778] [] ret_from_fork+0x10/0x20 [ 355.046147] Task dump for CPU 28: [ 355.049495] kworker/dying R running task 0 151 2 0x00000002 [ 355.056625] Call trace: [ 355.059094] [] __switch_to+0x70/0x7c [ 355.064288] [] return_address+0x44/0x64 [ 355.069745] [] preempt_count_add+0xfc/0x118 [ 355.075554] [] _raw_write_lock_irq+0x1c/0x48 [ 355.081451] [] do_exit+0x2d0/0x958 [ 355.086470] [] kthread+0xe4/0x12c [ 355.091399] [] ret_from_fork+0x10/0x20 WBR, Vadim