Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp557359pxv; Fri, 9 Jul 2021 04:14:16 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwwA427Glk7jM0zIGMCV5yOJeWp4+WF3MU1DfQXKHmE7IeGiLRe/J6NTdb+h4oN+wMhZp1f X-Received: by 2002:a17:906:58c9:: with SMTP id e9mr2623902ejs.144.1625829256132; Fri, 09 Jul 2021 04:14:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1625829256; cv=none; d=google.com; s=arc-20160816; b=kjyARsHMwloFiQXUSG/AU6qYjpH/YmQ0A4FBhNL+o4TDxmEHsERf8Q/3LiQiU0JHEM fc5R111VRImKt0HSymO++ocPfN7SatwU+Bfno0QJA27MOrjTt7ftksb9+o4zJ2D1Qrts iYw/cCOoasp5abmIkeAsnvY4e9D1LL9sZqJx6I1KTefgIiLa9yB7J0AumlqCnX5fmX6K w7HqyXOS7cDeMjDxZZku8OCyk7hfvLzovBkhyCQFvCkM3gcCk+NXH93KDsg6y4/28d7z qb2GFEXpanJOBkpbEmguU9UTKAjjhyP4lqjONbnFOjOKy28p8n+y4P7yJ6pYTIUs3x/m GdJw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:mime-version :dkim-signature; bh=BXx/rJU2Hzky+L1BmqciSMIFJorhvgMmK6l2+q1YeGo=; b=nqxmdtrb/MFXCkhj5kuwqJN2eTS+nM982qO3jHq9/vwodq9JmTfPl6FIL2QMGDOUkB wLbGo8IZHFCbRK2clcqpMwQ8PUFr4zfa7RG83VdofPCzhidw+55BtOsa8pXz3UgN0gJk 9de3n5/OxD3RK1F1QrzYYPF8YEtp+iFe+O33GNqk/QlRc0gqyimFKovlMPvfq3t2ErBa O1wclEcBjF8MQqavYALeI+mEXDL9Yjt7Irs50goDhpouvUPi2Q8jtWWMnb++eqKAxqQ9 tcEVJGTIvFeXDKKyg9jq1aaltiN/Hw/hO2LQu/yVFgWCGvdXldL6shf0U0TDs8Cek3aE LzCw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=K+SXz2DY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ce5si6577602ejb.237.2021.07.09.04.13.52; Fri, 09 Jul 2021 04:14:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=K+SXz2DY; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230372AbhGILPm (ORCPT + 99 others); Fri, 9 Jul 2021 07:15:42 -0400 Received: from us-smtp-delivery-124.mimecast.com ([170.10.133.124]:42525 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229861AbhGILPl (ORCPT ); Fri, 9 Jul 2021 07:15:41 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1625829178; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=BXx/rJU2Hzky+L1BmqciSMIFJorhvgMmK6l2+q1YeGo=; b=K+SXz2DYcddgVr/mFclerqTmLCaeIOaEqln+INDgY0e4BabbWBEeq6bE3Ljezoc/7UaB7i L8CmfSZFZqXuPGf4cTdKGNi8vBResVDYs3/XezY47dOHU9MwicAUphznQFUMz9Kc6WC071 ccVh3g2o6SjdnEYLkkvVwCyqZqAvGwI= Received: from mail-oi1-f197.google.com (mail-oi1-f197.google.com [209.85.167.197]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-592-bbi7CzqZNsyLvaCPYEHvHg-1; Fri, 09 Jul 2021 07:12:56 -0400 X-MC-Unique: bbi7CzqZNsyLvaCPYEHvHg-1 Received: by mail-oi1-f197.google.com with SMTP id w2-20020a0568081402b029023e9cab7ab8so6417934oiv.13 for ; Fri, 09 Jul 2021 04:12:56 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to:cc; bh=BXx/rJU2Hzky+L1BmqciSMIFJorhvgMmK6l2+q1YeGo=; b=XjfSbjoOkBylyRBAYpZZ41sslf2Myr66wxUDoqeewJjxuFlTv46Ac66QBy4LjRqiYG ENtkh02bYiImy6aajWS6gBbQwgHz7hnV47GBVmX/e3iPDIAHCl8oDGltkrgWd6nvfDPp gBjmLR7CiXtYdej/JUO+BfFJPAnhv/Vm6ApMpiV+x8B7waXWOpSjW2LXBElhUEpelfvd QKirt2nsMUg/5uIJkxV7e9J4D88uR7RYfA4ILoECc2IVZ4BPbWyqjiM2SW7KtlQC5yXM BZ3F9ZD1dJrqF1T12mRqaHcjBdL98iMlh29TYCwKEDL8MioLFptOuGwwZRI/4Pcwlmof UqSA== X-Gm-Message-State: AOAM532AjKkcQ0G2lU9Qw5llN6yo7WukZsSfW3cPfzJgILV5ss+MIsVj Uw5TlT84nqrndjft3u4zgmZ5C540j1OOQqRx5eJwFJADayEk/PZpiPOqEm9IidW0ofwLilOlvkr R8F7KBKOWGL6s5zy5+vcUZkvDSbSQuWxOHMXLyIOz X-Received: by 2002:a9d:6b02:: with SMTP id g2mr24448181otp.234.1625829174957; Fri, 09 Jul 2021 04:12:54 -0700 (PDT) X-Received: by 2002:a9d:6b02:: with SMTP id g2mr24448167otp.234.1625829174728; Fri, 09 Jul 2021 04:12:54 -0700 (PDT) MIME-Version: 1.0 From: Bruno Goncalves Date: Fri, 9 Jul 2021 13:12:43 +0200 Message-ID: Subject: WARNING: CPU: 5 PID: 0 at kernel/sched/fair.c:3306 To: CKI Project , linux-kernel@vger.kernel.org Cc: nathan@kernel.org, Xiong Zhou , Juri Lelli , Memory Management Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello, Since this commit (Commit: 9269d27e519a - Merge tag 'timers-nohz-2021-06-28') we started to see the following call trace. [ 1765.915152] ------------[ cut here ]------------ [ 1765.970347] cfs_rq->avg.load_avg || cfs_rq->avg.util_avg || cfs_rq->avg.runnable_avg [ 1765.970352] WARNING: CPU: 5 PID: 0 at kernel/sched/fair.c:3306 update_blocked_averages+0x8e4/0x940 [ 1766.170307] Modules linked in: dm_log_writes dm_flakey rfkill mlx4_ib ipmi_ssif ib_uverbs ib_core mlx4_en intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp acpi_ipmi sunrpc rapl ipmi_si iTCO_wdt intel_cstate intel_pmc_bxt iTCO_vendor_support gpio_ich intel_uncore pcspkr ipmi_devintf lpc_ich ipmi_msghandler mlx4_core fuse zram ip_tables x_tables xfs i915 crct10dif_pclmul crc32_pclmul crc32c_intel i2c_algo_bit drm_kms_helper ghash_clmulni_intel cec drm video [ 1766.685909] CPU: 5 PID: 0 Comm: swapper/5 Tainted: G W 5.13.0 #1 [ 1766.773390] Hardware name: HP ProLiant m710 Server Cartridge/, BIOS H03 04/26/2019 [ 1766.863991] RIP: 0010:update_blocked_averages+0x8e4/0x940 [ 1766.928557] Code: c7 c7 47 b5 5f 9b c6 05 3d ff 03 02 01 e8 29 16 b3 00 e9 bc fe ff ff 48 c7 c7 b8 c6 5f 9b c6 05 63 0f 04 02 01 e8 ce d3 b2 00 <0f> 0b 8b 83 78 01 00 00 e9 2d fb ff ff 48 c7 c7 70 c1 5f 9b c6 05 [ 1767.153523] RSP: 0018:ffffa2b5c01b0ee8 EFLAGS: 00010092 [ 1767.216002] RAX: 0000000000000048 RBX: ffff91eb87ad9400 RCX: 0000000000000027 [ 1767.301394] RDX: ffff91f25f358e18 RSI: 0000000000000001 RDI: ffff91f25f358e10 [ 1767.386785] RBP: ffffa2b5c01b0f60 R08: 0000000000000000 R09: ffffa2b5c01b0d28 [ 1767.472178] R10: ffffa2b5c01b0d20 R11: ffffffff9bf76208 R12: ffff91f25f36c900 [ 1767.557568] R13: ffff91eb87ad9580 R14: 0000000000000001 R15: 00000198729d28c5 [ 1767.642962] FS: 0000000000000000(0000) GS:ffff91f25f340000(0000) knlGS:0000000000000000 [ 1767.739811] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1767.808538] CR2: 00007f9192568590 CR3: 00000001079e6005 CR4: 00000000001706e0 [ 1767.893931] Call Trace: [ 1767.923077] [ 1767.947017] run_rebalance_domains+0x44/0x60 [ 1767.998039] __do_softirq+0xde/0x480 [ 1768.040734] __irq_exit_rcu+0xe4/0x110 [ 1768.085507] irq_exit_rcu+0xa/0x20 [ 1768.126111] sysvec_apic_timer_interrupt+0x72/0x90 [ 1768.183385] [ 1768.208365] asm_sysvec_apic_timer_interrupt+0x12/0x20 [ 1768.269805] RIP: 0010:cpuidle_enter_state+0x104/0x470 [ 1768.330209] Code: 48 0f a3 05 1e 63 85 01 0f 82 63 02 00 00 31 ff e8 21 3d 7c ff 45 84 ff 0f 85 dd 01 00 00 e8 63 8f 8b ff fb 66 0f 1f 44 00 00 <45> 85 f6 0f 88 fb 00 00 00 49 63 d6 4c 2b 2c 24 48 8d 04 52 48 8d [ 1768.555179] RSP: 0018:ffffa2b5c00bbeb0 EFLAGS: 00000246 [ 1768.617657] RAX: 0000000080000001 RBX: 0000000000000001 RCX: 000000000000001f [ 1768.703048] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff9a9fb21d [ 1768.788440] RBP: ffffc2b5bfd40260 R08: 0000000000000000 R09: 0000000000000000 [ 1768.873832] R10: 000000000000002a R11: 000000000000000d R12: ffffffff9c1043a0 [ 1768.959226] R13: 0000019b28beb466 R14: 0000000000000001 R15: 0000000000000000 [ 1769.044620] ? cpuidle_enter_state+0xfd/0x470 [ 1769.096688] cpuidle_enter+0x29/0x40 [ 1769.139377] do_idle+0x1e9/0x290 [ 1769.177898] cpu_startup_entry+0x19/0x20 [ 1769.224757] secondary_startup_64_no_verify+0xc2/0xcb [ 1769.285164] irq event stamp: 2874708 [ 1769.327849] hardirqs last enabled at (2874707): [] tick_nohz_idle_enter+0x65/0x90 [ 1769.437205] hardirqs last disabled at (2874708): [] do_idle+0xad/0x290 [ 1769.534062] softirqs last enabled at (2874690): [] __irq_exit_rcu+0xe4/0x110 [ 1769.638207] softirqs last disabled at (2874685): [] __irq_exit_rcu+0xe4/0x110 [ 1769.742350] ---[ end trace fffd33f79ba8504e ]--- We hit this issue running different tests and on different arches. [1] https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/datawarehouse-public/2021/06/28/328536464/build_x86_64_redhat%3A1383561589/tests/xfstests_xfs/10210046_x86_64_2_dmesg.log [2] https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/datawarehouse-public/2021/06/28/328536464/build_aarch64_redhat%3A1383561593/tests/xfstests_xfs/10210051_aarch64_2_dmesg.log [3] https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/datawarehouse-public/2021/06/28/328536464/build_x86_64_redhat%3A1383561589/tests/LTP/10210045_x86_64_1_dmesg.log Thank you, Bruno Goncalves