Received: by 2002:ac0:98c7:0:0:0:0:0 with SMTP id g7-v6csp173308imd; Wed, 31 Oct 2018 16:47:51 -0700 (PDT) X-Google-Smtp-Source: AJdET5dL/fP6A7jSgMlY2GMj9D68XoCcY9M/DI5rNt1YavhF+cpYNnhGiCddrJS2m+zUOVjYh96U X-Received: by 2002:a17:902:7484:: with SMTP id h4-v6mr5232595pll.227.1541029671745; Wed, 31 Oct 2018 16:47:51 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1541029671; cv=none; d=google.com; s=arc-20160816; b=Qgef1V50EroU6to43RP1+6ZoLeDnNU+bpqT3XgmX6StxgfXbt1FvAzpbYksEZToqP9 WTjMY/FxATswnr3hoHqOPlRHYHetTPhWOdhDJG05XdHARRqwwy3hR7YbBUJRXrppWiqM iRpb71DFib4PsoaMikGZ/DvTNZyeEqeKBhWTDnGgSXG7Y72Hat3HJ+RwX+JTu107y1m0 G9YkhwvpXyQTKYMgyyo/KcIm7BzSF/zX9GmuZ6ghFoEbSPF4rgYcJQ6Ny9eYBeTBTBM/ VS4GYu78A4waoCPNQRG+Yx2qEUltO3wU++p/NWTrRLJLLfa0f/prz0IUQFgNI6OlyCDq qZ5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature; bh=jLgvxr0WjtDUAw9szDgBMuPpWFAJg1Ii2m/bWMz8Lv8=; b=PLGY7zlop/MCD/HlO0eI4ES88JzJTWkXMmGVx2Z9hNBj7KYZ2KC9HUDQarrH2H4jRs NaMxwvmthYrR0R9+CJzKJYe12T9aN7IRiCAxw/xc1mKb6E1lPMEeMCJ4BgBO3qBEeaPo LsMeqidYgiXlvGj5QjLAzxQT+C3++Qu7zSOUi3UK3D9vy2jKAlqG+dnhdW9jWUVl2sg1 fFt1N1AcQAxHHKm3KQrM7j6w2ObEkpZ3tTFIlNG4KEOoUzFPKD6tCYrM4vd9h7A43q0t 4436DfWE+mKv9n4wl2g/GxZbmHtvszFIXy87V4VA0ty0w8WJgajdrj994wIVussuP0i5 6vIQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=sFjIsQdE; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c11-v6si30019642pgj.409.2018.10.31.16.47.37; Wed, 31 Oct 2018 16:47:51 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=sFjIsQdE; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728698AbeKAIHO (ORCPT + 99 others); Thu, 1 Nov 2018 04:07:14 -0400 Received: from mail.kernel.org ([198.145.29.99]:54732 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728581AbeKAIHL (ORCPT ); Thu, 1 Nov 2018 04:07:11 -0400 Received: from sasha-vm.mshome.net (c-73-47-72-35.hsd1.nh.comcast.net [73.47.72.35]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 16F092081B; Wed, 31 Oct 2018 23:07:00 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1541027220; bh=v5JwRh15q1fAOl0WuDZ1q8ubyHl4ldwk4yi9VO2bUeU=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=sFjIsQdEZ6TmadWZta64uoIhdxxbyKikP7RtJxpE7hKwchZqL3X5hQSiN7oRuGgIx hSZq2H9EXZLSQH7T7LYNSMtyxPdqQR7R0FOhtOHFA9slCU0RaMRqiWghYhkEdYkL7S S1m2nMTQEm7yYx7ylp2sBVjXlft6cmIw/bZNCJ9k= From: Sasha Levin To: stable@vger.kernel.org, linux-kernel@vger.kernel.org Cc: Michal Hocko , Tejun Heo , Sasha Levin Subject: [PATCH AUTOSEL 4.19 086/146] cgroup, netclassid: add a preemption point to write_classid Date: Wed, 31 Oct 2018 19:04:41 -0400 Message-Id: <20181031230541.28822-86-sashal@kernel.org> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20181031230541.28822-1-sashal@kernel.org> References: <20181031230541.28822-1-sashal@kernel.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Michal Hocko [ Upstream commit a90e90b7d55e789c71d85b946ffb5c1ab2f137ca ] We have seen a customer complaining about soft lockups on !PREEMPT kernel config with 4.4 based kernel [1072141.435366] NMI watchdog: BUG: soft lockup - CPU#21 stuck for 22s! [systemd:1] [1072141.444090] Modules linked in: mpt3sas raid_class binfmt_misc af_packet 8021q garp mrp stp llc xfs libcrc32c bonding iscsi_ibft iscsi_boot_sysfs msr ext4 crc16 jbd2 mbcache cdc_ether usbnet mii joydev hid_generic usbhid intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel ipmi_ssif mgag200 i2c_algo_bit ttm ipmi_devintf drbg ixgbe drm_kms_helper vxlan ansi_cprng ip6_udp_tunnel drm aesni_intel udp_tunnel aes_x86_64 iTCO_wdt syscopyarea ptp xhci_pci lrw iTCO_vendor_support pps_core gf128mul ehci_pci glue_helper sysfillrect mdio pcspkr sb_edac ablk_helper cryptd ehci_hcd sysimgblt xhci_hcd fb_sys_fops edac_core mei_me lpc_ich ses usbcore enclosure dca mfd_core ipmi_si mei i2c_i801 scsi_transport_sas usb_common ipmi_msghandler shpchp fjes wmi processor button acpi_pad btrfs xor raid6_pq sd_mod crc32c_intel megaraid_sas sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod md_mod autofs4 [1072141.444146] Supported: Yes [1072141.444149] CPU: 21 PID: 1 Comm: systemd Not tainted 4.4.121-92.80-default #1 [1072141.444150] Hardware name: LENOVO Lenovo System x3650 M5 -[5462P4U]- -[5462P4U]-/01GR451, BIOS -[TCE136H-2.70]- 06/13/2018 [1072141.444151] task: ffff880191bd0040 ti: ffff880191bd4000 task.ti: ffff880191bd4000 [1072141.444153] RIP: 0010:[] [] update_classid_sock+0x29/0x40 [1072141.444157] RSP: 0018:ffff880191bd7d58 EFLAGS: 00000286 [1072141.444158] RAX: ffff883b177cb7c0 RBX: 0000000000000000 RCX: 0000000000000000 [1072141.444159] RDX: 00000000000009c7 RSI: ffff880191bd7d5c RDI: ffff8822e29bb200 [1072141.444160] RBP: ffff883a72230980 R08: 0000000000000101 R09: 0000000000000000 [1072141.444161] R10: 0000000000000008 R11: f000000000000000 R12: ffffffff815229d0 [1072141.444162] R13: 0000000000000000 R14: ffff881fd0a47ac0 R15: ffff880191bd7f28 [1072141.444163] FS: 00007f3e2f1eb8c0(0000) GS:ffff882000340000(0000) knlGS:0000000000000000 [1072141.444164] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [1072141.444165] CR2: 00007f3e2f200000 CR3: 0000001ffea4e000 CR4: 00000000001606f0 [1072141.444166] Stack: [1072141.444166] ffffffa800000246 00000000000009c7 ffffffff8121d583 ffff8818312a05c0 [1072141.444168] ffff8818312a1100 ffff880197c3b280 ffff881861422858 ffffffffffffffea [1072141.444170] ffffffff81522b1c ffffffff81d0ca20 ffff8817fa17b950 ffff883fdd8121e0 [1072141.444171] Call Trace: [1072141.444179] [] iterate_fd+0x53/0x80 [1072141.444182] [] write_classid+0x4c/0x80 [1072141.444187] [] cgroup_file_write+0x9b/0x100 [1072141.444193] [] kernfs_fop_write+0x11b/0x150 [1072141.444198] [] __vfs_write+0x26/0x100 [1072141.444201] [] vfs_write+0x9d/0x190 [1072141.444203] [] SyS_write+0x42/0xa0 [1072141.444207] [] entry_SYSCALL_64_fastpath+0x1e/0xca [1072141.445490] DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xca If a cgroup has many tasks with many open file descriptors then we would end up in a large loop without any rescheduling point throught the operation. Add cond_resched once per task. Signed-off-by: Michal Hocko Signed-off-by: Tejun Heo Signed-off-by: Sasha Levin --- net/core/netclassid_cgroup.c | 1 + 1 file changed, 1 insertion(+) diff --git a/net/core/netclassid_cgroup.c b/net/core/netclassid_cgroup.c index 5e4f04004a49..7bf833598615 100644 --- a/net/core/netclassid_cgroup.c +++ b/net/core/netclassid_cgroup.c @@ -106,6 +106,7 @@ static int write_classid(struct cgroup_subsys_state *css, struct cftype *cft, iterate_fd(p->files, 0, update_classid_sock, (void *)(unsigned long)cs->classid); task_unlock(p); + cond_resched(); } css_task_iter_end(&it); -- 2.17.1