Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3484156imu; Sun, 11 Nov 2018 16:08:50 -0800 (PST) X-Google-Smtp-Source: AJdET5eUC4Q0TB5vtp6SSaOCBvIAwxrSlueO3A+SFJurZHjspwjLOKQ6/xlL6SFz8/tYe07Knpk1 X-Received: by 2002:a63:9306:: with SMTP id b6mr14974471pge.36.1541981330420; Sun, 11 Nov 2018 16:08:50 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1541981330; cv=none; d=google.com; s=arc-20160816; b=PeDPAqTzSux0K2r3hLfWBBrj6sybE8+8kftz+XeqGqQ0qunjEuSPE+9psWJqY8rqqh T2fE9Suc0Ab1b1UWDgedQByreuIRD3cPcfwY1PC0Y5NwqDV9mTQOgtaAABdfrMv3fvGa c3UwSvbjdaSR2dH7xgVqvgvmIAM0u0xGwa4m6IGZ/Jeibt0XKt0DzfmRQTLsYLEBvwQl KF3/oIF/RUGLH8VSyzUYuEzcjNWrGpehpbdAAPsTjjXDHRSzNKpSRx9b4BmCfEhBvM4S vZFnF+b/4MPf/3BXf2iNb3vTR+KX8zaBHGnFYNKoarNT8rQ9h7K73mV0YMiBSjIlls4c H2hw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=z9eDy2Pn9gZzDogXOJhNwS8HePdvOxS3ediwoh5+DWU=; b=UUFN9ccvu6guihqvZXjg6+n2mr1JNSOkJ80gAhJF/F5G/cpLfe6C+tB/I+CCcl2z0L uY1T2K/7KTv7tTRNkmz09QuJpEwO6JMFSe4IyA9vREee7Eghx2hsyVe5YQ/RuhcI5BfE DjtOa775kuB/Ck9LX7oVoTvGu4dD65xVJ+1Pd26jLXQlVvyzN2Cv5VHfLdwKj2zYkxHa rzSVM0I1ueKyWJloqw6G/i8E+Agjb9IJqOweeLoWHn/2/b6Z4t8gpvKC10MsZJpFkmqb PVN0Q5fgWc3fgpBwqWqEs0XCGYQbVF8Pl2SzXi88ay8BLpmTVGtWAqCLU5yitOkYz/y1 p3XQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=MhiWJr34; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 22si14827001pgr.356.2018.11.11.16.08.35; Sun, 11 Nov 2018 16:08:50 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=MhiWJr34; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732075AbeKLJ6k (ORCPT + 99 others); Mon, 12 Nov 2018 04:58:40 -0500 Received: from mail.kernel.org ([198.145.29.99]:60776 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731375AbeKLIRc (ORCPT ); Mon, 12 Nov 2018 03:17:32 -0500 Received: from localhost (unknown [206.108.79.134]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 06D4B22353; Sun, 11 Nov 2018 22:27:37 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1541975257; bh=Bg9CvmJmM8wPC5BNn0d2BXZy6/2TlFry+XafXGHchs8=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=MhiWJr34p7fa+dvMDgwzwbWP4QTvNEYut/EuL7p1sytwQy+SeFxze49UVtqPSEiE/ nDc4IlL4ReD9xhNWhmmCla/+yk2N5GPo7D43yhybDI2R3EWVfQqYKXd7ew8cMDd0XW xfo4CDQ9VP4kGdK4GEAAUnf4KlfEpPRyf7tvmWLk= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Michal Hocko , Tejun Heo , Sasha Levin Subject: [PATCH 4.19 131/361] cgroup, netclassid: add a preemption point to write_classid Date: Sun, 11 Nov 2018 14:17:58 -0800 Message-Id: <20181111221638.657612508@linuxfoundation.org> X-Mailer: git-send-email 2.19.1 In-Reply-To: <20181111221619.915519183@linuxfoundation.org> References: <20181111221619.915519183@linuxfoundation.org> User-Agent: quilt/0.65 X-stable: review MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 4.19-stable review patch. If anyone has any objections, please let me know. ------------------ From: Michal Hocko [ Upstream commit a90e90b7d55e789c71d85b946ffb5c1ab2f137ca ] We have seen a customer complaining about soft lockups on !PREEMPT kernel config with 4.4 based kernel [1072141.435366] NMI watchdog: BUG: soft lockup - CPU#21 stuck for 22s! [systemd:1] [1072141.444090] Modules linked in: mpt3sas raid_class binfmt_misc af_packet 8021q garp mrp stp llc xfs libcrc32c bonding iscsi_ibft iscsi_boot_sysfs msr ext4 crc16 jbd2 mbcache cdc_ether usbnet mii joydev hid_generic usbhid intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel ipmi_ssif mgag200 i2c_algo_bit ttm ipmi_devintf drbg ixgbe drm_kms_helper vxlan ansi_cprng ip6_udp_tunnel drm aesni_intel udp_tunnel aes_x86_64 iTCO_wdt syscopyarea ptp xhci_pci lrw iTCO_vendor_support pps_core gf128mul ehci_pci glue_helper sysfillrect mdio pcspkr sb_edac ablk_helper cryptd ehci_hcd sysimgblt xhci_hcd fb_sys_fops edac_core mei_me lpc_ich ses usbcore enclosure dca mfd_core ipmi_si mei i2c_i801 scsi_transport_sas usb_common ipmi_msghandler shpchp fjes wmi processor button acpi_pad btrfs xor raid6_pq sd_mod crc32c_intel megaraid_sas sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod md_mod autofs4 [1072141.444146] Supported: Yes [1072141.444149] CPU: 21 PID: 1 Comm: systemd Not tainted 4.4.121-92.80-default #1 [1072141.444150] Hardware name: LENOVO Lenovo System x3650 M5 -[5462P4U]- -[5462P4U]-/01GR451, BIOS -[TCE136H-2.70]- 06/13/2018 [1072141.444151] task: ffff880191bd0040 ti: ffff880191bd4000 task.ti: ffff880191bd4000 [1072141.444153] RIP: 0010:[] [] update_classid_sock+0x29/0x40 [1072141.444157] RSP: 0018:ffff880191bd7d58 EFLAGS: 00000286 [1072141.444158] RAX: ffff883b177cb7c0 RBX: 0000000000000000 RCX: 0000000000000000 [1072141.444159] RDX: 00000000000009c7 RSI: ffff880191bd7d5c RDI: ffff8822e29bb200 [1072141.444160] RBP: ffff883a72230980 R08: 0000000000000101 R09: 0000000000000000 [1072141.444161] R10: 0000000000000008 R11: f000000000000000 R12: ffffffff815229d0 [1072141.444162] R13: 0000000000000000 R14: ffff881fd0a47ac0 R15: ffff880191bd7f28 [1072141.444163] FS: 00007f3e2f1eb8c0(0000) GS:ffff882000340000(0000) knlGS:0000000000000000 [1072141.444164] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [1072141.444165] CR2: 00007f3e2f200000 CR3: 0000001ffea4e000 CR4: 00000000001606f0 [1072141.444166] Stack: [1072141.444166] ffffffa800000246 00000000000009c7 ffffffff8121d583 ffff8818312a05c0 [1072141.444168] ffff8818312a1100 ffff880197c3b280 ffff881861422858 ffffffffffffffea [1072141.444170] ffffffff81522b1c ffffffff81d0ca20 ffff8817fa17b950 ffff883fdd8121e0 [1072141.444171] Call Trace: [1072141.444179] [] iterate_fd+0x53/0x80 [1072141.444182] [] write_classid+0x4c/0x80 [1072141.444187] [] cgroup_file_write+0x9b/0x100 [1072141.444193] [] kernfs_fop_write+0x11b/0x150 [1072141.444198] [] __vfs_write+0x26/0x100 [1072141.444201] [] vfs_write+0x9d/0x190 [1072141.444203] [] SyS_write+0x42/0xa0 [1072141.444207] [] entry_SYSCALL_64_fastpath+0x1e/0xca [1072141.445490] DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xca If a cgroup has many tasks with many open file descriptors then we would end up in a large loop without any rescheduling point throught the operation. Add cond_resched once per task. Signed-off-by: Michal Hocko Signed-off-by: Tejun Heo Signed-off-by: Sasha Levin Signed-off-by: Greg Kroah-Hartman --- net/core/netclassid_cgroup.c | 1 + 1 file changed, 1 insertion(+) --- a/net/core/netclassid_cgroup.c +++ b/net/core/netclassid_cgroup.c @@ -106,6 +106,7 @@ static int write_classid(struct cgroup_s iterate_fd(p->files, 0, update_classid_sock, (void *)(unsigned long)cs->classid); task_unlock(p); + cond_resched(); } css_task_iter_end(&it);