Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp3484407ybc; Thu, 21 Nov 2019 09:01:35 -0800 (PST) X-Google-Smtp-Source: APXvYqwqZ0d4vTAp+LvR0rcBpnfWnA38KTu3PwPmJDnNOcEDVig/8n8oKddNlpXafu8en2xr4Jm0 X-Received: by 2002:ac8:4510:: with SMTP id q16mr9380407qtn.316.1574355695579; Thu, 21 Nov 2019 09:01:35 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574355695; cv=none; d=google.com; s=arc-20160816; b=Gy/KkzWnlaYmez/y9S1r6eTq1+RgpX9b5Mqu5bzKlDar0h5qesHk379yCfc9TC9+0y 0Q6V2t6ug2X1/86oAuAW7yyMTSU7o9X7d2yPvTCRiWxnUODjYQb3kLrd1mAcmMWGad9U ICZwHSvvza97P9oyvKW6UhrqoOocAAjujRlwaT2eTbTy3pi1c3mBcU4pSAj8b0VSi3oV cqh+i2D/ZUd7NTYtMDfKa2TwRj2s4iJ4r/2VJzzUYAiKr4tjStRSN2VC3ZhcCIqTFYWA IcTEUgEUxO1rKSga5MYV+EZspGg1VRqeNqx8Sel2G7zs8kN66FuxqhS5yZYpzoEGN8qo 1iMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:autocrypt :from:references:cc:to:subject; bh=DXk4pcckhIVnkPVub4SHdsU3YDHetair4RRzOyRbrFI=; b=bUSbujq9FVB6fKKijwWEs6y82vl/q7n+NLCxRcT9vGyX6tQJQjSQomyNbP6y2R7v8v NZYuYM/nEXCJp6ZUDAhjvosZwZxCWcqt8qMzVJYPnNcmEK7fkKLTzg9e7RAKEskivSMt PYoY24zUgw4TQQU39Bk7YnlPJSHPkLgx4Gsc9dCtLyQyCDQaHyUPkcU9TPpeKsrdu9ob Ik0MQ+b53CZFkDtUYvNxovuN8xBIRRosVzw4jqrTv0BjgOlCCB5Sx+9q2IibFCx4ZC0R f7+JWxTxLGO/zB+w0WKQeHnRldARrYJ6FHfnXF1f80q97SToGjLmlLhHV4gPoWAHlqgE 5jwQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 35si3539598edj.185.2019.11.21.09.01.11; Thu, 21 Nov 2019 09:01:35 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726757AbfKURAE (ORCPT + 99 others); Thu, 21 Nov 2019 12:00:04 -0500 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:8214 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726279AbfKURAE (ORCPT ); Thu, 21 Nov 2019 12:00:04 -0500 Received: from pps.filterd (m0098399.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id xALGvGDn029408 for ; Thu, 21 Nov 2019 12:00:02 -0500 Received: from e06smtp05.uk.ibm.com (e06smtp05.uk.ibm.com [195.75.94.101]) by mx0a-001b2d01.pphosted.com with ESMTP id 2wdhw9jp17-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Thu, 21 Nov 2019 12:00:02 -0500 Received: from localhost by e06smtp05.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Thu, 21 Nov 2019 17:00:00 -0000 Received: from b06cxnps4075.portsmouth.uk.ibm.com (9.149.109.197) by e06smtp05.uk.ibm.com (192.168.101.135) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Thu, 21 Nov 2019 16:59:56 -0000 Received: from b06wcsmtp001.portsmouth.uk.ibm.com (b06wcsmtp001.portsmouth.uk.ibm.com [9.149.105.160]) by b06cxnps4075.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id xALGxtus58392614 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 21 Nov 2019 16:59:55 GMT Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 87CEDA405B; Thu, 21 Nov 2019 16:59:55 +0000 (GMT) Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 1F6E3A4054; Thu, 21 Nov 2019 16:59:55 +0000 (GMT) Received: from oc7455500831.ibm.com (unknown [9.152.224.87]) by b06wcsmtp001.portsmouth.uk.ibm.com (Postfix) with ESMTP; Thu, 21 Nov 2019 16:59:55 +0000 (GMT) Subject: Re: WARNING bisected (was Re: [PATCH v7 08/10] mm: rework non-root kmem_cache lifecycle management) To: Roman Gushchin Cc: "akpm@linux-foundation.org" , "hannes@cmpxchg.org" , Kernel Team , "linux-kernel@vger.kernel.org" , "linux-mm@kvack.org" , "longman@redhat.com" , "shakeelb@google.com" , "vdavydov.dev@gmail.com" , Heiko Carstens , Janosch Frank , "kvm@vger.kernel.org" References: <20190611231813.3148843-9-guro@fb.com> <20191121111739.3054-1-borntraeger@de.ibm.com> <20191121165807.GA201621@localhost.localdomain> From: Christian Borntraeger Autocrypt: addr=borntraeger@de.ibm.com; prefer-encrypt=mutual; keydata= xsFNBE6cPPgBEAC2VpALY0UJjGmgAmavkL/iAdqul2/F9ONz42K6NrwmT+SI9CylKHIX+fdf J34pLNJDmDVEdeb+brtpwC9JEZOLVE0nb+SR83CsAINJYKG3V1b3Kfs0hydseYKsBYqJTN2j CmUXDYq9J7uOyQQ7TNVoQejmpp5ifR4EzwIFfmYDekxRVZDJygD0wL/EzUr8Je3/j548NLyL 4Uhv6CIPf3TY3/aLVKXdxz/ntbLgMcfZsDoHgDk3lY3r1iwbWwEM2+eYRdSZaR4VD+JRD7p8 0FBadNwWnBce1fmQp3EklodGi5y7TNZ/CKdJ+jRPAAnw7SINhSd7PhJMruDAJaUlbYaIm23A +82g+IGe4z9tRGQ9TAflezVMhT5J3ccu6cpIjjvwDlbxucSmtVi5VtPAMTLmfjYp7VY2Tgr+ T92v7+V96jAfE3Zy2nq52e8RDdUo/F6faxcumdl+aLhhKLXgrozpoe2nL0Nyc2uqFjkjwXXI OBQiaqGeWtxeKJP+O8MIpjyGuHUGzvjNx5S/592TQO3phpT5IFWfMgbu4OreZ9yekDhf7Cvn /fkYsiLDz9W6Clihd/xlpm79+jlhm4E3xBPiQOPCZowmHjx57mXVAypOP2Eu+i2nyQrkapaY IdisDQfWPdNeHNOiPnPS3+GhVlPcqSJAIWnuO7Ofw1ZVOyg/jwARAQABzUNDaHJpc3RpYW4g Qm9ybnRyYWVnZXIgKDJuZCBJQk0gYWRkcmVzcykgPGJvcm50cmFlZ2VyQGxpbnV4LmlibS5j b20+wsF5BBMBAgAjBQJdP/hMAhsDBwsJCAcDAgEGFQgCCQoLBBYCAwECHgECF4AACgkQEXu8 gLWmHHy/pA/+JHjpEnd01A0CCyfVnb5fmcOlQ0LdmoKWLWPvU840q65HycCBFTt6V62cDljB kXFFxMNA4y/2wqU0H5/CiL963y3gWIiJsZa4ent+KrHl5GK1nIgbbesfJyA7JqlB0w/E/SuY NRQwIWOo/uEvOgXnk/7+rtvBzNaPGoGiiV1LZzeaxBVWrqLtmdi1iulW/0X/AlQPuF9dD1Px hx+0mPjZ8ClLpdSp5d0yfpwgHtM1B7KMuQPQZGFKMXXTUd3ceBUGGczsgIMipZWJukqMJiJj QIMH0IN7XYErEnhf0GCxJ3xAn/J7iFpPFv8sFZTvukntJXSUssONnwiKuld6ttUaFhSuSoQg OFYR5v7pOfinM0FcScPKTkrRsB5iUvpdthLq5qgwdQjmyINt3cb+5aSvBX2nNN135oGOtlb5 tf4dh00kUR8XFHRrFxXx4Dbaw4PKgV3QLIHKEENlqnthH5t0tahDygQPnSucuXbVQEcDZaL9 WgJqlRAAj0pG8M6JNU5+2ftTFXoTcoIUbb0KTOibaO9zHVeGegwAvPLLNlKHiHXcgLX1tkjC DrvE2Z0e2/4q7wgZgn1kbvz7ZHQZB76OM2mjkFu7QNHlRJ2VXJA8tMXyTgBX6kq1cYMmd/Hl OhFrAU3QO1SjCsXA2CDk9MM1471mYB3CTXQuKzXckJnxHkHOwU0ETpw8+AEQAJjyNXvMQdJN t07BIPDtbAQk15FfB0hKuyZVs+0lsjPKBZCamAAexNRk11eVGXK/YrqwjChkk60rt3q5i42u PpNMO9aS8cLPOfVft89Y654Qd3Rs1WRFIQq9xLjdLfHh0i0jMq5Ty+aiddSXpZ7oU6E+ud+X Czs3k5RAnOdW6eV3+v10sUjEGiFNZwzN9Udd6PfKET0J70qjnpY3NuWn5Sp1ZEn6lkq2Zm+G 9G3FlBRVClT30OWeiRHCYB6e6j1x1u/rSU4JiNYjPwSJA8EPKnt1s/Eeq37qXXvk+9DYiHdT PcOa3aNCSbIygD3jyjkg6EV9ZLHibE2R/PMMid9FrqhKh/cwcYn9FrT0FE48/2IBW5mfDpAd YvpawQlRz3XJr2rYZJwMUm1y+49+1ZmDclaF3s9dcz2JvuywNq78z/VsUfGz4Sbxy4ShpNpG REojRcz/xOK+FqNuBk+HoWKw6OxgRzfNleDvScVmbY6cQQZfGx/T7xlgZjl5Mu/2z+ofeoxb vWWM1YCJAT91GFvj29Wvm8OAPN/+SJj8LQazd9uGzVMTz6lFjVtH7YkeW/NZrP6znAwv5P1a DdQfiB5F63AX++NlTiyA+GD/ggfRl68LheSskOcxDwgI5TqmaKtX1/8RkrLpnzO3evzkfJb1 D5qh3wM1t7PZ+JWTluSX8W25ABEBAAHCwV8EGAECAAkFAk6cPPgCGwwACgkQEXu8gLWmHHz8 2w//VjRlX+tKF3szc0lQi4X0t+pf88uIsvR/a1GRZpppQbn1jgE44hgF559K6/yYemcvTR7r 6Xt7cjWGS4wfaR0+pkWV+2dbw8Xi4DI07/fN00NoVEpYUUnOnupBgychtVpxkGqsplJZQpng v6fauZtyEcUK3dLJH3TdVQDLbUcL4qZpzHbsuUnTWsmNmG4Vi0NsEt1xyd/Wuw+0kM/oFEH1 4BN6X9xZcG8GYUbVUd8+bmio8ao8m0tzo4pseDZFo4ncDmlFWU6hHnAVfkAs4tqA6/fl7RLN JuWBiOL/mP5B6HDQT9JsnaRdzqF73FnU2+WrZPjinHPLeE74istVgjbowvsgUqtzjPIG5pOj cAsKoR0M1womzJVRfYauWhYiW/KeECklci4TPBDNx7YhahSUlexfoftltJA8swRshNA/M90/ i9zDo9ySSZHwsGxG06ZOH5/MzG6HpLja7g8NTgA0TD5YaFm/oOnsQVsf2DeAGPS2xNirmknD jaqYefx7yQ7FJXXETd2uVURiDeNEFhVZWb5CiBJM5c6qQMhmkS4VyT7/+raaEGgkEKEgHOWf ZDP8BHfXtszHqI3Fo1F4IKFo/AP8GOFFxMRgbvlAs8z/+rEEaQYjxYJqj08raw6P4LFBqozr nS4h0HDFPrrp1C2EMVYIQrMokWvlFZbCpsdYbBI= Date: Thu, 21 Nov 2019 17:59:54 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.2.0 MIME-Version: 1.0 In-Reply-To: <20191121165807.GA201621@localhost.localdomain> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 x-cbid: 19112116-0020-0000-0000-0000038CF5D6 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19112116-0021-0000-0000-000021E3311E Message-Id: X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.95,18.0.572 definitions=2019-11-21_04:2019-11-21,2019-11-21 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 adultscore=0 malwarescore=0 impostorscore=0 lowpriorityscore=0 phishscore=0 clxscore=1015 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=608 suspectscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-1910280000 definitions=main-1911210148 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21.11.19 17:58, Roman Gushchin wrote: > On Thu, Nov 21, 2019 at 12:17:39PM +0100, Christian Borntraeger wrote: >> Folks, >> >> I do get errors like the following when running a new testcase in our KVM CI. >> The test basically unloads kvm, reloads with with hpage=1 (enable huge page >> support for guests on s390) start a guest with libvirt and hugepages, shut the >> guest down and unload the kvm module. >> >> WARNING: CPU: 8 PID: 208 at lib/percpu-refcount.c:108 percpu_ref_exit+0x50/0x58 >> Modules linked in: kvm(-) xt_CHECKSUM xt_MASQUERADE bonding xt_tcpudp ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack ip6table_na> >> CPU: 8 PID: 208 Comm: kworker/8:1 Not tainted 5.2.0+ #66 >> Hardware name: IBM 2964 NC9 712 (LPAR) >> Workqueue: events sysfs_slab_remove_workfn >> Krnl PSW : 0704e00180000000 0000001529746850 (percpu_ref_exit+0x50/0x58) >> R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 RI:0 EA:3 >> Krnl GPRS: 00000000ffff8808 0000001529746740 000003f4e30e8e18 0036008100000000 >> 0000001f00000000 0035008100000000 0000001fb3573ab8 0000000000000000 >> 0000001fbdb6de00 0000000000000000 0000001529f01328 0000001fb3573b00 >> 0000001fbb27e000 0000001fbdb69300 000003e009263d00 000003e009263cd0 >> Krnl Code: 0000001529746842: f0a0000407fe srp 4(11,%r0),2046,0 >> 0000001529746848: 47000700 bc 0,1792 >> #000000152974684c: a7f40001 brc 15,152974684e >> >0000001529746850: a7f4fff2 brc 15,1529746834 >> 0000001529746854: 0707 bcr 0,%r7 >> 0000001529746856: 0707 bcr 0,%r7 >> 0000001529746858: eb8ff0580024 stmg %r8,%r15,88(%r15) >> 000000152974685e: a738ffff lhi %r3,-1 >> Call Trace: >> ([<000003e009263d00>] 0x3e009263d00) >> [<00000015293252ea>] slab_kmem_cache_release+0x3a/0x70 >> [<0000001529b04882>] kobject_put+0xaa/0xe8 >> [<000000152918cf28>] process_one_work+0x1e8/0x428 >> [<000000152918d1b0>] worker_thread+0x48/0x460 >> [<00000015291942c6>] kthread+0x126/0x160 >> [<0000001529b22344>] ret_from_fork+0x28/0x30 >> [<0000001529b2234c>] kernel_thread_starter+0x0/0x10 >> Last Breaking-Event-Address: >> [<000000152974684c>] percpu_ref_exit+0x4c/0x58 >> ---[ end trace b035e7da5788eb09 ]--- >> >> I have bisected this to >> # first bad commit: [f0a3a24b532d9a7e56a33c5112b2a212ed6ec580] mm: memcg/slab: rework non-root kmem_cache lifecycle management >> >> unmounting /sys/fs/cgroup/memory/ before the test makes the problem go away so >> it really seems to be related to the new percpu-refs from this patch. >> >> Any ideas? > > Hello, Christian! > > It seems to be a race between releasing of the root kmem_cache (caused by rmmod) > and a memcg kmem_cache. Does delaying rmmod for say, a minute, "resolve" the > issue? Yes, rmmod has to be called directly after the guest shutdown to see the issue. See my 2nd mail.