Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6140064ybv; Tue, 18 Feb 2020 10:41:41 -0800 (PST) X-Google-Smtp-Source: APXvYqyjl0PAP84/hPoiN793LvgeYKwL2BIdZvM/d6m+hxi6cn+4juWXZuIXfEEpVA2He5ApXZXU X-Received: by 2002:a05:6830:22cd:: with SMTP id q13mr16897396otc.224.1582051301819; Tue, 18 Feb 2020 10:41:41 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582051301; cv=none; d=google.com; s=arc-20160816; b=iefttJRI+tPyM4kbe4wJmp7LPwpAlVeqMzIDdSl0iidVW/cInIDX60ofLeC3RDo1k7 axIu96mWAvA2ttrS6rSvHM4eIsByw1ffBgavlZdi8YHYqwxcRf8/2y+1/10J5k5oLV4w oSslgkKXTfF9U4sefcsalF4zcvoyQEHifhmPNj0SICY96ruvji1euCsaryeQB9qc82Dn OfKL5NfO9Hv9lu3AgPavG/B1FUFBNN+6gBBFbbgOGeASrFySxIxoLIC0YNTLx3hy//ND cIaL++jgplGubeQdBZmDYJgEjYIIHUZ1a5dZUZ+b5v2lY6XL0SpqMz9IEEgOYeqZV/pU NEKw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id :dkim-signature; bh=4oAqcu9BwDpPKcExc1FRw4p4ak/gNbofDesTw+TDNrw=; b=HsoCTQ7NqxbtgB6Bn2sC1BrA+FL+mHcMtzSplBaffQL63qPz4SdkkRzFaHvuvlBJo3 wBtcUUUU/MsuPu5/NN4oN23Lx55KFTT4prl1DgEzQSAE0E+1jpjM6vBbsxNkm/tLHqW0 fclvs2a8UkDbrRkpSpi/6/LaDrAh2D7nSUkIDOjLuYaD47HMG9oly/IoOLZPnA9KsadE 8jaBC0sc8wGEuOpmbQ8uRs12nRxwvLP0A0IUNCcTir21xookJ0oEpDY1LeWdyNLJ6yZD Zi1UGdnbgdyGo3AyKwA8z0cTUBc5txvIcMMdky3SwnumIVe4Vfph9UdMNDo/DUIf/4+F iOng== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=Vk5cfYQ5; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o13si2244119otp.27.2020.02.18.10.41.30; Tue, 18 Feb 2020 10:41:41 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=Vk5cfYQ5; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726641AbgBRSlL (ORCPT + 99 others); Tue, 18 Feb 2020 13:41:11 -0500 Received: from mail-qk1-f196.google.com ([209.85.222.196]:37436 "EHLO mail-qk1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726422AbgBRSlK (ORCPT ); Tue, 18 Feb 2020 13:41:10 -0500 Received: by mail-qk1-f196.google.com with SMTP id c188so20512987qkg.4 for ; Tue, 18 Feb 2020 10:41:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lca.pw; s=google; h=message-id:subject:from:to:cc:date:in-reply-to:references :mime-version:content-transfer-encoding; bh=4oAqcu9BwDpPKcExc1FRw4p4ak/gNbofDesTw+TDNrw=; b=Vk5cfYQ5q/TgaB4CkQkA26YYOoWTr/ajOpC8SHfg/BoGoOEFEH9UZyubYufuzZmJA3 rhDqTSRdXsoOJ14IjRNWWDtVWaHpuvITVc+xTBKD2A8Aa+70C6xjel6e70RAJUGJBota eG4qzjRI2rHp6qlwmGucc6fzieBWudjIpwc6gkBjQrMQCGWVzas44Ip4HWDrdF1vhZ8Q 4E/RQxQe9Io0dX38RHdNrcSg+ZqRYos8elgZSgFB/ksSmymtz/x62SexX9COx6gItr9K P6DWw6Ka/+UNC9LyO1G6YKsZgvxxb9MwSFIky0IY8yKXn5wfhR6hHKOq2SvXxy4RC76C 70PA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:message-id:subject:from:to:cc:date:in-reply-to :references:mime-version:content-transfer-encoding; bh=4oAqcu9BwDpPKcExc1FRw4p4ak/gNbofDesTw+TDNrw=; b=O0evrPYVy14ag2bwOVrhLc7YCqU+EKDlcaVkVv1kI/8Hn0T0b1d8qKaefR5xEkvL/G Wr1A9iPPNDPXRClrFsLSXrUmqrT7z491JsIKllxquaRVRUTyuIWcYfFsHt0M2OgaF7vu ujcH1jMP0HqHv4Riv7MG/yfDZXCkZzRPDFV3MKxlvJUJARI4UaLfFofsXHWOCgVimHPS gtQgZYbpIAdtlCNCWK3Xjt+cNybDMGi8CZ7+wAOLie0yo2IAHw2s1fADf1I2J87KQVWi Wp54E5nPU4G/dKS028oWpJbUSOt7Trga+mVY4gN98Xq3TBVkknYjO4vX0ysBfhTRE2LH bnyQ== X-Gm-Message-State: APjAAAX+GjLCzAVWk9AyG5zjl8R/gLVP8HoXe09f+VCKjGpUv/tWdCQG Ivf6zTqKotfBUKPvk/2bzzulahNp7L0= X-Received: by 2002:a37:6308:: with SMTP id x8mr20048203qkb.381.1582051269217; Tue, 18 Feb 2020 10:41:09 -0800 (PST) Received: from dhcp-41-57.bos.redhat.com (nat-pool-bos-t.redhat.com. [66.187.233.206]) by smtp.gmail.com with ESMTPSA id m23sm2219557qtp.6.2020.02.18.10.41.07 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 18 Feb 2020 10:41:08 -0800 (PST) Message-ID: <1582051267.7365.96.camel@lca.pw> Subject: Re: [PATCH v12 1/9] hugetlb_cgroup: Add hugetlb_cgroup reservation counter From: Qian Cai To: Mina Almasry Cc: Andrew Morton , Mike Kravetz , shuah , David Rientjes , Shakeel Butt , Greg Thelen , open list , linux-mm@kvack.org, linux-kselftest@vger.kernel.org, cgroups@vger.kernel.org Date: Tue, 18 Feb 2020 13:41:07 -0500 In-Reply-To: References: <20200211213128.73302-1-almasrymina@google.com> <20200211151906.637d1703e4756066583b89da@linux-foundation.org> <1582035660.7365.90.camel@lca.pw> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.22.6 (3.22.6-10.el7) Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2020-02-18 at 10:35 -0800, Mina Almasry wrote: > On Tue, Feb 18, 2020 at 6:21 AM Qian Cai wrote: > > > > On Tue, 2020-02-11 at 15:19 -0800, Andrew Morton wrote: > > > On Tue, 11 Feb 2020 13:31:20 -0800 Mina Almasry wrote: > > > > > > > These counters will track hugetlb reservations rather than hugetlb > > > > memory faulted in. This patch only adds the counter, following patches > > > > add the charging and uncharging of the counter. > > > > > > We're still pretty thin on review here, but as it's v12 and Mike > > > appears to be signed up to look at this work, I'll add them to -next to > > > help move things forward. > > > > > > > Reverted the whole series on the top of next-20200217 fixed a crash below (I > > don't see anything in next-20200218 would make any differences). > > > > [ 7933.691114][T35046] LTP: starting hugemmap06 > > [ 7933.806377][T14355] ------------[ cut here ]------------ > > [ 7933.806541][T14355] kernel BUG at mm/hugetlb.c:490! > > VM_BUG_ON(t - f <= 1); > > [ 7933.806562][T14355] Oops: Exception in kernel mode, sig: 5 [#1] > > [ 7933.806573][T14355] LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=256 > > DEBUG_PAGEALLOC NUMA PowerNV > > [ 7933.806594][T14355] Modules linked in: kvm_hv kvm brd ext4 crc16 mbcache jbd2 > > loop ip_tables x_tables xfs sd_mod bnx2x ahci mdio libahci tg3 libata libphy > > firmware_class dm_mirror dm_region_hash dm_log dm_mod [last unloaded: > > binfmt_misc] > > [ 7933.806651][T14355] CPU: 54 PID: 14355 Comm: hugemmap06 Tainted: > > G O 5.6.0-rc2-next-20200217 #1 > > [ 7933.806674][T14355] NIP: c00000000040d22c LR: c00000000040d210 CTR: > > 0000000000000000 > > [ 7933.806696][T14355] REGS: c0000014b71ef660 TRAP: 0700 Tainted: > > G O (5.6.0-rc2-next-20200217) > > [ 7933.806727][T14355] MSR: 900000000282b033 > > CR: 22022228 XER: 00000000 > > [ 7933.806772][T14355] CFAR: c00000000040cbec IRQMASK: 0 > > [ 7933.806772][T14355] GPR00: c00000000040d210 c0000014b71ef8f0 c000000001657000 > > 0000000000000001 > > [ 7933.806772][T14355] GPR04: 0000000000000012 0000000000000013 0000000000000000 > > 0000000000000000 > > [ 7933.806772][T14355] GPR08: 0000000000000002 0000000000000002 0000000000000001 > > 0000000000000036 > > [ 7933.806772][T14355] GPR12: 0000000022022222 c000001ffffd3d00 00007fffad670000 > > 00007fffa4bc0000 > > [ 7933.806772][T14355] GPR16: 0000000000000000 c000000001567178 c0000014b71efa50 > > 0000000000000000 > > [ 7933.806772][T14355] GPR20: 0000000000000000 0000000000000013 0000000000000012 > > 0000000000000001 > > [ 7933.806772][T14355] GPR24: c0000019f74cd270 5deadbeef0000100 5deadbeef0000122 > > c0000019f74cd2c0 > > [ 7933.806772][T14355] GPR28: 0000000000000001 c0000019f74cd268 c0000014b71ef918 > > 0000000000000001 > > [ 7933.806961][T14355] NIP [c00000000040d22c] region_add+0x11c/0x3a0 > > [ 7933.806980][T14355] LR [c00000000040d210] region_add+0x100/0x3a0 > > [ 7933.807008][T14355] Call Trace: > > [ 7933.807024][T14355] [c0000014b71ef8f0] [c00000000040d210] > > region_add+0x100/0x3a0 (unreliable) > > [ 7933.807056][T14355] [c0000014b71ef9b0] [c00000000040e0c8] > > __vma_reservation_common+0x148/0x210 > > __vma_reservation_common at mm/hugetlb.c:2150 > > [ 7933.807087][T14355] [c0000014b71efa20] [c0000000004132a0] > > alloc_huge_page+0x350/0x830 > > alloc_huge_page at mm/hugetlb.c:2359 > > [ 7933.807100][T14355] [c0000014b71efad0] [c0000000004168f8] > > hugetlb_no_page+0x158/0xcb0 > > [ 7933.807113][T14355] [c0000014b71efc20] [c000000000417bc8] > > hugetlb_fault+0x678/0xb30 > > [ 7933.807136][T14355] [c0000014b71efcd0] [c0000000003b1de4] > > handle_mm_fault+0x444/0x450 > > [ 7933.807158][T14355] [c0000014b71efd20] [c000000000070b1c] > > __do_page_fault+0x2bc/0xfd0 > > [ 7933.807181][T14355] [c0000014b71efe20] [c00000000000aa88] > > handle_page_fault+0x10/0x30 > > [ 7933.807201][T14355] Instruction dump: > > [ 7933.807209][T14355] 38c00000 7ea5ab78 7ec4b378 7fa3eb78 4bfff80d e9210020 > > e91d0050 e95d0068 > > [ 7933.807232][T14355] 7d3c4850 7d294214 7faa4800 409c0238 <0b170000> 7f03c378 > > 4858c005 60000000 > > [ 7933.807267][T14355] ---[ end trace 7560275de5f409f8 ]--- > > [ 7933.905258][T14355] > > [ 7934.905339][T14355] Kernel panic - not syncing: Fatal exception > > Hi Qian, > > Yes this VM_BUG_ON was added by a patch in the series ("hugetlb: > disable region_add file_region coalescing") so it's definitely related > to the series. I'm taking a look at why this VM_BUG_ON fires. Can you > confirm you reproduce this by running hugemmap06 from the ltp on a > powerpc machine? Can I maybe have your config? Yes, reproduced on both powerpc and x86. Configs are in, https://github.com/cailca/linux-mm