Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752000AbdIVTw1 (ORCPT ); Fri, 22 Sep 2017 15:52:27 -0400 Received: from out0-205.mail.aliyun.com ([140.205.0.205]:52448 "EHLO out0-205.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751845AbdIVTwZ (ORCPT ); Fri, 22 Sep 2017 15:52:25 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R151e4;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e02c03293;MF=yang.s@alibaba-inc.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---.8yOHODC_1506109930; From: "Yang Shi" To: cl@linux.com, penberg@kernel.org, rientjes@google.com, iamjoonsoo.kim@lge.com, akpm@linux-foundation.org, mhocko@kernel.org Cc: "Yang Shi" , , Subject: [PATCH 0/2 v6] oom: capture unreclaimable slab info in oom message when kernel panic Date: Sat, 23 Sep 2017 03:52:05 +0800 Message-Id: <1506109927-17012-1-git-send-email-yang.s@alibaba-inc.com> X-Mailer: git-send-email 1.8.3.1 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1718 Lines: 41 Recently we ran into a oom issue, kernel panic due to no killable process. The dmesg shows huge unreclaimable slabs used almost 100% memory, but kdump doesn't capture vmcore due to some reason. So, it may sound better to capture unreclaimable slab info in oom message when kernel panic to aid trouble shooting and cover the corner case. Since kernel already panic, so capturing more information sounds worthy and doesn't bother normal oom killer. With the patchset, tools/vm/slabinfo has a new option, "-U", to show unreclaimable slab only. And, oom will print all non zero (num_objs * size != 0) unreclaimable slabs in oom killer message. For details, please see the commit log for each commit. Changelog v5 -> v6: * Fixed a checkpatch.pl warning for patch #2, zero error and warning for both patches Changelog v4 -> v5: * Solved the comments from David * Build test SLABINFO = n Changelog v3 -> v4: * Solved the comments from David * Added David’s Acked-by in patch 1 Changelog v2 -> v3: * Show used size and total size of each kmem cache per David’s comment Changelog v1 -> v2: * Removed the original patch 1 (“mm: slab: output reclaimable flag in /proc/slabinfo”) since Christoph suggested it might break the compatibility and /proc/slabinfo is legacy * Added Christoph’s Acked-by * Removed acquiring slab_mutex per Tetsuo’s comment Yang Shi (2): tools: slabinfo: add "-U" option to show unreclaimable slabs only mm: oom: show unreclaimable slab info when kernel panic mm/oom_kill.c | 3 +++ mm/slab.h | 8 ++++++++ mm/slab_common.c | 29 +++++++++++++++++++++++++++++ tools/vm/slabinfo.c | 11 ++++++++++- 4 files changed, 50 insertions(+), 1 deletion(-)