Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754713AbcDTIKl (ORCPT ); Wed, 20 Apr 2016 04:10:41 -0400 Received: from LGEAMRELO13.lge.com ([156.147.23.53]:55290 "EHLO lgeamrelo13.lge.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754614AbcDTIKb (ORCPT ); Wed, 20 Apr 2016 04:10:31 -0400 X-Original-SENDERIP: 156.147.1.127 X-Original-MAILFROM: iamjoonsoo.kim@lge.com X-Original-SENDERIP: 10.177.222.138 X-Original-MAILFROM: iamjoonsoo.kim@lge.com Date: Wed, 20 Apr 2016 17:13:45 +0900 From: Joonsoo Kim To: Valdis.Kletnieks@vt.edu Cc: Andrew Morton , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: linux-next crash during very early boot Message-ID: <20160420081345.GC7071@js1304-P5Q-DELUXE> References: <3689.1460593786@turing-police.cc.vt.edu> <20160414013546.GA9198@js1304-P5Q-DELUXE> <58269.1460729433@turing-police.cc.vt.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <58269.1460729433@turing-police.cc.vt.edu> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2080 Lines: 53 On Fri, Apr 15, 2016 at 10:10:33AM -0400, Valdis.Kletnieks@vt.edu wrote: > On Thu, 14 Apr 2016 10:35:47 +0900, Joonsoo Kim said: > > On Wed, Apr 13, 2016 at 08:29:46PM -0400, Valdis Kletnieks wrote: > > > I'm seeing my laptop crash/wedge up/something during very early > > > boot - before it can write anything to the console. Nothing in pstore, > > > need to hold down the power button for 6 seconds and reboot. > > > > > > git bisect points at: > > > > > > commit 7a6bacb133752beacb76775797fd550417e9d3a2 > > > Author: Joonsoo Kim > > > Date: Thu Apr 7 13:59:39 2016 +1000 > > > > > > mm/slab: factor out kmem_cache_node initialization code > > > > > > It can be reused on other place, so factor out it. Following patch wil > l > > > use it. > > > > > > > > > Not sure what the problem is - the logic *looks* ok at first read. The > > > patch *does* remove a spin_lock_irq() - but I find it difficult to > > > believe that with it gone, my laptop is able to hit the race condition > > > the spinlock protects against *every single boot*. > > > > > > The only other thing I see is that n->free_limit used to be assigned > > > every time, and now it's only assigned at initial creation. > > > > Hello, > > > > My fault. It should be assgined every time. Please test below patch. > > I will send it with proper SOB after you confirm the problem disappear. > > Thanks for report and analysis! > > Following up - I verified that it was your patch series and not a bad bisect > by starting with a clean next-20160413 and reverting that series - and the > resulting kernel boots fine. > > Will take a closer look at your fix patch and figure out what's still changed > afterwards - there's obviously some small semantic change that actually > matters, but we're not spotting it yet... Hello, Do you try to test the patch in following link on top of my fix for "mm/slab: factor out kmem_cache_node initialization code"? https://lkml.org/lkml/2016/4/10/703 I mentioned it in another thread but you didn't reply it so I'm curious. Thanks.