Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932355AbcCHPga (ORCPT ); Tue, 8 Mar 2016 10:36:30 -0500 Received: from mail-oi0-f52.google.com ([209.85.218.52]:34003 "EHLO mail-oi0-f52.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754258AbcCHPgT (ORCPT ); Tue, 8 Mar 2016 10:36:19 -0500 MIME-Version: 1.0 In-Reply-To: <56DEAD3D.5090706@huawei.com> References: <56D79284.3030009@redhat.com> <56D832BD.5080305@huawei.com> <20160304020232.GA12036@js1304-P5Q-DELUXE> <20160304043232.GC12036@js1304-P5Q-DELUXE> <56D92595.60709@huawei.com> <20160304063807.GA13317@js1304-P5Q-DELUXE> <56D93ABE.9070406@huawei.com> <20160307043442.GB24602@js1304-P5Q-DELUXE> <56DD7B20.1020508@suse.cz> <20160308074816.GA31471@js1304-P5Q-DELUXE> <56DEAD3D.5090706@huawei.com> Date: Wed, 9 Mar 2016 00:36:18 +0900 Message-ID: Subject: Re: Suspicious error for CMA stress test From: Joonsoo Kim To: Xishi Qiu Cc: Joonsoo Kim , Vlastimil Babka , Hanjun Guo , Laura Abbott , "linux-arm-kernel@lists.infradead.org" , "linux-kernel@vger.kernel.org" , Andrew Morton , Sasha Levin , Laura Abbott , Catalin Marinas , Will Deacon , Arnd Bergmann , "thunder.leizhen@huawei.com" , dingtinahong , chenjie6@huawei.com, "linux-mm@kvack.org" Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2209 Lines: 57 2016-03-08 19:45 GMT+09:00 Xishi Qiu : > On 2016/3/8 15:48, Joonsoo Kim wrote: > >> On Mon, Mar 07, 2016 at 01:59:12PM +0100, Vlastimil Babka wrote: >>> On 03/07/2016 05:34 AM, Joonsoo Kim wrote: >>>> On Fri, Mar 04, 2016 at 03:35:26PM +0800, Hanjun Guo wrote: >>>>>> Sad to hear that. >>>>>> >>>>>> Could you tell me your system's MAX_ORDER and pageblock_order? >>>>>> >>>>> >>>>> MAX_ORDER is 11, pageblock_order is 9, thanks for your help! >>> >>> I thought that CMA regions/operations (and isolation IIRC?) were >>> supposed to be MAX_ORDER aligned exactly to prevent needing these >>> extra checks for buddy merging. So what's wrong? >> >> CMA isolates MAX_ORDER aligned blocks, but, during the process, >> partialy isolated block exists. If MAX_ORDER is 11 and >> pageblock_order is 9, two pageblocks make up MAX_ORDER >> aligned block and I can think following scenario because pageblock >> (un)isolation would be done one by one. >> >> (each character means one pageblock. 'C', 'I' means MIGRATE_CMA, >> MIGRATE_ISOLATE, respectively. >> > > Hi Joonsoo, > >> CC -> IC -> II (Isolation) > >> II -> CI -> CC (Un-isolation) >> >> If some pages are freed at this intermediate state such as IC or CI, >> that page could be merged to the other page that is resident on >> different type of pageblock and it will cause wrong freepage count. >> > > Isolation will appear when do cma alloc, so there are two following threads. > > C(free)C(used) -> start_isolate_page_range -> I(free)C(used) -> I(free)I(someone free it) -> undo_isolate_page_range -> C(free)C(free) > so free cma is 2M -> 0M -> 0M -> 4M, the increased 2M was freed by someone. Your example is correct one but think about following one. C(free)C(used) -> start_isolate_page_range -> I(free)C(used) -> I(free)**C**(someone free it) -> undo_isolate_page_range -> C(free)C(free) it would be 2M -> 0M -> 2M -> 6M. When we do I(free)C(someone free it), CMA freepage is added because it is on CMA pageblock. But, bad merging happens and 4M buddy is made and it is in isolate buddy list. Later, when we do undo_isolation, this 4M buddy is moved to CMA buddy list and 4M is added to CMA freepage counter so total is 6M. Thanks.