Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp367794imu; Fri, 16 Nov 2018 03:56:33 -0800 (PST) X-Google-Smtp-Source: AJdET5cfW8pP3YvH3nBRx8yVjto1jDJm9Hu+ek/QS4mSWdQadEY8idKOh3juIZMClMmz4yEW6bTT X-Received: by 2002:a17:902:8507:: with SMTP id bj7-v6mr10309229plb.99.1542369392966; Fri, 16 Nov 2018 03:56:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1542369392; cv=none; d=google.com; s=arc-20160816; b=hRmmeBQmFJOtDne2FAF+yzVxiQlpfDigf1MRdnmmHj/gYMWM+ybCf+NQCF4WIgfyRu Ot94xVmX4Pp/8OxZcDK5OTHaCQzHKxeVmSj6xyskmFMIN8F2wF/k7ePRGZNYVKM85PzE pYufVrLTAYTcREhlGXqW77jmOmKecFuidcp0CUAiwE4igZXJO7sJ+y+uaQ2hkYHEUjXB HN8cW+f3QXtFrfbYW8b4WhY0hGVKqU6PyfPGYbUdfwAiaDxxNF20r4z0/dsumfCV6K93 v2Iy3mHOddTuBtz7zj2YqJMpgqt1wby4E3Kdqw463RJxzt34FjJlv61PS71SuaoWGRMR B5ag== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:subject:from; bh=hRbaGZQXqKc6gFJ6rJxvJ7nT3N2VBr4jBqOOwhiLK7s=; b=eTt5VYim578u4s+M5GJ0tzT3oiAhgPsWt/E+niNZR4b4yjqs8rBcNp9DBmxGcqIDj4 HDTNCoikGNIOGcnK8Fkaddhbrh9nNJ8KoNcuk5yGspwTu5/Di+8nS1yivPwQwpuUTXLS isbYt1nBQmAA59wQq/PmJwSpzEuOwwcVAY8IZRkx4VnEOnb9VMErohbGir0QypHXzJnf XbjHrL8gcX1/LfuigMGjjJIWvxuChkj0V1pG0JYHWBZhpLj3IliVJKExj8d/fMLgQY7u B1f07ychwM74jlMn71f+kUE+rSji8TqwWZB6O/g7ElY71eix8nBIt9MrIANM2gzVZvtN YcIg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f16si29562161pgb.140.2018.11.16.03.56.17; Fri, 16 Nov 2018 03:56:32 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2389532AbeKPWHm (ORCPT + 99 others); Fri, 16 Nov 2018 17:07:42 -0500 Received: from foss.arm.com ([217.140.101.70]:50256 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727963AbeKPWHm (ORCPT ); Fri, 16 Nov 2018 17:07:42 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 709E380D; Fri, 16 Nov 2018 03:55:39 -0800 (PST) Received: from [10.163.1.125] (unknown [10.163.1.125]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id A24DA3F718; Fri, 16 Nov 2018 03:55:37 -0800 (PST) From: Anshuman Khandual Subject: Re: [PATCH 0/5] mm, memory_hotplug: improve memory offlining failures debugging To: Michal Hocko , Andrew Morton Cc: Oscar Salvador , Baoquan He , linux-mm@kvack.org, LKML References: <20181116083020.20260-1-mhocko@kernel.org> Message-ID: <8a91e93d-386d-f0bc-d639-a696bb37a34e@arm.com> Date: Fri, 16 Nov 2018 17:25:35 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20181116083020.20260-1-mhocko@kernel.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/16/2018 02:00 PM, Michal Hocko wrote: > Hi, > this has been posted as an RFC [1]. I have screwed during rebasing so > there were few compilation issues in the previous version. I have also > integrated review feedback from Andrew and Anshuman. > > I have been promissing to improve memory offlining failures debugging > for quite some time. As things stand now we get only very limited > information in the kernel log when the offlining fails. It is usually > only > [ 1984.506184] rac1 kernel: memory offlining [mem 0x82600000000-0x8267fffffff] failed > without no further details. We do not know what exactly fails and for > what reason. Whenever I was forced to debug such a failure I've always > had to do a debugging patch to tell me more. We can enable some > tracepoints but it would be much better to get a better picture without > using them. > > This patch series does 2 things. The first one is to make dump_page > more usable by printing more information about the mapping patch 1. > Then it reduces the log level from emerg to warning so that this > function is usable from less critical context patch 2. Then I have > added more detailed information about the offlining failure patch 4 > and finally add dump_page to isolation and offlining migration paths. > Patch 3 is a trivial cleanup. > > Does this look go to you? > > [1] http://lkml.kernel.org/r/20181107101830.17405-1-mhocko@kernel.org > Agreed. It has been always difficult to debug memory hot plug problems without a debug patch particularly to understand the unmovable pages and their isolation failures in the range to be removed. This series is definitely going to help improve the situation.