Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp8095261imu; Thu, 15 Nov 2018 06:34:58 -0800 (PST) X-Google-Smtp-Source: AJdET5cB14K8iO6v68TOAh+dYrkCzCUqvVQWuWeDM3r+sSoh2KDxyth/YhXfz6U6QlMbUDWVUmz4 X-Received: by 2002:a63:a51b:: with SMTP id n27mr6095387pgf.17.1542292498851; Thu, 15 Nov 2018 06:34:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1542292498; cv=none; d=google.com; s=arc-20160816; b=gfPAxIWc/47aSibkqiTsUsl15bM+QBy3K7tqxAx5Gg1V93OX2Foq0o6EQnjU0T9IKd 1YcaRd+raoQ4L9bZCLTfaG4F7ks/0CFTicLJoI8Cqa5JW9ZFV+8PRCAqTd+051t6CtZK Q1+qlKY4V2ZUPCpKjsSilhNAgMDuORpplKmTzqLloav0dogLo8hSpkx5aG93RzuX1qjJ Z1T/kO8FY0B+Mvsv+ctvfZlsdvXY9RUvj0cmFCMuiXCIZMI4x/dKhNqFydvqI7qcwV6j venErEvagcCphKGmheieQ8ZRt0mDfiOFK60cZs+M8gq6KhMsj8dw2quA+zVCGO/Pzf6b qq2g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=u99NTzVXw1/Pln7rwETqLeI+JKypEIXJ6i5HmnLtPTA=; b=M921AkvbgJfgbEoURjjtJNELupJOG4WmuIBk0GZyWXazHPFFurySev6tb07CK0fvM4 QQHH+uUfOQ8ScV9TRQRLhmqgqxPPjLWWgHpC8ef06l2wOe+QYbqWCLTI7p1N/6MfukNW Lqq7Ogtw01RzVH4mzGe7uNVkLB6hrUnbl2YkyQBxmh4debrWUCGqvV/Up5cUMVnjgYRi BaoVrj1yHnrO6T62JvMffo5PSly8dhIuhAZuRptqYul7l19T1mOl+08Ic89HzL+MJVEv cYh4B5eVTzPTU3TBL4AE4Py4jyUOD51T0ZhUpzpKUwtQYOZioMzAhq25sU3OuyGxWPTc vkcw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g8-v6si19136999pli.79.2018.11.15.06.34.41; Thu, 15 Nov 2018 06:34:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388326AbeKPAkL (ORCPT + 99 others); Thu, 15 Nov 2018 19:40:11 -0500 Received: from mx2.suse.de ([195.135.220.15]:44518 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2387548AbeKPAkL (ORCPT ); Thu, 15 Nov 2018 19:40:11 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id CE207B015; Thu, 15 Nov 2018 14:32:05 +0000 (UTC) Date: Thu, 15 Nov 2018 15:32:04 +0100 From: Michal Hocko To: Baoquan He Cc: David Hildenbrand , linux-mm@kvack.org, pifang@redhat.com, linux-kernel@vger.kernel.org, akpm@linux-foundation.org, aarcange@redhat.com Subject: Re: Memory hotplug softlock issue Message-ID: <20181115143204.GV23831@dhcp22.suse.cz> References: <20181114090134.GG23419@dhcp22.suse.cz> <20181114145250.GE2653@MiWiFi-R3L-srv> <20181114150029.GY23419@dhcp22.suse.cz> <20181115051034.GK2653@MiWiFi-R3L-srv> <20181115073052.GA23831@dhcp22.suse.cz> <20181115075349.GL2653@MiWiFi-R3L-srv> <20181115083055.GD23831@dhcp22.suse.cz> <20181115131211.GP2653@MiWiFi-R3L-srv> <20181115131927.GT23831@dhcp22.suse.cz> <20181115133840.GR2653@MiWiFi-R3L-srv> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181115133840.GR2653@MiWiFi-R3L-srv> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu 15-11-18 21:38:40, Baoquan He wrote: > On 11/15/18 at 02:19pm, Michal Hocko wrote: > > On Thu 15-11-18 21:12:11, Baoquan He wrote: > > > On 11/15/18 at 09:30am, Michal Hocko wrote: > > [...] > > > > It would be also good to find out whether this is fs specific. E.g. does > > > > it make any difference if you use a different one for your stress > > > > testing? > > > > > > Created a ramdisk and put stress bin there, then run stress -m 200, now > > > seems it's stuck in libc-2.28.so migrating. And it's still xfs. So now xfs > > > is a big suspect. At bottom I paste numactl printing, you can see that it's > > > the last 4G. > > > > > > Seems it's trying to migrate libc-2.28.so, but stress program keeps trying to > > > access and activate it. > > > > Is this still with faultaround disabled? I have seen exactly same > > pattern in the bug I am working on. It was ext4 though. > > After a long time struggling, the last 2nd block where libc-2.28.so is > located is reclaimed, now it comes to the last memory block, still > stress program itself. swap migration entry has been made and trying to > unmap, now it's looping there. > > [ +0.004445] migrating pfn 190ff2bb0 failed > [ +0.000013] page:ffffea643fcaec00 count:203 mapcount:201 mapping:ffff888dfb268f48 index:0x0 > [ +0.012809] shmem_aops > [ +0.000011] name:"stress" > [ +0.002550] flags: 0x1dfffffc008004e(referenced|uptodate|dirty|workingset|swapbacked) > [ +0.010715] raw: 01dfffffc008004e ffffea643fcaec48 ffffea643fc714c8 ffff888dfb268f48 > [ +0.007828] raw: 0000000000000000 0000000000000000 000000cb000000c8 ffff888e72e92000 > [ +0.007810] page->mem_cgroup:ffff888e72e92000 [...] > [ +0.004455] migrating pfn 190ff2bb0 failed > [ +0.000018] page:ffffea643fcaec00 count:203 mapcount:201 mapping:ffff888dfb268f48 index:0x0 > [ +0.014392] shmem_aops > [ +0.000010] name:"stress" > [ +0.002565] flags: 0x1dfffffc008004e(referenced|uptodate|dirty|workingset|swapbacked) > [ +0.010675] raw: 01dfffffc008004e ffffea643fcaec48 ffffea643fc714c8 ffff888dfb268f48 > [ +0.007819] raw: 0000000000000000 0000000000000000 000000cb000000c8 ffff888e72e92000 > [ +0.007808] page->mem_cgroup:ffff888e72e92000 OK, so this is tmpfs backed code of your stree test. This just tells us that this is not fs specific. Reference count is 2 more than the map count which is the expected state. So the reference count must have been elevated at the time when the migration was attempted. Shmem supports fault around so this might be still possible (assuming it is enabled). If not we really need to dig deeper. I will think of a debugging patch. -- Michal Hocko SUSE Labs