Received: by 2002:a25:e74b:0:0:0:0:0 with SMTP id e72csp1631892ybh; Tue, 14 Jul 2020 03:09:36 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxIg05E3nG1C7biiGr8q/8XS5TO0okjlzR6lTGQ4F635b5hazeFQPTklXN2fIRkPjca5wNx X-Received: by 2002:a17:906:384a:: with SMTP id w10mr3892362ejc.235.1594721375890; Tue, 14 Jul 2020 03:09:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1594721375; cv=none; d=google.com; s=arc-20160816; b=EfR8TNKlMcyhgjTmIiQUk7aWReRV88nPSNLdLjZeCeGEZiSOfzDUAspKe7OVD4EN+t wSsjwAjeGRB/nDInicGbFlD2Es53T6cy/LW0W+CMWxeUtzglRtQwFziJx/2VEg8CRqUr hJzTaHxbAGFYdSZdVLD1/xRdQc7WNeWAlX/iGEzhiFCQtzjyIgMkWN56zi1LdpE+mFYd BbxKMLMwFV5YwWzT/G8DTvl4S8ujvtoopbD/TD6H1HdNWACO64gFIHabN9Kmi33f7n1q aHRHjWJ33GbCSoFWIpaVTc7NBNDWCo0kqZdEu8AJk5m0zVg0HuobqJsIXY6sNtrJf4uU Ks4g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=pOJKPI6xEhmG7z8CvTfBzzPL1Mj9votxv+3zzYjQGUU=; b=L2ZAa6EVnbVJbJ1mNegg+MO3r/cGSpLXtE7DWzkpWDxoxRRkNWLo1ER/tpaZOdAUNW /PHvl16CashsGiFyGvOOyEgb95weTRyxlAA/b7YBQRpNNF3uoObqti1c5ntjv8Co5UbW BQ4AxC8QEjbDLchJF7OoKLbiRs1+uFcU9R1OrX95ZGDBH6OTP/7aPQHXBN5okdB6tF8I 8js43drN9JZpFyRQNrkldvvnBsz03gAZIsF7GATJPWGRTWn5wz/V8/vax+GMoI2GXCCv 3Y7a1kB3CnLscoGLyjIEoyAWLAOFm7OC0k7HqSNtmq/5K/4+CTMV8ubCuqpVTTldwjQD aXOg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bz20si10197469ejc.90.2020.07.14.03.09.12; Tue, 14 Jul 2020 03:09:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726890AbgGNKIu (ORCPT + 99 others); Tue, 14 Jul 2020 06:08:50 -0400 Received: from mx2.suse.de ([195.135.220.15]:59702 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725906AbgGNKIu (ORCPT ); Tue, 14 Jul 2020 06:08:50 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 86DE6ADCA; Tue, 14 Jul 2020 10:08:51 +0000 (UTC) Date: Tue, 14 Jul 2020 12:08:46 +0200 From: Oscar Salvador To: Qian Cai Cc: nao.horiguchi@gmail.com, linux-mm@kvack.org, mhocko@kernel.org, akpm@linux-foundation.org, mike.kravetz@oracle.com, tony.luck@intel.com, david@redhat.com, aneesh.kumar@linux.vnet.ibm.com, zeil@yandex-team.ru, naoya.horiguchi@nec.com, linux-kernel@vger.kernel.org Subject: Re: [PATCH v3 00/15] HWPOISON: soft offline rework Message-ID: <20200714100839.GA1939@linux> References: <20200624150137.7052-1-nao.horiguchi@gmail.com> <20200630050803.GA2747@lca.pw> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200630050803.GA2747@lca.pw> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jun 30, 2020 at 01:08:03AM -0400, Qian Cai wrote: > Even after applied the compling fix, > > https://lore.kernel.org/linux-mm/20200628065409.GA546944@u2004/ > > madvise(MADV_SOFT_OFFLINE) will fail with EIO with hugetlb where it > would succeed without this series. Steps: > > # git clone https://github.com/cailca/linux-mm > # cd linux-mm; make > # ./random 1 (Need at least two NUMA memory nodes) > start: migrate_huge_offline > - use NUMA nodes 0,4. > - mmap and free 8388608 bytes hugepages on node 0 > - mmap and free 8388608 bytes hugepages on node 4 > madvise: Input/output error Ok, sorry for the lateness, but I had to re-fetch the code on my brain again. I just finished v4 of this patchset and it seems this problem is gone: # ./random 1 - start: migrate_huge_offline - use NUMA nodes 0,1. - mmap and free 8388608 bytes hugepages on node 0 - mmap and free 8388608 bytes hugepages on node 1 - pass: mmap_offline_node_huge - start: hotplug_memory offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Invalid argument offline: Device or resource busy offline: Invalid argument offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy offline: Device or resource busy - pass: hotplug_memory The test seems to suceed and no crash on the kernel side. I will just run some more tests to make sure the thing is solid enough and then I will post v4. Thanks -- Oscar Salvador SUSE L3