Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp1443952imm; Tue, 2 Oct 2018 08:17:30 -0700 (PDT) X-Google-Smtp-Source: ACcGV63U9iyjIHirziwySt5OFQgk4nZkmhRRYd59FQ+FRI/LjSzqoUdrBnasCGtI3jNxIOdRYCKO X-Received: by 2002:a63:6645:: with SMTP id a66-v6mr14616803pgc.5.1538493450543; Tue, 02 Oct 2018 08:17:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1538493450; cv=none; d=google.com; s=arc-20160816; b=WAYdxQSjmNwM8ew18kdyxL4CNKUMUBotzNdQ9jvTm6QCXwOEjxsJdqu5se2h3Nef3m OwNf+z74hc6AsV89HGNbUk73eQAxpyT2OFBmSz7Yim6Up2Abk47NITcvDdfoFCp93NGu s+mL0bWKxvZTSZSppkLnU3QjY+Quf62AXQc8aICHxenxzQnLtmkCPIANs9pg28KnVDZT 7STaCSMkaH+ddWY2pa3TAzkKxLuogJdVjuHzVblZnEOYWvLQqybaz1W3Xjw4JKm8Bbij NP8k8FYTD0tyLAWFUE9nLfFbgaxTHW6oSDSi9EdFQ5tEYO2v4aadSFb1mir7N9z7PiqS bmfw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :organization:autocrypt:openpgp:from:references:cc:to:subject; bh=Xlitf7Qn3WeiX8OJeRxq8jGaCTBXJZRYfIt82hzVlwE=; b=R9sHxjZUPJRHYoHVejODP9PIacHrCW1yIDYXV1HSBfqrzr1oBAGKTPZB4mMCcR9rNG Uy9LR2AbwHpUYtC4MHHKUOU98iLvjCnzuF0YFFw2lZcKQ/kt+RkW0wCxvsPv2DSkcsEB /zHMdaOBtPpsW8oWtdwYnlrDmVVDyfCfFJLyq4Jk1/CZE7IphRTKHV5bANa9fqm4RMGO 64Af2jyNjTwc/t5LvV5JHOLNQC72IPb23d8BDJC265GSwaEHiPfZ04Saeyboj8Ws5g4N KVCzDrTfSYU2crEr1CloNqXQiAEx/vrnAC+jPTdciFFbmy8jdHMPGlAt+QPF8IBTQxTy AnIg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f9-v6si17932622plm.126.2018.10.02.08.17.15; Tue, 02 Oct 2018 08:17:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729100AbeJBWAT (ORCPT + 99 others); Tue, 2 Oct 2018 18:00:19 -0400 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:38672 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726822AbeJBWAT (ORCPT ); Tue, 2 Oct 2018 18:00:19 -0400 Received: from pps.filterd (m0098396.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w92FFtJw110363 for ; Tue, 2 Oct 2018 11:16:25 -0400 Received: from e16.ny.us.ibm.com (e16.ny.us.ibm.com [129.33.205.206]) by mx0a-001b2d01.pphosted.com with ESMTP id 2mv9ue46ew-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Tue, 02 Oct 2018 11:16:19 -0400 Received: from localhost by e16.ny.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Tue, 2 Oct 2018 11:14:58 -0400 Received: from b01cxnp23034.gho.pok.ibm.com (9.57.198.29) by e16.ny.us.ibm.com (146.89.104.203) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Tue, 2 Oct 2018 11:14:53 -0400 Received: from b01ledav006.gho.pok.ibm.com (b01ledav006.gho.pok.ibm.com [9.57.199.111]) by b01cxnp23034.gho.pok.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w92FErdm8716490 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Tue, 2 Oct 2018 15:14:53 GMT Received: from b01ledav006.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id A706FAC05E; Tue, 2 Oct 2018 11:14:10 -0400 (EDT) Received: from b01ledav006.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id ECDFEAC05B; Tue, 2 Oct 2018 11:14:07 -0400 (EDT) Received: from oc5000245537.ibm.com (unknown [9.53.179.212]) by b01ledav006.gho.pok.ibm.com (Postfix) with ESMTP; Tue, 2 Oct 2018 11:14:07 -0400 (EDT) Subject: Re: [PATCH] migration/mm: Add WARN_ON to try_offline_node To: Michal Hocko Cc: Tyrel Datwyler , Thomas Falcon , Kees Cook , Mathieu Malaterre , Pavel Tatashin , Nicholas Piggin , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Mauricio Faria de Oliveira , Juliet Kim , Thiago Jung Bauermann , Nathan Fontenot , Andrew Morton , YASUAKI ISHIMATSU , linuxppc-dev@lists.ozlabs.org, Dan Williams , Oscar Salvador References: <20181001185616.11427.35521.stgit@ltcalpine2-lp9.aus.stglabs.ibm.com> <20181001202724.GL18290@dhcp22.suse.cz> <20181002145922.GZ18290@dhcp22.suse.cz> From: Michael Bringmann Openpgp: preference=signencrypt Autocrypt: addr=mwb@linux.vnet.ibm.com; prefer-encrypt=mutual; keydata= xsBNBFcY7GcBCADzw3en+yzo9ASFGCfldVkIg95SAMPK0myXp2XJYET3zT45uBsX/uj9/2nA lBmXXeOSXnPfJ9V3vtiwcfATnWIsVt3tL6n1kqikzH9nXNxZT7MU/7gqzWZngMAWh/GJ9qyg DTOZdjsvdUNUWxtiLvBo7y+reA4HjlQhwhYxxvCpXBeRoF0qDWfQ8DkneemqINzDZPwSQ7zY t4F5iyN1I9GC5RNK8Y6jiKmm6bDkrrbtXPOtzXKs0J0FqWEIab/u3BDrRP3STDVPdXqViHua AjEzthQbGZm0VCxI4a7XjMi99g614/qDcXZCs00GLZ/VYIE8hB9C5Q+l66S60PLjRrxnABEB AAHNLU1pY2hhZWwgVy4gQnJpbmdtYW5uIDxtd2JAbGludXgudm5ldC5pYm0uY29tPsLAeAQT AQIAIgUCVxjsZwIbAwYLCQgHAwIGFQgCCQoLBBYCAwECHgECF4AACgkQSEdag3dpuTI0NAf8 CKYTDKQLgOSjVrU2L5rM4lXaJRmQV6oidD3vIhKSnWRvPq9C29ifRG6ri20prTHAlc0vycgm 41HHg0y2vsGgNXGTWC2ObemoZBI7mySXe/7Tq5mD/semGzOp0YWZ7teqrkiSR8Bw0p+LdE7K QmT7tpjjvuhrtQ3RRojUYcuy1nWUsc4D+2cxsnZslsx84FUKxPbLagDgZmgBhUw/sUi40s6S AkdViVCVS0WANddLIpG0cfdsV0kCae/XdjK3mRK6drFKv1z+QFjvOhc8QIkkxFD0da9w3tJj oqnqHFV5gLcHO6/wizPx/NV90y6RngeBORkQiRFWxTXS4Oj9GVI/Us7ATQRXGOxnAQgAmJ5Y ikTWrMWPfiveUacETyEhWVl7u8UhZcx3yy2te8O0ay7t9fYcZgIEfQPPVVus89acIXlG3wYL DDPvb21OprLxi+ZJ2a0S5we+LcSWN1jByxJlbWBq+/LcMtGAOhNLpysY1gD0Y4UW/eKS+TFZ 562qKC3k1dBvnV9JXCgeS1taYFxRdVAn+2DwK3nuyG/DDq/XgJ5BtmyC3MMx8CiW3POj+O+l 6SedIeAfZlZ7/xhijx82g93h07VavUQRwMZgZFsqmuxBxVGiav2HB+dNvs3PFB087Pvc9OHe qhajPWOP/gNLMmvBvknn1NToM9a8/E8rzcIZXoYs4RggRRYh6wARAQABwsBfBBgBAgAJBQJX GOxnAhsMAAoJEEhHWoN3abky+RUH/jE08/r5QzaNKYeVhu0uVbgXu5fsxqr2cAxhf+KuwT3T efhEP2alarxzUZdEh4MsG6c+X2NYLbD3cryiXxVx/7kSAJEFQJfA5P06g8NLR25Qpq9BLsN7 ++dxQ+CLKzSEb1X24hYAJZpOhS8ev3ii+M/XIo+olDBKuTaTgB6elrg3CaxUsVgLBJ+jbRkW yQe2S5f/Ja1ThDpSSLLWLiLK/z7+gaqwhnwjQ8Z8Y9D2itJQcj4itHilwImsqwLG7SxzC0NX IQ5KaAFYdRcOgwR8VhhkOIVd70ObSZU+E4pTET1WDz4o65xZ89yfose1No0+r5ht/xWOOrh8 53/hcWvxHVs= Organization: IBM Linux Technology Center Date: Tue, 2 Oct 2018 10:14:49 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20181002145922.GZ18290@dhcp22.suse.cz> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 x-cbid: 18100215-0072-0000-0000-000003AE1A9A X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00009809; HX=3.00000242; KW=3.00000007; PH=3.00000004; SC=3.00000267; SDB=6.01096834; UDB=6.00567186; IPR=6.00876876; MB=3.00023590; MTD=3.00000008; XFM=3.00000015; UTC=2018-10-02 15:14:56 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18100215-0073-0000-0000-0000499E6A50 Message-Id: X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2018-10-02_06:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1807170000 definitions=main-1810020148 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/02/2018 09:59 AM, Michal Hocko wrote: > On Tue 02-10-18 09:51:40, Michael Bringmann wrote: > [...] >> When the device-tree affinity attributes have changed for memory, >> the 'nid' affinity calculated points to a different node for the >> memory block than the one used to install it, previously on the >> source system. The newly calculated 'nid' affinity may not yet >> be initialized on the target system. The current memory tracking >> mechanisms do not record the node to which a memory block was >> associated when it was added. Nathan is looking at adding this >> feature to the new implementation of LMBs, but it is not there >> yet, and won't be present in earlier kernels without backporting a >> significant number of changes. > > Then the patch you have proposed here just papers over a real issue, no? > IIUC then you simply do not remove the memory if you lose the race. The problem occurs when removing memory after an affinity change references a node that was previously unreferenced. Other code in 'kernel/mm/memory_hotplug.c' deals with initializing an empty node when adding memory to a system. The 'removing memory' case is specific to systems that perform LPM and allow device-tree changes. The powerpc kernel does not have the option of accepting some PRRN requests and accepting others. It must perform them all. The kernel/mm code that removes memory blocks does not (before this patch) recognize that the affinity of a memory block could have changed to a previously unused node. If every path to try_offline_node made such a check, then this patch would be unnecessary. However, putting a patch at a single location to check for a relatively rare occurrence, would seem to be a more efficient implementation. Michael -- Michael W. Bringmann Linux Technology Center IBM Corporation Tie-Line 363-5196 External: (512) 286-5196 Cell: (512) 466-0650 mwb@linux.vnet.ibm.com