Received: by 2002:a25:824b:0:0:0:0:0 with SMTP id d11csp6566635ybn; Sun, 29 Sep 2019 23:30:15 -0700 (PDT) X-Google-Smtp-Source: APXvYqwh4dlRtmVRkLrELuLUbFuBew+xM69SXm6Dp7QKQ4yBkLNiWujFHL+GmR8yPSLyHnhEqy1D X-Received: by 2002:a17:906:e28f:: with SMTP id gg15mr17690214ejb.182.1569825014940; Sun, 29 Sep 2019 23:30:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569825014; cv=none; d=google.com; s=arc-20160816; b=caNsAR0dOKDLWA1y0JM/KVr29BAahtQobVeot8JM4KNe9XOP6NmSiLwzMw/W0oXW4w usaET480XMDh+Dd7vAahNfoXEkUp8Wx1v2CFs//4uC9m8LUzX9XevFEmCAunL1tYzlyo 14HXFBTcZlu5vDWMTMTOJyZJc8yZCuunK6cCPYu1ghrKZCi35yJY5Iz6seyUABiCfKxh E/jUejto4U4m9M3dOJ6nzsBaoLic5Cfe7qDo/eNK+n5G5/WnXTh9ZhO2krHHkceMxpEn UQzI/7KdZo0/CQZPwbXY0JYraVHSCDRixihCBsrKAJ6mEn5uThHNkkoVCrC/mkeqBI2t mZBw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:autocrypt :openpgp:from:references:cc:to:subject; bh=iu0Xm9aLagUYrZTp3haUgQB1/w0H8OdJEUFghfFxJeA=; b=EADoKC8a033NZfQIA5YJDTxvS3K+WMvsOtewadN172a4vOuSgwRo5jOAivXPLBRMpT yB7TsWeFu8AHgReqqc5AszfwXc2q2zZBEkR+x6qXxPYpP0+UqHSY4xs0bHN7d0k8ELDP /XBkSJvTiaZoa/WZf99cd4OnfjPxccKVOL/wkDUdB0hwEv86dPywEkmUXKHbj3CbvemD a34nJy+29KEtwV7+KLy12t2dOX98WmE0cbSblMSqJmH/cJKaE1TxgGuftx5ogdWEeIL/ 2ZcKW+8qHGrebKL4nfXr5+lbcvdX7VwGoHyFbpTbonPTDUrX3qXaTRx7GZKC6JjdR9Xa aOaA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x62si6569029ede.352.2019.09.29.23.29.50; Sun, 29 Sep 2019 23:30:14 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729713AbfI3G1r (ORCPT + 99 others); Mon, 30 Sep 2019 02:27:47 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:26294 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729648AbfI3G1r (ORCPT ); Mon, 30 Sep 2019 02:27:47 -0400 Received: from pps.filterd (m0098413.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.27/8.16.0.27) with SMTP id x8U6RQAl059913 for ; Mon, 30 Sep 2019 02:27:45 -0400 Received: from e06smtp04.uk.ibm.com (e06smtp04.uk.ibm.com [195.75.94.100]) by mx0b-001b2d01.pphosted.com with ESMTP id 2vbce602am-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Mon, 30 Sep 2019 02:27:45 -0400 Received: from localhost by e06smtp04.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Mon, 30 Sep 2019 07:27:43 +0100 Received: from b06avi18626390.portsmouth.uk.ibm.com (9.149.26.192) by e06smtp04.uk.ibm.com (192.168.101.134) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Mon, 30 Sep 2019 07:27:40 +0100 Received: from d06av25.portsmouth.uk.ibm.com (d06av25.portsmouth.uk.ibm.com [9.149.105.61]) by b06avi18626390.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id x8U6RBld37618038 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Mon, 30 Sep 2019 06:27:11 GMT Received: from d06av25.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id DEDAF11C054; Mon, 30 Sep 2019 06:27:39 +0000 (GMT) Received: from d06av25.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 964D411C04C; Mon, 30 Sep 2019 06:27:39 +0000 (GMT) Received: from oc7455500831.ibm.com (unknown [9.152.224.146]) by d06av25.portsmouth.uk.ibm.com (Postfix) with ESMTP; Mon, 30 Sep 2019 06:27:39 +0000 (GMT) Subject: Re: [PATCH] mm/page_alloc: fix a crash in free_pages_prepare() To: David Hildenbrand , Alexander Duyck , Andrew Morton Cc: Qian Cai , Heiko Carstens , gor@linux.ibm.com, linux-s390@vger.kernel.org, linux-mm , LKML References: <1569613623-16820-1-git-send-email-cai@lca.pw> <20190927140222.6f7d0a41b9e734053ee911b9@linux-foundation.org> <1569619686.5576.242.camel@lca.pw> <20190927145945.846a3f3405d3af066827d3f5@linux-foundation.org> <5ee9164f-71c5-4082-a80d-8fbc5dc50750@redhat.com> From: Christian Borntraeger Openpgp: preference=signencrypt Autocrypt: addr=borntraeger@de.ibm.com; prefer-encrypt=mutual; keydata= mQINBE6cPPgBEAC2VpALY0UJjGmgAmavkL/iAdqul2/F9ONz42K6NrwmT+SI9CylKHIX+fdf J34pLNJDmDVEdeb+brtpwC9JEZOLVE0nb+SR83CsAINJYKG3V1b3Kfs0hydseYKsBYqJTN2j CmUXDYq9J7uOyQQ7TNVoQejmpp5ifR4EzwIFfmYDekxRVZDJygD0wL/EzUr8Je3/j548NLyL 4Uhv6CIPf3TY3/aLVKXdxz/ntbLgMcfZsDoHgDk3lY3r1iwbWwEM2+eYRdSZaR4VD+JRD7p8 0FBadNwWnBce1fmQp3EklodGi5y7TNZ/CKdJ+jRPAAnw7SINhSd7PhJMruDAJaUlbYaIm23A +82g+IGe4z9tRGQ9TAflezVMhT5J3ccu6cpIjjvwDlbxucSmtVi5VtPAMTLmfjYp7VY2Tgr+ T92v7+V96jAfE3Zy2nq52e8RDdUo/F6faxcumdl+aLhhKLXgrozpoe2nL0Nyc2uqFjkjwXXI OBQiaqGeWtxeKJP+O8MIpjyGuHUGzvjNx5S/592TQO3phpT5IFWfMgbu4OreZ9yekDhf7Cvn /fkYsiLDz9W6Clihd/xlpm79+jlhm4E3xBPiQOPCZowmHjx57mXVAypOP2Eu+i2nyQrkapaY IdisDQfWPdNeHNOiPnPS3+GhVlPcqSJAIWnuO7Ofw1ZVOyg/jwARAQABtDRDaHJpc3RpYW4g Qm9ybnRyYWVnZXIgKElCTSkgPGJvcm50cmFlZ2VyQGRlLmlibS5jb20+iQI4BBMBAgAiBQJO nDz4AhsDBgsJCAcDAgYVCAIJCgsEFgIDAQIeAQIXgAAKCRARe7yAtaYcfOYVD/9sqc6ZdYKD bmDIvc2/1LL0g7OgiA8pHJlYN2WHvIhUoZUIqy8Sw2EFny/nlpPVWfG290JizNS2LZ0mCeGZ 80yt0EpQNR8tLVzLSSr0GgoY0lwsKhAnx3p3AOrA8WXsPL6prLAu3yJI5D0ym4MJ6KlYVIjU ppi4NLWz7ncA2nDwiIqk8PBGxsjdc/W767zOOv7117rwhaGHgrJ2tLxoGWj0uoH3ZVhITP1z gqHXYaehPEELDV36WrSKidTarfThCWW0T3y4bH/mjvqi4ji9emp1/pOWs5/fmd4HpKW+44tD Yt4rSJRSa8lsXnZaEPaeY3nkbWPcy3vX6qafIey5d8dc8Uyaan39WslnJFNEx8cCqJrC77kI vcnl65HaW3y48DezrMDH34t3FsNrSVv5fRQ0mbEed8hbn4jguFAjPt4az1xawSp0YvhzwATJ YmZWRMa3LPx/fAxoolq9cNa0UB3D3jmikWktm+Jnp6aPeQ2Db3C0cDyxcOQY/GASYHY3KNra z8iwS7vULyq1lVhOXg1EeSm+lXQ1Ciz3ub3AhzE4c0ASqRrIHloVHBmh4favY4DEFN19Xw1p 76vBu6QjlsJGjvROW3GRKpLGogQTLslbjCdIYyp3AJq2KkoKxqdeQYm0LZXjtAwtRDbDo71C FxS7i/qfvWJv8ie7bE9A6Wsjn7kCDQROnDz4ARAAmPI1e8xB0k23TsEg8O1sBCTXkV8HSEq7 JlWz7SWyM8oFkJqYAB7E1GTXV5UZcr9iurCMKGSTrSu3ermLja4+k0w71pLxws859V+3z1jr nhB3dGzVZEUhCr3EuN0t8eHSLSMyrlPL5qJ11JelnuhToT6535cLOzeTlECc51bp5Xf6/XSx SMQaIU1nDM31R13o98oRPQnvSqOeljc25aflKnVkSfqWSrZmb4b0bcWUFFUKVPfQ5Z6JEcJg Hp7qPXHW7+tJTgmI1iM/BIkDwQ8qe3Wz8R6rfupde+T70NiId1M9w5rdo0JJsjKAPePKOSDo RX1kseJsTZH88wyJ30WuqEqH9zBxif0WtPQUTjz/YgFbmZ8OkB1i+lrBCVHPdcmvathknAxS bXL7j37VmYNyVoXez11zPYm+7LA2rvzP9WxR8bPhJvHLhKGk2kZESiNFzP/E4r4Wo24GT4eh YrDo7GBHN82V4O9JxWZtjpxBBl8bH9PvGWBmOXky7/bP6h96jFu9ZYzVgIkBP3UYW+Pb1a+b w4A83/5ImPwtBrN324bNUxPPqUWNW0ftiR5b81ms/rOcDC/k/VoN1B+IHkXrcBf742VOLID4 YP+CB9GXrwuF5KyQ5zEPCAjlOqZoq1fX/xGSsumfM7d6/OR8lvUPmqHfAzW3s9n4lZOW5Jfx bbkAEQEAAYkCHwQYAQIACQUCTpw8+AIbDAAKCRARe7yAtaYcfPzbD/9WNGVf60oXezNzSVCL hfS36l/zy4iy9H9rUZFmmmlBufWOATjiGAXnn0rr/Jh6Zy9NHuvpe3tyNYZLjB9pHT6mRZX7 Z1vDxeLgMjTv983TQ2hUSlhRSc6e6kGDJyG1WnGQaqymUllCmeC/p9q5m3IRxQrd0skfdN1V AMttRwvipmnMduy5SdNayY2YbhWLQ2wS3XHJ39a7D7SQz+gUQfXgE3pf3FlwbwZhRtVR3z5u aKjxqjybS3Ojimx4NkWjidwOaUVZTqEecBV+QCzi2oDr9+XtEs0m5YGI4v+Y/kHocNBP0myd pF3OoXvcWdTb5atk+OKcc8t4TviKy1WCNujC+yBSq3OM8gbmk6NwCwqhHQzXCibMlVF9hq5a FiJb8p4QKSVyLhM8EM3HtiFqFJSV7F+h+2W0kDyzBGyE0D8z3T+L3MOj3JJJkfCwbEbTpk4f n8zMboekuNruDw1OADRMPlhoWb+g6exBWx/YN4AY9LbE2KuaScONqph5/HvJDsUldcRN3a5V RGIN40QWFVlZvkKIEkzlzqpAyGaRLhXJPv/6tpoQaCQQoSAc5Z9kM/wEd9e2zMeojcWjUXgg oWj8A/wY4UXExGBu+UCzzP/6sQRpBiPFgmqPTytrDo/gsUGqjOudLiHQcMU+uunULYQxVghC syiRa+UVlsKmx1hsEg== Date: Mon, 30 Sep 2019 08:27:39 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <5ee9164f-71c5-4082-a80d-8fbc5dc50750@redhat.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 x-cbid: 19093006-0016-0000-0000-000002B1F2F7 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19093006-0017-0000-0000-00003312CC0A Message-Id: <364ff782-b9d0-5228-0392-ecb61b786dce@de.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2019-09-30_03:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=7 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1908290000 definitions=main-1909300068 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 28.09.19 11:06, David Hildenbrand wrote: > On 28.09.19 00:17, Alexander Duyck wrote: >> On Fri, Sep 27, 2019 at 2:59 PM Andrew Morton wrote: >>> >>> On Fri, 27 Sep 2019 17:28:06 -0400 Qian Cai wrote: >>> >>>>> >>>>> So I think you've moved the arch_free_page() to be after the final >>>>> thing which can access page contents, yes? If so, we should have a >>>>> comment in free_pages_prepare() to attmept to prevent this problem from >>>>> reoccurring as the code evolves? >>>> >>>> Right, something like this above arch_free_page() there? >>>> >>>> /* >>>> * It needs to be just above kernel_map_pages(), as s390 could mark those >>>> * pages unused and then trigger a fault when accessing. >>>> */ >>> >>> I did this. >>> >>> --- a/mm/page_alloc.c~mm-page_alloc-fix-a-crash-in-free_pages_prepare-fix >>> +++ a/mm/page_alloc.c >>> @@ -1179,7 +1179,13 @@ static __always_inline bool free_pages_p >>> kernel_init_free_pages(page, 1 << order); >>> >>> kernel_poison_pages(page, 1 << order, 0); >>> + /* >>> + * arch_free_page() can make the page's contents inaccessible. s390 >>> + * does this. So nothing which can access the page's contents should >>> + * happen after this. >>> + */ >>> arch_free_page(page, order); >>> + >>> if (debug_pagealloc_enabled()) >>> kernel_map_pages(page, 1 << order, 0); >>> >> >> So the question I would have is what is the state of the page after it >> has been marked unused and then pulled back in? I'm assuming it will >> be all 0s. > > I think this comment relates to the s390x implementation, so I'll try to > explain that. After arch_free_page() the page might have been zapped in > the hypervisor, but that might happen deferred. The guest ends up > triggering the ESSA instruction in arch_free_page(). That instruction > sets some extended-page-table-related ("pgste") bits in the hypervisor > tables for the guest ("gmap") and fills a buffer with these entries. The > page is marked _PGSTE_GPS_USAGE_UNUSED. Yes. And that also means that architecturally it can be 0 or it can contain the old content depending on whether the host has paged that page out or not and how many pages have been marked unused. I am also sure that the implementation of z/VM and KVM do differ in that regard. For example KVM does not make use of the logical zero state but z/VM does. In essence you can consider this like a ballooner that takes away the page lazily. For a writeup of the details see https://www.kernel.org/doc/ols/2006/ols2006v2-pages-321-336.pdf (This also contains additional states that were never merged upstream)