Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp246250pxb; Wed, 11 Nov 2020 02:36:42 -0800 (PST) X-Google-Smtp-Source: ABdhPJz/ALydvF0KJTOcClZOcfMlwP/kBkwfwh4cwvTUCZJ12HQrQvpaKrRnNnr6yWngJ5Xe/7hW X-Received: by 2002:a05:6402:1c8e:: with SMTP id cy14mr26922952edb.39.1605091002647; Wed, 11 Nov 2020 02:36:42 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605091002; cv=none; d=google.com; s=arc-20160816; b=nqT+rB2vBJvpwFTm+vZFTxshyKbX1ScfHfHaYofEaTy6GaGmR0288tFTDE3VIabDcn nYGnKSVgjE3aadPBmairZC5SkfITy+3IEaun77fQBLq/fqutTQEzl9Th6L0xQAiI4gOK pqoupz8UHOCvr95PYTVqUdkYblmCZHBzWO1dAhiFs3rwlhLQOl97QzUMiX6q0hGCZBUY CraOjN+T0+BxcGhds92TviYzXYtw7LrguiUuMT5OzJpnEscAhb72VegXG28P+KEXDLhF 1P8e2Tbf6OFcGpsxS3tIZmJL6JX8aKu3eowxZLDMn31FvvpYphUjFcxL0HMECWOJ5CX2 9/yg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:organization :from:references:cc:to:subject:dkim-signature; bh=nQanxT8X3ELig0XMX5/McNaRarJw9LhbTOfvsbkiJyA=; b=uAA/0gLg/zcfZeoCZnMtGrzeV2Ga/FEfMMJCLF4s4rtf944Y/Edn7Xk2aVy1u9prMO BA95f6NV4Rne7g/2FtBeqHddM7hV45ImInK1HUPSP9MO+hiv0MXnQQysMD9ZJ359ZL8R jd6IEZo/6YDS7KKl6J1HzIca2fEZucEviwFkoGVC74jNQJfJTt9mgmRmUqZbBsYUQMv6 svshVipPHh2/bxIPs4Tuq0ipByiGPd110bJ+Wa9qPqJnQtMueClRhUK7g+N3tpaVO9jm STa1WlPswtG5H7no2IuZQFxa2jcGNjOCkFcuYipFrVWj46rbfXC5XZwx/HZ2uMiYkiaP Watw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ejkvmBsy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cw3si1187616edb.149.2020.11.11.02.36.18; Wed, 11 Nov 2020 02:36:42 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ejkvmBsy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727253AbgKKKci (ORCPT + 99 others); Wed, 11 Nov 2020 05:32:38 -0500 Received: from us-smtp-delivery-124.mimecast.com ([63.128.21.124]:29113 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726229AbgKKKch (ORCPT ); Wed, 11 Nov 2020 05:32:37 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1605090756; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=nQanxT8X3ELig0XMX5/McNaRarJw9LhbTOfvsbkiJyA=; b=ejkvmBsyGlqVnbMqCGlI/MAup1rOADJ+yNAHh/e+qaAfEFy6kf/2+ZSn8c4ADcXuaZdNmC gtRD+W/FcmUkUrXIvMPOBH0p8/ofP+eHe6zT5VqPcPZBWuam97Ig9VmBfDWXGBAfBaTomj NzFfUoDILCDqh3/2DnANcBp5bloUhrU= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-84-sLOqzFc1OsW-ZY8MMNDcjw-1; Wed, 11 Nov 2020 05:32:32 -0500 X-MC-Unique: sLOqzFc1OsW-ZY8MMNDcjw-1 Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.phx2.redhat.com [10.5.11.15]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 5E5B3186DD4D; Wed, 11 Nov 2020 10:32:30 +0000 (UTC) Received: from [10.36.114.151] (ovpn-114-151.ams2.redhat.com [10.36.114.151]) by smtp.corp.redhat.com (Postfix) with ESMTP id EF30E7513C; Wed, 11 Nov 2020 10:32:26 +0000 (UTC) Subject: Re: [PATCH v1] mm/page_alloc: clear pages in alloc_contig_pages() with init_on_alloc=1 or __GFP_ZERO To: Michal Hocko Cc: Vlastimil Babka , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Andrew Morton , Alexander Potapenko , Mike Kravetz , Mike Rapoport , Oscar Salvador , Kees Cook , Michael Ellerman References: <20201110193240.25401-1-david@redhat.com> <20201111084738.GT12240@dhcp22.suse.cz> <4ebc711e-7fbc-62aa-b88f-3d6ffa9379ff@redhat.com> <5e104380-c0b1-4911-b484-b6e1e1c46f7d@redhat.com> <20201111102207.GV12240@dhcp22.suse.cz> From: David Hildenbrand Organization: Red Hat GmbH Message-ID: Date: Wed, 11 Nov 2020 11:32:25 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.6.0 MIME-Version: 1.0 In-Reply-To: <20201111102207.GV12240@dhcp22.suse.cz> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.15 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11.11.20 11:22, Michal Hocko wrote: > On Wed 11-11-20 11:05:21, David Hildenbrand wrote: >> On 11.11.20 10:58, Vlastimil Babka wrote: >>> On 11/11/20 10:06 AM, David Hildenbrand wrote: >>>> On 11.11.20 09:47, Michal Hocko wrote: >>>>> On Tue 10-11-20 20:32:40, David Hildenbrand wrote: >>>>>> commit 6471384af2a6 ("mm: security: introduce init_on_alloc=1 and >>>>>> init_on_free=1 boot options") resulted with init_on_alloc=1 in all pages >>>>>> leaving the buddy via alloc_pages() and friends to be >>>>>> initialized/cleared/zeroed on allocation. >>>>>> >>>>>> However, the same logic is currently not applied to >>>>>> alloc_contig_pages(): allocated pages leaving the buddy aren't cleared >>>>>> with init_on_alloc=1 and init_on_free=0. Let's also properly clear >>>>>> pages on that allocation path and add support for __GFP_ZERO. >>>>> >>>>> AFAIR we do not have any user for __GFP_ZERO right? Not that this is >>>> >>>> Sorry, I had extended information under "---" but accidentally >>>> regenerated the patch before sending it out. >>>> >>>> __GFP_ZERO is not used yet. It's intended to be used in >>>> https://lkml.kernel.org/r/20201029162718.29910-1-david@redhat.com >>>> and I can move that change into a separate patch if desired. > > OK, it would make sense to add it with its user. > >>>>> harmful but it is better to call that explicitly because a missing >>>>> implementation would be a real problem and as such a bug fix. >>>>> >>>>> I am also not sure handling init_on_free at the higher level is good. >>>>> As we have discussed recently the primary point of this feature is to >>>>> add clearing at very few well defined entry points rather than spill it over >>>>> many places. In this case the entry point for the allocator is >>>>> __isolate_free_page which removes pages from the page allocator. I >>>>> haven't checked how much this is used elsewhere but I would expect >>>>> init_on_alloc to be handled there. >>>> >>>> Well, this is the entry point to our range allocator, which lives in >>>> page_alloc.c - used by actual high-level allocators (CMA, gigantic >>>> pages, etc). It's just a matter of taste where we want to have that >>>> handling exactly inside our allocator. > > Yes I completely agree here. I just believe it should the lowest we can > achieve. > >>> I agree alloc_contig_range() is fine as an entry point. >> >> Thanks, let's see if Michal insists of having this somewhere inside >> isolate_freepages_range() instead. > > It's not that I would be insisting. I am just pointing out that changes > like this one go against the idea of init_on_alloc because it is adding > more special casing and long term more places to be really careful about > when one has to be really careful to not undermine the security aspect > of the feature. I haven't really checked why compaction is not the > problem but I suspect it is the fact that it unconditionally copy the > full page content to the isolated page so there is no way to sneak > any data leak there. That is fine. We should however make that clear by Exactly. > using a special cased function which skips this particular > initialization and make sure everybody else will just do the right thing > without much thinking. I totally agree, but I think we don't have many places where free pages actually leave the buddy besides alloc_pages() and friends (compaction is something special). I agree having a single place to handle that would be preferred. I'll have a look if that can be reworked without doing too much harm / affecting other hot paths. -- Thanks, David / dhildenb