Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp6857302ybi; Wed, 31 Jul 2019 23:09:52 -0700 (PDT) X-Google-Smtp-Source: APXvYqy1KakDWdfldsZntPfgCWjpOYUwWVH5DkHvsU/ik2VV30DC2OeVAlFw6Uy98T+lomU6Djw1 X-Received: by 2002:a17:90a:ba93:: with SMTP id t19mr6557609pjr.139.1564639792266; Wed, 31 Jul 2019 23:09:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1564639792; cv=none; d=google.com; s=arc-20160816; b=dXHETkqLluIg3BDbOOtp5BARj4tW1PIElg1oWOLyCD7eVm+n1C5Qp7is8tmWGxApQe JO49Xj2j0OZjAXuYOqp18VruSswE7Xwg9Hlafp5/lB295IXN+Ztawvr34g+ClmQyU6PW Nf18eank0/h4AxwwMlMbaKH9iG3DM6Vu+Q6JGGEXG81LF6pNRbg4tFSPMZM6Ei+oGDpY 9zFZIZdD2XMdJhGlTdUs5a65F5clcm3iew6VBCobcV3IbycjXy7dm+ujAxq7DTaUduGa PQSmN2bE9W9MnancDK+ixcBzj8hjqv7bTgI3JBR7q9wcs5z7NSMNesYymHFY4oRIH+F8 Rqhw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=MU1o63eXjvVOfyHWLw4w2H3Yz2yztz/ZbyQX++989rU=; b=QV7tyEpKDIldoAX+LPaZb8fD9y4fSs1qfAaKB5UA+oiu3DKWgPouCitVf6J8+EO/ps ESmwfrpztjpTmE94iMYk2GSEOUjmjljV/LzkKvLLk7DaDn+o2wGZCKTVEf/dqKHoi+nf 1DNve/PQ4iA/zLqIN5lrvYNS5wXQhu9V/jSLS2BhwcfJWhM1aBq+QCWiNbJp58zTMtIW beQRePfrOSk5vOK6IgHIz0x2MoUNe/rGA+ITSopLrMutbdTPK1e07AzRwG6s2Kgf0j6J YOxUYBT7DqFIwiNCcRgfLl2xW9nh7Z28iw5TZh2LHjKddjhEoziU+Ef5J2dw6Y7LSRTI MESw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m12si33779743plt.413.2019.07.31.23.09.36; Wed, 31 Jul 2019 23:09:52 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728902AbfHAGIy (ORCPT + 99 others); Thu, 1 Aug 2019 02:08:54 -0400 Received: from foss.arm.com ([217.140.110.172]:58624 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725790AbfHAGIy (ORCPT ); Thu, 1 Aug 2019 02:08:54 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 8F57B337; Wed, 31 Jul 2019 23:08:53 -0700 (PDT) Received: from [10.163.1.81] (unknown [10.163.1.81]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id CF2273F694; Wed, 31 Jul 2019 23:10:46 -0700 (PDT) Subject: Re: [PATCH v9 10/21] mm: Add generic p?d_leaf() macros To: Steven Price , Mark Rutland Cc: x86@kernel.org, Arnd Bergmann , Ard Biesheuvel , Peter Zijlstra , Catalin Marinas , Dave Hansen , linux-kernel@vger.kernel.org, linux-mm@kvack.org, =?UTF-8?B?SsOpcsO0bWUgR2xpc3Nl?= , Ingo Molnar , Borislav Petkov , Andy Lutomirski , "H. Peter Anvin" , James Morse , Thomas Gleixner , Will Deacon , Andrew Morton , linux-arm-kernel@lists.infradead.org, "Liang, Kan" References: <20190722154210.42799-1-steven.price@arm.com> <20190722154210.42799-11-steven.price@arm.com> <20190723094113.GA8085@lakrids.cambridge.arm.com> <674bd809-f853-adb0-b1ab-aa4404093083@arm.com> From: Anshuman Khandual Message-ID: <0979d4b4-7a97-2dc3-67cf-3aa6569bfdcd@arm.com> Date: Thu, 1 Aug 2019 11:39:18 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <674bd809-f853-adb0-b1ab-aa4404093083@arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/29/2019 05:08 PM, Steven Price wrote: > On 28/07/2019 12:44, Anshuman Khandual wrote: >> >> >> On 07/23/2019 03:11 PM, Mark Rutland wrote: >>> On Mon, Jul 22, 2019 at 04:41:59PM +0100, Steven Price wrote: >>>> Exposing the pud/pgd levels of the page tables to walk_page_range() means >>>> we may come across the exotic large mappings that come with large areas >>>> of contiguous memory (such as the kernel's linear map). >>>> >>>> For architectures that don't provide all p?d_leaf() macros, provide >>>> generic do nothing default that are suitable where there cannot be leaf >>>> pages that that level. >>>> >>>> Signed-off-by: Steven Price >>> >>> Not a big deal, but it would probably make sense for this to be patch 1 >>> in the series, given it defines the semantic of p?d_leaf(), and they're >>> not used until we provide all the architectural implemetnations anyway. >> >> Agreed. >> >>> >>> It might also be worth pointing out the reasons for this naming, e.g. >>> p?d_large() aren't currently generic, and this name minimizes potential >>> confusion between p?d_{large,huge}(). >> >> Agreed. But these fallback also need to first check non-availability of large >> pages. I am not sure whether CONFIG_HUGETLB_PAGE config being clear indicates >> that conclusively or not. Being a page table leaf entry has a broader meaning >> than a large page but that is really not the case today. All leaf entries here >> are large page entries from MMU perspective. This dependency can definitely be >> removed when there are other types of leaf entries but for now IMHO it feels >> bit problematic not to directly associate leaf entries with large pages in >> config restriction while doing exactly the same. > > The intention here is that the page walkers are able to walk any type of > page table entry which the kernel may use. CONFIG_HUGETLB_PAGE only > controls whether "huge TLB pages" are used by user space processes. It's > quite possible that option to not be selected but the linear mapping to > have been mapped using "large pages" (i.e. leaf entries further up the > tree than normal). I understand that kernel page table might use large pages where as user space never enabled HugeTLB. The point to make here was CONFIG_HUGETLB approximately indicates the presence of large pages though the absence of same does not conclusively indicate that large pages are really absent on the MMU. Perhaps it will requires something new like MMU_[LARGE|HUGE]_PAGES. > > One of the goals was to avoid tying the new functions to a configuration > option but instead match the hardware architecture. Of course this isn't > possible in the most general case (e.g. an architecture may have > multiple hardware page table formats). But to the extent that other > functions like p?d_none() work the desire is that p?d_leaf() should also > work. It is fair enough to assume that a platform can decide wisely and provide accurate definition for p?d_leaf() functions. Anyways its okay not to make this more complex by tying with a new config option which does not exist.