Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp465142ybg; Fri, 12 Jun 2020 06:20:31 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwezk2SKmyxQ3I0sGMJKKUn8gbZLDo7OdcR30Xf9kzQ8Oubtm/NSiiCsZaKV04adh6xx5uU X-Received: by 2002:a17:906:ecfa:: with SMTP id qt26mr13435857ejb.493.1591968031215; Fri, 12 Jun 2020 06:20:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591968031; cv=none; d=google.com; s=arc-20160816; b=nB5I5/4DP/QrxFilYMCEajoy3DjU2DTkyf/8wNWvh4s070uhM1Zj0hotfmYK0Moc7i 61x1TtgImBTI6trB9J3Qjl0tw0VvvyA30dgHoj0h9jYjh2RI7pXN+MgA9sqQ5XhlRD7O c5el9KWUQQXF12IHeUw2EAxSmg/Rxy7Y+GcvWREqh1Mc1Z6Yq3z6GIrgFVSnk2iM35r8 ML8QBkj9GoASV1YWQrrbINELqDjzzgaukCcRuSxlUq2M1yoP4kYrBoxSk/ZigDPbMPN/ NEO7cnHqBwCqmK6CSkoZ3y/mVoylN3HVxBL0SsVgM0iE5aovEE8ag8BwgihCs7BWwPYx Vdrg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject; bh=AEA0QNHCPJDPE0TgWBd6+OtGLaRZTbRA9nlpsi7Glkw=; b=X7MPNVVrv1XbuYDQfG7XIumreZKaq2tQXszIxZhKPfQsNmcxELhpyBgIcsWc35yKdz 3yqcu+y61KruJ0i4EUN97u3NoR8wtgQwBJYUWfFIaAVWYzv0AhEPSxO2a3BKr1lnOn2d Q0EbglvEGPqB+YT2JbFG9/x0WeSjE5i8GQz/8oaq5osl1hbNEB5WdwyjbRI/nXQamC64 9iwzRxK4nLpq2lE6RDzBvqzG3/zDDn+XH7kJffz5jQC3rOtQqZQ0QbtoU4zdQ1I7LUSH 5NhkyIGXDsVWsaUEIaxJjwpE4sZ9WkiJJhY4Xzdyo3ENzjsrzn+KsWocQckifg99MY3q n5kg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id pg21si3818839ejb.349.2020.06.12.06.20.07; Fri, 12 Jun 2020 06:20:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726323AbgFLNRr (ORCPT + 99 others); Fri, 12 Jun 2020 09:17:47 -0400 Received: from relay6-d.mail.gandi.net ([217.70.183.198]:48219 "EHLO relay6-d.mail.gandi.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726053AbgFLNRr (ORCPT ); Fri, 12 Jun 2020 09:17:47 -0400 X-Originating-IP: 90.112.45.105 Received: from [192.168.1.11] (lfbn-gre-1-325-105.w90-112.abo.wanadoo.fr [90.112.45.105]) (Authenticated sender: alex@ghiti.fr) by relay6-d.mail.gandi.net (Postfix) with ESMTPSA id 7929EC0011; Fri, 12 Jun 2020 13:17:43 +0000 (UTC) Subject: Re: [PATCH 0/2] PUD/PGDIR entries for linear mapping From: Alex Ghiti To: Atish Patra Cc: Anup Patel , "linux-kernel@vger.kernel.org List" , Atish Patra , Palmer Dabbelt , Paul Walmsley , linux-riscv References: <20200603153608.30056-1-alex@ghiti.fr> <7ad7057e-fdab-14ef-9bdb-c77ccefd208a@ghiti.fr> <36739fc4-21ea-14f4-f2a6-52614b602dea@ghiti.fr> Message-ID: <8867b8d5-4a15-fbc1-67e1-7fc48be6eae3@ghiti.fr> Date: Fri, 12 Jun 2020 09:17:42 -0400 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.9.0 MIME-Version: 1.0 In-Reply-To: <36739fc4-21ea-14f4-f2a6-52614b602dea@ghiti.fr> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: fr Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Le 6/12/20 à 8:59 AM, Alex Ghiti a écrit : > Hi Atish, > > Le 6/11/20 à 1:29 PM, Atish Patra a écrit : >> On Wed, Jun 10, 2020 at 11:51 PM Alex Ghiti  wrote: >>> Hi Atish, >>> >>> Le 6/10/20 à 2:32 PM, Atish Patra a écrit : >>>> On Wed, Jun 3, 2020 at 8:36 AM Alexandre Ghiti  wrote: >>>>> This small patchset intends to use PUD/PGDIR entries for linear >>>>> mapping >>>>> in order to better utilize TLB. >>>>> >>>>> At the moment, only PMD entries can be used since on common platforms >>>>> (qemu/unleashed), the kernel is loaded at DRAM + 2MB which >>>>> dealigns virtual >>>>> and physical addresses and then prevents the use of PUD/PGDIR >>>>> entries. >>>>> So the kernel must be able to get those 2MB for PAGE_OFFSET to map >>>>> the >>>>> beginning of the DRAM: this is achieved in patch 1. >>>>> >>>> I don't have in depth knowledge of how mm code works so this question >>>> may be a completely >>>> stupid one :). Just for my understanding, >>>> As per my understanding, kernel will map those 2MB of memory but >>>> never use it. >>>> How does the kernel ensure that it doesn't allocate any memory from >>>> those 2MB >>>> memory if it is not marked as reserved? >>> Yes, a 1GB hugepage will cover those 2MB: I rely on the previous boot >>> stage to mark this region >>> as reserved if there is something there (like opensbi). Otherwise, the >>> kernel will indeed try to >>> allocate memory from there :) >>> >> In that case, this patch mandates that the firmware region has to be >> mark "reserved" >> the device tree so that the Linux kernel doesn't try to allocate >> memory from there. >> OpenSBI is already doing it from v0.7. Thus, any user using latest >> OpenSBI can leverage >> this patch for a better TLB utilization. > > > Note that *currently* OpenSBI v0.7 still adds the "no-map" property > which prevents such optimization. > >> However, legacy previous boot stages(BBL) do not reserve this area via >> DT which may >> result in an unexpected crash. I am not sure how many developers still >> use BBL though. >> >> Few general suggestions to tackle this problem: >> 1. This mandatory requirement should be added to the booting document >> so that any other >> SBI implementation is also aware of it. >> 2. You may have to move the patch1 to a separate config so that any >> users of legacy boot stages >> can disable this feature. > > > IMHO, the region occupied by runtime services should be marked as > reserved in the device-tree. So it seems redundant to add this as a > requirement, I would rather consider its absence as a bug. > > Even if I understand that this might break some system, I don't like > the idea of a new config to support old "buggy" bootloaders: when will > we be able to remove it ? We'll never know when people will stop using > those bootloaders, so it will stay here forever...Where can I find the > boot document you are talking about ? Can we simply state here that > this kernel version will not be compatible with those bootloaders > (we'll draw an exhaustive list here) ? Ok, I have just found Documentation/riscv/boot-image-header.rst: could we imagine doing something like incrementing the version and use that as a hint in the kernel not to map the 2MB offset ? That's still legacy, but at least it does not require to recompile a kernel as the check would be done at runtime. > > Alex > > >>> Alex >>> >>> >>>>> But furthermore, at the moment, the firmware (opensbi) explicitly >>>>> asks the >>>>> kernel not to map the region it occupies, which is on those common >>>>> platforms at the very beginning of the DRAM and then it also dealigns >>>>> virtual and physical addresses. I proposed a patch here: >>>>> >>>>> https://github.com/riscv/opensbi/pull/167 >>>>> >>>>> that removes this 'constraint' but *not* all the time as it offers >>>>> some >>>>> kind of protection in case PMP is not available. So sometimes, we may >>>>> have a part of the memory below the kernel that is removed creating a >>>>> misalignment between virtual and physical addresses. So for >>>>> performance >>>>> reasons, we must at least make sure that PMD entries can be used: >>>>> that >>>>> is guaranteed by patch 1 too. >>>>> >>>>> Finally the second patch simply improves best_map_size so that >>>>> whenever >>>>> possible, PUD/PGDIR entries are used. >>>>> >>>>> Below is the kernel page table without this patch on a 6G platform: >>>>> >>>>> ---[ Linear mapping ]--- >>>>> 0xffffc00000000000-0xffffc00176e00000 0x0000000080200000 5998M >>>>> PMD     D A . . . W R V >>>>> >>>>> And with this patchset + opensbi patch: >>>>> >>>>> ---[ Linear mapping ]--- >>>>> 0xffffc00000000000-0xffffc00140000000 0x0000000080000000         >>>>> 5G PUD     D A . . . W R V >>>>> 0xffffc00140000000-0xffffc00177000000 0x00000001c0000000 880M >>>>> PMD     D A . . . W R V >>>>> >>>>> Alexandre Ghiti (2): >>>>>     riscv: Get memory below load_pa while ensuring linear mapping >>>>> is PMD >>>>>       aligned >>>>>     riscv: Use PUD/PGDIR entries for linear mapping when possible >>>>> >>>>>    arch/riscv/include/asm/page.h |  8 ++++ >>>>>    arch/riscv/mm/init.c          | 69 >>>>> +++++++++++++++++++++++++++++------ >>>>>    2 files changed, 65 insertions(+), 12 deletions(-) >>>>> >>>>> -- >>>>> 2.20.1 >>>>> >>>>> >> >