Received: by 2002:a05:7412:b795:b0:e2:908c:2ebd with SMTP id iv21csp482794rdb; Thu, 2 Nov 2023 09:04:20 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFcoTV4S8ZmehKfWiyWat6ectixJS/+oARCx3X7D22KVZeXXoLcARfEdSATD7PDRvGA2jtz X-Received: by 2002:a17:902:e80f:b0:1cc:5a1c:3da0 with SMTP id u15-20020a170902e80f00b001cc5a1c3da0mr12293893plg.63.1698941060092; Thu, 02 Nov 2023 09:04:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698941060; cv=none; d=google.com; s=arc-20160816; b=XLZHaNylvBxoVxsaFCw1Oj0NTCVIUcg9K956vP9RkAk1UIF39JBgqHfUCfzvDHKhiO GVgLXwYGLTb4SaTtI82dFfUqGSFRvv4GwyQSQrWidj7evPuOcHGgBIfaSCc0kx6Thd0Z +ZeeQ+65bjvaBweScTbOncZQkeHu+1W7L4tz5bU0B5emVQcUqC0+8zBIVtpubiNAOcCn BqKzMAUb41KE/CzymrkS4VXiVfuKpITlILZ107UAc8DHHStETJLLLBCZe3ntu1GTkyUK gm4mMh3e43O4fdrCtNOYDGPgr5xbpeqFoTu9WyPWDgFnRaEcu46JbPGgJe8QYvo7rKwq nrsA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:autocrypt :from:references:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=cKI1bO012pJuVEiLnCGm3LvLoCf6xh2uqrQS8U+D6Hc=; fh=WIWQNI4KQR1CK7EF2pl8sVTiZ7VXjibUi9AV3wKPpeI=; b=I4nqn6KjTpdgIhtHVUr25rWzEC8Eap3iyQwbGRq5yyOdrQhzJzR/gtXVccTZj+IljK 507wGO4KRpwC22zs37v9GvLteXqopHg1bqMMms+zA7LlfP6ASZBXj7SH8yeLzeIe9qB9 q1Sr7/BKH+TXxZjLP9Wfl8F2Nx83YP1JqtWpynz2Wiza1ck8PUzEKbyK8J6+oL+FdwPL ZoUvyjhYMsRSytNTrlib0hv6rVLEJkk678GohNG7TzqRfJUY+qoNglNW4QJ/16j06++/ MMSaYvwojybPZJnwKLRYYysSpHkCkQCMj69dyPwnKz4QZzzMD0by6fQ4kMXgjZwFgz0g I90g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b="eh9ZDmz/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [23.128.96.33]) by mx.google.com with ESMTPS id i3-20020a170902c94300b001c60d334996si69316pla.622.2023.11.02.09.04.16 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 02 Nov 2023 09:04:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) client-ip=23.128.96.33; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b="eh9ZDmz/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id 0FCB78183ED4; Thu, 2 Nov 2023 09:04:12 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1377076AbjKBQDo (ORCPT + 99 others); Thu, 2 Nov 2023 12:03:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50224 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235160AbjKBQDl (ORCPT ); Thu, 2 Nov 2023 12:03:41 -0400 Received: from mgamail.intel.com (mgamail.intel.com [134.134.136.24]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E38A0DE for ; Thu, 2 Nov 2023 09:03:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1698941015; x=1730477015; h=message-id:date:mime-version:subject:to:references:from: in-reply-to:content-transfer-encoding; bh=oHS7cOg81nqV+G8kRWYjFbHqFT61vd7Hu+Ov25Z7XEQ=; b=eh9ZDmz/nUOCwyv43ZHGlBdDnpJrT5YhlbMvPF/aEztwYogjZTft5gU8 aJ/dVnwjvMAc95z/sS+Q7MJmqsSWRI5RWAZhOyCegnkYEfPEk886x+Ol0 joLvcAZFTFEZBs9MCMVMXgAZBQUPpPQ43pPCbnO2FIPynzlffFsj3IySi wJDsQyrhSfG2UetXXm3vWxQr8Fe/R9UVau6KpJgZgsx/vHoLSDosXveg/ gsRzDiBxh8UahODFxleHcyolNB1MSowlj49Y+5yBBeTKkKLL5jQ+ia/xu syL/5L0V6gL1fZxtHO4N47hLmimA1pepVdc1cq3ISCTBcpgHPVbN78Afg g==; X-IronPort-AV: E=McAfee;i="6600,9927,10882"; a="391611539" X-IronPort-AV: E=Sophos;i="6.03,272,1694761200"; d="scan'208";a="391611539" Received: from fmsmga006.fm.intel.com ([10.253.24.20]) by orsmga102.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Nov 2023 09:02:45 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10882"; a="1008508334" X-IronPort-AV: E=Sophos;i="6.03,272,1694761200"; d="scan'208";a="1008508334" Received: from kookjinl-mobl.amr.corp.intel.com (HELO [10.212.164.123]) ([10.212.164.123]) by fmsmga006-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Nov 2023 09:02:44 -0700 Message-ID: Date: Thu, 2 Nov 2023 09:02:44 -0700 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH] x86/mm/ident_map: Use gbpages only where full GB page should be mapped. Content-Language: en-US To: Steve Wahl , rja_direct@groups.int.hpe.com, Dave Hansen , Andy Lutomirski , Peter Zijlstra , Thomas Gleixner , Ingo Molnar , Borislav Petkov , x86@kernel.org, "H. Peter Anvin" , linux-kernel@vger.kernel.org References: <20231031195049.2075561-1-steve.wahl@hpe.com> From: Dave Hansen Autocrypt: addr=dave.hansen@intel.com; keydata= xsFNBE6HMP0BEADIMA3XYkQfF3dwHlj58Yjsc4E5y5G67cfbt8dvaUq2fx1lR0K9h1bOI6fC oAiUXvGAOxPDsB/P6UEOISPpLl5IuYsSwAeZGkdQ5g6m1xq7AlDJQZddhr/1DC/nMVa/2BoY 2UnKuZuSBu7lgOE193+7Uks3416N2hTkyKUSNkduyoZ9F5twiBhxPJwPtn/wnch6n5RsoXsb ygOEDxLEsSk/7eyFycjE+btUtAWZtx+HseyaGfqkZK0Z9bT1lsaHecmB203xShwCPT49Blxz VOab8668QpaEOdLGhtvrVYVK7x4skyT3nGWcgDCl5/Vp3TWA4K+IofwvXzX2ON/Mj7aQwf5W iC+3nWC7q0uxKwwsddJ0Nu+dpA/UORQWa1NiAftEoSpk5+nUUi0WE+5DRm0H+TXKBWMGNCFn c6+EKg5zQaa8KqymHcOrSXNPmzJuXvDQ8uj2J8XuzCZfK4uy1+YdIr0yyEMI7mdh4KX50LO1 pmowEqDh7dLShTOif/7UtQYrzYq9cPnjU2ZW4qd5Qz2joSGTG9eCXLz5PRe5SqHxv6ljk8mb ApNuY7bOXO/A7T2j5RwXIlcmssqIjBcxsRRoIbpCwWWGjkYjzYCjgsNFL6rt4OL11OUF37wL QcTl7fbCGv53KfKPdYD5hcbguLKi/aCccJK18ZwNjFhqr4MliQARAQABzUVEYXZpZCBDaHJp c3RvcGhlciBIYW5zZW4gKEludGVsIFdvcmsgQWRkcmVzcykgPGRhdmUuaGFuc2VuQGludGVs LmNvbT7CwXgEEwECACIFAlQ+9J0CGwMGCwkIBwMCBhUIAgkKCwQWAgMBAh4BAheAAAoJEGg1 lTBwyZKwLZUP/0dnbhDc229u2u6WtK1s1cSd9WsflGXGagkR6liJ4um3XCfYWDHvIdkHYC1t MNcVHFBwmQkawxsYvgO8kXT3SaFZe4ISfB4K4CL2qp4JO+nJdlFUbZI7cz/Td9z8nHjMcWYF IQuTsWOLs/LBMTs+ANumibtw6UkiGVD3dfHJAOPNApjVr+M0P/lVmTeP8w0uVcd2syiaU5jB aht9CYATn+ytFGWZnBEEQFnqcibIaOrmoBLu2b3fKJEd8Jp7NHDSIdrvrMjYynmc6sZKUqH2 I1qOevaa8jUg7wlLJAWGfIqnu85kkqrVOkbNbk4TPub7VOqA6qG5GCNEIv6ZY7HLYd/vAkVY E8Plzq/NwLAuOWxvGrOl7OPuwVeR4hBDfcrNb990MFPpjGgACzAZyjdmYoMu8j3/MAEW4P0z F5+EYJAOZ+z212y1pchNNauehORXgjrNKsZwxwKpPY9qb84E3O9KYpwfATsqOoQ6tTgr+1BR CCwP712H+E9U5HJ0iibN/CDZFVPL1bRerHziuwuQuvE0qWg0+0SChFe9oq0KAwEkVs6ZDMB2 P16MieEEQ6StQRlvy2YBv80L1TMl3T90Bo1UUn6ARXEpcbFE0/aORH/jEXcRteb+vuik5UGY 5TsyLYdPur3TXm7XDBdmmyQVJjnJKYK9AQxj95KlXLVO38lczsFNBFRjzmoBEACyAxbvUEhd GDGNg0JhDdezyTdN8C9BFsdxyTLnSH31NRiyp1QtuxvcqGZjb2trDVuCbIzRrgMZLVgo3upr MIOx1CXEgmn23Zhh0EpdVHM8IKx9Z7V0r+rrpRWFE8/wQZngKYVi49PGoZj50ZEifEJ5qn/H Nsp2+Y+bTUjDdgWMATg9DiFMyv8fvoqgNsNyrrZTnSgoLzdxr89FGHZCoSoAK8gfgFHuO54B lI8QOfPDG9WDPJ66HCodjTlBEr/Cwq6GruxS5i2Y33YVqxvFvDa1tUtl+iJ2SWKS9kCai2DR 3BwVONJEYSDQaven/EHMlY1q8Vln3lGPsS11vSUK3QcNJjmrgYxH5KsVsf6PNRj9mp8Z1kIG qjRx08+nnyStWC0gZH6NrYyS9rpqH3j+hA2WcI7De51L4Rv9pFwzp161mvtc6eC/GxaiUGuH BNAVP0PY0fqvIC68p3rLIAW3f97uv4ce2RSQ7LbsPsimOeCo/5vgS6YQsj83E+AipPr09Caj 0hloj+hFoqiticNpmsxdWKoOsV0PftcQvBCCYuhKbZV9s5hjt9qn8CE86A5g5KqDf83Fxqm/ vXKgHNFHE5zgXGZnrmaf6resQzbvJHO0Fb0CcIohzrpPaL3YepcLDoCCgElGMGQjdCcSQ+Ci FCRl0Bvyj1YZUql+ZkptgGjikQARAQABwsFfBBgBAgAJBQJUY85qAhsMAAoJEGg1lTBwyZKw l4IQAIKHs/9po4spZDFyfDjunimEhVHqlUt7ggR1Hsl/tkvTSze8pI1P6dGp2XW6AnH1iayn yRcoyT0ZJ+Zmm4xAH1zqKjWplzqdb/dO28qk0bPso8+1oPO8oDhLm1+tY+cOvufXkBTm+whm +AyNTjaCRt6aSMnA/QHVGSJ8grrTJCoACVNhnXg/R0g90g8iV8Q+IBZyDkG0tBThaDdw1B2l asInUTeb9EiVfL/Zjdg5VWiF9LL7iS+9hTeVdR09vThQ/DhVbCNxVk+DtyBHsjOKifrVsYep WpRGBIAu3bK8eXtyvrw1igWTNs2wazJ71+0z2jMzbclKAyRHKU9JdN6Hkkgr2nPb561yjcB8 sIq1pFXKyO+nKy6SZYxOvHxCcjk2fkw6UmPU6/j/nQlj2lfOAgNVKuDLothIxzi8pndB8Jju KktE5HJqUUMXePkAYIxEQ0mMc8Po7tuXdejgPMwgP7x65xtfEqI0RuzbUioFltsp1jUaRwQZ MTsCeQDdjpgHsj+P2ZDeEKCbma4m6Ez/YWs4+zDm1X8uZDkZcfQlD9NldbKDJEXLIjYWo1PH hYepSffIWPyvBMBTW2W5FRjJ4vLRrJSUoEfJuPQ3vW9Y73foyo/qFoURHO48AinGPZ7PC7TF vUaNOTjKedrqHkaOcqB185ahG2had0xnFsDPlx5y In-Reply-To: <20231031195049.2075561-1-steve.wahl@hpe.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.3 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE, URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Thu, 02 Nov 2023 09:04:12 -0700 (PDT) On 10/31/23 12:50, Steve Wahl wrote: > Instead of using gbpages for all memory regions, use them only when > map creation requests include the full GB page of space; descend to > using smaller 2M pages when only portions of a GB page are requested. ... The logic here is sound: we obviously don't want systems rebooting because speculation went wonky. > diff --git a/arch/x86/mm/ident_map.c b/arch/x86/mm/ident_map.c > index 968d7005f4a7..b63a1ffcfe9f 100644 > --- a/arch/x86/mm/ident_map.c > +++ b/arch/x86/mm/ident_map.c > @@ -31,18 +31,26 @@ static int ident_pud_init(struct x86_mapping_info *info, pud_t *pud_page, > if (next > end) > next = end; > > - if (info->direct_gbpages) { > + /* > + * if gbpages allowed, this entry not yet present, and > + * the full gbpage range is requested (both ends are > + * correctly aligned), create a gbpage. > + */ > + if (info->direct_gbpages > + && !pud_present(*pud) > + && !(addr & ~PUD_MASK) > + && !(next & ~PUD_MASK)) { This is a _bit_ too subtle for my taste. Let's say someone asks for mapping of 2MB@5G, then later asks for 1G@5G. The first call in here will do the 2MB mapping with small (pud) entries. The second call will see the new pud_present() check and *also* skip large pud creation. That's a regression. It might not matter _much_, but it's a place where the old code creates large PUDs and the new code creates small ones. It's the kind of behavior change that at least needs to be noted in the changelog. Off the top of my head, I can't think of why we'd get overlapping requests in here, though. Did you think through that case? Is it common? > pud_t pudval; > > - if (pud_present(*pud)) > - continue; > - > - addr &= PUD_MASK; > pudval = __pud((addr - info->offset) | info->page_flag); > set_pud(pud, pudval); > continue; > } > > + /* if this is already a gbpage, this portion is already mapped */ > + if (pud_large(*pud)) > + continue; I'd probably move this check up to above the large PUD creation code. It would make it more obvious that any PUD that's encountered down below is a small PUD. > if (pud_present(*pud)) { > pmd = pmd_offset(pud, 0); > ident_pmd_init(info, pmd, addr, next); That would end up looking something like this: bool do_gbpages = true; ... // Is the whole range already mapped? if (pud_large(*pud)) continue; /* PUD is either empty or small */ // GB pages allowed? do_gbpages &= info->direct_gbpages; // Addresses aligned for GB pages? do_gbpages &= ~(addr & ~PUD_MASK); do_gbpages &= ~(next & ~PUD_MASK); // Check for existing mapping using a small PUD do_gbpages &= !pud_present(*pud); if (do_gbpages) { set_pud(pud, __pud((addr - info->offset) | info->page_flag)); continue }