Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp5734405imm; Wed, 12 Sep 2018 10:15:30 -0700 (PDT) X-Google-Smtp-Source: ANB0VdaVKxVv0kkrS5bGZNVBmOSJ9UtKshs4c54Xt70VmxWUXUqh99vw0q+rvUvtSzoCV6rvxPqq X-Received: by 2002:a62:5302:: with SMTP id h2-v6mr3509581pfb.183.1536772530485; Wed, 12 Sep 2018 10:15:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536772530; cv=none; d=google.com; s=arc-20160816; b=clAk/W02MNgzTulHWLYlnXsEguaxUkkK4czIB7srUVY8nHNypa9dGXsvaqLDYqh7vv T4WVJJWjVsGgBOYmq2CqYJnZdWhQ3yjDSR3/9UFk0Lx4HOEviWuCGApRiiJHX34Irs6O B0bnR49ye63DEdibcZ8PFDw0Y8HIYoNYKADb4InkYE0Jqv0Cg+5GhT6qDui+3Vb1NL0n bHmiHCqSfqfE6IcHNgCFrdeNvw+vCqQePtSOcOnFQoxrrOL8T1EHOoV3yfQiYSl+z3Hd HN6RJsKO3n26QwE3BpiyprS1T3K5RPUfvLvXFQ+iStFsODWAn2RoJ+RM2JOKhjKCqHpI z3NQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=t7MAEJw6n52wHR6/1zW+FYqL3SMYpY8pyuGo/zZQB1c=; b=xcN/k/G1ykKlEcjwDVzKVrbBA+Hw2f9XzWllBUTHLAejkx+C1EzoRrhX9CL3pli1h+ aF2NBnTEJIQwm4SRQypk7d0wT7Zg+B8E803D74Iray9jR8vBRANGLRnK1OJ06D4PQ0vG lIy64X6U6MsuA9tqZVo/BwH4bRZr3noobvYBxKPcwz7usRaLIaB4HhBFdYJVzCF828+W hKbhPTIcqOI9JFehZGZjr3wYrzpZ/IhV8NGoW2nZwvvpWGhCFrGFOZS25TKqnJebAlzR rtCpHwE7JVjeKEA3eP3uCB92l1/RT0T0yZ26ZAvKxK54vO9Bqh/s9ey0tEwm70BTgw5e NzNQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e12-v6si1449739pfn.322.2018.09.12.10.15.15; Wed, 12 Sep 2018 10:15:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727739AbeILWUP (ORCPT + 99 others); Wed, 12 Sep 2018 18:20:15 -0400 Received: from mga02.intel.com ([134.134.136.20]:12102 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726428AbeILWUO (ORCPT ); Wed, 12 Sep 2018 18:20:14 -0400 X-Amp-Result: UNSCANNABLE X-Amp-File-Uploaded: False Received: from orsmga002.jf.intel.com ([10.7.209.21]) by orsmga101.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 12 Sep 2018 10:14:45 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,365,1531810800"; d="scan'208";a="91077724" Received: from sjchrist-coffee.jf.intel.com (HELO linux.intel.com) ([10.54.74.20]) by orsmga002.jf.intel.com with ESMTP; 12 Sep 2018 10:14:34 -0700 Date: Wed, 12 Sep 2018 10:14:34 -0700 From: Sean Christopherson To: Will Deacon Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, cpandya@codeaurora.org, toshi.kani@hpe.com, tglx@linutronix.de, mhocko@suse.com, akpm@linux-foundation.org Subject: Re: [PATCH 4/5] lib/ioremap: Ensure phys_addr actually corresponds to a physical address Message-ID: <20180912171434.GA31712@linux.intel.com> References: <1536747974-25875-1-git-send-email-will.deacon@arm.com> <1536747974-25875-5-git-send-email-will.deacon@arm.com> <20180912150939.GA30274@linux.intel.com> <20180912163914.GA16071@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180912163914.GA16071@arm.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Sep 12, 2018 at 05:39:14PM +0100, Will Deacon wrote: > Hi Sean, > > Thanks for looking at the patch. > > On Wed, Sep 12, 2018 at 08:09:39AM -0700, Sean Christopherson wrote: > > On Wed, Sep 12, 2018 at 11:26:13AM +0100, Will Deacon wrote: > > > The current ioremap() code uses a phys_addr variable at each level of > > > page table, which is confusingly offset by subtracting the base virtual > > > address being mapped so that adding the current virtual address back on > > > when iterating through the page table entries gives back the corresponding > > > physical address. > > > > > > This is fairly confusing and results in all users of phys_addr having to > > > add the current virtual address back on. Instead, this patch just updates > > > phys_addr when iterating over the page table entries, ensuring that it's > > > always up-to-date and doesn't require explicit offsetting. > > > > > > Cc: Chintan Pandya > > > Cc: Toshi Kani > > > Cc: Thomas Gleixner > > > Cc: Michal Hocko > > > Cc: Andrew Morton > > > Signed-off-by: Will Deacon > > > --- > > > lib/ioremap.c | 28 ++++++++++++---------------- > > > 1 file changed, 12 insertions(+), 16 deletions(-) > > > > > > diff --git a/lib/ioremap.c b/lib/ioremap.c > > > index 6c72764af19c..fc834a59c90c 100644 > > > --- a/lib/ioremap.c > > > +++ b/lib/ioremap.c > > > @@ -101,19 +101,18 @@ static inline int ioremap_pmd_range(pud_t *pud, unsigned long addr, > > > pmd_t *pmd; > > > unsigned long next; > > > > > > - phys_addr -= addr; > > > pmd = pmd_alloc(&init_mm, pud, addr); > > > if (!pmd) > > > return -ENOMEM; > > > do { > > > next = pmd_addr_end(addr, end); > > > > > > - if (ioremap_try_huge_pmd(pmd, addr, next, phys_addr + addr, prot)) > > > + if (ioremap_try_huge_pmd(pmd, addr, next, phys_addr, prot)) > > > continue; > > > > > > - if (ioremap_pte_range(pmd, addr, next, phys_addr + addr, prot)) > > > + if (ioremap_pte_range(pmd, addr, next, phys_addr, prot)) > > > return -ENOMEM; > > > - } while (pmd++, addr = next, addr != end); > > > + } while (pmd++, addr = next, phys_addr += PMD_SIZE, addr != end); > > > > I think bumping phys_addr by PXX_SIZE is wrong if phys_addr and addr > > start unaligned with respect to PXX_SIZE. The addresses must be > > PAGE_ALIGNED, which lets ioremap_pte_range() do a simple calculation, > > but that doesn't hold true for the upper levels, i.e. phys_addr needs > > to be adjusted using an algorithm similar to pxx_addr_end(). > > > > Using a 2mb page as an example (lower 32 bits only): > > > > pxx_size = 0x00020000 > > pxx_mask = 0xfffe0000 > > addr = 0x1000 > > end = 0x00040000 > > phys_addr = 0x1000 > > > > Loop 1: > > addr = 0x1000 > > phys = 0x1000 > > > > Loop 2: > > addr = 0x20000 > > phys = 0x21000 > > Yes, I think you're completely right, however I also don't think this > can happen with the current code (and I've failed to trigger it in my > testing). The virtual addresses allocated for VM_IOREMAP allocations > are aligned to the order of the allocation, which means that the virtual > address at the start of the mapping is aligned such that when we hit the > end of a pXd, we know we've mapped the previous PXD_SIZE bytes. > > Having said that, this is clearly a change from the current code and I > haven't audited architectures other than arm64 (where IOREMAP_MAX_ORDER > corresponds to the maximum size of our huge mappings), so it would be > much better not to introduce this funny behaviour in a patch that aims > to reduce confusion in the first place! > > Fixing this using the pxx_addr_end() macros is a bit strange, since we > don't have a physical end variable (nor do we need one), so perhaps > something like changing the while condition to be: > > do { > ... > } while (pmd++, phys_addr += (next - addr), addr = next, addr != end); > > would do the trick. What do you reckon? LGTM. I like that there isn't a separate calculation for phys_addr's offset.