Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp5594208imm; Wed, 12 Sep 2018 08:15:06 -0700 (PDT) X-Google-Smtp-Source: ANB0Vdb6BFsWL5ahHWU1mRiEEGtGuKJAkZc7fRpCrz2dILV1hqEByf3uMPDBA0xWvidCdkqKqSNH X-Received: by 2002:a63:c702:: with SMTP id n2-v6mr2936870pgg.108.1536765306802; Wed, 12 Sep 2018 08:15:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536765306; cv=none; d=google.com; s=arc-20160816; b=r2/UI/YZF/sgROARGW9pXLISkhFkJsp6loXZDk12Jb/4XY2eOoFI0rnVpQz1loHXW3 5EHKE51FwGW5EBENMNUug+hX44JgbcbO5vjSxIXv8WTyzo8yoSVCWRN3r9M05ubinr1V 0Ktos2i8XKkkF/eu9J6iVx0ZLH4CPE1MhsQ+tPTWdALJP/xPLe5JdnuHTgzLIcremHUF 2xBNFlaLVBg/lW4eAUxIAPIrNmitBuCpc56FIe/sAY+3q4YRYK3SvDtAAbuJE3P4S0mp fb/dtIKNxj/6oL5P4N1Y+SA1KBnD5qVxjlGEAjrus9qe0CsVgaGUGLGOaisNiKpKXWIu OKlw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=MldDKBW/M9SlPRdoA0LmhBMwD66ryIBnKH8HGEB1Kxk=; b=HBnW2JN42QNmb4G1AUKQx8FqX8riK+cgmOhxsw+5EeyGTGpuqrldG3l+TNFb1ZMTM6 jiacJb3NTDcPPS7TcRMmAjdcs381To6uu5jSoxlOUfJeLWO5w0WXwMk0BJ2kQcvsvU4t hL8Yjah0+Lnf43MLz6yCAY47Hsqn7rb2Ad4UsrnC+HdeHJ1314IWETpOEVrVoxMAQcJI +1xiMZvlvQtqVW695ApvxjgjetSqs5dDB4WKdJQwvIOBcGDY9j+X46zLdLDvYEb/DU6q JLYwmSrxFrRTC0jOYbc89cMYgS5sCBjPf4I5U6rgbvUGL1inKOmaqpB3x09rhwHL7D+I Nn/Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q132-v6si1294600pfc.159.2018.09.12.08.14.40; Wed, 12 Sep 2018 08:15:06 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727640AbeILUSh (ORCPT + 99 others); Wed, 12 Sep 2018 16:18:37 -0400 Received: from mga12.intel.com ([192.55.52.136]:33077 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726640AbeILUSh (ORCPT ); Wed, 12 Sep 2018 16:18:37 -0400 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga106.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 12 Sep 2018 08:13:40 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,365,1531810800"; d="scan'208";a="88146928" Received: from sjchrist-coffee.jf.intel.com (HELO linux.intel.com) ([10.54.74.20]) by fmsmga004.fm.intel.com with ESMTP; 12 Sep 2018 08:09:39 -0700 Date: Wed, 12 Sep 2018 08:09:39 -0700 From: Sean Christopherson To: Will Deacon Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, cpandya@codeaurora.org, toshi.kani@hpe.com, tglx@linutronix.de, mhocko@suse.com, akpm@linux-foundation.org Subject: Re: [PATCH 4/5] lib/ioremap: Ensure phys_addr actually corresponds to a physical address Message-ID: <20180912150939.GA30274@linux.intel.com> References: <1536747974-25875-1-git-send-email-will.deacon@arm.com> <1536747974-25875-5-git-send-email-will.deacon@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1536747974-25875-5-git-send-email-will.deacon@arm.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Sep 12, 2018 at 11:26:13AM +0100, Will Deacon wrote: > The current ioremap() code uses a phys_addr variable at each level of > page table, which is confusingly offset by subtracting the base virtual > address being mapped so that adding the current virtual address back on > when iterating through the page table entries gives back the corresponding > physical address. > > This is fairly confusing and results in all users of phys_addr having to > add the current virtual address back on. Instead, this patch just updates > phys_addr when iterating over the page table entries, ensuring that it's > always up-to-date and doesn't require explicit offsetting. > > Cc: Chintan Pandya > Cc: Toshi Kani > Cc: Thomas Gleixner > Cc: Michal Hocko > Cc: Andrew Morton > Signed-off-by: Will Deacon > --- > lib/ioremap.c | 28 ++++++++++++---------------- > 1 file changed, 12 insertions(+), 16 deletions(-) > > diff --git a/lib/ioremap.c b/lib/ioremap.c > index 6c72764af19c..fc834a59c90c 100644 > --- a/lib/ioremap.c > +++ b/lib/ioremap.c > @@ -101,19 +101,18 @@ static inline int ioremap_pmd_range(pud_t *pud, unsigned long addr, > pmd_t *pmd; > unsigned long next; > > - phys_addr -= addr; > pmd = pmd_alloc(&init_mm, pud, addr); > if (!pmd) > return -ENOMEM; > do { > next = pmd_addr_end(addr, end); > > - if (ioremap_try_huge_pmd(pmd, addr, next, phys_addr + addr, prot)) > + if (ioremap_try_huge_pmd(pmd, addr, next, phys_addr, prot)) > continue; > > - if (ioremap_pte_range(pmd, addr, next, phys_addr + addr, prot)) > + if (ioremap_pte_range(pmd, addr, next, phys_addr, prot)) > return -ENOMEM; > - } while (pmd++, addr = next, addr != end); > + } while (pmd++, addr = next, phys_addr += PMD_SIZE, addr != end); I think bumping phys_addr by PXX_SIZE is wrong if phys_addr and addr start unaligned with respect to PXX_SIZE. The addresses must be PAGE_ALIGNED, which lets ioremap_pte_range() do a simple calculation, but that doesn't hold true for the upper levels, i.e. phys_addr needs to be adjusted using an algorithm similar to pxx_addr_end(). Using a 2mb page as an example (lower 32 bits only): pxx_size = 0x00020000 pxx_mask = 0xfffe0000 addr = 0x1000 end = 0x00040000 phys_addr = 0x1000 Loop 1: addr = 0x1000 phys = 0x1000 Loop 2: addr = 0x20000 phys = 0x21000 > return 0; > } > > @@ -142,19 +141,18 @@ static inline int ioremap_pud_range(p4d_t *p4d, unsigned long addr, > pud_t *pud; > unsigned long next; > > - phys_addr -= addr; > pud = pud_alloc(&init_mm, p4d, addr); > if (!pud) > return -ENOMEM; > do { > next = pud_addr_end(addr, end); > > - if (ioremap_try_huge_pud(pud, addr, next, phys_addr + addr, prot)) > + if (ioremap_try_huge_pud(pud, addr, next, phys_addr, prot)) > continue; > > - if (ioremap_pmd_range(pud, addr, next, phys_addr + addr, prot)) > + if (ioremap_pmd_range(pud, addr, next, phys_addr, prot)) > return -ENOMEM; > - } while (pud++, addr = next, addr != end); > + } while (pud++, addr = next, phys_addr += PUD_SIZE, addr != end); > return 0; > } > > @@ -164,7 +162,6 @@ static inline int ioremap_p4d_range(pgd_t *pgd, unsigned long addr, > p4d_t *p4d; > unsigned long next; > > - phys_addr -= addr; > p4d = p4d_alloc(&init_mm, pgd, addr); > if (!p4d) > return -ENOMEM; > @@ -173,14 +170,14 @@ static inline int ioremap_p4d_range(pgd_t *pgd, unsigned long addr, > > if (ioremap_p4d_enabled() && > ((next - addr) == P4D_SIZE) && > - IS_ALIGNED(phys_addr + addr, P4D_SIZE)) { > - if (p4d_set_huge(p4d, phys_addr + addr, prot)) > + IS_ALIGNED(phys_addr, P4D_SIZE)) { > + if (p4d_set_huge(p4d, phys_addr, prot)) > continue; > } > > - if (ioremap_pud_range(p4d, addr, next, phys_addr + addr, prot)) > + if (ioremap_pud_range(p4d, addr, next, phys_addr, prot)) > return -ENOMEM; > - } while (p4d++, addr = next, addr != end); > + } while (p4d++, addr = next, phys_addr += P4D_SIZE, addr != end); > return 0; > } > > @@ -196,14 +193,13 @@ int ioremap_page_range(unsigned long addr, > BUG_ON(addr >= end); > > start = addr; > - phys_addr -= addr; > pgd = pgd_offset_k(addr); > do { > next = pgd_addr_end(addr, end); > - err = ioremap_p4d_range(pgd, addr, next, phys_addr+addr, prot); > + err = ioremap_p4d_range(pgd, addr, next, phys_addr, prot); > if (err) > break; > - } while (pgd++, addr = next, addr != end); > + } while (pgd++, addr = next, phys_addr += PGDIR_SIZE, addr != end); > > flush_cache_vmap(start, end); > > -- > 2.1.4 >