Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp3267386ybz; Sun, 3 May 2020 22:35:49 -0700 (PDT) X-Google-Smtp-Source: APiQypJwa9s4PSSO+vg3Dh4WsZyrmycv6Se9lGp1PEYhCtayT1i5uhG7gxzm9Y/QcLalWJRuTfHS X-Received: by 2002:a17:906:1cd5:: with SMTP id i21mr13405552ejh.251.1588570549532; Sun, 03 May 2020 22:35:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588570549; cv=none; d=google.com; s=arc-20160816; b=dVE6px8k1DrtNayt9whhWSEBXT2l4seB8/9KoVjd1U4Q1ZfAwTslBvVffYoc6l3Dmj lhMzZLwR5qx0zaD5mkpj4+du7ENeCNGr2u0bMSDZIplU5nf0jSiMhLT66H6eojUzhe5I 6inbXUFqex/lokPhNjYYHlScYD4Q/72WDBi3plrgI1HrIEmWR0tXwcUU9cEEeZLdmA5S Dc5Mbd1reV0TJcmkeFaag4L+DoYLjHP9DFExhrWUO9vWYyAGVPQn2y5b4arWEqnitL2s wuobLfKp+/fq5+K2lzVlQbPM5xnP6uLwx0Tc53Irj9dUu7AtfjLxJtbPNVvZtKeodm+f Vxaw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=OUb9VXiV5aR1MQjTJBcENt5+Tr4dl0bHfe8n4Dc2h6I=; b=QWY83zRtoJklvydaGKbzhWaliPU86+nAUCNOAZTH5NxXuAmI6h/oog0mwxHgfXz78e CRsTqhsuFXmirxdQvubXYSsTfSTyX0nUKO+gZY2czrYJ7Y7+fYl4rwVzhG4mgh73UCeT pwWjRgCtwG9+k7r5qv/b345DfQCmMu7l62qKtR6hzUHu2nB5sWfxHPZvfNcU3IPCqt5S cmFpw700/JdPi98/mk/UqE7d1bGfLx0Xw25o2iyUKkj14/i7/h2awQ+Uce/N6+X3O5s3 QXpd1hgODr6DL/+W5fcHqZBqY9iyVzAwXAFpJzs+Se7/mXkAXfYv3e1VKo6Dnvo4HPY/ j5OQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id o16si5981986edr.91.2020.05.03.22.35.26; Sun, 03 May 2020 22:35:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727783AbgEDFeH (ORCPT + 99 others); Mon, 4 May 2020 01:34:07 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53508 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725859AbgEDFeH (ORCPT ); Mon, 4 May 2020 01:34:07 -0400 Received: from ZenIV.linux.org.uk (zeniv.linux.org.uk [IPv6:2002:c35c:fd02::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0A94EC061A0E; Sun, 3 May 2020 22:34:07 -0700 (PDT) Received: from viro by ZenIV.linux.org.uk with local (Exim 4.92.3 #3 (Red Hat Linux)) id 1jVTjp-000ZHp-A5; Mon, 04 May 2020 05:33:57 +0000 Date: Mon, 4 May 2020 06:33:57 +0100 From: Al Viro To: Ira Weiny Cc: linux-kernel@vger.kernel.org, Andrew Morton , Christian Koenig , Huang Rui , Thomas Bogendoerfer , "James E.J. Bottomley" , Helge Deller , Benjamin Herrenschmidt , Paul Mackerras , "David S. Miller" , Thomas Gleixner , Ingo Molnar , Borislav Petkov , x86@kernel.org, "H. Peter Anvin" , Dave Hansen , Andy Lutomirski , Peter Zijlstra , Chris Zankel , Max Filippov , Dan Williams , linux-snps-arc@lists.infradead.org, linux-arm-kernel@lists.infradead.org, linux-csky@vger.kernel.org, linux-mips@vger.kernel.org, linux-parisc@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, sparclinux@vger.kernel.org, linux-xtensa@linux-xtensa.org, dri-devel@lists.freedesktop.org Subject: Re: [PATCH V2 00/11] Subject: Remove duplicated kmap code Message-ID: <20200504053357.GV23230@ZenIV.linux.org.uk> References: <20200504010912.982044-1-ira.weiny@intel.com> <20200504013509.GU23230@ZenIV.linux.org.uk> <20200504050447.GA979899@iweiny-DESK2.sc.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200504050447.GA979899@iweiny-DESK2.sc.intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, May 03, 2020 at 10:04:47PM -0700, Ira Weiny wrote: > Grepping for 'asm/highmem.h' and investigations don't reveal any issues... But > you do have me worried. That said 0-day has been crunching on multiple > versions of this series without issues such as this (save the mips issue > above). > > I have to say it would be nice if the relation between linux/highmem.h and > asm/highmem.h was more straightforward. IIRC, the headache was due to asm/pgtable.h on several architectures and asm/cacheflush.h on parisc. || IOW, there's one in linux/highmem.h (conditional on !CONFIG_HIGHMEM, || !ARCH_HAS_KMAP) and several per-architecture variants, usually declared in || their asm/highmem.h. In three of those (microblaze, parisc and powerpc) these || are inlines (parisc one identical to linux/highmem.h, lives in asm/cacheflush.h, || powerpc and microblaze ones calling kmap_atomic_prot() which is defined in || arch/$ARCH/mm/highmem.c). || || parisc case is weird - essentially, they want to call || flush_kernel_dcache_page_addr() at the places where kunmap/kunmap_atomic || is done. And they do so despite not selecting HIGHMEM, with definitions || in usual place. They do have ARCH_HAS_KMAP defined, which prevents || getting buggered in linux/highmem.h. ARCH_HAS_KMAP is parisc-unique, || BTW, and checked only in linux/highmem.h. || || All genuine arch-specific variants are defined in (or call functions || defined in) translation units that are only included CONFIG_HIGHMEM builds. || || It would be tempting to consolidate those, e.g. by adding __kmap_atomic() || and __kmap_atomic_prot() without that boilerplate, with universal kmap_atomic() || and kmap_atomic_prot() in linux/highmem.h. Potential problem with that would || be something that pulls ash/highmem.h (or asm/cacheflush.h in case of parisc) || directly and uses kmap_atomic/kmap_atomic_prot. There's not a lot places || pulling asm/highmem.h, and many of them are not even in includes: || || arch/arm/include/asm/efi.h:13:#include || arch/arm/mm/dma-mapping.c:31:#include || arch/arm/mm/flush.c:14:#include || arch/arm/mm/mmu.c:27:#include || arch/mips/include/asm/pgtable-32.h:22:#include || arch/mips/mm/cache.c:19:#include || arch/mips/mm/fault.c:28:#include /* For VMALLOC_END */ || arch/nds32/include/asm/pgtable.h:60:#include || arch/x86/kernel/setup_percpu.c:20:#include || include/linux/highmem.h:35:#include || || Users of asm/cacheflush.h are rather more numerous; however, anything || outside of parisc-specific code has to pull linux/highmem.h, or it won't see || the definitions of kmap_atomic/kmap_atomic_prot at all. arch/parisc itself || has no callers of those. || || Outside of arch/* there is a plenty of callers. However, the following is || true: all instances of kmap_atomic or kmap_atomic_prot outside of arch/* || are either inside the linux/highmem.h or are preceded by include of || linux/highmem.h on any build that sees them (there is a common include || chain that is conditional upon CONFIG_BLOCK, but it's only needed in || drivers that are BLOCK-dependent). It was not fun to verify, to put || it mildly... || || So for parisc we have no problem - leaving __kmap_atomic()/__kmap_atomic_prot() || in asm/cachefile.h and adding universal wrappers in linux/highmem.h will be || fine. For other architectures the things might be trickier. || || * arc: all users in arch/arc/ are within arch/arc/mm/{cache,highmem}.c; || both pull linux/highmem.h. We are fine. || || * arm: much, much worse. We have several files that pull linux/highmem.h: || arch/arm/mm/cache-feroceon-l2.c, arch/arm/mm/cache-xsc3l2.c, || arch/arm/mm/copypage-*.c, arch/arm/mm/dma-mapping.c, arch/arm/mm/flush.c, || arch/arm/mm/highmem.c, arch/arm/probes/uprobes/core.c, || arch/arm/include/asm/kvm_mmu.h (kmap_atomic_pfn()). || Those are fine, but we also have this: || arch/arm/include/asm/pgtable.h:200:#define __pte_map(pmd) (pte_t *)kmap_atomic(pmd_page(*(pmd))) || arch/arm/include/asm/pgtable.h:208:#define pte_offset_map(pmd,addr) (__pte_map(pmd) + pte_index(addr)) || and sure as hell, asm/pgtable.h does *NOT* pull linux/highmem.h. || || Fortunately, the users of pte_offset_map() (__pte_map() has no other users) || are few, both in arch/arm and outside of arch. All arm ones are pulling || linux/highmem (arch/arm/mm/{pgd,fault*}.c). Outside of arch we have several || that pull highmem.h (by way of rmap.h or pagemap.h, usually): || fs/userfaultfd.c, mm/gup.c, mm/hmm.c, mm/huge_memory.c, || mm/khugepaged.c, mm/memory-failure.c, mm/memory.c, mm/migrate.c, || mm/mremap.c, mm/page_vma_mapped.c, mm/swap_state.c, mm/swapfile.c, || mm/vmalloc.c || and then there are these in linux/mm.h: || || #define pte_offset_map_lock(mm, pmd, address, ptlp) \ || ({ \ || spinlock_t *__ptl = pte_lockptr(mm, pmd); \ || pte_t *__pte = pte_offset_map(pmd, address); \ || *(ptlp) = __ptl; \ || spin_lock(__ptl); \ || __pte; \ || }) || #define pte_alloc_map(mm, pmd, address) \ || (pte_alloc(mm, pmd) ? NULL : pte_offset_map(pmd, address)) || #define pte_alloc_map_lock(mm, pmd, address, ptlp) \ || (pte_alloc(mm, pmd) ? \ || NULL : pte_offset_map_lock(mm, pmd, address, ptlp)) || || These have two users in arch/arm (arch/arm/mm/pgd.c and || arch/arm/lib/uaccess_with_memcpy.c, both pulling highmem.h). Outside of || arch there are several new files (plus a lot of what we'd already seen || in mm/*.c, unsurprisingly): || fs/proc/task_mmu.c, mm/ksm.c, mm/madvise.c, mm/memcontrol.c, || mm/mempolicy.c, mm/mincore.c, mm/mprotect.c, mm/pagewalk.c, || mm/shmem.c, mm/userfaultfd.c, || all pulling linux/highmem.h, as pretty much all core VM does. So we are || still fine. || || * csky: users in arch/csky/abiv2/cacheflush.c, arch/csky/mm/dma-mapping.c, || arch/csky/mm/highmem.c, all pulling linux/highmem.h || || * microblaze: users in arch/microblaze/mm/highmem.c (pulls linux/highmem.h) and, || arch/microblaze/include/asm/pgtable.h, this: || #define pte_offset_map(dir, addr) \ || ((pte_t *) kmap_atomic(pmd_page(*(dir))) + pte_index(addr)) || One pte_offset_map user in arch/microblaze: || arch/microblaze/kernel/signal.c:207: ptep = pte_offset_map(pmdp, address); || Messy, but doesn't require any changes (we have asm/pgalloc.h included || there, and that pull linux/highmem.h). || Outside of arch we'd already sorted it out when looking at arm. || || * mips: users in arch/mips/kernel/crash_dump.c, arch/mips/kernel/uprobes.c, || arch/mips/mm/c-r4k.c, arch/mips/mm/dma-noncoherent.c, arch/mips/mm/highmem.c, || and arch/mips/mm/init.c (all pulling linux/highmem.h) plus this || arch/mips/mm/cache.c, which relies upon asm/highmem.h. This can be switched || to linux/highmem.h. On !CONFIG_HIGHMEM builds the call of kmap_atomic() in || there is eliminated, since it's conditional upon PageHighMem(). IOW, even || though we get a call of (inexistent) out-of-line version, it's not going to || survive into object file. With linux/highmem.h use it will be an equally || eliminated call of inlined version. || XXX: arch/mips/mm/cache.c || || * nds32: users in arch/nds32/kernel/dma.c, arch/nds32/mm/cacheflush.c and || arch/nds32/mm/highmem.c, all pulling linux/highmem.h || || * powerpc: users in arch/powerpc/kvm/book3s_pr.c, || arch/powerpc/kvm/e500_mmu_host.c, arch/powerpc/mm/dma-noncoherent.c, || arch/powerpc/mm/highmem.c and arch/powerpc/mm/mem.c, all pulling || linux/highmem.h, a user in arch/powerpc/mm/hugetlbpage.c pulling it || via asm/tlb.h -> linux/pagemap.h -> linux/highmem.h and || macros for pte_offset_map in arch/powerpc/include/asm/*/32/pgtable.h. || Users of that within arch/powerpc are either 64bit-only or || pull linux/highmem.h (arch/powerpc/mm/pgtable_32.c and || arch/powerpc/xmon/xmon.c). Users outside of arch - same as for arm. || || * sparc: users in arch/sparc/kernel/uprobes.c and arch/sparc/mm/highmem.c || (both pulling linux/highmem.h directly) + arch/sparc/mm/init_64.c pulling || it via linux/pagemap.h. Strangely, arch/sparc/mm/io-unit.c and || arch/sparc/mm/iommu.c both include linux/highmem.h with odd comment || that seems to indicate that once upon a time pte_offset_map() used to || requite kmap_atomic() there... Right, it used to - until 2002. || These includes are pointless, then... || || * x86: users in arch/x86/kernel/crash_dump_32.c, arch/x86/kvm/svm.c, || arch/x86/lib/usercopy_64.c, arch/x86/mm/highmem_32.c and arch/x86/mm/iomap_32.c, || all pulling linux/highmem.h, users in paging_tmpl.h (included from || arch/x86/kvm/mmu/mmu.c, which has pulled linux/highmem.h prior to that) || and definition of pte_offset_map() (in asm/pgtable_32.h) || Users of pte_offset_map() and friends in arch/x86 are in || arch/x86/kernel/vm86_32.c and arch/x86/mm/dump_pagetables.c (both || pulling linux/highmem.h), in arch/x86/mm/mem_encrypt_identity.c || (64bit-only, pte_offset_map() doesn't use kmap_atomic() there) and || arch/x86/kernel/tboot.c (pulls linux/highmem.h via asm/pgalloc.h || and linux/pagemap.h) || || * xtensa: users in arch/xtensa/kernel/pci-dma.c, arch/xtensa/mm/highmem.c, || arch/xtensa/mm/cache.c and arch/xtensa/platforms/iss/simdisk.c (all pull || linux/highmem.h).