Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp9553845imu; Wed, 5 Dec 2018 06:42:19 -0800 (PST) X-Google-Smtp-Source: AFSGD/WDnWF0RV5HzosRBjYNMtR0O+tJjr/FajG7/Haye0VwcomCuzwnAbVzBbtCocpVkHEpYtNk X-Received: by 2002:a62:4181:: with SMTP id g1mr24513345pfd.45.1544020939647; Wed, 05 Dec 2018 06:42:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544020939; cv=none; d=google.com; s=arc-20160816; b=tdXwZN0qdTsiozhK+xnrBcNq9u5pjWXz57EZcTkJK+pL7vC/lbA0HkCmhwessgw/pI TLNiFkHwjA0GbIdtGUOHKHy2PYQ8cG9VLP/U893BiGemW0J9w5kjyRVm9D53uJJ2MuBK nwF4D0pkvGvUpzhqdu6wt5daxehH/n/R2mA1xWXPjIs9/h+2usi8DYex4hqjd7iT+JIi DjVXw/OVNeGeASctA8OMThfs7V5FCiXenxWZg+Ho8H9hVUnD4MlyjX9fVrC6OChv1Smn pasVVAc/q/sTCIqbINvpXJ/uoPiyVcFJD+tYGTHN8JQHV+y95yvwPB77qMsn3zYv0Oim SeQA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=vBeFk1mC/Nv7OpNUQhPnNF39ai/JyTE5FMU9ApLjmRg=; b=bg/3BC73TQlsojtXGI1hfC0s4ymY3ZE7cXVbDvbAAd8lo4jzroiRZgi6v5r43tgl40 N68cRDb3dIFrCrUTRn+VRUAA+8MN1yuzvBsYrAEwZg6OBKyRG8OKPUlP2RdrsI9QoIN7 Bg6K6M3RIyTqQX68H7By0iaD287f/3ATZQvqbtgnljOMlHE2n8rLwFMER9MnDevur3+2 c05EZPmBxJbqYT+QQ6EBXApIj+BQ3YsinCMlYuZXOrKhFyMh9yciZyXD95BTCAgmT4vr zp+3+8kGOb7FoP+Jhd2xh+Foi3XR0yCoMHgzgsTjOtkP5N8z7ZQ+kRw3Rq91ttZaPzdw mXYA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f15si19781116plr.144.2018.12.05.06.42.03; Wed, 05 Dec 2018 06:42:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727620AbeLEOlL (ORCPT + 99 others); Wed, 5 Dec 2018 09:41:11 -0500 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:55828 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727025AbeLEOlK (ORCPT ); Wed, 5 Dec 2018 09:41:10 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id E632580D; Wed, 5 Dec 2018 06:41:09 -0800 (PST) Received: from edgewater-inn.cambridge.arm.com (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id B61A83F59C; Wed, 5 Dec 2018 06:41:09 -0800 (PST) Received: by edgewater-inn.cambridge.arm.com (Postfix, from userid 1000) id 778D51AE0BC3; Wed, 5 Dec 2018 14:41:30 +0000 (GMT) Date: Wed, 5 Dec 2018 14:41:30 +0000 From: Will Deacon To: Nicolas Boichat Cc: Vlastimil Babka , Robin Murphy , Christoph Lameter , Michal Hocko , Matthias Brugger , hch@infradead.org, Matthew Wilcox , Joerg Roedel , Pekka Enberg , David Rientjes , Joonsoo Kim , Andrew Morton , Mel Gorman , Levin Alexander , Huaisheng Ye , Mike Rapoport , linux-arm Mailing List , iommu@lists.linux-foundation.org, lkml , linux-mm@kvack.org, Yong Wu , Tomasz Figa , yingjoe.chen@mediatek.com, Hsin-Yi Wang , Daniel Kurtz Subject: Re: [PATCH v2 0/3] iommu/io-pgtable-arm-v7s: Use DMA32 zone for page tables Message-ID: <20181205144130.GA16121@arm.com> References: <20181111090341.120786-1-drinkcat@chromium.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Dec 05, 2018 at 10:04:00AM +0800, Nicolas Boichat wrote: > On Tue, Dec 4, 2018 at 10:35 PM Vlastimil Babka wrote: > > > > On 12/4/18 10:37 AM, Nicolas Boichat wrote: > > > On Sun, Nov 11, 2018 at 5:04 PM Nicolas Boichat wrote: > > >> > > >> This is a follow-up to the discussion in [1], to make sure that the page > > >> tables allocated by iommu/io-pgtable-arm-v7s are contained within 32-bit > > >> physical address space. > > >> > > >> [1] https://lists.linuxfoundation.org/pipermail/iommu/2018-November/030876.html > > > > > > Hi everyone, > > > > > > Let's try to summarize here. > > > > > > First, we confirmed that this is a regression, and IOMMU errors happen > > > on 4.19 and linux-next/master on MT8173 (elm, Acer Chromebook R13). > > > The issue most likely starts from ad67f5a6545f ("arm64: replace > > > ZONE_DMA with ZONE_DMA32"), i.e. 4.15, and presumably breaks a number > > > of Mediatek platforms (and maybe others?). > > > > > > We have a few options here: > > > 1. This series [2], that adds support for GFP_DMA32 slab caches, > > > _without_ adding kmalloc caches (since there are no users of > > > kmalloc(..., GFP_DMA32)). I think I've addressed all the comments on > > > the 3 patches, and AFAICT this solution works fine. > > > 2. genalloc. That works, but unless we preallocate 4MB for L2 tables > > > (which is wasteful as we usually only need a handful of L2 tables), > > > we'll need changes in the core (use GFP_ATOMIC) to allow allocating on > > > demand, and as it stands we'd have no way to shrink the allocation. > > > 3. page_frag [3]. That works fine, and the code is quite simple. One > > > drawback is that fragments in partially freed pages cannot be reused > > > (from limited experiments, I see that IOMMU L2 tables are rarely > > > freed, so it's unlikely a whole page would get freed). But given the > > > low number of L2 tables, maybe we can live with that. > > > > > > I think 2 is out. Any preference between 1 and 3? I think 1 makes > > > better use of the memory, so that'd be my preference. But I'm probably > > > missing something. > > > > I would prefer 1 as well. IIRC you already confirmed that alignment > > requirements are not broken for custom kmem caches even in presence of > > SLUB debug options (and I would say it's a bug to be fixed if they > > weren't). > > > I just asked (and didn't get a reply I think) about your > > ability to handle the GFP_ATOMIC allocation failures. They should be > > rare when only single page allocations are needed for the kmem cache. > > But in case they are not an option, then preallocating would be needed, > > thus probably option 2. > > Oh, sorry, I missed your question. > > I don't have a full answer, but: > - The allocations themselves are rare (I count a few 10s of L2 tables > at most on my system, I assume we rarely have >100), and yes, we only > need a single page, so the failures should be exceptional. > - My change is probably not making anything worse: I assume that even > with the current approach using GFP_DMA slab caches on older kernels, > failures could potentially happen. I don't think we've seen those. If > we are really concerned about this, maybe we'd need to modify > mtk_iommu_map to not hold a spinlock (if that's possible), so we don't > need to use GFP_ATOMIC. I suggest we just keep an eye on such issues, > and address them if they show up (we can even revisit genalloc at that > stage). I think the spinlock is the least of our worries: the map/unmap routines can be called in irq context and may need to allocate second-level tables. Will