Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp3015304pxu; Sat, 10 Oct 2020 16:10:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwQYYpmP1mzarFIAt/KykNS2NArHlneqqpk+nLXWPYIelhSDtZaCw/bu8nt3qo7VoWm56tx X-Received: by 2002:a17:906:e15:: with SMTP id l21mr21952970eji.509.1602371455712; Sat, 10 Oct 2020 16:10:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602371455; cv=none; d=google.com; s=arc-20160816; b=p6AGecucA5s7iWzZFutle5AjK+Bl49RsSknwfypazO5SzvmQPABLByuOij7loLydIS po6seUizfMjt6DGQN+j9iKydv/5MMXrq+6LUqGAXgSsKltSzsmqTQCggPSSJIkXasEn4 p9ODpzsI89o+xPy70TZQY2cbA6VdBYia/5ACUU7txqungiQWdV5rKFY2J3/RREuNw4cx 6Rh+b2qaPRE1QSshzjg+dnIEKX44NMwiHpVvtauqCPezXKL/uuBhHq/7+BPYNpWKLstt wi4JDo0LJ4vhdWNqUUNBmQdse6jU069Plmf6JG39xk0HGm6iQZFs6cUJraDDUujkn9xA AtYw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=ONkhJu12mH9rfnLgLGFARK/yH2P90gYRIjMUSqrTwk8=; b=C/l7KI6Qcm5iWDaRRT+IV+ea62pm5kofw4qWQcjmIS7LNdPFDTyPfSwrbqDsr0dwiK 83Wv0SAoVEurASRx7JQDp+3c+KkCxc8sWE2itN71hPMG/wbVmpRWRUD0G+qFWQ5Nru8g Wx9b38IOVnJNUZvtNnQpBlpxFkjLLsNGxUNTeGW+oX1iBDQM4uf0rD9Acv/Ll9ir47gc GtHHIABS9IZ4JbcQHPMRBgjmgx1EYtGHj4xQRYB47TriFYGmT5su4DnSxANUhzo2vkeL ohH5c2dJHQ+RsYOFowimz1rxQ0hPmKhxS7K0XyjCP4MMgUOcZTLoNQ74qACehaMJNvPm HHkg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l20si6829238edv.312.2020.10.10.16.10.32; Sat, 10 Oct 2020 16:10:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2390781AbgJJW7W (ORCPT + 99 others); Sat, 10 Oct 2020 18:59:22 -0400 Received: from mail.kernel.org ([198.145.29.99]:57066 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732633AbgJJTyj (ORCPT ); Sat, 10 Oct 2020 15:54:39 -0400 Received: from gaia (unknown [95.149.105.49]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id B1DFA21655; Sat, 10 Oct 2020 12:38:57 +0000 (UTC) Date: Sat, 10 Oct 2020 13:38:55 +0100 From: Catalin Marinas To: Nicolas Saenz Julienne Cc: Ard Biesheuvel , Lorenzo Pieralisi , "open list:OPEN FIRMWARE AND FLATTENED DEVICE TREE BINDINGS" , Will Deacon , Linux Kernel Mailing List , Linux Memory Management List , iommu@lists.linux-foundation.org, Rob Herring , linux-rpi-kernel@lists.infradead.org, Frank Rowand , Christoph Hellwig , Linux ARM , Robin Murphy Subject: Re: [PATCH 1/4] of/fdt: Update zone_dma_bits when running in bcm2711 Message-ID: <20201010123854.GA27186@gaia> References: <12f33d487eabd626db4c07ded5a1447795eed355.camel@suse.de> <20201009071013.GA12208@lst.de> <513833810c15b5efeab7c3cbae1963a78c71a79f.camel@suse.de> <20201009152433.GA19953@e121166-lin.cambridge.arm.com> <20201009171051.GL23638@gaia> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Oct 10, 2020 at 12:53:19PM +0200, Nicolas Saenz Julienne wrote: > On Sat, 2020-10-10 at 12:36 +0200, Ard Biesheuvel wrote: > > On Fri, 9 Oct 2020 at 19:10, Catalin Marinas wrote: > > > On Fri, Oct 09, 2020 at 06:23:06PM +0200, Ard Biesheuvel wrote: > > > > On Fri, 9 Oct 2020 at 17:24, Lorenzo Pieralisi > > > > wrote: > > > > > We can move this check to IORT code and call it from arm64 if it > > > > > can be made to work. > > > > > > > > Finding the smallest value in the IORT, and assigning it to > > > > zone_dma_bits if it is < 32 should be easy. But as I understand it, > > > > having these separate DMA and DMA32 zones is what breaks kdump, no? So > > > > how is this going to fix the underlying issue? > > > > > > If zone_dma_bits is 32, ZONE_DMA32 disappears into ZONE_DMA (GFP_DMA32 > > > allocations fall back to ZONE_DMA). > > > > > > kdump wants DMA-able memory and, without a 30-bit ZONE_DMA, that would > > > be the bottom 32-bit. With the introduction of ZONE_DMA, this suddenly > > > became 1GB. We could change kdump to allocate ZONE_DMA32 but this one > > > may also be small as it lost 1GB to ZONE_DMA. However, the kdump kernel > > > would need to be rebuilt without ZONE_DMA since it won't have any. IIRC > > > (it's been a while since I looked), the kdump allocation couldn't span > > > multiple zones. > > > > > > In a separate thread, we try to fix kdump to use allocations above 4G as > > > a fallback but this only fixes platforms with enough RAM (and maybe it's > > > only those platforms that care about kdump). > > > > > > > One thing that strikes me as odd is that we are applying the same > > shifting logic to ZONE_DMA as we are applying to ZONE_DMA32, i.e., if > > DRAM starts outside of the zone, it is shifted upwards. > > > > On a typical ARM box, this gives me > > > > [ 0.000000] Zone ranges: > > [ 0.000000] DMA [mem 0x0000000080000000-0x00000000bfffffff] > > [ 0.000000] DMA32 [mem 0x00000000c0000000-0x00000000ffffffff] > > [ 0.000000] Normal [mem 0x0000000100000000-0x0000000fffffffff] > > > > i.e., the 30-bit addressable range has bit 31 set, which is weird. > > Yes I agree it's weird, and IMO plain useless. I sent a series this summer to > address this[1], which ultimately triggered the discussion we're having right > now. I don't mind assuming that ZONE_DMA is always from pfn 0 (i.e. no dma_offset for such constrained devices). But if ZONE_DMA32 is squeezed out with ZONE_DMA extended to 4GB, it should allow non-zero upper 32 bits. IIRC we do have SoCs with RAM starting above 4GB. However, your patch didn't completely solve the problem for non-RPi4 platforms as there's hardware with RAM starting at 0 that doesn't need the 1GB ZONE_DMA. We may end up with a combination of the two approaches. > Although with with your latest patch and the DT counterpart, we should be OK. > It would be weird for a HW description to define DMA constraints that are > impossible to reach on that system. I don't remember the difficulties with parsing a DT early for inferring the ZONE_DMA requirements. Could we not check the dma-ranges property in the soc node? I can see bcm2711.dtsi defines a 30-bit address range. We are not interested in the absolute physical/bus addresses, just the size to check whether it's smaller than 32-bit. > > I wonder if it wouldn't be better (and less problematic in the general > > case) to drop this logic for ZONE_DMA, and simply let it remain empty > > unless there is really some memory there. > > From my experience, you can't have empty ZONE_DMA when enabled. Empty > ZONE_DMA32 is OK though. Although I'm sure it's something that can be changed. Indeed, because we still have GFP_DMA around which can't fall back to ZONE_DMA32 (well, unless CONFIG_ZONE_DMA is disabled). -- Catalin