Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754763AbYGDUh4 (ORCPT ); Fri, 4 Jul 2008 16:37:56 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752196AbYGDUhq (ORCPT ); Fri, 4 Jul 2008 16:37:46 -0400 Received: from casper.infradead.org ([85.118.1.10]:46203 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752113AbYGDUhp (ORCPT ); Fri, 4 Jul 2008 16:37:45 -0400 Date: Fri, 4 Jul 2008 13:37:33 -0700 From: Arjan van de Ven To: Pierre Ossman Cc: LKML , linux-mm@kvack.org Subject: Re: How to alloc highmem page below 4GB on i386? Message-ID: <20080704133733.278b6458@infradead.org> In-Reply-To: <20080704222323.68afbe88@mjolnir.drzeus.cx> References: <20080630200323.2a5992cd@mjolnir.drzeus.cx> <20080704195800.4ef6e00a@mjolnir.drzeus.cx> <20080704111224.68266afc@infradead.org> <20080704222323.68afbe88@mjolnir.drzeus.cx> Organization: Intel X-Mailer: Claws Mail 3.3.1 (GTK+ 2.12.10; i386-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-SRS-Rewrite: SMTP reverse-path rewritten from by casper.infradead.org See http://www.infradead.org/rpr.html Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3325 Lines: 79 On Fri, 4 Jul 2008 22:23:23 +0200 Pierre Ossman wrote: > On Fri, 4 Jul 2008 11:12:24 -0700 > Arjan van de Ven wrote: > > > On Fri, 4 Jul 2008 19:58:00 +0200 > > Pierre Ossman wrote: > > > > > On Mon, 30 Jun 2008 20:03:23 +0200 > > > Pierre Ossman wrote: > > > > > > > Simple question. How do I allocate a page from highmem, that's > > > > still within 32 bits? x86_64 has the DMA32 zone, but i386 has > > > > just HIGHMEM. As most devices can't DMA above 32 bit, I have 3 > > > > GB of memory that's not getting decent usage (or results in > > > > needless bouncing). What to do? > > > > > > > > I tried just enabling CONFIG_DMA32 for i386, but there is some > > > > guard against too many memory zones. I'm assuming this is there > > > > for a good reason? > > > > > > > > > > Anyone? > > > > > > > well... the assumption sort of is that all high-perf devices are 64 > > bit capable. For the rest... well you get what you get. There's > > IOMMU's in modern systems from Intel (and soon AMD) that help you > > avoid the bounce if you really care. > > I was under the impression that the PCI bus was utterly incapable of > any larger address than 32 bits? But perhaps you only consider PCIE > stuff high-perf. :) actually your impression is not correct. There's a difference between how many physical bits the bus has, and the logical data. Specifically, PCI (and PCIE etc) have something that's called "Dual Address Cycle", which is a pci bus transaction that sends the 64 bit address using 2 cycles on the bus even if the buswidth is 32 bit (logically). > > The second assumption sort of is that you don't have 'too much' > > above 4Gb; once you're over 16Gb or so people assume you will run > > the 64 bit kernel instead... > > Unfortunately some proprietary crud keeps migration somewhat annoying. > And in my case it's a 4 GB system, where 1 GB gets mapped up to make > room for devices, so it's not that uncommon. 4Gb systems are entirely reasonably still with 32 bit kernels (I'm typing on one right now ;-); it gets problematic in the 12-16Gb range. > > The strange thing is that I keep getting pages from > 4GB all the > time, even on a loaded system. I would have expected mostly getting > pages below that limit as that's where most of the memory is. Do you > have any insight into which areas tend to fill up first? ok this is tricky and goes way deep into buddy allocator internals. On the highest level (2Mb chunks iirc, but it could be a bit or two bigger now) we allocate top down. But once we split such a top level chunk up, inside the chunk we allocate bottom up (so that the scatter gather IOs tend to group nicer). In addition, the kernel will prefer allocating userspace/pagecache memory from highmem over lowmem, out of an effort to keep memory pressure in the lowmem zones lower. -- If you want to reach me at my work email, use arjan@linux.intel.com For development, discussion and tips for power savings, visit http://www.lesswatts.org -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/