Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757547Ab2FOTkn (ORCPT ); Fri, 15 Jun 2012 15:40:43 -0400 Received: from acsinet15.oracle.com ([141.146.126.227]:49396 "EHLO acsinet15.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757512Ab2FOTkm convert rfc822-to-8bit (ORCPT ); Fri, 15 Jun 2012 15:40:42 -0400 MIME-Version: 1.0 Message-ID: <9c6c8ae0-0212-402d-a906-0d0c61e5e058@default> Date: Fri, 15 Jun 2012 12:39:13 -0700 (PDT) From: Dan Magenheimer To: Seth Jennings Cc: Nitin Gupta , Peter Zijlstra , Minchan Kim , Greg Kroah-Hartman , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Thomas Gleixner , Ingo Molnar , Tejun Heo , David Howells , x86@kernel.org, Nick Piggin , Konrad Rzeszutek Wilk Subject: RE: [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range References: <1337133919-4182-1-git-send-email-minchan@kernel.org> <1337133919-4182-3-git-send-email-minchan@kernel.org> <4FB4B29C.4010908@kernel.org> <1337266310.4281.30.camel@twins> <4FDB5107.3000308@linux.vnet.ibm.com> <7e925563-082b-468f-a7d8-829e819eeac0@default> <4FDB66B7.2010803@vflare.org> <10ea9d19-bd24-400c-8131-49f0b4e9e5ae@default> <4FDB8808.9010508@linux.vnet.ibm.com> In-Reply-To: <4FDB8808.9010508@linux.vnet.ibm.com> X-Priority: 3 X-Mailer: Oracle Beehive Extensions for Outlook 2.0.1.6 (510070) [OL 12.0.6607.1000 (x86)] Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8BIT X-Source-IP: ucsinet22.oracle.com [156.151.31.94] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2869 Lines: 60 > From: Seth Jennings [mailto:sjenning@linux.vnet.ibm.com] > > The compression code already compresses to a per-cpu page-pair > > already and then that "zpage" is copied into the space allocated > > for it by zsmalloc. For that final copy, if the copy code knows > > the target may cross a page boundary, has both target pages > > kmap'ed, and is smart about doing the copy, the "pair mapping" > > can be avoided for compression. > > The problem is that by "smart" you mean "has access to zsmalloc > internals". zcache, or any user, would need the know the kmapped > address of the first page, the offset to start at within that page, and > the kmapped address of the second page in order to do the smart copy > you're talking about. Then the complexity to do the smart copy that > would have to be implemented in each user. Or simply add a zsmalloc_copy in zsmalloc and require that it be used by the caller (instead of a memcpy). > > The decompression path calls lzo1x directly and it would be > > a huge pain to make lzo1x smart about page boundaries. BUT > > since we know that the decompressed result will always fit > > into a page (actually exactly a page), you COULD do an extra > > copy to the end of the target page (using the same smart- > > about-page-boundaries copying code from above) and then do > > in-place decompression, knowing that the decompression will > > not cross a page boundary. So, with the extra copy, the "pair > > mapping" can be avoided for decompression as well. > > This is an interesting thought. > > But this does result in a copy in the decompression (i.e. page fault) > path, where right now, it is copy free. The compressed data is > decompressed directly from its zsmalloc allocation to the page allocated > in the fault path. The page fault occurs as soon as the lzo1x compression code starts anyway, as do all the cache faults... both just occur earlier, so the only additional cost is the actual cpu instructions to move the sequence of (compressed) bytes from the zsmalloc-allocated area to the end of the target page. TLB operations can be very expensive, not to mention (as the subject of this thread attests) non-portable. > Doing this smart copy stuff would move most of the complexity out of > zsmalloc into the user which defeats the purpose of abstracting the > functionality out in the first place: so the each user that wants to do > something like this doesn't have to reinvent the wheel. See above. It can be buried in zsmalloc. Again, this is all just a suggestion, admittedly from someone who doesn't like pure abstractions in kernel code ;-) Dan -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/