Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760279Ab1FXCCg (ORCPT ); Thu, 23 Jun 2011 22:02:36 -0400 Received: from mga09.intel.com ([134.134.136.24]:42227 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1760159Ab1FXCCc convert rfc822-to-8bit (ORCPT ); Thu, 23 Jun 2011 22:02:32 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.65,417,1304319600"; d="scan'208";a="19080053" From: "Ma, Ling" To: Ingo Molnar , Andi Kleen CC: "hpa@zytor.com" , "tglx@linutronix.de" , "linux-kernel@vger.kernel.org" Date: Fri, 24 Jun 2011 10:01:28 +0800 Subject: RE: [PATCH RFC] [x86] Optimize copy-page by reducing impact from HW prefetch Thread-Topic: [PATCH RFC] [x86] Optimize copy-page by reducing impact from HW prefetch Thread-Index: Acwxc9zXuTo3YQvYQpKzil1gi1OinwAnn06g Message-ID: References: <1308351117-32452-1-git-send-email-ling.ma@intel.com> <20110623070448.GA25707@elte.hu> In-Reply-To: <20110623070448.GA25707@elte.hu> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2287 Lines: 65 Sure, I separate two patches ASAP, one is for performance tuning code after some experiments, another code style patch. Thanks Ling > -----Original Message----- > From: Ingo Molnar [mailto:mingo@elte.hu] > Sent: Thursday, June 23, 2011 3:05 PM > To: Andi Kleen > Cc: Ma, Ling; hpa@zytor.com; tglx@linutronix.de; linux- > kernel@vger.kernel.org > Subject: Re: [PATCH RFC] [x86] Optimize copy-page by reducing impact > from HW prefetch > > > * Andi Kleen wrote: > > > ling.ma@intel.com writes: > > > > > impact(DCU prefetcher), and simplify original code. The > > > performance is improved about 15% on core2, 36% on snb > > > respectively. (We use our micro-benchmark, and will do further > > > test according to your requirment) > > > > This doesn't make a lot of sense because neither Core-2 nor SNB use > > the code path you patched. They all use the rep ; movs path > > Ling, mind double checking which one is the faster/better one on SNB, > in cold-cache and hot-cache situations, copy_page or copy_page_c? > > Also, while looking at this file please fix the countless pieces of > style excrements it has before modifying it: > > - non-Linux comment style (and needless two comments - it can > be in one comment block): > > /* Don't use streaming store because it's better when the target > ends up in cache. */ > > /* Could vary the prefetch distance based on SMP/UP */ > > - (there's other non-standard comment blocks in this file as well) > > - The copy_page/copy_page_c naming is needlessly obfuscated, it > should be copy_page, copy_page_norep or so - the _c postfix has no > obvious meaning. > > - all #include's should be at the top > > - please standardize it on the 'instrn %x, %y' pattern that we > generally use in arch/x86/, not 'instrn %x,%y' pattern. > > and do this cleanup patch first and the speedup on top of it, and > keep the two in two separate patches so that the modification to the > assembly code can be reviewed more easily. > > Thanks, > > Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/