Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp1375469imm; Thu, 6 Sep 2018 22:16:50 -0700 (PDT) X-Google-Smtp-Source: ANB0VdYlxXJh0aWtyu+whUTdxda6DUBotuGLC5jtadcc7tnbdUUBx++NWCoc7t5eCZXhR7X++FhC X-Received: by 2002:a62:fc5:: with SMTP id 66-v6mr6509768pfp.237.1536297410002; Thu, 06 Sep 2018 22:16:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536297409; cv=none; d=google.com; s=arc-20160816; b=mFhu+FDBW56KeXzCE1+Fgcl8CrOZRy3XcMZl7UWC2WQfvvRqKvhdsJ25dy14j7+6k7 Zu4S7zYJqBR3LLOEwFtGu7nhItNZAIRD2OsCI+mROdu+UYTv3Xc225aaDf+cq0zSOEr6 eqyF9uJ8NUg3Os5u4XOpamNMfFfmPY/3RzxY92wl0mnj9C9GQ7v2PPGb6EqQrYXIOwXu 2NOOVG1BapOfHCLNSvgJctVv2wH8EBPrm30bxwfZAdfWq39X79or9LMQ45fhKY6Ip2nv 1l++cKrxhLIC+m1Dhm/lRhsMAKaWBBj9ReqRg+4V7MiwMoVW/OwGqOf1JrkWA+Gu2hyw Ta2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=KSQzEQklDpwdrrplGtdNrky8LsMBIkRCrsiHS4s6KjY=; b=gDskFFCJcfHP5hzQAMz4UrZl9oCEyplHVMUsMbqH64Ce8GUrVW8fSJ6vXhJM0TLacH VJZSP0dTfryyCOXbUB/PF03QuRunF6N7cwzD5MOEuezzKw+Iq4gGbX9szidoWoqbvL+3 TlfMbMbmqWrECAvFI63gwSWM6SMhNR2CHA5/yNhUY6qo+26czh26Og02684wmb9D6ZTB 8LjfbIaiZgGvqGLrne+jqikhxyMFY4C4We46lFES/7tfDeFO7tdfbZALZbiyPYXV8cHA gVw/P66GUwXE6iBAv2yMT3xPWtagsc9y9ABpFNd/4WLCf3LssLtBO0j7KPzd1f4VFmI7 vgxA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g23-v6si6602333pgh.308.2018.09.06.22.16.34; Thu, 06 Sep 2018 22:16:49 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727589AbeIGJrZ (ORCPT + 99 others); Fri, 7 Sep 2018 05:47:25 -0400 Received: from smtp2200-217.mail.aliyun.com ([121.197.200.217]:46629 "EHLO smtp2200-217.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725805AbeIGJrZ (ORCPT ); Fri, 7 Sep 2018 05:47:25 -0400 X-Alimail-AntiSpam: AC=CONTINUE;BC=0.07528035|-1;CH=green;FP=0|0|0|0|0|-1|-1|-1;HT=e02c03310;MF=ren_guo@c-sky.com;NM=1;PH=DS;RN=11;RT=11;SR=0;TI=SMTPD_---.CnYzP-y_1536296883; Received: from localhost(mailfrom:ren_guo@c-sky.com fp:SMTPD_---.CnYzP-y_1536296883) by smtp.aliyun-inc.com(10.147.41.138); Fri, 07 Sep 2018 13:08:03 +0800 Date: Fri, 7 Sep 2018 13:08:02 +0800 From: Guo Ren To: Arnd Bergmann Cc: linux-arch , Linux Kernel Mailing List , Thomas Gleixner , Daniel Lezcano , Jason Cooper , c-sky_gcc_upstream@c-sky.com, gnu-csky@mentor.com, Thomas Petazzoni , wbx@uclibc-ng.org, Greentime Hu Subject: Re: [PATCH V3 13/26] csky: Library functions Message-ID: <20180907050801.GA13356@guoren-Inspiron-7460> References: <37f9bd824ede529fdab291a40eef3415f99ec8aa.1536138304.git.ren_guo@c-sky.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Sep 06, 2018 at 04:24:59PM +0200, Arnd Bergmann wrote: > On Wed, Sep 5, 2018 at 2:08 PM Guo Ren wrote: > > > --- /dev/null > > +++ b/arch/csky/abiv1/memset.c > > @@ -0,0 +1,38 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > +// Copyright (C) 2018 Hangzhou C-SKY Microsystems co.,ltd. > > +#include > > + > > +void *memset(void *dest, int c, size_t l) > > +{ > > + char *d = dest; > > + int ch = c; > > + int tmp; > > + > > + if ((long)d & 0x3) > > + while (l--) *d++ = ch; > > + else { > > + ch &= 0xff; > > + tmp = (ch | ch << 8 | ch << 16 | ch << 24); > > + > > + while (l >= 16) { > > + *(((long *)d)) = tmp; > > + *(((long *)d)+1) = tmp; > > + *(((long *)d)+2) = tmp; > > + *(((long *)d)+3) = tmp; > > + l -= 16; > > + d += 16; > > + } > > + > > + while (l > 3) { > > + *(((long *)d)) = tmp; > > + d = d + 4; > > + l -= 4; > > + } > > + > > + while (l) { > > + *d++ = ch; > > + l--; > > + } > > + } > > + return dest; > > +} > > I see that we have a trivial memset() implementation in lib/string.c, but yours > seems to be better optimized. Where did you get it from? We write it for our ck610 to improve the performance, but I think a lot of other arch done it in asm style. > Is this a version > that works particularly well on C-Sky, or is this a generic optimized memset > that others could use as well? We only test it on C-SKY, but I think it will also work better on other arch CPU than current lib/string.c memset implement. I see that in lib/string.c: void *memset(void *s, int c, size_t count) { char *xs = s; while (count--) *xs++ = c; return s; } The most problem is "char *xs;" and it will cause "st.b" in asm. "st.b" is very slow. Our key improvement is: > > + *(((long *)d)) = tmp; > > + *(((long *)d)+1) = tmp; > > + *(((long *)d)+2) = tmp; > > + *(((long *)d)+3) = tmp; It will cause SOC AXI burst transfer. > In the latter case, we could add it to > lib/string.c and let architectures select it in place of the triivial version. Good idea. Guo Ren