Received: by 10.192.165.156 with SMTP id m28csp749849imm; Fri, 13 Apr 2018 07:13:25 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/0+SwYhSES3yIUZpqeqy9INqrMETLLb/w3s/LWDDJzzn233ayq4Vw/Kw7l0Okc2kZshfiy X-Received: by 2002:a17:902:144:: with SMTP id 62-v6mr5423287plb.202.1523628805400; Fri, 13 Apr 2018 07:13:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523628805; cv=none; d=google.com; s=arc-20160816; b=cvPNV8h1wEOE3vvFp3IttPkU/8mSiu6obdCr2+r0EBiDBduUq7soiYh2kYT/kREHdv aeQvFbsYpheTZG9ldVlB339SQ41nGis1+tnmfLWYqaTvJ3caM0U6aMMHeoNwyjYTW7uf +guU7ssKj41MdXW2TBJWHlKVBl6mBbF0aDnaxSUG3dd1wZRfHSCfswlOTHle+pgi/1eR Gof4AIhDjko/VNshg+Oa+cctb8kyXKv9QbE+U/xn81XW6ldrRIsACGowHM4b+7wNxzmJ fcxB+LGSS+HUDhRtXztiQSPFNvGU97A7OLHf6pKhcIvAKdMRfVb/hdFEd7ektBYSqqbi laTg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:arc-authentication-results; bh=2oqso7cdb0NxSECeDTee+fvGMlcUtzNaGov9LKpRVfg=; b=wpaZuAs4sEv+8kJl4qLzMVgmFZ8UWpivjSLA1NJ/gOVypOHXQ+OVOjsoDUN9v8UWsT NQrobi9tOBOTAoNizmx3NdboAEp8Qckfa3gyG+MkhakFroaLGAQmtYddea1j9fdiCzE8 9zxfFSPB5WrH0At+iUPSI7fHSv+I3RSD2KVUea37sn9ZPtVH3eUvq/w45T0udu1uvKbJ spqkPtjGZ21ppizW154BUIlydqZ1ikP1gG9z84qlp0vyL6G6f4lG4G/cei+Q0Dktxzrv 9K0Dnlsxibc/tozBmeubdEzGDC0WXXZZOQoailCQDEaFZyTU27axuJdB5egFM8TV/yoq 5NdA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k1si2796663pgo.255.2018.04.13.07.13.09; Fri, 13 Apr 2018 07:13:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754830AbeDMOKr (ORCPT + 99 others); Fri, 13 Apr 2018 10:10:47 -0400 Received: from mga11.intel.com ([192.55.52.93]:36295 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754305AbeDMOKq (ORCPT ); Fri, 13 Apr 2018 10:10:46 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga006.fm.intel.com ([10.253.24.20]) by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 13 Apr 2018 07:10:43 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.48,445,1517904000"; d="scan'208";a="220185419" Received: from haiyuewa-mobl1.ccr.corp.intel.com (HELO [10.255.31.195]) ([10.255.31.195]) by fmsmga006.fm.intel.com with ESMTP; 13 Apr 2018 07:10:43 -0700 Subject: Re: [PATCH ipmi/kcs_bmc v1] ipmi: kcs_bmc: optimize the data buffers allocation To: minyard@acm.org, openipmi-developer@lists.sourceforge.net, linux-kernel@vger.kernel.org References: <1521116452-4993-1-git-send-email-haiyue.wang@linux.intel.com> <1710edd4-277b-1d2e-5885-a070751ddd2a@linux.intel.com> <1483ac91-5f41-e299-6c74-3dd31aa95ad6@gmail.com> From: "Wang, Haiyue" Message-ID: <26f50e9b-acc0-0bb3-5376-cf78ddbab43f@linux.intel.com> Date: Fri, 13 Apr 2018 22:10:42 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <1483ac91-5f41-e299-6c74-3dd31aa95ad6@gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018-04-13 21:50, Corey Minyard wrote: > On 04/07/2018 02:54 AM, Wang, Haiyue wrote: >> Hi Corey, >> >> Since IPMI 2.0 just defined minimum, no maximum: >> >> ---- >> >> KCS/SMIC Input : Required: 40 bytes IPMI Message, minimum >> >> KCS/SMIC Output : Required: 38 bytes IPMI Message, minimum >> > > Yes, though there are practical maximums that are much smaller than > 1000 bytes. > > >> ---- >> >> We can enlarge the block size for avoiding waste, and make our driver >> >> support most worst message size case. And I think this patch make >> checking >> >> simple (from 3 to 1), and the code clean, this is the biggest reason >> I want to >> >> change. The TLB is just memory management study from book, no data to >> >> support access improvement. :) > > I would argue that the way it is now expresses the intent of the code > better > than one allocation split into three parts.  Expressing your intent is > more > important than the number of checks and a minuscule performance > improvement.  For me it makes the code easier to understand.  If you had > a tool that checked for out-of-bounds memory access, then a single > allocation > might not find an overrun between the parts.  Smaller allocations tend > to result in less memory fragmentation. > When I wrote the commit, I felt that the message was not so professional, and the reason sounded weak. The driver development is a complex work, needs considering more things, not just one. Thanks for your patience. > My preference is to leave it as it is.  However, it's not that > important, and > if you really want this patch, I can include it. > So leave it as it is, abandon this patch. :-) BTW, another patch about KCS BMC chip support: https://lkml.org/lkml/2018/3/22/284 Look forward your reviewing, I've tried my best to make it better. > Thanks, > > -corey > >> >> BR, >> >> Haiyue >> >> >> On 2018-04-07 10:37, Wang, Haiyue wrote: >>> >>> >>> On 2018-04-07 05:47, Corey Minyard wrote: >>>> On 03/15/2018 07:20 AM, Haiyue Wang wrote: >>>>> Allocate a continuous memory block for the three KCS data buffers >>>>> with >>>>> related index assignment. >>>> >>>> I'm finally getting to this. >>>> >>>> Is there a reason you want to do this?  In general, it's better to >>>> not try to >>>> outsmart your base system.  Depending on the memory allocator, in this >>>> case, you might actually use more memory.  You probably won't use any >>>> less. >>>> >>> I got this idea from another code review, but that patch allocates >>> 30 more >>> the same size memory block, reducing the devm_kmalloc call will be >>> better. >>> For KCS only have 3, may be the key point is memory waste. >>> >>>> In the original case, you allocate three 1000 byte buffers, >>>> resulting in 3 >>>> 1024 byte slab allocated. >>>> >>>> In the changed case, you will allocate a 3000 byte buffer, >>>> resulting in >>>> a single 4096 byte slab allocation, wasting 1024 more bytes of memory. >>>> >>> As the kcs has memory copy between in/out/kbuffer, put them in the same >>> page will be better ? Such as the same TLB ? (Well, I just got this >>> from book, >>> no real experience of memory accessing performance. And also, I was >>> told >>> that using space to save the time. :-)). >>> >>> Just my stupid thinking. I'm OK to drop this patch if it doesn't >>> help with >>> performance, or something else. >>> >>> BR. >>> Haiyue >>> >>>> -corey >>>> >>>>> Signed-off-by: Haiyue Wang >>>>> --- >>>>>   drivers/char/ipmi/kcs_bmc.c | 10 ++++++---- >>>>>   1 file changed, 6 insertions(+), 4 deletions(-) >>>>> >>>>> diff --git a/drivers/char/ipmi/kcs_bmc.c >>>>> b/drivers/char/ipmi/kcs_bmc.c >>>>> index fbfc05e..dc19c0d 100644 >>>>> --- a/drivers/char/ipmi/kcs_bmc.c >>>>> +++ b/drivers/char/ipmi/kcs_bmc.c >>>>> @@ -435,6 +435,7 @@ static const struct file_operations >>>>> kcs_bmc_fops = { >>>>>   struct kcs_bmc *kcs_bmc_alloc(struct device *dev, int >>>>> sizeof_priv, u32 channel) >>>>>   { >>>>>       struct kcs_bmc *kcs_bmc; >>>>> +    void *buf; >>>>>         kcs_bmc = devm_kzalloc(dev, sizeof(*kcs_bmc) + >>>>> sizeof_priv, GFP_KERNEL); >>>>>       if (!kcs_bmc) >>>>> @@ -448,11 +449,12 @@ struct kcs_bmc *kcs_bmc_alloc(struct device >>>>> *dev, int sizeof_priv, u32 channel) >>>>>       mutex_init(&kcs_bmc->mutex); >>>>>       init_waitqueue_head(&kcs_bmc->queue); >>>>>   -    kcs_bmc->data_in = devm_kmalloc(dev, KCS_MSG_BUFSIZ, >>>>> GFP_KERNEL); >>>>> -    kcs_bmc->data_out = devm_kmalloc(dev, KCS_MSG_BUFSIZ, >>>>> GFP_KERNEL); >>>>> -    kcs_bmc->kbuffer = devm_kmalloc(dev, KCS_MSG_BUFSIZ, >>>>> GFP_KERNEL); >>>>> -    if (!kcs_bmc->data_in || !kcs_bmc->data_out || >>>>> !kcs_bmc->kbuffer) >>>>> +    buf = devm_kmalloc_array(dev, 3, KCS_MSG_BUFSIZ, GFP_KERNEL); >>>>> +    if (!buf) >>>>>           return NULL; >>>>> +    kcs_bmc->data_in  = buf; >>>>> +    kcs_bmc->data_out = buf + KCS_MSG_BUFSIZ; >>>>> +    kcs_bmc->kbuffer  = buf + KCS_MSG_BUFSIZ * 2; >>>>>         kcs_bmc->miscdev.minor = MISC_DYNAMIC_MINOR; >>>>>       kcs_bmc->miscdev.name = dev_name(dev); >>>> >>>> >>> >> >