Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1580099imm; Wed, 1 Aug 2018 19:39:47 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfZjLXq3SXUGX13ccW0U8Et5djJHdbP1iFZYlvXhkdnJ4R+zA0Q1S2OSeSQNTBzqjdgEH3v X-Received: by 2002:a63:b256:: with SMTP id t22-v6mr856130pgo.101.1533177587477; Wed, 01 Aug 2018 19:39:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533177587; cv=none; d=google.com; s=arc-20160816; b=CPdTVyIu9k8E4ZI01yl99uv5pM3gq3AB81HjQusGB1Xo5/vuFoUnuZr11MBFd67UiM QbjfP9AYzBXXXsuZtZhVEZgomiZ1YWF6NjLOl0yvYmEdwk4Za+KBmE2oL28S+0rI/apH r2NQNPgccWKfjk7OmBDs85TvHQDQyAmSLlc4FlUT0qbmfWy7M9KSko0qpWx1oFT3fo97 vigTpzB58zHnbNaB5uopJltlw6R8vjwcB+DXv6wBQ+rAa/W/mTshfO28TeA2znIPOPk/ LTLZ2ztzIgh+Gg7YKqXMOSeCe3u3oyLo4vfNGVZLRKqXApbptDkd2YmVls++dSqpshb3 i3Og== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:arc-authentication-results; bh=+XnoUOImRUvuQC+/pE7cqkrAYVlmkZnyE6iZL8CW5vg=; b=FmLGrWbZKGSul69xsPn/BOtERX0IqGPGDflSeBHMiOV0mk0zkY2bbE+kg4058AOs0z Ko1V2+0/pkaZCzilLxaDUAPPhgZI3cFJZBhSBqtpply4FE2UyDA52oiSsTT5tR2ar8o1 qqBYCflcD9OcHFuT1gdeKDpTfrOixr265SkTYzT3a1EhFif0/kJeHCnnmXue2bQ5m/4I F3a9f0E0lk4mfrqI991x81Tdg+HHY81TfoHEh0ZwGTS+fO9b4AbcmXRVJeHHNrcRXDNz 73iQIz5kkIfXhDGcAs8LV5fQfn4RYWEvzYSqelcn6K+lMnA+XpdnPBO4zsc0f+1920Wt GFHw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p61-v6si512199plb.55.2018.08.01.19.39.33; Wed, 01 Aug 2018 19:39:47 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726268AbeHBE0w (ORCPT + 99 others); Thu, 2 Aug 2018 00:26:52 -0400 Received: from nautica.notk.org ([91.121.71.147]:35053 "EHLO nautica.notk.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726045AbeHBE0w (ORCPT ); Thu, 2 Aug 2018 00:26:52 -0400 Received: by nautica.notk.org (Postfix, from userid 1001) id 71AA5C01A; Thu, 2 Aug 2018 04:37:57 +0200 (CEST) From: Dominique Martinet To: v9fs-developer@lists.sourceforge.net Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Dominique Martinet , Matthew Wilcox , Greg Kurz , Jun Piao Subject: [PATCH v2 2/2] net/9p: add a per-client fcall kmem_cache Date: Thu, 2 Aug 2018 04:37:32 +0200 Message-Id: <1533177452-2165-2-git-send-email-asmadeus@codewreck.org> X-Mailer: git-send-email 1.7.10.4 In-Reply-To: <1533177452-2165-1-git-send-email-asmadeus@codewreck.org> References: <1532943263-24378-1-git-send-email-asmadeus@codewreck.org> <1533177452-2165-1-git-send-email-asmadeus@codewreck.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Dominique Martinet Having a specific cache for the fcall allocations helps speed up allocations a bit, especially in case of non-"round" msizes. The caches will automatically be merged if there are multiple caches of items with the same size so we do not need to try to share a cache between different clients of the same size. Since the msize is negotiated with the server, only allocate the cache after that negotiation has happened - previous allocations or allocations of different sizes (e.g. zero-copy fcall) are made with kmalloc directly. Signed-off-by: Dominique Martinet Cc: Matthew Wilcox Cc: Greg Kurz Cc: Jun Piao --- v2: - Add a pointer to the cache in p9_fcall to make sure a buffer allocated with kmalloc gets freed with kfree and vice-versa This could have been smaller with a bool but this spares having to look at the client so looked a bit cleaner, I'm expecting this patch will need a v3 one way or another so I went for the bolder approach - please say if you think a smaller item is better ; I *think* nothing relies on this being ordered the same way as the data on the wire (struct isn't packed anyway) so we can move id after tag and add another u8 to not have any overhead - added likely() to cache existence check in allocation, but nothing for msize check or free because of zc request being of different size include/net/9p/9p.h | 1 + include/net/9p/client.h | 2 ++ net/9p/client.c | 34 ++++++++++++++++++++++++++++------ net/9p/trans_rdma.c | 2 +- 4 files changed, 32 insertions(+), 7 deletions(-) diff --git a/include/net/9p/9p.h b/include/net/9p/9p.h index e23896116d9a..f1d2ed3cee61 100644 --- a/include/net/9p/9p.h +++ b/include/net/9p/9p.h @@ -558,6 +558,7 @@ struct p9_fcall { size_t offset; size_t capacity; + struct kmem_cache *cache; u8 *sdata; }; diff --git a/include/net/9p/client.h b/include/net/9p/client.h index 4b4ac1362ad5..735f3979d559 100644 --- a/include/net/9p/client.h +++ b/include/net/9p/client.h @@ -123,6 +123,7 @@ struct p9_client { struct p9_trans_module *trans_mod; enum p9_trans_status status; void *trans; + struct kmem_cache *fcall_cache; union { struct { @@ -230,6 +231,7 @@ int p9_client_mkdir_dotl(struct p9_fid *fid, const char *name, int mode, kgid_t gid, struct p9_qid *); int p9_client_lock_dotl(struct p9_fid *fid, struct p9_flock *flock, u8 *status); int p9_client_getlock_dotl(struct p9_fid *fid, struct p9_getlock *fl); +void p9_fcall_fini(struct p9_fcall *fc); struct p9_req_t *p9_tag_lookup(struct p9_client *, u16); void p9_client_cb(struct p9_client *c, struct p9_req_t *req, int status); diff --git a/net/9p/client.c b/net/9p/client.c index bc40bb11b832..0e0f8bb3fd3c 100644 --- a/net/9p/client.c +++ b/net/9p/client.c @@ -231,19 +231,36 @@ static int parse_opts(char *opts, struct p9_client *clnt) return ret; } -static int p9_fcall_init(struct p9_fcall *fc, int alloc_msize) +static int p9_fcall_init(struct p9_client *c, struct p9_fcall *fc, + int alloc_msize) { - fc->sdata = kmalloc(alloc_msize, GFP_NOFS); + if (likely(c->fcall_cache) && alloc_msize == c->msize) { + fc->sdata = kmem_cache_alloc(c->fcall_cache, GFP_NOFS); + fc->cache = c->fcall_cache; + } else { + fc->sdata = kmalloc(alloc_msize, GFP_NOFS); + fc->cache = NULL; + } if (!fc->sdata) return -ENOMEM; fc->capacity = alloc_msize; return 0; } -static void p9_fcall_fini(struct p9_fcall *fc) +void p9_fcall_fini(struct p9_fcall *fc) { - kfree(fc->sdata); + /* sdata can be NULL for interrupted requests in trans_rdma, + * and kmem_cache_free does not do NULL-check for us + */ + if (unlikely(!fc->sdata)) + return; + + if (fc->cache) + kmem_cache_free(fc->cache, fc->sdata); + else + kfree(fc->sdata); } +EXPORT_SYMBOL(p9_fcall_fini); static struct kmem_cache *p9_req_cache; @@ -266,9 +283,9 @@ p9_tag_alloc(struct p9_client *c, int8_t type, unsigned int max_size) if (!req) return NULL; - if (p9_fcall_init(&req->tc, alloc_msize)) + if (p9_fcall_init(c, &req->tc, alloc_msize)) goto free_req; - if (p9_fcall_init(&req->rc, alloc_msize)) + if (p9_fcall_init(c, &req->rc, alloc_msize)) goto free; p9pdu_reset(&req->tc); @@ -950,6 +967,7 @@ struct p9_client *p9_client_create(const char *dev_name, char *options) clnt->trans_mod = NULL; clnt->trans = NULL; + clnt->fcall_cache = NULL; client_id = utsname()->nodename; memcpy(clnt->name, client_id, strlen(client_id) + 1); @@ -986,6 +1004,9 @@ struct p9_client *p9_client_create(const char *dev_name, char *options) if (err) goto close_trans; + clnt->fcall_cache = kmem_cache_create("9p-fcall-cache", clnt->msize, + 0, 0, NULL); + return clnt; close_trans: @@ -1017,6 +1038,7 @@ void p9_client_destroy(struct p9_client *clnt) p9_tag_cleanup(clnt); + kmem_cache_destroy(clnt->fcall_cache); kfree(clnt); } EXPORT_SYMBOL(p9_client_destroy); diff --git a/net/9p/trans_rdma.c b/net/9p/trans_rdma.c index c5cac97df7f7..c60655c90c9e 100644 --- a/net/9p/trans_rdma.c +++ b/net/9p/trans_rdma.c @@ -445,7 +445,7 @@ static int rdma_request(struct p9_client *client, struct p9_req_t *req) if (unlikely(atomic_read(&rdma->excess_rc) > 0)) { if ((atomic_sub_return(1, &rdma->excess_rc) >= 0)) { /* Got one! */ - kfree(req->rc.sdata); + p9_fcall_fini(&req->rc); req->rc.sdata = NULL; goto dont_need_post_recv; } else { -- 2.17.1