Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp4701619imm; Mon, 30 Jul 2018 21:19:12 -0700 (PDT) X-Google-Smtp-Source: AAOMgpf40eBo4FzokKYYHeBDs3fgdnG0g3k8ri5p6AqpieRjmEuno2WFkegzGGtx3snkWRziEoiG X-Received: by 2002:a62:25c5:: with SMTP id l188-v6mr20368162pfl.179.1533010752451; Mon, 30 Jul 2018 21:19:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533010752; cv=none; d=google.com; s=arc-20160816; b=cSxV9tEgCo6Ugw/tQRy6IlWt3fduMVoSV36Uw8d9vzYudr1hDHhfnL+Ntj0xqguxdl Ac9c6zRvkBBmUNQ+4eDHh1ERP1f+J198QdFcOw3exkyXcsS8yg76enthwHZIHVq8qh34 p0KxSa1FlKWKtxFtENCA8NevgexArctuNacjT9U/W761Ba8p7aFHUUPaow/zgSl4nSm6 SVerShbDt2SjqVldLs2Ky8jLRnp2NyGtZAeMkcG0Nm+28vBsWfP7E3Up0QA5DT0T5Gms RSKMZqSYqxMVl5bur99GtGWVsGTTWRRcJDUobQnMjI2pTsdGtd/Yr1UcVjfIaRiuEeez a54g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=vzOY4BKuSv7NSnwRps0Fgl/hDd9XORmuYgd8XQW4/Ig=; b=bZQQFsnetp54qoejmv+uvs8FzP6vDIzLlkilpBAPb4FE2DRZQeU2+yFUXCs8fqqNWp VCnKNPieWLhiWBmB2WFcDYqaytiRJVc7btxeYC/zaa46M21TBwj/BgpGmQIBYkFHUb/G gASTaS7KFHb1E3ra0tfZWw1yzTssFJMggju/t0ZPp9ydTzKalU+dfg86dRN1Wn+V1u9X h0WfzPJ5tq/EzvlYluFWaD53dLnKEkSfv8aRfeXS10r+dIyDVd/rmGHwC1CcrmHvZQX/ CgMVUNsSw3W7FZhKTU4rb9IKIFttLNncNwzuJcvSPZU6/IE8WR3sUklIXQBWcxqTpzvj UbsA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k125-v6si12166443pgk.315.2018.07.30.21.18.58; Mon, 30 Jul 2018 21:19:12 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728035AbeGaFzl (ORCPT + 99 others); Tue, 31 Jul 2018 01:55:41 -0400 Received: from nautica.notk.org ([91.121.71.147]:51623 "EHLO nautica.notk.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726071AbeGaFzl (ORCPT ); Tue, 31 Jul 2018 01:55:41 -0400 Received: by nautica.notk.org (Postfix, from userid 1001) id 668F7C009; Tue, 31 Jul 2018 06:17:22 +0200 (CEST) Date: Tue, 31 Jul 2018 06:17:07 +0200 From: Dominique Martinet To: Matthew Wilcox Cc: v9fs-developer@lists.sourceforge.net, Greg Kurz , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH 2/2] net/9p: add a per-client fcall kmem_cache Message-ID: <20180731041707.GA20546@nautica> References: <20180730093101.GA7894@nautica> <1532943263-24378-1-git-send-email-asmadeus@codewreck.org> <1532943263-24378-2-git-send-email-asmadeus@codewreck.org> <20180731024658.GC19692@bombadil.infradead.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20180731024658.GC19692@bombadil.infradead.org> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Matthew Wilcox wrote on Mon, Jul 30, 2018: > On Mon, Jul 30, 2018 at 11:34:23AM +0200, Dominique Martinet wrote: > > -static int p9_fcall_alloc(struct p9_fcall *fc, int alloc_msize) > > +static int p9_fcall_alloc(struct p9_client *c, struct p9_fcall *fc, > > + int alloc_msize) > > { > > - fc->sdata = kmalloc(alloc_msize, GFP_NOFS); > > + if (c->fcall_cache && alloc_msize == c->msize) > > + fc->sdata = kmem_cache_alloc(c->fcall_cache, GFP_NOFS); > > + else > > + fc->sdata = kmalloc(alloc_msize, GFP_NOFS); > > Could you simplify this by initialising c->msize to 0 and then this > can simply be: > > > + if (alloc_msize == c->msize) > ... Hmm, this is rather tricky with the current flow of things; p9_client_version() has multiple uses for that msize field. Basically what happens is: - init client struct, set clip msize to mount option/transport-specific max - p9_client_version() uses current c->msize to send a suggested value to the server - p9_client_rpc() uses current c->msize to allocate that first rpc, this is pretty much hard-coded and will be quite intrusive to make an exception for - p9_client_version() looks at the msize the server suggested and clips c->msize if the reply's is smaller than c->msize I kind of agree it'd be nice to remove that check being done all the time for just startup, but I don't see how to do this easily with the current code. Making p9_client_version take an extra argument would be easy but we'd need to actually hardcode in p9_client_rpc that "if the message type is TVERSION then use [page size or whatever] for allocation" and that kinds of kills the point... The alternative being having p9_client_rpc takes the actual size as argument itself but this once again is pretty intrusive even if it could be done mechanically... I'll think about this some more > > +void p9_fcall_free(struct p9_client *c, struct p9_fcall *fc) > > +{ > > + /* sdata can be NULL for interrupted requests in trans_rdma, > > + * and kmem_cache_free does not do NULL-check for us > > + */ > > + if (unlikely(!fc->sdata)) > > + return; > > + > > + if (c->fcall_cache && fc->capacity == c->msize) > > + kmem_cache_free(c->fcall_cache, fc->sdata); > > + else > > + kfree(fc->sdata); > > +} > > Is it possible for fcall_cache to be allocated before fcall_free is > called? I'm concerned we might do this: > > allocate message A > allocate message B > receive response A > allocate fcall_cache > receive response B > > and then we'd call kmem_cache_free() for something allocated by kmalloc(), > which works with slab and slub, but doesn't work with slob (alas). Bleh, I checked this would work for slab and didn't really check others.. This cannot happen right now because we only return the client struct from p9_client_create after the first message is done (and, right now, freed) but when we start adding refcounting to requests it'd be possible to free the very first response after fcall_cache is allocated with a "bad" server like syzcaller does sending the version reply before the request came in. I can't see any work-around around this other than storing how the fcall was allocated in the struct itself though... I guess I might as well do that now, unless you have a better idea. > > @@ -980,6 +1000,9 @@ struct p9_client *p9_client_create(const char *dev_name, char *options) > > if (err) > > goto close_trans; > > > > + clnt->fcall_cache = kmem_cache_create("9p-fcall-cache", clnt->msize, > > + 0, 0, NULL); > > + > > If we have slab merging turned off, or we have two mounts from servers > with different msizes, we'll end up with two slabs called 9p-fcall-cache. > I'm OK with that, but are you? Yeah, the reason I didn't make it global like p9_req_cache is precisely to get two separate caches if the msizes are different. I actually considered adding msize to the string with snprintf or something but someone looking at it through slabinfo or similar will have the sizes anyway so I don't think this would bring anything, do you know if/think that tools will choke on multiple caches with the same name? I'm not sure about slab merging being disabled though, from the little I understand I do not see why anyone would do that except for debugging, and I'm fine with that. Please let me know if I'm missing something though! Thanks for the review, -- Dominique Martinet