Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp306370pxb; Wed, 20 Jan 2021 07:29:32 -0800 (PST) X-Google-Smtp-Source: ABdhPJzoVF57JuQGbVU+AIHb/3AqXP90rfEk0I9JQ5Dvc6rn1xRlQeJSL5vh8AQPT1Z0ZAGyCrOp X-Received: by 2002:a17:907:a06f:: with SMTP id ia15mr6558824ejc.328.1611156572664; Wed, 20 Jan 2021 07:29:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611156572; cv=none; d=google.com; s=arc-20160816; b=08nALpXR4Rc83+lZolpqc9VJuaZHiGGwHN/XGs/YbBWTgcnpOCJ5uTYwWJnpiYpNyy zC0wNZWK6gqt+KJLi9RneyFkCx2nt9/kJRyWjogz3McOXGKsccpViX7izK+NkOKRScUx zozHY4VVfUZ0JMhe5sFQgS+w1FHy+8vQLpJERqZIe0kzIX9VrP+uSnr++4J9WX991s/b Q8e+F0iDEfPKyinMXlC04e7wegUBwDxZQmH/wfNUVoJUogiOLjZpD5DE2AU4dt3fRwMi ROyq8ICg/TtsopRR9mVDrDIxyx4qeqPrf8pJpXM1/ueXmNJlg57K0U7KZ2HsKNHkSzBZ D4YQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:dkim-filter; bh=fes2ideAaPB5Yj0lwoll0Ulkg8QPY896Szrs4WHrXwg=; b=l9ZbUu6uZEJlpe3bcdNRyb3j7cmy5yayC2Y3rqem+MJiMlmz11Ufzp/raegXbnmfVy ZDJKy+pXcKWMR5NvR4wS5jRnxplYiQ+9F9Hq/bT3ImNXBCFDsAcxsO6MBMpgTI6REHbl rn1hgohbpVZn01F2RA+XiGRVTjdW7tKxVfk+S2qxooZutOgjkDDxgfzoFK/X5xTkFGeS 4cY2qPv4qXiuKAB4ZJ9vXmMcB3WVwrxU1l2PeIFMioIz46k3i9quosMdpRXS54JdlYde IySGSn/NvWDobtEsCidg8Us39XE/G5wTMxRr/RqA0yEG84/+F/VcpcXKV9iUVUc6pwta HUbA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fieldses.org header.s=default header.b=UVyT4ksj; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id f10si736056ejb.622.2021.01.20.07.28.58; Wed, 20 Jan 2021 07:29:32 -0800 (PST) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@fieldses.org header.s=default header.b=UVyT4ksj; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732023AbhATPXd (ORCPT + 99 others); Wed, 20 Jan 2021 10:23:33 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33160 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729113AbhATPIV (ORCPT ); Wed, 20 Jan 2021 10:08:21 -0500 Received: from fieldses.org (fieldses.org [IPv6:2600:3c00:e000:2f7::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3C5BAC061575 for ; Wed, 20 Jan 2021 07:07:39 -0800 (PST) Received: by fieldses.org (Postfix, from userid 2815) id CB7186C0D; Wed, 20 Jan 2021 10:07:37 -0500 (EST) DKIM-Filter: OpenDKIM Filter v2.11.0 fieldses.org CB7186C0D DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fieldses.org; s=default; t=1611155257; bh=fes2ideAaPB5Yj0lwoll0Ulkg8QPY896Szrs4WHrXwg=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=UVyT4ksjCKkh10Sg37fuGiaBEv1UO8WHIOLFyMlNruYOwbS9Vx39BY2d45aejrPzh qoMYJSvP15DviBbPCMIhLU6c6HQYTDWcqaAcqNbmoEmQe3zsCBLnSEhxbVJV/eH2sS cp+ENQ10fDHzlFIM30njKjouMqUiM+ysNFAz8jaU= Date: Wed, 20 Jan 2021 10:07:37 -0500 From: "bfields@fieldses.org" To: Trond Myklebust Cc: "fsorenso@redhat.com" , "linux-nfs@vger.kernel.org" , "aglo@umich.edu" , "bcodding@redhat.com" , "jshivers@redhat.com" , "chuck.lever@oracle.com" Subject: Re: unsharing tcp connections from different NFS mounts Message-ID: <20210120150737.GA17548@fieldses.org> References: <20201007001814.GA5138@fieldses.org> <57E3293C-5C49-4A80-957B-E490E6A9B32E@redhat.com> <5B5CF80C-494A-42D3-8D3F-51C0277D9E1B@redhat.com> <8ED5511E-25DE-4C06-9E26-A1947383C86A@oracle.com> <20201007140502.GC23452@fieldses.org> <85F496CD-9AAC-451C-A224-FCD138BDC591@oracle.com> <20201007160556.GE23452@fieldses.org> <20210119222229.GA29488@fieldses.org> <2d77534fb8be557c6883c8c386ebf4175f64454a.camel@hammerspace.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <2d77534fb8be557c6883c8c386ebf4175f64454a.camel@hammerspace.com> User-Agent: Mutt/1.5.21 (2010-09-15) Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Tue, Jan 19, 2021 at 11:09:55PM +0000, Trond Myklebust wrote: > On Tue, 2021-01-19 at 17:22 -0500, bfields@fieldses.org wrote: > > On Wed, Oct 07, 2020 at 04:50:26PM +0000, Trond Myklebust wrote: > > > As far as I can tell, this thread started with a complaint that > > > performance suffers when we don't allow setups that hack the client > > > by > > > pretending that a multi-homed server is actually multiple different > > > servers. > > > > > > AFAICS Tom Talpey's question is the relevant one. Why is there a > > > performance regression being seen by these setups when they share > > > the > > > same connection? Is it really the connection, or is it the fact > > > that > > > they all share the same fixed-slot session? > > > > > > I did see Igor's claim that there is a QoS issue (which afaics > > > would > > > also affect NFSv3), but why do I care about QoS as a per-mountpoint > > > feature? > > > > Sorry for being slow to get back to this. > > > > Some more details: > > > > Say an NFS server exports /data1 and /data2. > > > > A client mounts both.  Process 'large' starts creating 10G+ files in > > /data1, queuing up a lot of nfs WRITE rpc_tasks. > > > > Process 'small' creates a lot of small files in /data2, which > > requires a > > lot of synchronous rpc_tasks, each of which wait in line with the > > large > > WRITE tasks. > > > > The 'small' process makes painfully slow progress. > > > > The customer previously made things work for them by mounting two > > different server IP addresses, so the "small" and "large" processes > > effectively end up with their own queues. > > > > Frank Sorenson has a test showing the difference; see > > > >         https://bugzilla.redhat.com/show_bug.cgi?id=1703850#c42 > >         https://bugzilla.redhat.com/show_bug.cgi?id=1703850#c43 > > > > In that test, the "small" process creates files at a rate thousands > > of > > times slower when the "large" process is also running. > > > > Any suggestions? > > > > I don't see how this answers my questions above? So mainly: > > > Why is there a performance regression being seen by these setups > > > when they share the same connection? Is it really the connection, > > > or is it the fact that they all share the same fixed-slot session? I don't know. Any pointers how we might go about finding the answer? It's easy to test the case of entirely seperate state & tcp connections. If we want to test with a shared connection but separate slots I guess we'd need to create a separate session for each nfs4_server, and a lot of functions that currently take an nfs4_client would need to take an nfs4_server? --b.