Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932525AbdHVKOh (ORCPT ); Tue, 22 Aug 2017 06:14:37 -0400 Received: from mx1.redhat.com ([209.132.183.28]:54684 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932414AbdHVKOg (ORCPT ); Tue, 22 Aug 2017 06:14:36 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mx1.redhat.com 4FBC67E440 Authentication-Results: ext-mx03.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com Authentication-Results: ext-mx03.extmail.prod.ext.phx2.redhat.com; spf=fail smtp.mailfrom=stefanha@redhat.com Date: Tue, 22 Aug 2017 11:14:33 +0100 From: Stefan Hajnoczi To: Dexuan Cui Cc: "davem@davemloft.net" , "netdev@vger.kernel.org" , "devel@linuxdriverproject.org" , KY Srinivasan , Haiyang Zhang , Stephen Hemminger , George Zhang , Jorgen Hansen , Michal Kubecek , Vitaly Kuznetsov , Cathy Avery , "jasowang@redhat.com" , Rolf Neugebauer , Dave Scott , Marcelo Cerri , "apw@canonical.com" , "olaf@aepfle.de" , "joe@perches.com" , "linux-kernel@vger.kernel.org" , Dan Carpenter Subject: Re: [PATCH net-next 3/3] hv_sock: implements Hyper-V transport for Virtual Sockets (AF_VSOCK) Message-ID: <20170822101433.GC16799@stefanha-x1.localdomain> References: <20170817145551.GI5539@stefanha-x1.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.8.3 (2017-05-23) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Tue, 22 Aug 2017 10:14:36 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2498 Lines: 67 On Fri, Aug 18, 2017 at 10:23:54PM +0000, Dexuan Cui wrote: > > From: Stefan Hajnoczi [mailto:stefanha@redhat.com] > > Sent: Thursday, August 17, 2017 07:56 > > To: Dexuan Cui > > On Tue, Aug 15, 2017 at 10:18:41PM +0000, Dexuan Cui wrote: > > > +static u32 hvs_get_local_cid(void) > > > +{ > > > + return VMADDR_CID_ANY; > > > +} > > > > Interesting concept: the guest never knows its CID. This is nice from a > > live migration perspective. Currently VMCI and virtio adjust listen > > socket local CIDs after migration. > > > > > +static bool hvs_stream_allow(u32 cid, u32 port) > > > +{ > > > + static const u32 valid_cids[] = { > > > + VMADDR_CID_ANY, > > > > Is this for loopback? > > No, we don't support lookback in Linux VM, at least for now. > In our Linux implementation, Linux VM can only connect to the host, and > here when Linux VM calls connect(), I treat VMADDR_CID_ANY > the same as VMADDR_CID_HOST. VMCI and virtio-vsock do not treat connect(VMADDR_CID_ANY) the same as connect(VMADDR_CID_HOST). It is an error to connect to VMADDR_CID_ANY. > > > + VMADDR_CID_HOST, > > > + }; > > > + int i; > > > + > > > + /* The host's port range [MIN_HOST_EPHEMERAL_PORT, 0xFFFFFFFF) > > is > > > + * reserved as ephemeral ports, which are used as the host's ports > > > + * when the host initiates connections. > > > + */ > > > + if (port > MAX_HOST_LISTEN_PORT) > > > + return false; > > > > Without this if statement the guest will attempt to connect. I guess > > there will be no listen sockets above MAX_HOST_LISTEN_PORT, so the > > connection attempt will fail. > > You're correct. > To use the vsock common infrastructure, we have to map Hyper-V's > GUID to int , and hence we must limit > the port range we can listen() on to [0, MAX_LISTEN_PORT], i.e. > we can only use half of the whole 32-bit port space for listen(). > This is detailed in the long comments starting at about Line 100. > > > ...but hardcode this knowledge into the guest driver? > I'd like the guest's connect() to fail immediately here. > IMO this is better than a connect timeout. :-) Thanks for explaining. Perhaps the comment could be updated: /* The host's port range [MIN_HOST_EPHEMERAL_PORT, 0xFFFFFFFF) is * reserved as ephemeral ports, which are used as the host's ports when * the host initiates connections. * * Perform this check in the guest so an immediate error is produced * instead of a timeout. */ Stefan