Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp854234pxt; Thu, 5 Aug 2021 13:25:44 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyRylZs8B+9IzzPM8kYNmm7uPCvXZ9LuFDvIOoBTta7h+LpRGWEZ7gUwdWiTZ147Lj6eMkn X-Received: by 2002:a05:6602:1587:: with SMTP id e7mr218805iow.112.1628195143812; Thu, 05 Aug 2021 13:25:43 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628195143; cv=none; d=google.com; s=arc-20160816; b=UGE7b9QJisWoOGV8d7R6/4l6KrmH3H8oaFo6Ki4svReMJzeqaEvS8YRumEjtd7lpxt swOK8s4VTEFWrQTVrK3/D9wOuMZJTB8Rlsc3+ul+s/yTOXF8ZQOJrX5AyFGSLYX+i6gM eDhN5+LyY/zGN9YunjjQJHSZNUppIC88teR4bipTVOc+XTg9SEe7wKQ6/eLhzXni9ayN SnXhkGz/lajmBlCTJWEZeThrRhlFxNXME6jqKYN4a4s1yJBvw6yPSTzaI6BPq8iUI5Sm hDNhBzaBrhGzv55NcqWgP0hrb5vIdUyQiSQnzKtwXY0FJyK4o0JhAuhhZWYA8yy6Kn5w 447Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=OJR9WhSXTh5/r+GEiJEc1RI/WSSXlisXNNbFI4X9Mvo=; b=bapWBY8nsLAXRYT9rxG2/oygLxdyYlFenfHER32OFHpC2TQ8fp+jUNuI7E+jICI5b6 oJBV5se3VFKv9bmI48iuxwREkQmgnlFp548mhR2UsbNNjPXXSw484VyoBXXCt5PftO+M uYQmoFG7VLllXYY7874MmcFMX7g4PykbWKHpH8kkuzpZzvRYMEro5FXdxScVqcAn0TzF TGqvkbJBMq1JLtIGItLMVUR2HkxAJkSBzScia1FDbXeU/SExd68CBXwIpfONxHVbXPsV w03/gFtO3stMf/XVmiZl2NB2qR22T6HAdw2gM4BlMQhP+tm5nxpG15T2FvgmBhD/YD8b 2s/w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=ZAfG2mch; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q18si6923653ile.96.2021.08.05.13.25.31; Thu, 05 Aug 2021 13:25:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=ZAfG2mch; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241316AbhHESem (ORCPT + 99 others); Thu, 5 Aug 2021 14:34:42 -0400 Received: from mail.kernel.org ([198.145.29.99]:33536 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S241264AbhHESek (ORCPT ); Thu, 5 Aug 2021 14:34:40 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 54E5360F22; Thu, 5 Aug 2021 18:34:25 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1628188466; bh=UtExKdknJwWELP42Tar44N0iPU1hu7Jrdi3rzGZLrrc=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=ZAfG2mch9mi5g4KuL8Ia+7debgTpvSZerbxw7ILeONpZ5TbMAHzGYfZMrRlVu44ia pfZUbZwaVbvtzzIabtos6dA0pC3wpowLzlc3bv0o6iFDPX3xhA7Xiybhwa/6573Oax TcWGVSkv3nELwIW6KfKIvKr5cGq9dculb4Mb8Fqg= Date: Thu, 5 Aug 2021 20:34:23 +0200 From: Greg Kroah-Hartman To: Long Li Cc: Bart Van Assche , "longli@linuxonhyperv.com" , "linux-block@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "linux-hyperv@vger.kernel.org" , Jonathan Corbet , KY Srinivasan , Haiyang Zhang , Stephen Hemminger , Wei Liu , Dexuan Cui , Bjorn Andersson , Hans de Goede , "Williams, Dan J" , Maximilian Luz , Mike Rapoport , Ben Widawsky , Jiri Slaby , Andra Paraschiv , Siddharth Gupta , Hannes Reinecke Subject: Re: [Patch v5 0/3] Introduce a driver to support host accelerated access to Microsoft Azure Blob for Azure VM Message-ID: References: <1628146812-29798-1-git-send-email-longli@linuxonhyperv.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 05, 2021 at 06:24:57PM +0000, Long Li wrote: > > Subject: Re: [Patch v5 0/3] Introduce a driver to support host accelerated > > access to Microsoft Azure Blob for Azure VM > > > > On 8/5/21 12:00 AM, longli@linuxonhyperv.com wrote: > > > From: Long Li > > > > > > Azure Blob storage [1] is Microsoft's object storage solution for the > > > cloud. Users or client applications can access objects in Blob storage > > > via HTTP, from anywhere in the world. Objects in Blob storage are > > > accessible via the Azure Storage REST API, Azure PowerShell, Azure > > > CLI, or an Azure Storage client library. The Blob storage interface is > > > not designed to be a POSIX compliant interface. > > > > > > Problem: When a client accesses Blob storage via HTTP, it must go > > > through the Blob storage boundary of Azure and get to the storage > > > server through multiple servers. This is also true for an Azure VM. > > > > > > Solution: For an Azure VM, the Blob storage access can be accelerated > > > by having Azure host execute the Blob storage requests to the backend > > > storage server directly. > > > > > > This driver implements a VSC (Virtual Service Client) for accelerating > > > Blob storage access for an Azure VM by communicating with a VSP > > > (Virtual Service > > > Provider) on the Azure host. Instead of using HTTP to access the Blob > > > storage, an Azure VM passes the Blob storage request to the VSP on the > > > Azure host. The Azure host uses its native network to perform Blob > > > storage requests to the backend server directly. > > > > > > This driver doesn't implement Blob storage APIs. It acts as a fast > > > channel to pass user-mode Blob storage requests to the Azure host. The > > > user-mode program using this driver implements Blob storage APIs and > > > packages the Blob storage request as structured data to VSC. The > > > request data is modeled as three user provided buffers (request, > > > response and data buffers), that are patterned on the HTTP model used > > > by existing Azure Blob clients. The VSC passes those buffers to VSP for Blob > > storage requests. > > > > > > The driver optimizes Blob storage access for an Azure VM in two ways: > > > > > > 1. The Blob storage requests are performed by the Azure host to the > > > Azure Blob backend storage server directly. > > > > > > 2. It allows the Azure host to use transport technologies (e.g. RDMA) > > > available to the Azure host but not available to the VM, to reach to > > > Azure Blob backend servers. > > > > > > Test results using this driver for an Azure VM: > > > 100 Blob clients running on an Azure VM, each reading 100GB Block Blobs. > > > (10 TB total read data) > > > With REST API over HTTP: 94.4 mins > > > Using this driver: 72.5 mins > > > Performance (measured in throughput) gain: 30%. > > > > > > [1] > > > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs > > > .microsoft.com%2Fen-us%2Fazure%2Fstorage%2Fblobs%2Fstorage-blobs- > > intro > > > > > duction&data=04%7C01%7Clongli%40microsoft.com%7C6ba60a78f4e74 > > aeb0b > > > > > b108d95833bf53%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C6376 > > 378015 > > > > > 92577579%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoi > > V2luMzIiL > > > > > CJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ab5Zl2cQdmUhdT3l > > SotDwMl > > > DQuE0JaY%2B1REPQ0%2FjXa4%3D&reserved=0 > > > > Is the ioctl interface the only user space interface provided by this kernel > > driver? If so, why has this code been implemented as a kernel driver instead > > of e.g. a user space library that uses vfio to interact with a PCIe device? As an > > example, Qemu supports many different virtio device types. > > The Hyper-V presents one such device for the whole VM. This device is used by all processes on the VM. (The test benchmark used 100 processes) > > Hyper-V doesn't support creating one device for each process. We cannot use VFIO in this model. I still think this "model" is totally broken and wrong overall. Again, you are creating a custom "block" layer with a character device, forcing all userspace programs to use a custom library (where is it at?) just to get their data. There's a reason the POSIX model is there, why are you all ignoring it? greg k-h