Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp3527468pxb; Sun, 31 Jan 2021 19:40:31 -0800 (PST) X-Google-Smtp-Source: ABdhPJzyH0w+exHz5j2BYvtPNmJfCVkuuWxlc0/Ubw44v09sArl98GXIi1frThomFlGrGaBdXOtL X-Received: by 2002:a17:906:f6d8:: with SMTP id jo24mr15791080ejb.213.1612150831382; Sun, 31 Jan 2021 19:40:31 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1612150831; cv=none; d=google.com; s=arc-20160816; b=ubf2R0l5RQuFXNH+qGw5C/RQOTxLEKFSWpYKIRC6IHkMGjqO9tqsDSBHdpvN8LS3EA VWrjsN4Y0PtDPgZ3HgRLSV8qcglSQguidhmrZIJ4psnETfJSyzohxXdXwSVKqfHOMHyO xfG2mFy7hL5pR5IUM+sjtS0pg0VgMgBG/zdQHFbMZpcQnZpEACydjS2bry/q7009b8X9 7OHKYm5BMouP1UXGuqIVviwrMp1Dcq7LvW+u011qvKW5wZmQ3Z6+D3lfnxpmuJ+87dgp cJsmWRqhu74EibMDH3KsaCeRjnM1NOgvUp5/llKljgzBfvBToIMkH8pjBn4+QYBl4dLe IUsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-language:content-transfer-encoding :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=/gAb6zAwOxu2R4KqWA0oIvyoIahKBpzIeRdzj32BG5w=; b=L6fkab4StA+y8paUJFuf9BS5pEPIUQPmioVdPaYB9QJRJZKyUtCNmNMHN8xRs4etTR xpdXtvO5KhKod2t+PLIY8o7/zub0g/s/RuZjxrJgzR5mivxE4AFTbQ38V2B0RaYTwxss 7uJM0gmc1VYeNJ5jAIoxQ5gct3CsjyzxYnkZEdY0ruS+7e1Hj1hBmkgUm4Y3lqQFsdvY eitPk0Y7AG8lmv7L5av0sPZ67wKb8ev+Vd6a/qhgD5PZrU99vzFX6XoAMUnSJdXScRGM 91nFEqkWvDZjmRWpB4d4f7UQR90JSHM0sHYjdQnIHJbTh6zMGY/i4bTTlHU9x9tjUY5G T3wg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=S6uMOtmy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id gb7si10596186ejc.286.2021.01.31.19.40.07; Sun, 31 Jan 2021 19:40:31 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=S6uMOtmy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231429AbhBADiL (ORCPT + 99 others); Sun, 31 Jan 2021 22:38:11 -0500 Received: from us-smtp-delivery-124.mimecast.com ([216.205.24.124]:31864 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231393AbhBADiD (ORCPT ); Sun, 31 Jan 2021 22:38:03 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1612150595; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=/gAb6zAwOxu2R4KqWA0oIvyoIahKBpzIeRdzj32BG5w=; b=S6uMOtmy/WFj6fu1viR8Uf+N2x9WshQKQsXWHWD1dvrIDv0J0MJL++r63TFqxpCrXa+gBt izHtbFHlCtKcIqd/hHGdf+H9YuUJENK6wgDhS9CFn7ZY/Oe6ic9VDUyDpdkndNNV15vGYt /k9C6w9S07hKc8yvIkjCVaPxYAxB+gw= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-389-3I_W8Ol6Ns6EcSdpf565sA-1; Sun, 31 Jan 2021 22:36:31 -0500 X-MC-Unique: 3I_W8Ol6Ns6EcSdpf565sA-1 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 83616801B13; Mon, 1 Feb 2021 03:36:30 +0000 (UTC) Received: from [10.72.13.233] (ovpn-13-233.pek2.redhat.com [10.72.13.233]) by smtp.corp.redhat.com (Postfix) with ESMTP id DEE9D5D9DC; Mon, 1 Feb 2021 03:36:24 +0000 (UTC) Subject: Re: [PATCH 2/2] vdpa/mlx5: Restore the hardware used index after change map To: Eli Cohen Cc: mst@redhat.com, virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org, lulu@redhat.com References: <20210128134130.3051-1-elic@nvidia.com> <20210128134130.3051-3-elic@nvidia.com> <54239b51-918c-3475-dc88-4da1a4548da8@redhat.com> <20210131185536.GA164217@mtl-vdi-166.wap.labs.mlnx> From: Jason Wang Message-ID: <0c99f35c-7644-7201-cd11-7d486389a182@redhat.com> Date: Mon, 1 Feb 2021 11:36:23 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <20210131185536.GA164217@mtl-vdi-166.wap.labs.mlnx> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/2/1 上午2:55, Eli Cohen wrote: > On Fri, Jan 29, 2021 at 11:49:45AM +0800, Jason Wang wrote: >> On 2021/1/28 下午9:41, Eli Cohen wrote: >>> When a change of memory map occurs, the hardware resources are destroyed >>> and then re-created again with the new memory map. In such case, we need >>> to restore the hardware available and used indices. The driver failed to >>> restore the used index which is added here. >>> >>> Fixes 1a86b377aa21 ("vdpa/mlx5: Add VDPA driver for supported mlx5 devices") >>> Signed-off-by: Eli Cohen >> >> A question. Does this mean after a vq is suspended, the hw used index is not >> equal to vq used index? > Surely there is just one "Used index" for a VQ. What I was trying to say > is that after the VQ is suspended, I read the used index by querying the > hardware. The read result is the used index that the hardware wrote to > memory. Just to make sure I understand here. So it looks to me we had two index. The first is the used index which is stored in the memory/virtqueue, the second is the one that is stored by the device. > After the I create the new hardware object, I need to tell it > what is the used index (and the available index) as a way to sync it > with the existing VQ. For avail index I understand that the hardware index is not synced with the avail index stored in the memory/virtqueue. The question is used index, if the hardware one is not synced with the one in the virtqueue. It means after vq is suspended,  some requests is not completed by the hardware (e.g the buffer were not put to used ring). This may have implications to live migration, it means those unaccomplished requests needs to be migrated to the destination and resubmitted to the device. This looks not easy. Thanks > > This sync is especially important when a change of map occurs while the > VQ was already used (hence the indices are likely to be non zero). This > can be triggered by hot adding memory after the VQs have been used. > >> Thanks >> >> >>> --- >>> drivers/vdpa/mlx5/net/mlx5_vnet.c | 7 +++++++ >>> 1 file changed, 7 insertions(+) >>> >>> diff --git a/drivers/vdpa/mlx5/net/mlx5_vnet.c b/drivers/vdpa/mlx5/net/mlx5_vnet.c >>> index 549ded074ff3..3fc8588cecae 100644 >>> --- a/drivers/vdpa/mlx5/net/mlx5_vnet.c >>> +++ b/drivers/vdpa/mlx5/net/mlx5_vnet.c >>> @@ -87,6 +87,7 @@ struct mlx5_vq_restore_info { >>> u64 device_addr; >>> u64 driver_addr; >>> u16 avail_index; >>> + u16 used_index; >>> bool ready; >>> struct vdpa_callback cb; >>> bool restore; >>> @@ -121,6 +122,7 @@ struct mlx5_vdpa_virtqueue { >>> u32 virtq_id; >>> struct mlx5_vdpa_net *ndev; >>> u16 avail_idx; >>> + u16 used_idx; >>> int fw_state; >>> /* keep last in the struct */ >>> @@ -804,6 +806,7 @@ static int create_virtqueue(struct mlx5_vdpa_net *ndev, struct mlx5_vdpa_virtque >>> obj_context = MLX5_ADDR_OF(create_virtio_net_q_in, in, obj_context); >>> MLX5_SET(virtio_net_q_object, obj_context, hw_available_index, mvq->avail_idx); >>> + MLX5_SET(virtio_net_q_object, obj_context, hw_used_index, mvq->used_idx); >>> MLX5_SET(virtio_net_q_object, obj_context, queue_feature_bit_mask_12_3, >>> get_features_12_3(ndev->mvdev.actual_features)); >>> vq_ctx = MLX5_ADDR_OF(virtio_net_q_object, obj_context, virtio_q_context); >>> @@ -1022,6 +1025,7 @@ static int connect_qps(struct mlx5_vdpa_net *ndev, struct mlx5_vdpa_virtqueue *m >>> struct mlx5_virtq_attr { >>> u8 state; >>> u16 available_index; >>> + u16 used_index; >>> }; >>> static int query_virtqueue(struct mlx5_vdpa_net *ndev, struct mlx5_vdpa_virtqueue *mvq, >>> @@ -1052,6 +1056,7 @@ static int query_virtqueue(struct mlx5_vdpa_net *ndev, struct mlx5_vdpa_virtqueu >>> memset(attr, 0, sizeof(*attr)); >>> attr->state = MLX5_GET(virtio_net_q_object, obj_context, state); >>> attr->available_index = MLX5_GET(virtio_net_q_object, obj_context, hw_available_index); >>> + attr->used_index = MLX5_GET(virtio_net_q_object, obj_context, hw_used_index); >>> kfree(out); >>> return 0; >>> @@ -1602,6 +1607,7 @@ static int save_channel_info(struct mlx5_vdpa_net *ndev, struct mlx5_vdpa_virtqu >>> return err; >>> ri->avail_index = attr.available_index; >>> + ri->used_index = attr.used_index; >>> ri->ready = mvq->ready; >>> ri->num_ent = mvq->num_ent; >>> ri->desc_addr = mvq->desc_addr; >>> @@ -1646,6 +1652,7 @@ static void restore_channels_info(struct mlx5_vdpa_net *ndev) >>> continue; >>> mvq->avail_idx = ri->avail_index; >>> + mvq->used_idx = ri->used_index; >>> mvq->ready = ri->ready; >>> mvq->num_ent = ri->num_ent; >>> mvq->desc_addr = ri->desc_addr;