Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp7201801rwb; Tue, 6 Dec 2022 02:34:30 -0800 (PST) X-Google-Smtp-Source: AA0mqf5vmvld9U8T5m6reqyeRsNDqoAJfVog2VE8q7JCYgbCnACGqxPFes7my1rPtKX3v15ycmtr X-Received: by 2002:a17:906:49c4:b0:7c0:d23c:ead5 with SMTP id w4-20020a17090649c400b007c0d23cead5mr12511292ejv.606.1670322869758; Tue, 06 Dec 2022 02:34:29 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1670322869; cv=none; d=google.com; s=arc-20160816; b=oAWYAuRjgQcnHKtG0CEXVpIPUqUo1Q3VA81O/aG9vZvonuUG2QoF0HPhBL9kt2SzZJ bpnTceXYr2DNv4bnBUnj6rEMrSAaQD/RS1vodLDPigF3sw/kwZJZFEEyhymIrW4dYq9V Hkb5v9KVFX7+vBHHfDoWDDObTVAsc9vKTyZ1OmXNUdnkoJ/vYpyXrKM0zVtBB0bKO9C2 OmCmdTgQwJLKvqzMIlDP+moye53P3AQzPcnpe9cQgk/1kOgPCEyAy/CQ7s4SsDWTdBci UEkM0gAKUZxBRJph7rY4YRJL3unFzpeUW79JpDpXsen2eqYj04IEeeBKcp7oQZ08Mwtv 9F5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id; bh=WBVuDXKiHGfJOKDpU3g131P/D12DuBKwAR9pKWCyHgk=; b=o8UduKHFALPjJ20/BZXQZHAgz+LIlufRPIyyjkQYqovTOUMKF0KQmeXF23kEJqNXI0 3t0pBSk+ftXYWqyf3Mew+S9w+oX+8mOC1T0Q6budntDg5EMtqmZ4jQVa5K3Uq1lYTHde TsdsF+Sp7fwVCmU+/QfRMWtMjNS5wq/yDWJr4sVv8rhTkRr9eoV/YlBWLby6bMg+htbp rj7G/dv4qX2+bLhmOubRW6Sm6onZb+esvF36I2VATry7qVnjhVxAWrfV9NUcWUZ4WEOU GXwKR9nCnHVNsQ23uoipGNnFe2aua4+LtyrfQBdTCBAwwh6jyD2FoVumk4jwjinH8Lgf hlkw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id gt7-20020a1709072d8700b007897abd2d17si10442353ejc.468.2022.12.06.02.34.09; Tue, 06 Dec 2022 02:34:29 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234122AbiLFIqJ (ORCPT + 79 others); Tue, 6 Dec 2022 03:46:09 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42044 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232099AbiLFIqA (ORCPT ); Tue, 6 Dec 2022 03:46:00 -0500 Received: from szxga02-in.huawei.com (szxga02-in.huawei.com [45.249.212.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 70D1DEE29 for ; Tue, 6 Dec 2022 00:45:59 -0800 (PST) Received: from kwepemi100025.china.huawei.com (unknown [172.30.72.56]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4NRDWT3H8pzFpfF; Tue, 6 Dec 2022 16:45:09 +0800 (CST) Received: from [10.174.148.223] (10.174.148.223) by kwepemi100025.china.huawei.com (7.221.188.158) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Tue, 6 Dec 2022 16:45:42 +0800 Message-ID: Date: Tue, 6 Dec 2022 16:45:42 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.7.0 Subject: Re: [PATCH v2] vdpasim: support doorbell mapping To: Jason Wang CC: , , , , , , , , , References: <20221205135130.2336-1-longpeng2@huawei.com> From: "Longpeng (Mike, Cloud Infrastructure Service Product Dept.)" In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 8bit X-Originating-IP: [10.174.148.223] X-ClientProxiedBy: dggems703-chm.china.huawei.com (10.3.19.180) To kwepemi100025.china.huawei.com (7.221.188.158) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.5 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2022/12/6 14:59, Jason Wang 写道: > On Mon, Dec 5, 2022 at 9:52 PM Longpeng(Mike) wrote: >> >> From: Longpeng >> >> Support doorbell mapping for vdpasim devices, then we can test the notify >> passthrough feature even if there's no real hardware on hand. >> >> Allocates a dummy page which is used to emulate the notify page of the device, >> all VQs share the same notify register that initiated to 0xffff. A periodic >> work will check whether there're requests need to process ( the value of the >> notify register is 0xffff or not ). >> >> This cap is disabled as default, users can enable it as follow: >> modprobe vdpa_sim notify_passthrough=true >> >> Signed-off-by: Longpeng >> --- >> Changes v1->v2: >> - support both kick mode and passthrough mode. [Jason] >> - poll the notify register first. [Jason, Michael] >> --- >> drivers/vdpa/vdpa_sim/vdpa_sim.c | 77 ++++++++++++++++++++++++++++++++ >> drivers/vdpa/vdpa_sim/vdpa_sim.h | 3 ++ >> 2 files changed, 80 insertions(+) >> >> diff --git a/drivers/vdpa/vdpa_sim/vdpa_sim.c b/drivers/vdpa/vdpa_sim/vdpa_sim.c >> index 7438a89ce939..07096a04dabb 100644 >> --- a/drivers/vdpa/vdpa_sim/vdpa_sim.c >> +++ b/drivers/vdpa/vdpa_sim/vdpa_sim.c >> @@ -14,6 +14,7 @@ >> #include >> #include >> #include >> +#include >> #include >> #include >> #include >> @@ -36,9 +37,16 @@ module_param(max_iotlb_entries, int, 0444); >> MODULE_PARM_DESC(max_iotlb_entries, >> "Maximum number of iotlb entries for each address space. 0 means unlimited. (default: 2048)"); >> >> +static bool notify_passthrough; >> +module_param(notify_passthrough, bool, 0444); >> +MODULE_PARM_DESC(notify_passthrough, >> + "Enable vq notify(doorbell) area mapping. (default: false)"); > > I'm not sure if it's worth doing this, I think we can afford the cost > of periodic work (especially considering it's a simulator). > We can support kick mode and passthgourh mode simultaneously now, it is a little unnecessary. I would remove it in next version. >> + >> #define VDPASIM_QUEUE_ALIGN PAGE_SIZE >> #define VDPASIM_QUEUE_MAX 256 >> #define VDPASIM_VENDOR_ID 0 >> +#define VDPASIM_VRING_POLL_PERIOD 100 /* ms */ >> +#define VDPASIM_NOTIFY_DEFVAL 0xffff >> >> static struct vdpasim *vdpa_to_sim(struct vdpa_device *vdpa) >> { >> @@ -246,6 +254,28 @@ static const struct dma_map_ops vdpasim_dma_ops = { >> static const struct vdpa_config_ops vdpasim_config_ops; >> static const struct vdpa_config_ops vdpasim_batch_config_ops; >> >> +static void vdpasim_notify_work(struct work_struct *work) >> +{ >> + struct vdpasim *vdpasim; >> + u16 *val; >> + >> + vdpasim = container_of(work, struct vdpasim, notify_work.work); >> + >> + if (!(vdpasim->status & VIRTIO_CONFIG_S_DRIVER_OK)) >> + goto out; >> + >> + if (!vdpasim->running) >> + goto out; >> + >> + val = (u16 *)vdpasim->notify; >> + if (xchg(val, VDPASIM_NOTIFY_DEFVAL) != VDPASIM_NOTIFY_DEFVAL) >> + schedule_work(&vdpasim->work); >> + >> +out: >> + schedule_delayed_work(&vdpasim->notify_work, >> + msecs_to_jiffies(VDPASIM_VRING_POLL_PERIOD)); >> +} >> + >> struct vdpasim *vdpasim_create(struct vdpasim_dev_attr *dev_attr, >> const struct vdpa_dev_set_config *config) >> { >> @@ -287,6 +317,18 @@ struct vdpasim *vdpasim_create(struct vdpasim_dev_attr *dev_attr, >> set_dma_ops(dev, &vdpasim_dma_ops); >> vdpasim->vdpa.mdev = dev_attr->mgmt_dev; >> >> + if (notify_passthrough) { >> + INIT_DELAYED_WORK(&vdpasim->notify_work, vdpasim_notify_work); >> + >> + vdpasim->notify = __get_free_page(GFP_KERNEL | __GFP_ZERO); >> + if (!vdpasim->notify) >> + goto err_iommu; >> +#ifdef CONFIG_X86 >> + set_memory_uc(vdpasim->notify, 1); >> +#endif > > I had the same question with version 1, any reason for having this > part uncacheable? It's a hint that we have bugs somewhere. Are we > missing ACCESS/ORDER_PLATFORM or other features? > Oh, I had replied you in that thread. The vm_page_prot of notify mapping VMA is pgprot_noncached (see vhost_vdpa_fault) but the vdpasim->notify is WB, so we should set its memtype to UC here and set it back to WB when releasing the device. Otherwise, the warning in trace_pfn_remap() would be triggered. For example: x86/PAT: CPU 16/KVM:17819 map pfn RAM range req uncached-minus for [mem 0x5151f3000-0x5151f3fff], got write-back >> + *(u16 *)vdpasim->notify = VDPASIM_NOTIFY_DEFVAL; > > WRITE_ONCE()? > Okay. >> + } >> + >> vdpasim->config = kzalloc(dev_attr->config_size, GFP_KERNEL); >> if (!vdpasim->config) >> goto err_iommu; >> @@ -495,6 +537,18 @@ static u8 vdpasim_get_status(struct vdpa_device *vdpa) >> return status; >> } >> >> +static void vdpasim_set_notify_work(struct vdpasim *vdpasim, bool start) >> +{ >> + if (!notify_passthrough) >> + return; > > Only two callers for this function: one is start another is stop. If > we decide to get rid of notify_passthrough, I'd rather opencode the > schedule/cancel_delayed(). > Okay, thanks. > Thanks > >> + >> + if (start) >> + schedule_delayed_work(&vdpasim->notify_work, >> + msecs_to_jiffies(VDPASIM_VRING_POLL_PERIOD)); >> + else >> + cancel_delayed_work_sync(&vdpasim->notify_work); >> +} >> + >> static void vdpasim_set_status(struct vdpa_device *vdpa, u8 status) >> { >> struct vdpasim *vdpasim = vdpa_to_sim(vdpa); >> @@ -502,12 +556,14 @@ static void vdpasim_set_status(struct vdpa_device *vdpa, u8 status) >> spin_lock(&vdpasim->lock); >> vdpasim->status = status; >> spin_unlock(&vdpasim->lock); >> + vdpasim_set_notify_work(vdpasim, status & VIRTIO_CONFIG_S_DRIVER_OK); >> } >> >> static int vdpasim_reset(struct vdpa_device *vdpa, bool clear) >> { >> struct vdpasim *vdpasim = vdpa_to_sim(vdpa); >> >> + vdpasim_set_notify_work(vdpasim, false); >> spin_lock(&vdpasim->lock); >> vdpasim->status = 0; >> vdpasim_do_reset(vdpasim); >> @@ -672,11 +728,24 @@ static int vdpasim_dma_unmap(struct vdpa_device *vdpa, unsigned int asid, >> return 0; >> } >> >> +static struct vdpa_notification_area >> +vdpasim_get_vq_notification(struct vdpa_device *vdpa, u16 qid) >> +{ >> + struct vdpasim *vdpasim = vdpa_to_sim(vdpa); >> + struct vdpa_notification_area notify; >> + >> + notify.addr = virt_to_phys((void *)vdpasim->notify); >> + notify.size = PAGE_SIZE; >> + >> + return notify; >> +} >> + >> static void vdpasim_free(struct vdpa_device *vdpa) >> { >> struct vdpasim *vdpasim = vdpa_to_sim(vdpa); >> int i; >> >> + vdpasim_set_notify_work(vdpasim, false); >> cancel_work_sync(&vdpasim->work); >> >> for (i = 0; i < vdpasim->dev_attr.nvqs; i++) { >> @@ -693,6 +762,12 @@ static void vdpasim_free(struct vdpa_device *vdpa) >> vhost_iotlb_free(vdpasim->iommu); >> kfree(vdpasim->vqs); >> kfree(vdpasim->config); >> + if (vdpasim->notify) { >> +#ifdef CONFIG_X86 >> + set_memory_wb(vdpasim->notify, 1); >> +#endif >> + free_page(vdpasim->notify); >> + } >> } >> >> static const struct vdpa_config_ops vdpasim_config_ops = { >> @@ -704,6 +779,7 @@ static const struct vdpa_config_ops vdpasim_config_ops = { >> .get_vq_ready = vdpasim_get_vq_ready, >> .set_vq_state = vdpasim_set_vq_state, >> .get_vq_state = vdpasim_get_vq_state, >> + .get_vq_notification = vdpasim_get_vq_notification, >> .get_vq_align = vdpasim_get_vq_align, >> .get_vq_group = vdpasim_get_vq_group, >> .get_device_features = vdpasim_get_device_features, >> @@ -737,6 +813,7 @@ static const struct vdpa_config_ops vdpasim_batch_config_ops = { >> .get_vq_ready = vdpasim_get_vq_ready, >> .set_vq_state = vdpasim_set_vq_state, >> .get_vq_state = vdpasim_get_vq_state, >> + .get_vq_notification = vdpasim_get_vq_notification, >> .get_vq_align = vdpasim_get_vq_align, >> .get_vq_group = vdpasim_get_vq_group, >> .get_device_features = vdpasim_get_device_features, >> diff --git a/drivers/vdpa/vdpa_sim/vdpa_sim.h b/drivers/vdpa/vdpa_sim/vdpa_sim.h >> index 0e78737dcc16..0769ccbd3911 100644 >> --- a/drivers/vdpa/vdpa_sim/vdpa_sim.h >> +++ b/drivers/vdpa/vdpa_sim/vdpa_sim.h >> @@ -69,6 +69,9 @@ struct vdpasim { >> bool running; >> /* spinlock to synchronize iommu table */ >> spinlock_t iommu_lock; >> + /* dummy notify page */ >> + unsigned long notify; >> + struct delayed_work notify_work; >> }; >> >> struct vdpasim *vdpasim_create(struct vdpasim_dev_attr *attr, >> -- >> 2.23.0 >> > > .