Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp4064193pxt; Tue, 10 Aug 2021 19:10:42 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxBOJ/HKkUvPkFvwsCxnWcY//2F9x29QL67WXyN/Dxg+AvTrFi5iasf/wMAWPzLXAjfUJYv X-Received: by 2002:a17:906:6d85:: with SMTP id h5mr1337631ejt.305.1628647842664; Tue, 10 Aug 2021 19:10:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628647842; cv=none; d=google.com; s=arc-20160816; b=xKgUSHHWjmc9wAPXSlwO7vb9sXKO6SARhM9HZi6JHnCYhJQvFOeNx3wb1vzaONj+xw HwWQdCdNe50huQ1WZKGJVp7YuFQZhh1FGzhmR6SGawcJLdXkcuJYDjXRR9GvAGkdAKVS jTGjbFL+2Jyxjm1wiPkV+B6Um0ChZJC6Afi5V3Qi94pxTFVrtdr8a1Mb052vQhE28gmH 5oEq9i+NgEC2fYKzk7ae+p3420Bflracp0971YFF2WeOwV/Y8YofvKFH16LqBpwrv3Tk 40Z0rGlpb9ENhXFOO/4e93HpqXjOSdsf/XqwRRni+IHeLYajacz6Grq8cjc2GF9qra34 L/VA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=sV+jtDRx4e6X6lQwXIlB1WFQZRTD0sIbGbXorO+ux4E=; b=YkyKQYec1tkS+jpHSfNUenVvlfusgNWoyLS5j5L7VsrQs1/owyCNsZIkBia8tFN2Eo GZGPRkMYpFYw2pXV/bFk/afU6WEHIAV6aenUYOjljyYuKQsp2TA4JgNv6+M5dQi2Aaup x4O+g/soSePc3Jw70r+WWGc8mSLwaswzpY5KTKFrqpW4DnYHsujsX91tyeUVzkFgoQx8 6nXu4wVQh1AhsjlSsbyCDLQzsZNGkwmQcTGidBO/yTwyvu+2fxA9ETEe9IMi8oq0kezI P7G58G+h5leIUqBLVcVoW3PQWq37vze1L7KdHL6MUprGH4djNDJj3V32G5yBQCVkrv1h zs3Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a16si18343343ejk.489.2021.08.10.19.10.16; Tue, 10 Aug 2021 19:10:42 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231633AbhHKCHy (ORCPT + 99 others); Tue, 10 Aug 2021 22:07:54 -0400 Received: from szxga02-in.huawei.com ([45.249.212.188]:13412 "EHLO szxga02-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231233AbhHKCHx (ORCPT ); Tue, 10 Aug 2021 22:07:53 -0400 Received: from dggemv711-chm.china.huawei.com (unknown [172.30.72.53]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4GktQt1SnGzdZh1; Wed, 11 Aug 2021 10:03:50 +0800 (CST) Received: from dggpemm500006.china.huawei.com (7.185.36.236) by dggemv711-chm.china.huawei.com (10.1.198.66) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2176.2; Wed, 11 Aug 2021 10:07:29 +0800 Received: from [10.174.179.0] (10.174.179.0) by dggpemm500006.china.huawei.com (7.185.36.236) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2176.2; Wed, 11 Aug 2021 10:07:28 +0800 Subject: Re: [PATCH RFC 0/8] iommu/arm-smmu-v3: add support for ECMDQ register mode To: Will Deacon CC: Robin Murphy , Joerg Roedel , linux-arm-kernel , iommu , linux-kernel References: <20210626110130.2416-1-thunder.leizhen@huawei.com> <20210810183529.GC3296@willie-the-truck> From: "Leizhen (ThunderTown)" Message-ID: Date: Wed, 11 Aug 2021 10:07:27 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 In-Reply-To: <20210810183529.GC3296@willie-the-truck> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.174.179.0] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To dggpemm500006.china.huawei.com (7.185.36.236) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/8/11 2:35, Will Deacon wrote: > On Sat, Jun 26, 2021 at 07:01:22PM +0800, Zhen Lei wrote: >> SMMU v3.3 added a new feature, which is Enhanced Command queue interface >> for reducing contention when submitting Commands to the SMMU, in this >> patch set, ECMDQ is the abbreviation of Enhanced Command Queue. >> >> When the hardware supports ECMDQ and each core can exclusively use one ECMDQ, >> each core does not need to compete with other cores when using its own ECMDQ. >> This means that each core can insert commands in parallel. If each ECMDQ can >> execute commands in parallel, the overall performance may be better. However, >> our hardware currently does not support multiple ECMDQ execute commands in >> parallel. >> >> In order to reuse existing code, I originally still call arm_smmu_cmdq_issue_cmdlist() >> to insert commands. Even so, however, there was a performance improvement of nearly 12% >> in strict mode. >> >> The test environment is the EMU, which simulates the connection of the 200 Gbit/s NIC. >> Number of queues: passthrough lazy strict(ECMDQ) strict(CMDQ) >> 6 188 180 162 145 --> 11.7% improvement >> 8 188 188 184 183 --> 0.55% improvement > > Sorry, I don't quite follow the numbers here. Why does the number of queues > affect the classic "CMDQ" mode? We only have one queue there, right? These queues indicates the network concurrency, maybe I should use channels or threads. 6 means six threads are deployed on different cores using their own channels to send and receive network packets. > >> In recent days, I implemented a new function without competition with other >> cores to replace arm_smmu_cmdq_issue_cmdlist() when a core can have an ECMDQ. >> I'm guessing it might get better performance results. Because the EMU is too >> slow, it will take a while before the relevant data is available. > > I'd certainly prefer to wait until we have something we know is > representative. Yes, it would be better to have an actual set of performance data. Now the EMU is used to analyze hardware problems. This test has not been numbered yet. > However, I can take the first four prep patches now if you > respin the second one. At least that's then less for you to carry. Great. Thank you. I will respin the second one. > > I'd also like review from the Arm side on this (and thank you for adopting > the architecture unlike others seem to have done judging by the patches > floating around). > > Will > . >