Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932452AbdLTGo2 (ORCPT ); Wed, 20 Dec 2017 01:44:28 -0500 Received: from mail-ot0-f196.google.com ([74.125.82.196]:43709 "EHLO mail-ot0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932220AbdLTGo0 (ORCPT ); Wed, 20 Dec 2017 01:44:26 -0500 X-Google-Smtp-Source: ACJfBosoDtX7F9QLYjavGRJNGAL3zdghBCw9YYcnzlISsDDqxFVlPVLWcvVTV3tCXxL10PBukU1jK31gjg2y0RFfkDs= MIME-Version: 1.0 In-Reply-To: <20171219085010.4081-1-ynorov@caviumnetworks.com> References: <20171219085010.4081-1-ynorov@caviumnetworks.com> From: Wanpeng Li Date: Wed, 20 Dec 2017 14:44:25 +0800 Message-ID: Subject: Re: [PATCH v2] IPI performance benchmark To: Yury Norov Cc: linux-kernel@vger.kernel.org, kvm , LAK , Andrew Morton , Ashish Kalra , Christoffer Dall , Geert Uytterhoeven , Konrad Rzeszutek Wilk , Linu Cherian , Shih-Wei Li , Sunil Goutham Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1733 Lines: 36 Hi Yury, 2017-12-19 16:50 GMT+08:00 Yury Norov : > This benchmark sends many IPIs in different modes and measures > time for IPI delivery (first column), and total time, ie including > time to acknowledge the receive by sender (second column). > > The scenarios are: > Dry-run: do everything except actually sending IPI. Useful > to estimate system overhead. > Self-IPI: Send IPI to self CPU. > Normal IPI: Send IPI to some other CPU. > Broadcast IPI: Send broadcast IPI to all online CPUs. > Broadcast lock: Send broadcast IPI to all online CPUs and force them > acquire/release spinlock. > > The raw output looks like this: > [ 155.363374] Dry-run: 0, 2999696 ns > [ 155.429162] Self-IPI: 30385328, 65589392 ns > [ 156.060821] Normal IPI: 566914128, 631453008 ns > [ 158.384427] Broadcast IPI: 0, 2323368720 ns > [ 160.831850] Broadcast lock: 0, 2447000544 ns > > For virtualized guests, sending and reveiving IPIs causes guest exit. > I used this test to measure performance impact on KVM subsystem of > Christoffer Dall's series "Optimize KVM/ARM for VHE systems" [1]. > > Test machine is ThunderX2, 112 online CPUs. Below the results normalized > to host dry-run time, broadcast lock results omitted. Smaller - better. Could you test on a x86 box? I see a lot of calltraces on my haswell client host, there is no calltrace in the guest, however, I can still observe "Invalid parameters" warning when insmod this module. In addition, the x86 box fails to boot when ipi_benchmark is buildin. Regards, Wanpeng Li