Received: by 10.223.164.202 with SMTP id h10csp307604wrb; Tue, 14 Nov 2017 01:41:28 -0800 (PST) X-Google-Smtp-Source: AGs4zMYe/KkWguCaF19735hMWk7J18dyQy6RjhI6ySusVjufoK/DrI7X7kKraq7xJnmT55+7ggyg X-Received: by 10.84.174.4 with SMTP id q4mr12085607plb.233.1510652488351; Tue, 14 Nov 2017 01:41:28 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1510652488; cv=none; d=google.com; s=arc-20160816; b=phIdqBtb5KqZmC//0hrb8wMkUnsyHGP1+vPOeccNtIOJ8TQcYXKjNCdT8RU0T1QBTl N51zlaU2c4xLBQ32nUEAWxFiq/K9z+6Q1+5hF+/+4I8XU/GuAeHVDhS7lHvC5Q2Pnca9 wLCdCY7149Lbt9LiC3IItbqFTzGmzKmsSBaX3TucyF0UTdfxueM9AFQPJQjVsHTMIkcf YBee7EW1KvsGW5P4QoZFdgYDcBQD9nnsk0vKA/zfAOQ78qX23dO3K7w6RFA5PaxoBZHl mro43yk64Q7dJfC+/Qml501L57rbtxnKtjbJH/CESEARTQtqtxIqBqBPhdIMFvsnBr8q +TjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature :arc-authentication-results; bh=MFr2ru+LzvK9rGFyRCbgObpgIFqOSzRG+trTfZhT8w4=; b=yz2w1oGzB01G12wkgS+aYW6zBHzAqvlQPpmWrM9hTYPP6pbQ0rXxzi1M0z9UW1W2v5 FvNoT0NaGBLKqFsnu7E44Qc8oCIRcBtxASgJVOKQ5fO3Lolcfjcm0j5MktDdDu7zffCB AaGM7XB0zSakExldpZO4C9YVZ1PwTCJuUhe6VXYJ8C3bfVjnuWiL6C6VYnnNWzYOEsN4 xeMx/0u2XXUgHkt/1Ylmb9LShVajA9mJqCcgSNqfythEvI38Ybh3Px0NDmeSeoqQ97Vx SzAk7R97o/9TepBduxz+p4uQJ6hNGf2yXTrwxhRO7+zU3wj8Lqz6/cHVKbq2cjD8AcAr VCiA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=RlIv64+j; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n3si15909114pld.602.2017.11.14.01.41.16; Tue, 14 Nov 2017 01:41:28 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=RlIv64+j; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754030AbdKNJjR (ORCPT + 87 others); Tue, 14 Nov 2017 04:39:17 -0500 Received: from mail-ot0-f196.google.com ([74.125.82.196]:53664 "EHLO mail-ot0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754243AbdKNJjF (ORCPT ); Tue, 14 Nov 2017 04:39:05 -0500 Received: by mail-ot0-f196.google.com with SMTP id 105so6987913oth.10; Tue, 14 Nov 2017 01:39:05 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=MFr2ru+LzvK9rGFyRCbgObpgIFqOSzRG+trTfZhT8w4=; b=RlIv64+jKvqg5Z+K2bUXXQ3I1lvYSAMJW7WLLQPtW3LcD0S26lQh8PEmaMATsD6di/ FlzdQRhO6noIZohkTKyvFAb3uJUbElc+yci0T/DfMhwoJCiYndyjZVyl0fI1ZN/j2Koy SUCX71gLrMit8tUGoLSinbb6jwlOOAlqFvI43t3ekCeB6bjMTCY7p1D2vxeUO/hyOJUh zDF1hbj53WLUprt03HETVQNYe5KYc5HSnu6+lrCzXFlBG8Z+DTQKDGEFux7jKmVBD5KD bHgBi9Z2KgKjwW9XPMZh8khgzDWkXDhDy5lQx3CS4XffzE0sKAun8fxJe5x2jLRvjTXF Kcqg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=MFr2ru+LzvK9rGFyRCbgObpgIFqOSzRG+trTfZhT8w4=; b=rvaoIKaRsjCf1Wc3K4dkJgV0SN17Fhh9XHnn/S5koyMX9taU48ZLNACf7bO/tV5Ofr g0kcoPFJvmyA9yO7NarW9gzRvVduRMuvDdH31c58R1jeaimIeSzayWZX9lS65H5Rl9dE SLXELv67D2DP2HqP/+0y2TCHWp3u7zaxRb18o+zzqcYyCgcvleQ4RoMZByKf4Al5g0G7 qzFEXn7IGTW92ChoEhShtjFCsq4PgV+ENQ6OXW9NmXdMJ4hLEA3oNR/4E7Tw++/kl6Yf jNXusQkyNfSyqQI4IBwGLRw5MVuAcnCzUmPq6+xjqIf1hxlGMrjLvmxe4UaNyVKw00Y4 CW8w== X-Gm-Message-State: AJaThX5d30zoIJkK4TRyhgZtAyqbo00cmEAffVurdP/4IOpEjz+bezNS z2KYph+S+jvG7p+F6K4ocjg= X-Received: by 10.157.9.234 with SMTP id 39mr7620549otz.323.1510652345112; Tue, 14 Nov 2017 01:39:05 -0800 (PST) Received: from [0.0.0.0] ([47.89.242.186]) by smtp.gmail.com with ESMTPSA id 62sm8532483otg.41.2017.11.14.01.39.00 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 14 Nov 2017 01:39:04 -0800 (PST) Subject: Re: [PATCH RFC v3 1/6] x86/paravirt: Add pv_idle_ops to paravirt ops To: Juergen Gross , Quan Xu , kvm@vger.kernel.org, linux-doc@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, virtualization@lists.linux-foundation.org, x86@kernel.org, xen-devel@lists.xenproject.org Cc: Yang Zhang , Alok Kataria , Rusty Russell , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" References: <1510567565-5118-1-git-send-email-quan.xu0@gmail.com> <1510567565-5118-2-git-send-email-quan.xu0@gmail.com> <07fac696-e3d4-8f35-8f3d-764d7ab41204@suse.com> <902da704-1e4f-583b-91c3-1a62ccd6e73d@gmail.com> From: Quan Xu Message-ID: <79dad15c-2d26-bcf3-7283-293e42a161ea@gmail.com> Date: Tue, 14 Nov 2017 17:38:57 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.4.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2017/11/14 15:30, Juergen Gross wrote: > On 14/11/17 08:02, Quan Xu wrote: >> >> On 2017/11/13 18:53, Juergen Gross wrote: >>> On 13/11/17 11:06, Quan Xu wrote: >>>> From: Quan Xu >>>> >>>> So far, pv_idle_ops.poll is the only ops for pv_idle. .poll is called >>>> in idle path which will poll for a while before we enter the real idle >>>> state. >>>> >>>> In virtualization, idle path includes several heavy operations >>>> includes timer access(LAPIC timer or TSC deadline timer) which will >>>> hurt performance especially for latency intensive workload like message >>>> passing task. The cost is mainly from the vmexit which is a hardware >>>> context switch between virtual machine and hypervisor. Our solution is >>>> to poll for a while and do not enter real idle path if we can get the >>>> schedule event during polling. >>>> >>>> Poll may cause the CPU waste so we adopt a smart polling mechanism to >>>> reduce the useless poll. >>>> >>>> Signed-off-by: Yang Zhang >>>> Signed-off-by: Quan Xu >>>> Cc: Juergen Gross >>>> Cc: Alok Kataria >>>> Cc: Rusty Russell >>>> Cc: Thomas Gleixner >>>> Cc: Ingo Molnar >>>> Cc: "H. Peter Anvin" >>>> Cc: x86@kernel.org >>>> Cc: virtualization@lists.linux-foundation.org >>>> Cc: linux-kernel@vger.kernel.org >>>> Cc: xen-devel@lists.xenproject.org >>> Hmm, is the idle entry path really so critical to performance that a new >>> pvops function is necessary? >> Juergen, Here is the data we get when running benchmark netperf: >>  1. w/o patch and disable kvm dynamic poll (halt_poll_ns=0): >>     29031.6 bit/s -- 76.1 %CPU >> >>  2. w/ patch and disable kvm dynamic poll (halt_poll_ns=0): >>     35787.7 bit/s -- 129.4 %CPU >> >>  3. w/ kvm dynamic poll: >>     35735.6 bit/s -- 200.0 %CPU >> >>  4. w/patch and w/ kvm dynamic poll: >>     42225.3 bit/s -- 198.7 %CPU >> >>  5. idle=poll >>     37081.7 bit/s -- 998.1 %CPU >> >> >> >>  w/ this patch, we will improve performance by 23%.. even we could improve >>  performance by 45.4%, if we use w/patch and w/ kvm dynamic poll. also the >>  cost of CPU is much lower than 'idle=poll' case.. > I don't question the general idea. I just think pvops isn't the best way > to implement it. > >>> Wouldn't a function pointer, maybe guarded >>> by a static key, be enough? A further advantage would be that this would >>> work on other architectures, too. >> I assume this feature will be ported to other archs.. a new pvops makes       sorry, a typo.. /other archs/other hypervisors/       it refers hypervisor like Xen, HyperV and VMware).. >> code >> clean and easy to maintain. also I tried to add it into existed pvops, >> but it >> doesn't match. > You are aware that pvops is x86 only? yes, I'm aware.. > I really don't see the big difference in maintainability compared to the > static key / function pointer variant: > > void (*guest_idle_poll_func)(void); > struct static_key guest_idle_poll_key __read_mostly; > > static inline void guest_idle_poll(void) > { > if (static_key_false(&guest_idle_poll_key)) > guest_idle_poll_func(); > } thank you for your sample code :) I agree there is no big difference.. I think we are discussion for two things:  1) x86 VM on different hypervisors  2) different archs VM on kvm hypervisor What I want to do is x86 VM on different hypervisors, such as kvm / xen / hyperv .. > And KVM would just need to set guest_idle_poll_func and enable the > static key. Works on non-x86 architectures, too. > .. referred to 'pv_mmu_ops', HyperV and Xen can implement their own functions for 'pv_mmu_ops'. I think it is the same to pv_idle_ops. with above explaination, do you still think I need to define the static key/function pointer variant? btw, any interest to port it to Xen HVM guest? :) Quan Alibaba Cloud From 1584030927342122693@xxx Tue Nov 14 08:53:31 +0000 2017 X-GM-THRID: 1583947963447418004 X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread