Received: by 10.223.164.202 with SMTP id h10csp416947wrb; Tue, 14 Nov 2017 03:47:37 -0800 (PST) X-Google-Smtp-Source: AGs4zMa9ePRR7hvp5xGCQFBTdyNxIQSlhixa19+pxYz+WWupOMEJ4RD6gG7A2BtIOUV/yWeRXCKr X-Received: by 10.84.138.193 with SMTP id 59mr12280261plp.446.1510660057251; Tue, 14 Nov 2017 03:47:37 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1510660057; cv=none; d=google.com; s=arc-20160816; b=WhSO4A30UqXi1GkKFq8SHFra+4Tb/Sh9cdkFVVZW6+Zp/dKtPgAtzcuQaLlSVBY6pN 81WrfRTr6Az5QCTcdReuHlZnhsEYyUCqoDvUQ6Xh1NVzvO4gkroBEo1gpVN/oAU135PG W7Hanbq/0jx8VBgEF1GF+Y4EFuVGy6G59SsWjkV0TmmJnHmIhP1kFp44YVEVFnR6OUjk ol+ruupFepbtlfUQIQJyEz07GhPG5MTAPAE8Uvx1NENnlq5WK3a2WK2BH2cnLDSYBrop VoGuIvokFgP779/AMGRIDlCiRuhbP8gqA03C9v+fKp1BR4HWpC5qu4qJ96c9//hJq26l DKMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature :arc-authentication-results; bh=1l8xo5hRblAsVbCAqJ/FzXjAujD0Gg694oF0iAHJ+w8=; b=Q5XFLpTRyxu+DO7N2+8VBo5zt6JIdJ6KNhTgDLrDXV65J7vre/yGAw17JOR57YBgZo h3TNMsXG4tsZNDoSqZmrGwhI5EGblvgFiAwoJAwRBrzApnbnsDc78JXaUpqwgrVYH9Pg kxTWvZbYyzRjXcYfvjDCch4hip3ECJeiEm939Gy4MqJqCAMyjLr14cTqt+ky4o5xFKaG R8mvLpFYxhh1JFIOpjT8f7FVBA2OY3+srCEm+0QuH42N4LAwQ567oaPP5ty6Bbh9j41T qT075v5ip1EeiP6zCEr4C8yeoirsZVJYznQA1CoYivWQQ+vtZ8bQND6Tqd8ESJ6sk1YX /g1g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=kRBPwlMa; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i1si16373802plt.630.2017.11.14.03.47.24; Tue, 14 Nov 2017 03:47:37 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=kRBPwlMa; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754400AbdKNLnV (ORCPT + 88 others); Tue, 14 Nov 2017 06:43:21 -0500 Received: from mail-ot0-f196.google.com ([74.125.82.196]:53707 "EHLO mail-ot0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752284AbdKNLnQ (ORCPT ); Tue, 14 Nov 2017 06:43:16 -0500 Received: by mail-ot0-f196.google.com with SMTP id 105so7238044oth.10; Tue, 14 Nov 2017 03:43:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=1l8xo5hRblAsVbCAqJ/FzXjAujD0Gg694oF0iAHJ+w8=; b=kRBPwlMaKF2yOg9iEdIb8sJ/LKH+hzdAj02TQHDglDVS60eoshfN7r51hqFJsxlDny 0LsnNVJY3wUTxR5rE0FJKa7jxxgrbMilo47eQPCYh7g0HHQWqN5UCN6h7+U5HkKAtl61 /XoBIzE2Rt3hF0Y1LA4NooBrW7H4wc6bECdKcK34B0D0gjQfK1toWXu11qmrN7TWU+bw rT6b5nrvJdUQviiBs0EEa6SQf6FfPfRcwBJlv4q8OA/yvyWTHmNx+AeixS1Rjrg+p5mX WD+FskQt1whpuCP9OGIjcrH/EmtCedX6O/D0cQ9xUHLgQd5+384iA5ivX6EF2DcD4r0+ LvPg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=1l8xo5hRblAsVbCAqJ/FzXjAujD0Gg694oF0iAHJ+w8=; b=UZPKrOBlIlm7QOMv1phO970CHxfwPaWZ7jpK75gu0OZNYxDbBGjoPwEwq4sPg0H8Ya Uc99d8dFAPdgPHUIjD/FsObOIitf9XWEqTTi5wzDyzif+B0waLKBfVTlcaeVROMR73F8 XBlYtZJrfqbqfKTwIqHKVsb2gZ3Frj8uN9NSvosJkMC5ky2U69+94ouE+6D24xjgnvFh 6n47Ve5ib8nnksdzOqRsTrX2bAnJQdQcm53/pOW3UikIeIVWe/tql91h9vHftsHwCsWb JpB6Iw6I1ueNVU2JH1N+uZlZZbTkqbj23oPM1RDsEJXsew0vMK9RfYUQgjYIAk98D7C3 jDgQ== X-Gm-Message-State: AJaThX66JtS6pQ6sikCn9WvnTKu3Si525OdKvVukVd4hqdH9qPixTPYZ 4m8JerZDTf2H00V0Fj8Vr4A= X-Received: by 10.157.14.84 with SMTP id n20mr6956090otd.329.1510659796229; Tue, 14 Nov 2017 03:43:16 -0800 (PST) Received: from [0.0.0.0] ([47.89.242.186]) by smtp.gmail.com with ESMTPSA id q12sm8537686oti.44.2017.11.14.03.43.10 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 14 Nov 2017 03:43:15 -0800 (PST) Subject: Re: [PATCH RFC v3 1/6] x86/paravirt: Add pv_idle_ops to paravirt ops To: Juergen Gross , Quan Xu , kvm@vger.kernel.org, linux-doc@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, virtualization@lists.linux-foundation.org, x86@kernel.org, xen-devel@lists.xenproject.org Cc: Yang Zhang , Alok Kataria , Rusty Russell , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" References: <1510567565-5118-1-git-send-email-quan.xu0@gmail.com> <1510567565-5118-2-git-send-email-quan.xu0@gmail.com> <07fac696-e3d4-8f35-8f3d-764d7ab41204@suse.com> <902da704-1e4f-583b-91c3-1a62ccd6e73d@gmail.com> <79dad15c-2d26-bcf3-7283-293e42a161ea@gmail.com> <9a4f53f3-225f-0e99-d9b2-d89656b8fb31@suse.com> From: Quan Xu Message-ID: <41403bbb-bfa9-0618-abf7-dd871a7b783a@gmail.com> Date: Tue, 14 Nov 2017 19:43:07 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.4.0 MIME-Version: 1.0 In-Reply-To: <9a4f53f3-225f-0e99-d9b2-d89656b8fb31@suse.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2017/11/14 18:27, Juergen Gross wrote: > On 14/11/17 10:38, Quan Xu wrote: >> >> On 2017/11/14 15:30, Juergen Gross wrote: >>> On 14/11/17 08:02, Quan Xu wrote: >>>> On 2017/11/13 18:53, Juergen Gross wrote: >>>>> On 13/11/17 11:06, Quan Xu wrote: >>>>>> From: Quan Xu >>>>>> >>>>>> So far, pv_idle_ops.poll is the only ops for pv_idle. .poll is called >>>>>> in idle path which will poll for a while before we enter the real idle >>>>>> state. >>>>>> >>>>>> In virtualization, idle path includes several heavy operations >>>>>> includes timer access(LAPIC timer or TSC deadline timer) which will >>>>>> hurt performance especially for latency intensive workload like >>>>>> message >>>>>> passing task. The cost is mainly from the vmexit which is a hardware >>>>>> context switch between virtual machine and hypervisor. Our solution is >>>>>> to poll for a while and do not enter real idle path if we can get the >>>>>> schedule event during polling. >>>>>> >>>>>> Poll may cause the CPU waste so we adopt a smart polling mechanism to >>>>>> reduce the useless poll. >>>>>> >>>>>> Signed-off-by: Yang Zhang >>>>>> Signed-off-by: Quan Xu >>>>>> Cc: Juergen Gross >>>>>> Cc: Alok Kataria >>>>>> Cc: Rusty Russell >>>>>> Cc: Thomas Gleixner >>>>>> Cc: Ingo Molnar >>>>>> Cc: "H. Peter Anvin" >>>>>> Cc: x86@kernel.org >>>>>> Cc: virtualization@lists.linux-foundation.org >>>>>> Cc: linux-kernel@vger.kernel.org >>>>>> Cc: xen-devel@lists.xenproject.org >>>>> Hmm, is the idle entry path really so critical to performance that a >>>>> new >>>>> pvops function is necessary? >>>> Juergen, Here is the data we get when running benchmark netperf: >>>>   1. w/o patch and disable kvm dynamic poll (halt_poll_ns=0): >>>>      29031.6 bit/s -- 76.1 %CPU >>>> >>>>   2. w/ patch and disable kvm dynamic poll (halt_poll_ns=0): >>>>      35787.7 bit/s -- 129.4 %CPU >>>> >>>>   3. w/ kvm dynamic poll: >>>>      35735.6 bit/s -- 200.0 %CPU >>>> >>>>   4. w/patch and w/ kvm dynamic poll: >>>>      42225.3 bit/s -- 198.7 %CPU >>>> >>>>   5. idle=poll >>>>      37081.7 bit/s -- 998.1 %CPU >>>> >>>> >>>> >>>>   w/ this patch, we will improve performance by 23%.. even we could >>>> improve >>>>   performance by 45.4%, if we use w/patch and w/ kvm dynamic poll. >>>> also the >>>>   cost of CPU is much lower than 'idle=poll' case.. >>> I don't question the general idea. I just think pvops isn't the best way >>> to implement it. >>> >>>>> Wouldn't a function pointer, maybe guarded >>>>> by a static key, be enough? A further advantage would be that this >>>>> would >>>>> work on other architectures, too. >>>> I assume this feature will be ported to other archs.. a new pvops makes >>       sorry, a typo.. /other archs/other hypervisors/ >>       it refers hypervisor like Xen, HyperV and VMware).. >> >>>> code >>>> clean and easy to maintain. also I tried to add it into existed pvops, >>>> but it >>>> doesn't match. >>> You are aware that pvops is x86 only? >> yes, I'm aware.. >> >>> I really don't see the big difference in maintainability compared to the >>> static key / function pointer variant: >>> >>> void (*guest_idle_poll_func)(void); >>> struct static_key guest_idle_poll_key __read_mostly; >>> >>> static inline void guest_idle_poll(void) >>> { >>>     if (static_key_false(&guest_idle_poll_key)) >>>         guest_idle_poll_func(); >>> } >> >> >> thank you for your sample code :) >> I agree there is no big difference.. I think we are discussion for two >> things: >>  1) x86 VM on different hypervisors >>  2) different archs VM on kvm hypervisor >> >> What I want to do is x86 VM on different hypervisors, such as kvm / xen >> / hyperv .. > Why limit the solution to x86 if the more general solution isn't > harder? > > As you didn't give any reason why the pvops approach is better other > than you don't care for non-x86 platforms you won't get an "Ack" from > me for this patch. It just looks a little odder to me. I understand you care about no-x86 arch. Are you aware 'pv_time_ops' for arm64/arm/x86 archs, defined in    - arch/arm64/include/asm/paravirt.h    - arch/x86/include/asm/paravirt_types.h    - arch/arm/include/asm/paravirt.h I am unfamilar with arm code. IIUC, if you'd implement pv_idle_ops for arm/arm64 arch, you'd define a same structure in    - arch/arm64/include/asm/paravirt.h     or    - arch/arm/include/asm/paravirt.h .. instead of static key / fuction. then implement a real function in    - arch/arm/kernel/paravirt.c. Also I wonder HOW/WHERE to define a static key/function, then to benifit x86/no-x86 archs? Quan Alibaba Cloud >>> And KVM would just need to set guest_idle_poll_func and enable the >>> static key. Works on non-x86 architectures, too. >>> >> .. referred to 'pv_mmu_ops', HyperV and Xen can implement their own >> functions for 'pv_mmu_ops'. >> I think it is the same to pv_idle_ops. >> >> with above explaination, do you still think I need to define the static >> key/function pointer variant? >> >> btw, any interest to port it to Xen HVM guest? :) > Maybe. But this should work for Xen on ARM, too. > > > Juergen > From 1584040759952768140@xxx Tue Nov 14 11:29:48 +0000 2017 X-GM-THRID: 1583947963447418004 X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread