Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751794AbdHAMXB (ORCPT ); Tue, 1 Aug 2017 08:23:01 -0400 Received: from mx1.redhat.com ([209.132.183.28]:33678 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751199AbdHAMW7 (ORCPT ); Tue, 1 Aug 2017 08:22:59 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mx1.redhat.com 184F410792C Authentication-Results: ext-mx04.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com Authentication-Results: ext-mx04.extmail.prod.ext.phx2.redhat.com; spf=fail smtp.mailfrom=pbonzini@redhat.com Subject: Re: [PATCH v4] KVM: async_pf: make rcu irq exit if not triggered from idle task To: Wanpeng Li , linux-kernel@vger.kernel.org, kvm@vger.kernel.org Cc: =?UTF-8?B?UmFkaW0gS3LEjW3DocWZ?= , Wanpeng Li , "Paul E . McKenney" , stable@vger.kernel.org References: <1501590003-3707-1-git-send-email-wanpeng.li@hotmail.com> From: Paolo Bonzini Message-ID: <03f04a0c-8933-a2b4-fc0a-a3a881b730ed@redhat.com> Date: Tue, 1 Aug 2017 14:22:55 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 MIME-Version: 1.0 In-Reply-To: <1501590003-3707-1-git-send-email-wanpeng.li@hotmail.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.28]); Tue, 01 Aug 2017 12:22:59 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2731 Lines: 86 On 01/08/2017 14:20, Wanpeng Li wrote: > From: Wanpeng Li > > WARNING: CPU: 5 PID: 1242 at kernel/rcu/tree_plugin.h:323 rcu_note_context_switch+0x207/0x6b0 > CPU: 5 PID: 1242 Comm: unity-settings- Not tainted 4.13.0-rc2+ #1 > RIP: 0010:rcu_note_context_switch+0x207/0x6b0 > Call Trace: > __schedule+0xda/0xba0 > ? kvm_async_pf_task_wait+0x1b2/0x270 > schedule+0x40/0x90 > kvm_async_pf_task_wait+0x1cc/0x270 > ? prepare_to_swait+0x22/0x70 > do_async_page_fault+0x77/0xb0 > ? do_async_page_fault+0x77/0xb0 > async_page_fault+0x28/0x30 > RIP: 0010:__d_lookup_rcu+0x90/0x1e0 > > I encounter this when trying to stress the async page fault in L1 guest w/ > L2 guests running. > > Commit 9b132fbe5419 (Add rcu user eqs exception hooks for async page fault) > adds rcu_irq_enter/exit() to kvm_async_pf_task_wait() to exit cpu idle eqs > when needed, to protect the code that needs use rcu. There is no need to call > this pairs if async page fault is not triggered from idle task. This is not accurate anymore: Commit 9b132fbe5419 (Add rcu user eqs exception hooks for async page fault) adds rcu_irq_enter/exit() to kvm_async_pf_task_wait() to exit cpu idle eqs when needed, to protect the code that needs use rcu. However, we need to call the pair even if the function calls schedule(), as seen from the above backtrace. Can be fixed when committing. > This patch fixes it by informing the RCU subsystem exit/enter the irq > towards/away from idle for both n.halted and !n.halted. > > Cc: Paolo Bonzini > Cc: Radim Krčmář > Cc: Paul E. McKenney > Cc: stable@vger.kernel.org > Signed-off-by: Wanpeng Li > --- > v3 -> v4: > * pull rcu_irq_exit/enter outside the "if" > v2 -> v3: > * add the warning log > v1 -> v2: > * update patch description > > arch/x86/kernel/kvm.c | 6 ++++-- > 1 file changed, 4 insertions(+), 2 deletions(-) > > diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c > index 71c17a5..d04e30e 100644 > --- a/arch/x86/kernel/kvm.c > +++ b/arch/x86/kernel/kvm.c > @@ -151,6 +151,8 @@ void kvm_async_pf_task_wait(u32 token) > if (hlist_unhashed(&n.link)) > break; > > + rcu_irq_exit(); > + > if (!n.halted) { > local_irq_enable(); > schedule(); > @@ -159,11 +161,11 @@ void kvm_async_pf_task_wait(u32 token) > /* > * We cannot reschedule. So halt. > */ > - rcu_irq_exit(); > native_safe_halt(); > local_irq_disable(); > - rcu_irq_enter(); > } > + > + rcu_irq_enter(); > } > if (!n.halted) > finish_swait(&n.wq, &wait); > Apart from that, Reviewed-by: Paolo Bonzini