Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1328135imu; Wed, 9 Jan 2019 16:15:50 -0800 (PST) X-Google-Smtp-Source: ALg8bN5WsnpIa8i2KEH03MDFQgi0hQMT/3ADZU0djdZPzrQE9/fLgLMy/rrpr/LqR6FIJs1vpY9l X-Received: by 2002:a17:902:541:: with SMTP id 59mr8302540plf.88.1547079350182; Wed, 09 Jan 2019 16:15:50 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1547079350; cv=none; d=google.com; s=arc-20160816; b=I4+EHAKEGLIFdxdBzmG/cEyEPwYF3Io6mFSkQcAbZ6C+4uZrzmlS26aHta5SKytyAB V9vLEr1GEgas2Bp/edMBUzon1QbBrX6BMxQonxq4PW9Lz/prWQ7JkhYtQa32EFhZMP8R KQ2R2IC2JPDirNnn60vlonRviLNHwI//8FZH9OlMCArGOPDoAG0L9ZVOKRGfIQ5ZBOil Tufkj61k0oLR4p/1b1VMswftU0MN08cDINsJGZHugPb3OHZLnCHi5/Q+css7JERGCTqH 9UGycJAIjVcAIA9pNDlUMkue/oRvLT0XysHARlXDJRoMpuV8JGuZHtavpRCxlca7IIWi N/Hg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=Fbexbh09t6H0Xe1OA+eCxFMvHxVJN2ZlWQSZpw05Ixg=; b=b+49UQBViy537J0HJMcJs5kmibvxFbml17RnUssDa/S6S9EngtLFyPZGv6bdnbAD0u qv0bRT+kQt8KW0sDUF6lygO6rOtfE0Ogfd7gS1Gslqha3scGPD7iD4/Nx3rRUNT0D4Vi 9GdL7C/+P8hE83aF1wVzVmfiuSpxbvvMqGKXq3E+yRe/tVJ4NQAyJ30fmVc6lqHreokh nyg8rY94hrL4kHuarrs7JjiKQ0ljbaIUd/C7sYcGhPitz3CK0z4DLNl7UO2dge8/oiyg A+8Jsk263GQAJZv4/N9OqPM32iufJGZIBPT4JwaQeauSZmjzUWr8e8T/wiR8LutIiYeu BpJQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=uF5ROo5C; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w5si12913530pll.64.2019.01.09.16.15.35; Wed, 09 Jan 2019 16:15:50 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=uF5ROo5C; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726614AbfAJALc (ORCPT + 99 others); Wed, 9 Jan 2019 19:11:32 -0500 Received: from mail.kernel.org ([198.145.29.99]:52344 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726286AbfAJALc (ORCPT ); Wed, 9 Jan 2019 19:11:32 -0500 Received: from devbox (NE2965lan1.rev.em-net.ne.jp [210.141.244.193]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 707DF2173B; Thu, 10 Jan 2019 00:11:29 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1547079090; bh=h9FpAV7ViS0x77yN0brgUg1SlZ1S+BVD3oJKuVUo6qY=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=uF5ROo5CAjeOfV2heuc2LntifdxgtrpMa0MogXYqMU13mCCVcKJkzHLvwiw5pt2Vm SbIosPXeOlC3Wc8rQMb5aMeLJ+ThhFt0IB5NURIdBo91N4F8lYRiR0iGduU204fKLV 0c2sB4UjEd/2Qjj4uTobcZ0gnz23ONxWt4wrX/Hg= Date: Thu, 10 Jan 2019 09:11:27 +0900 From: Masami Hiramatsu To: Steven Rostedt Cc: Ingo Molnar , peterz@infradead.org, Mathieu Desnoyers , linux-kernel , Andrea Righi , stable@vger.kernel.org Subject: Re: [PATCH v2 3/3] x86/kprobes: Fix to avoid kretprobe recursion Message-Id: <20190110091127.17e902c318578b192051eeb0@kernel.org> In-Reply-To: <20190109110816.72a7ae85@gandalf.local.home> References: <154692263564.1133.17363562046971295490.stgit@devbox> <154692272225.1133.799265848498671353.stgit@devbox> <20190109110816.72a7ae85@gandalf.local.home> X-Mailer: Sylpheed 3.5.1 (GTK+ 2.24.31; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 9 Jan 2019 11:08:16 -0500 Steven Rostedt wrote: > On Tue, 8 Jan 2019 13:45:22 +0900 > Masami Hiramatsu wrote: > > > Fix to avoid kretprobe recursion loop by setting a dummy > > kprobes to current_kprobe per-cpu variable. > > > > This bug has been introduced with the asm-coded trampoline > > code, since previously it used another kprobe for hooking > > the function return placeholder (which only has a nop) and > > trampoline handler was called from that kprobe. > > > > This revives the old lost kprobe again. > > > > With this fix, we don't see deadlock anymore. > > > > # echo "r:event_1 __fdget" >> kprobe_events > > # echo "r:event_2 _raw_spin_lock_irqsave" >> kprobe_events > > # echo 1 > events/kprobes/enable > > > > And you can see that all inner-called kretprobe are skipped. > > > > # cat kprobe_profile > > event_1 235 0 > > event_2 19375 19612 > > > > The 1st column is recorded count and the 2nd is missed count. > > Above shows (event_1 rec) + (event_2 rec) ~= (event_2 missed) > > I don't quite understand the above. Is the miss count because we missed > event_2 events for both event_1 and event_2? > > trace raw_spin_lock() > handler calls raw_spin_lock() > trace raw_spin_lock() [ skip ] Yes, both events(kretprobe) eventually hits event_2 in trampoline_handler()'s spin_lock_irqsave(). > > I'm also guessing that the 2 extra (19612 - (235 + 19375) = 2) are > possibly due to the displaying being racy? Yes, it can be racy. Thank you, > > > > > Signed-off-by: Masami Hiramatsu > > Reported-by: Andrea Righi > > Fixes: c9becf58d935 ("[PATCH] kretprobe: kretprobe-booster") > > Cc: stable@vger.kernel.org > > --- > > arch/x86/kernel/kprobes/core.c | 22 ++++++++++++++++++++-- > > 1 file changed, 20 insertions(+), 2 deletions(-) > > > > diff --git a/arch/x86/kernel/kprobes/core.c b/arch/x86/kernel/kprobes/core.c > > index 69b6400d1ce2..f4b954ff5b89 100644 > > --- a/arch/x86/kernel/kprobes/core.c > > +++ b/arch/x86/kernel/kprobes/core.c > > @@ -749,11 +749,16 @@ asm( > > NOKPROBE_SYMBOL(kretprobe_trampoline); > > STACK_FRAME_NON_STANDARD(kretprobe_trampoline); > > > > +static struct kprobe kretprobe_kprobe = { > > + .addr = (void *)kretprobe_trampoline, > > +}; > > + > > /* > > * Called from kretprobe_trampoline > > */ > > static __used void *trampoline_handler(struct pt_regs *regs) > > { > > + struct kprobe_ctlblk *kcb; > > struct kretprobe_instance *ri = NULL; > > struct hlist_head *head, empty_rp; > > struct hlist_node *tmp; > > @@ -763,6 +768,17 @@ static __used void *trampoline_handler(struct pt_regs *regs) > > void *frame_pointer; > > bool skipped = false; > > > > + preempt_disable(); > > + > > + /* > > + * Set a dummy kprobe for avoiding kretprobe recursion. > > + * Since kretprobe never run in kprobe handler, kprobe must not > > + * be running at this point. > > + */ > > + kcb = get_kprobe_ctlblk(); > > + __this_cpu_write(current_kprobe, &kretprobe_kprobe); > > If an interrupt comes in here, is this still safe, if the interrupt > handler has a kretprobe too? > > -- Steve > > > + kcb->kprobe_status = KPROBE_HIT_ACTIVE; > > + > > INIT_HLIST_HEAD(&empty_rp); > > kretprobe_hash_lock(current, &head, &flags); > > /* fixup registers */ > > @@ -838,10 +854,9 @@ static __used void *trampoline_handler(struct pt_regs *regs) > > orig_ret_address = (unsigned long)ri->ret_addr; > > if (ri->rp && ri->rp->handler) { > > __this_cpu_write(current_kprobe, &ri->rp->kp); > > - get_kprobe_ctlblk()->kprobe_status = KPROBE_HIT_ACTIVE; > > ri->ret_addr = correct_ret_addr; > > ri->rp->handler(ri, regs); > > - __this_cpu_write(current_kprobe, NULL); > > + __this_cpu_write(current_kprobe, &kretprobe_kprobe); > > } > > > > recycle_rp_inst(ri, &empty_rp); > > @@ -857,6 +872,9 @@ static __used void *trampoline_handler(struct pt_regs *regs) > > > > kretprobe_hash_unlock(current, &flags); > > > > + __this_cpu_write(current_kprobe, NULL); > > + preempt_enable(); > > + > > hlist_for_each_entry_safe(ri, tmp, &empty_rp, hlist) { > > hlist_del(&ri->hlist); > > kfree(ri); > -- Masami Hiramatsu