Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757411AbeAIN0O (ORCPT + 1 other); Tue, 9 Jan 2018 08:26:14 -0500 Received: from merlin.infradead.org ([205.233.59.134]:39212 "EHLO merlin.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757188AbeAIN0M (ORCPT ); Tue, 9 Jan 2018 08:26:12 -0500 Date: Tue, 9 Jan 2018 14:26:02 +0100 From: Peter Zijlstra To: Vince Weaver Cc: Ingo Molnar , linux-kernel@vger.kernel.org, Ingo Molnar , Arnaldo Carvalho de Melo , Thomas Gleixner Subject: Re: perf: perf_fuzzer quickly locks up on 4.15-rc7 Message-ID: <20180109132602.GA2369@hirez.programming.kicks-ass.net> References: <20180108173005.lkglqrixb2ota6g2@gmail.com> <20180109102507.GG6176@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180109102507.GG6176@hirez.programming.kicks-ass.net> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On Tue, Jan 09, 2018 at 11:25:07AM +0100, Peter Zijlstra wrote: > On Mon, Jan 08, 2018 at 03:29:42PM -0500, Vince Weaver wrote: > > On Mon, 8 Jan 2018, Ingo Molnar wrote: > > > > > > Note that the page table isolation (PTI) feature has a number of effects on perf > > > and on NMI handlers, so one of the things to try would be to disable PTI. > > > > Yes, it seems to be a KPTI issue. > > > > With KPTI disabled I can fuzz for a few hours, no problems. > > > > With KPTI enabled the perf_fuzzer on this haswell machine will lock up > > within minutes. > > I'll go take a look... OK, so I'm running on an IVB-EP with PTI enabled. I insta triggered a lockdep splat, after fixing that I get NMI watchdog triggers that look like false positives; and I suspect: [ 290.791675] perf: Dynamic interrupt throttling disabled, can hang your system! is responsible for that. That is, subsequent stacktraces for the same CPU are unrelated, which make it look like the CPU actually makes progress. So remind me again, how are you running that fuzzer? I'm running ./fast_repro99.sh as root.