Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp683447pxb; Thu, 25 Feb 2021 12:18:45 -0800 (PST) X-Google-Smtp-Source: ABdhPJzsTI9GmTzWFFNxpHN8VkI+0hxOFhVPK705k9f1QrEkoPqZzpaWmoS9tHsgdhro4mCqew9t X-Received: by 2002:a05:6402:84e:: with SMTP id b14mr4927154edz.186.1614284324896; Thu, 25 Feb 2021 12:18:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614284324; cv=none; d=google.com; s=arc-20160816; b=JPqplrRx8D1IfrXKHf5XKaHcztthcHhv6//ZKO29iZo1xyMWFK2j62hKopAEQ1LyJI jXlr6g22cL4xpMsi8GIWgKrbZ1BoJTldp9CzO4Hk8Kq/ktSScj+42ngpffMnrhKOnj8W /22cWzeNWRAeu/stNEUqDco6JxpHncrU/f5XkSzCNm6rOqZIVh9f+ORXSCjxZR08hD7I 2u4vsMluuVGoQXFTVi1fIBala0k3FSG/iVvDdfdNx1BSiz0HAq0Op2NNn8y++Uf7xt6n UqbDNkobVvIK/gqe+0oWU7EFusjUd1HdahBhNPfCp7ntTMYYWhwLxJAjwgEO56l5BSH6 hzyw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:ironport-sdr:ironport-sdr; bh=LvNmfwf9l/AmqygDwwO8G5llPNU6EszYap2dM4CJjgo=; b=zAr9B6jtJByXWMMMeAAXUE4D6PaIqzK/Wu7FEotWkR+1ZaBsWrp6y857NMuhhNWU3h VuB9IiUDBDiPXYl7eN0/2gj/E/JtoOW6haenUJ56C7Iq6MBacijtC6umt5LSeTXdM/uH bn545rt1FiL8KFLRcns0ZtPOaunkXFMU8fj3HAorkuXRpyNBiu+3LnWzEnK4Smjg9nM1 TOZEoyKuyI6sJEm3KmtMnYtHqjpFouTnmdSe/FyUBNLjP5z1wFh7FO13Y0QEG+9tjRaJ VZ0yzmXCwRk5ZWjcj2FkQ/vdVcBkTPAQ5FkQkI0Ve/BDFHK9YgusgLHrdwm6bPgmkHsT 6OtA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id n12si4112194eja.612.2021.02.25.12.18.21; Thu, 25 Feb 2021 12:18:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233159AbhBYUR3 (ORCPT + 99 others); Thu, 25 Feb 2021 15:17:29 -0500 Received: from mga14.intel.com ([192.55.52.115]:38233 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234141AbhBYURP (ORCPT ); Thu, 25 Feb 2021 15:17:15 -0500 IronPort-SDR: J/G7zmWtYflgRYjgqnZSyEiwBnDfMN/x95pe+eDjEOmydL5qrQbzAX/68Xr4jbIzBCTfXeZeSV LNsucaWGHGvQ== X-IronPort-AV: E=McAfee;i="6000,8403,9906"; a="184965109" X-IronPort-AV: E=Sophos;i="5.81,206,1610438400"; d="scan'208";a="184965109" Received: from orsmga003.jf.intel.com ([10.7.209.27]) by fmsmga103.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 25 Feb 2021 12:15:29 -0800 IronPort-SDR: Lp0kxLVj3i3LsaWdmuhDqNRxSPh5IJ7dkLQYetfa1ZtANFwJdVgAqWZBAoXdjBk2jOqS4G7Hf1 1FxUcWP/KMxA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.81,206,1610438400"; d="scan'208";a="365580912" Received: from linux.intel.com ([10.54.29.200]) by orsmga003.jf.intel.com with ESMTP; 25 Feb 2021 12:15:28 -0800 Received: from [10.254.88.247] (kliang2-MOBL.ccr.corp.intel.com [10.254.88.247]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by linux.intel.com (Postfix) with ESMTPS id B79835804A9; Thu, 25 Feb 2021 12:15:27 -0800 (PST) Subject: Re: [perf] perf_fuzzer causes crash in intel_pmu_drain_pebs_nhm() To: Vince Weaver Cc: Peter Zijlstra , linux-kernel@vger.kernel.org, Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Stephane Eranian References: <61a56699-aab4-ef6-ed8d-a22b6bf532d@maine.edu> <7170d3b-c17f-1ded-52aa-cc6d9ae999f4@maine.edu> <2a655469-de9d-c80-dd7f-26436d6f03a@maine.edu> From: "Liang, Kan" Message-ID: <9b3f84e0-e1cc-cebe-43b6-fa062484ad28@linux.intel.com> Date: Thu, 25 Feb 2021 15:15:26 -0500 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.7.1 MIME-Version: 1.0 In-Reply-To: <2a655469-de9d-c80-dd7f-26436d6f03a@maine.edu> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2/11/2021 5:14 PM, Vince Weaver wrote: > On Thu, 11 Feb 2021, Liang, Kan wrote: > >>> On Thu, Jan 28, 2021 at 02:49:47PM -0500, Vince Weaver wrote: >> I'd like to reproduce it on my machine. >> Is this issue only found in a Haswell client machine? >> >> To reproduce the issue, can I use ./perf_fuzzer under perf_event_tests/fuzzer? >> Do I need to apply any parameters with ./perf_fuzzer? >> >> Usually how long does it take to reproduce the issue? > > On my machine if I run the commands > echo 1 > /proc/sys/kernel/nmi_watchdog > echo 0 > /proc/sys/kernel/perf_event_paranoid > echo 1000 > /proc/sys/kernel/perf_event_max_sample_rate > ./perf_fuzzer -s 30000 -r 1611784483 > > it is repeatable within a minute, but because of the nature of the fuzzer > it probably won't work for you because the random events will diverge > based on the different configs of the system. > > I can try to generate a simple reproducer, I've just been extremely busy > here at work and haven't had the chance. > > If you want to try to reproduce it the hard way, run the > ./fast_repro99.sh > script in the perf_fuzzer directory. It will start fuzzing. My machine > turned up the issue within a day or so. > Sorry for the late response. Just want to let you know I'm still trying to reproduce the issue. I only have a Haswell server on hand. I run the ./fast_repro99.sh on the machine for 3 days, but I didn't observe any crash. Now, I'm looking for a HSW client in my lab. I will check if I can reproduce it on a client machine. If you have a simple reproducer, please let me know. Thanks, Kan