Received: by 2002:a25:d7c1:0:0:0:0:0 with SMTP id o184csp5200026ybg; Mon, 21 Oct 2019 23:04:09 -0700 (PDT) X-Google-Smtp-Source: APXvYqzeQdi358J04BvwYStd5KYrB6+/HoiHtSyLyjmZ6iyv7Cmp4jzeA+dqCVJz1ule7SfBH1/o X-Received: by 2002:a17:906:9aa:: with SMTP id q10mr24966530eje.93.1571724249458; Mon, 21 Oct 2019 23:04:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1571724249; cv=none; d=google.com; s=arc-20160816; b=R0LZ3hN8fNUJLpEfr2fyRf8cJNi12EQ9saGVctFoRm5kSLapkpP63aTm9Ke7w3t82i Q4UTgTlbO1LTDSXNsM+TL8AqT4PBIhJtIkH99b7Q3bRROM4QbhFlsDlOQx8sXxV1lm6M iRbGVIQ1pmQlaMgB79kOdJlXasJsq/0TAh6BVFCQzy7aWr9FDZkP0aqnuVfZ7FD8QJvH bVLIRgQ80p/iZotNZQz4nzIuzZvxtVEuEvNGIvl/hPgWEyai2HfYVS+h91lxNiCgstgZ KotR2SnYHqmgdTmjvrRnaqB5nkFwAisDg3gmUuE7WzGfQ+2Gec/5SYltu9XEAsUGSbwz v27A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:references:cc:to:from:subject; bh=ktSAvxeWcmGPxCt6iEzMgmho2GjeDqOum114BUuWEeY=; b=byS5d7UOSTOjWV4Qv/0ENB+e17gJPQYRhZrEusthLpRoM9POVMX4hs4VVSiL+VwD+l kLEpByhPv3Xci/TPL0HM58jMri5wadXzyBmBuMcXnX2PJRRh7WoFtBhfuXdOyjLzKNa+ tqsYBSU2sq6hckCBqkA3diLp8dJOP/eBs7Bq+SgOQYEVDjuRRI6lgVnZBnCNP00xydRo Sr0SvC+4gsriYm5grocRpn4VGsjFx5/XR5VZurS3JvfVYpYfBPw2iJoh+tUmTFY/kc9N uKUrp0v43Zw76aYk+iCg318M8T6z6mrptfiwOXKeUXB2wMGq5YI4v1A39t3md/JxPXMs wFaQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j18si10165091ejv.201.2019.10.21.23.03.45; Mon, 21 Oct 2019 23:04:09 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387712AbfJVGBR (ORCPT + 99 others); Tue, 22 Oct 2019 02:01:17 -0400 Received: from mga09.intel.com ([134.134.136.24]:31498 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727943AbfJVGBQ (ORCPT ); Tue, 22 Oct 2019 02:01:16 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga102.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 21 Oct 2019 23:01:16 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.67,326,1566889200"; d="scan'208";a="227580609" Received: from linux.intel.com ([10.54.29.200]) by fmsmga002.fm.intel.com with ESMTP; 21 Oct 2019 23:01:14 -0700 Received: from [10.249.230.171] (abudanko-mobl.ccr.corp.intel.com [10.249.230.171]) by linux.intel.com (Postfix) with ESMTP id 69818580100; Mon, 21 Oct 2019 23:01:12 -0700 (PDT) Subject: [PATCH v4 4/4] perf/core,x86: synchronize PMU task contexts on optimized context switches From: Alexey Budankov To: Peter Zijlstra Cc: Arnaldo Carvalho de Melo , Ingo Molnar , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Andi Kleen , Kan Liang , Stephane Eranian , Ian Rogers , Song Liu , linux-kernel References: Organization: Intel Corp. Message-ID: <4d6320bb-0d15-0028-aefb-a176c986b8db@linux.intel.com> Date: Tue, 22 Oct 2019 09:01:11 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Install Intel specific PMU task context synchronization adapter and extend optimized context switch path with PMU specific task context synchronization to fix LBR callstack virtualization on context switches. Signed-off-by: Alexey Budankov --- arch/x86/events/intel/core.c | 7 +++++++ kernel/events/core.c | 13 +++++++++++++ 2 files changed, 20 insertions(+) diff --git a/arch/x86/events/intel/core.c b/arch/x86/events/intel/core.c index bbf6588d47ee..b9f518aa478e 100644 --- a/arch/x86/events/intel/core.c +++ b/arch/x86/events/intel/core.c @@ -3820,6 +3820,12 @@ static void intel_pmu_sched_task(struct perf_event_context *ctx, intel_pmu_lbr_sched_task(ctx, sched_in); } +static void intel_pmu_sync_task_ctx(struct x86_perf_task_context *one, + struct x86_perf_task_context *another) +{ + intel_pmu_lbr_sync_task_ctx(one, another); +} + static int intel_pmu_check_period(struct perf_event *event, u64 value) { return intel_pmu_has_bts_period(event, value) ? -EINVAL : 0; @@ -3955,6 +3961,7 @@ static __initconst const struct x86_pmu intel_pmu = { .guest_get_msrs = intel_guest_get_msrs, .sched_task = intel_pmu_sched_task, + .sync_task_ctx = intel_pmu_sync_task_ctx, .check_period = intel_pmu_check_period, diff --git a/kernel/events/core.c b/kernel/events/core.c index f9a5d4356562..51d4138b06f7 100644 --- a/kernel/events/core.c +++ b/kernel/events/core.c @@ -3204,11 +3204,24 @@ static void perf_event_context_sched_out(struct task_struct *task, int ctxn, raw_spin_lock(&ctx->lock); raw_spin_lock_nested(&next_ctx->lock, SINGLE_DEPTH_NESTING); if (context_equiv(ctx, next_ctx)) { + struct pmu *pmu = ctx->pmu; + WRITE_ONCE(ctx->task, next); WRITE_ONCE(next_ctx->task, task); swap(ctx->task_ctx_data, next_ctx->task_ctx_data); + /* + * PMU specific parts of task perf context can require + * additional synchronization which makes sense only if + * both next_ctx->task_ctx_data and ctx->task_ctx_data + * pointers are allocated. As an example of such + * synchronization see implementation details of Intel + * LBR call stack data profiling; + */ + if (ctx->task_ctx_data && next_ctx->task_ctx_data) + pmu->sync_task_ctx(next_ctx->task_ctx_data, + ctx->task_ctx_data); /* * RCU_INIT_POINTER here is safe because we've not * modified the ctx and the above modification of -- 2.20.1