Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933000AbcDEB76 (ORCPT ); Mon, 4 Apr 2016 21:59:58 -0400 Received: from [217.140.101.70] ([217.140.101.70]:48451 "EHLO foss.arm.com" rhost-flags-FAIL-FAIL-OK-OK) by vger.kernel.org with ESMTP id S932600AbcDEB75 (ORCPT ); Mon, 4 Apr 2016 21:59:57 -0400 Date: Tue, 5 Apr 2016 00:33:39 +0100 From: Mark Rutland To: Tai Tri Nguyen Cc: will.deacon@arm.com, catalin.marinas@arm.com, linux-kernel@vger.kernel.org, devicetree@vger.kernel.org, linux-arm-kernel , patches Subject: Re: [PATCH 3/4] perf: xgene: Add APM X-Gene SoC Performance Monitoring Unit driver Message-ID: <20160404233338.GA1917@svinekod> References: <1459467472-31561-1-git-send-email-ttnguyen@apm.com> <1459467472-31561-4-git-send-email-ttnguyen@apm.com> <20160401121820.GB29876@leverpostej> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2545 Lines: 70 On Mon, Apr 04, 2016 at 04:42:11PM -0700, Tai Tri Nguyen wrote: > On Fri, Apr 1, 2016 at 5:18 AM, Mark Rutland wrote: > >> +static int get_next_avail_cntr(struct xgene_pmu_dev *pmu_dev) > >> +{ > >> + int shift, cntr, retval; > >> + unsigned long flags; > >> + > >> + raw_spin_lock_irqsave(&pmu_dev->lock, flags); > >> + > >> + for (cntr = 0; cntr < pmu_dev->max_counters; cntr++) { > >> + shift = cntr; > >> + if (!(pmu_dev->cntr_assign_mask & (1ULL << shift))) { > >> + pmu_dev->cntr_assign_mask |= (1ULL << shift); > >> + retval = cntr; > >> + goto out; > >> + } > >> + } > >> + retval = -ENOSPC; > > Are the spinlocks necessary? > > > > I thought add and del couldn't be called in parallel for the same > > context, and those are the only users of this mask. > > > > I'm trying to avoid the case where multiple events may claim the same > available counter. > There's a race condition here. I don't think there is, so long as events are all associated with the same CPU, and hence the same ctx. As I mentioned, add and del are the only users of this mask. Both of those are only called with ctx->lock held, so I couldn't see how these could race. Were you considering events in different cpu contexts racing? Is there something I've missed? > >> + hwc->config = config; > >> + if (config1) > >> + hwc->extra_reg.config = config1; > >> + else > >> + /* Enable all Agents */ > >> + hwc->extra_reg.config = 0xFFFFFFFFFFFFFFFFULL; > > > > I'm not sure I follow what's going on here. > > > > It would be good to document precisely what this means. > > These are X-Gene PMU specific for monitoring performance of a specific > data path. > X-Gene PMUs have 2 registers capable of masking the Agents from which > the request come from. If the bit with the bit number corresponding to > the AgentID > is set, the event will be counted only if it is caused by a request > from that agent. > Each PMU has different set of Agents. By default, the event will be counted for > all agent requests. > > I'll have it commented better for next revision of the patch. It might be worth having something under Documentation/ for this, similarly to what we do for CCN in Documentation/arm/CCN.txt. How is the user expected to determine agent IDs? Is there a listing somewhere? Does this change between reivisions? This may be worth documenting. Thanks, Mark.