Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp6903532rwd; Tue, 6 Jun 2023 03:47:31 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ6UHMcSdXqn4e7+8qe1lusUGZ6KwjY6w7phy/SFss8Cgq8qnNZZqSiaBOfuMh1dkbBAXnyR X-Received: by 2002:a05:622a:355:b0:3f5:2faf:7917 with SMTP id r21-20020a05622a035500b003f52faf7917mr1898431qtw.9.1686048451008; Tue, 06 Jun 2023 03:47:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686048450; cv=none; d=google.com; s=arc-20160816; b=LYArtklFJ9ch2PL7kj/sA5OvBt387DIvqGVVt+R/onYsvBf8vIcMX12bgWrUVCMuYb Er3gupDzxiBZHwdTtYSwU+9xAXO5tCFBHKtTvIB23x8rg1fvdWj/2lSTFjjQQBvhOqD8 OzqHx/tFipjaCEJvfH+e+4FMKlSR7OJ4Q5t2Ku6gS9/Xxu0RwH67o0PcN1TDtgg1rRSF dl/E0LHitwswO9tBFtZkYgMhzjh5vYTGQ0bTKcM+Pi5QnoTkLpp4ER1zUkwVTutuM5Xc A+71iaZrueWFFgbuD6HPcQPw0cwHF5SB83QoAiCvLad4esppCsfA/njzU5zNBz6vkaAB nLIg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=pNO27YIXeZWvxO9iRph/r9v9iRuskViT6vl8Y6e/pss=; b=GhUIUaImKWQA/djs74598LumLCJEjOGay0I4DrgvyU/3bMp4UO/xvPxp/ZMgtc/yVu O3LoDk0Oe6lhl7s0B9ZoBYTLtayFZqm/Fyk00ARKISilA+O/EYXnh782RCuKRXHfraHH UvOn8jyPYnxfX2CJL05uMfgyT+7J4hDLq5CgxkeS0SRLM/NDHUHM8wExPXxB5ZQBqSRT OSxg0IyIaDBLNZ9b02/bAhBlkJd9jpebDXphBHB8nQAoMKIjkshBfRHU0tlJiEPyk7pi sEGQsSrjHIk8tePuHnrvnBCsUwyM5LxTt957fL7qicdExUCZuAJa5Tig0N143e+NyRYu Xjsg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id s31-20020a05622a1a9f00b003f9a7707278si1989134qtc.64.2023.06.06.03.47.16; Tue, 06 Jun 2023 03:47:30 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236838AbjFFKfa (ORCPT + 99 others); Tue, 6 Jun 2023 06:35:30 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55786 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237082AbjFFKfF (ORCPT ); Tue, 6 Jun 2023 06:35:05 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 0356D10E2; Tue, 6 Jun 2023 03:34:47 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 02A8B2F4; Tue, 6 Jun 2023 03:35:32 -0700 (PDT) Received: from [10.163.44.166] (unknown [10.163.44.166]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id EF9B23F587; Tue, 6 Jun 2023 03:34:34 -0700 (PDT) Message-ID: Date: Tue, 6 Jun 2023 16:04:25 +0530 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.11.0 Subject: Re: [PATCH V11 05/10] arm64/perf: Add branch stack support in ARMV8 PMU Content-Language: en-US To: Mark Rutland Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, will@kernel.org, catalin.marinas@arm.com, Mark Brown , James Clark , Rob Herring , Marc Zyngier , Suzuki Poulose , Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , linux-perf-users@vger.kernel.org References: <20230531040428.501523-1-anshuman.khandual@arm.com> <20230531040428.501523-6-anshuman.khandual@arm.com> From: Anshuman Khandual In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-4.3 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/5/23 17:35, Mark Rutland wrote: > On Wed, May 31, 2023 at 09:34:23AM +0530, Anshuman Khandual wrote: >> This enables support for branch stack sampling event in ARMV8 PMU, checking >> has_branch_stack() on the event inside 'struct arm_pmu' callbacks. Although >> these branch stack helpers armv8pmu_branch_XXXXX() are just dummy functions >> for now. While here, this also defines arm_pmu's sched_task() callback with >> armv8pmu_sched_task(), which resets the branch record buffer on a sched_in. > > This generally looks good, but I have a few comments below. > > [...] > >> +static inline bool armv8pmu_branch_valid(struct perf_event *event) >> +{ >> + WARN_ON_ONCE(!has_branch_stack(event)); >> + return false; >> +} > > IIUC this is for validating the attr, so could we please name this > armv8pmu_branch_attr_valid() ? Sure, will change the name and updated call sites. > > [...] > >> +static int branch_records_alloc(struct arm_pmu *armpmu) >> +{ >> + struct pmu_hw_events *events; >> + int cpu; >> + >> + for_each_possible_cpu(cpu) { >> + events = per_cpu_ptr(armpmu->hw_events, cpu); >> + events->branches = kzalloc(sizeof(struct branch_records), GFP_KERNEL); >> + if (!events->branches) >> + return -ENOMEM; >> + } >> + return 0; > > This leaks memory if any allocation fails, and the next patch replaces this > code entirely. Okay. > > Please add this once in a working state. Either use the percpu allocation > trick in the next patch from the start, or have this kzalloc() with a > corresponding kfree() in an error path. I will change branch_records_alloc() as suggested in the next patch's thread and fold those changes here in this patch. > >> } >> >> static int armv8pmu_probe_pmu(struct arm_pmu *cpu_pmu) >> @@ -1145,12 +1162,24 @@ static int armv8pmu_probe_pmu(struct arm_pmu *cpu_pmu) >> }; >> int ret; >> >> + ret = armv8pmu_private_alloc(cpu_pmu); >> + if (ret) >> + return ret; >> + >> ret = smp_call_function_any(&cpu_pmu->supported_cpus, >> __armv8pmu_probe_pmu, >> &probe, 1); >> if (ret) >> return ret; >> >> + if (arm_pmu_branch_stack_supported(cpu_pmu)) { >> + ret = branch_records_alloc(cpu_pmu); >> + if (ret) >> + return ret; >> + } else { >> + armv8pmu_private_free(cpu_pmu); >> + } > > I see from the next patch that "private" is four ints, so please just add that > to struct arm_pmu under an ifdef CONFIG_ARM64_BRBE. That'll simplify this, and > if we end up needing more space in future we can consider factoring it out. struct arm_pmu { ........................................ /* Implementation specific attributes */ void *private; } private pointer here creates an abstraction for given pmu implementation to hide attribute details without making it known to core arm pmu layer. Although adding ifdef CONFIG_ARM64_BRBE solves the problem as mentioned above, it does break that abstraction. Currently arm_pmu layer is aware about 'branch records' but not about BRBE in particular which the driver adds later on. I suggest we should not break that abstraction. Instead a global 'static struct brbe_hw_attr' in drivers/perf/arm_brbe.c can be initialized into arm_pmu->private during armv8pmu_branch_probe(), which will also solve the allocation-free problem. Also similar helpers armv8pmu_task_ctx_alloc()/free() could be defined to manage task context cache i.e arm_pmu->pmu.task_ctx_cache independently. But now armv8pmu_task_ctx_alloc() can be called after pmu probe confirms to have arm_pmu->has_branch_stack. > >> + >> return probe.present ? 0 : -ENODEV; >> } > > It also seems odd to ceck probe.present *after* checking > arm_pmu_branch_stack_supported(). I will reorganize as suggested below. > > With the allocation removed I think this can be written more clearly as: > > | static int armv8pmu_probe_pmu(struct arm_pmu *cpu_pmu) > | { > | struct armv8pmu_probe_info probe = { > | .pmu = cpu_pmu, > | .present = false, > | }; > | int ret; > | > | ret = smp_call_function_any(&cpu_pmu->supported_cpus, > | __armv8pmu_probe_pmu, > | &probe, 1); > | if (ret) > | return ret; > | > | if (!probe.present) > | return -ENODEV; > | > | if (arm_pmu_branch_stack_supported(cpu_pmu)) > | ret = branch_records_alloc(cpu_pmu); > | > | return ret; > | }