Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp203051pxb; Wed, 22 Sep 2021 20:16:34 -0700 (PDT) X-Google-Smtp-Source: ABdhPJykZ04zPt1ztRzX0CX0oNd82oO8b53tQkuZVSGaV7BgXnTQ3hne1YICHdhKz8RQJPaWKwts X-Received: by 2002:a17:906:2418:: with SMTP id z24mr2645040eja.105.1632366994035; Wed, 22 Sep 2021 20:16:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632366994; cv=none; d=google.com; s=arc-20160816; b=qRIxzbrtO10G9+6RJlLANgSpw0oQyU8GKUO+hqYUQt6P2Z4bQ7ttSZsIlbd7p/hPDR 0bClqGg4jYIrwNh7AdjcKwOUnJJPwwes2o3IDciLRit+6O4Eoi0zuA+6+jRD2K8p0XcQ /39Uh2Px3bUp0qear5GENCz7Fy5lOJo7bX+wnZ3Pkb+HahIIpcKiaME3cwFiYcSmaggq zxXuSo3Xs2kDenVcMvRwi1BWOw1sSzC44eg/alUJgSuuRlk6ZqrnyIo3PlqXpQTW2Blf ZzUMImKqzCOeFFosB7zsGxHKrFsesuogyWyuOx/vN+NID0hq/g4gTpVzvne9NVV9JYCj XLHw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=A5ilyySj54GfwDOk1AGZuIWsLXyudIZO6/aUMlyMJYQ=; b=DsmcX51Op3TVLGg3MLYyr/DQGWQZ3OTzoAbQ6vAzd0pS0cApkAReszmBr5uf7WKFR0 S6SHnMUlyqVj4nAX2UaDir/SKkhFqr3vqbsz+/mMMExSDqB43Smgrc84NoPqVeS6Ii7S nvat4TphcVOkeL09NhWT4RjNsIe4Ck4jjolH91Ck8/4vQ+2GOuR2kvZlNa1NlQQUV0q9 vjvhsn1rCRN8AStod3v4sn3WKOVIHQ4HtxKU3COqyw6jAkhTbtl2qiofVz0nLSV+x9D1 ZBFwKbwK04uPG2jGGWZqMB+SlO/uAjAhuOHm51wGyR3/Byp7fZSjEgRUfcUTiUXsg38j HZ7g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l6si4482489ejq.366.2021.09.22.20.16.10; Wed, 22 Sep 2021 20:16:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238190AbhIWDQS (ORCPT + 99 others); Wed, 22 Sep 2021 23:16:18 -0400 Received: from foss.arm.com ([217.140.110.172]:56726 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235623AbhIWDQP (ORCPT ); Wed, 22 Sep 2021 23:16:15 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id C3414D6E; Wed, 22 Sep 2021 20:14:43 -0700 (PDT) Received: from [10.163.73.173] (unknown [10.163.73.173]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 014BB3F59C; Wed, 22 Sep 2021 20:14:39 -0700 (PDT) Subject: Re: [PATCH v2 16/17] coresight: trbe: Work around write to out of range To: Suzuki K Poulose , linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org, maz@kernel.org, catalin.marinas@arm.com, mark.rutland@arm.com, james.morse@arm.com, leo.yan@linaro.org, mike.leach@linaro.org, mathieu.poirier@linaro.org, will@kernel.org, lcherian@marvell.com, coresight@lists.linaro.org References: <20210921134121.2423546-1-suzuki.poulose@arm.com> <20210921134121.2423546-17-suzuki.poulose@arm.com> From: Anshuman Khandual Message-ID: <6ecb0391-1525-8d65-93ba-b66424dadfd3@arm.com> Date: Thu, 23 Sep 2021 08:45:46 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <20210921134121.2423546-17-suzuki.poulose@arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 9/21/21 7:11 PM, Suzuki K Poulose wrote: > TRBE implementations affected by Arm erratum (2253138 or 2224489), could > write to the next address after the TRBLIMITR.LIMIT, instead of wrapping > to the TRBBASER. This implies that the TRBE could potentially corrupt : > > - A page used by the rest of the kernel/user (if the LIMIT = end of > perf ring buffer) > - A page within the ring buffer, but outside the driver's range. > [head, head + size]. This may contain some trace data, may be > consumed by the userspace. > > We workaround this erratum by : > - Making sure that there is at least an extra PAGE space left in the > TRBE's range than we normally assign. This will be additional to other > restrictions (e.g, the TRBE alignment for working around > TRBE_WORKAROUND_OVERWRITE_IN_FILL_MODE, where there is a minimum of PAGE_SIZE. > Thus we would have 2 * PAGE_SIZE) > > - Adjust the LIMIT to leave the last PAGE_SIZE out of the TRBE's allowed > range (i.e, TRBEBASER...TRBLIMITR.LIMIT), by : > > TRBLIMITR.LIMIT -= PAGE_SIZE > > Cc: Anshuman Khandual > Cc: Mathieu Poirier > Cc: Mike Leach > Cc: Leo Yan > Signed-off-by: Suzuki K Poulose > --- > drivers/hwtracing/coresight/coresight-trbe.c | 59 +++++++++++++++++++- > 1 file changed, 57 insertions(+), 2 deletions(-) > > diff --git a/drivers/hwtracing/coresight/coresight-trbe.c b/drivers/hwtracing/coresight/coresight-trbe.c > index 02f9e00e2091..ea907345354c 100644 > --- a/drivers/hwtracing/coresight/coresight-trbe.c > +++ b/drivers/hwtracing/coresight/coresight-trbe.c > @@ -86,7 +86,8 @@ struct trbe_buf { > * affects the given instance of the TRBE. > */ > #define TRBE_WORKAROUND_OVERWRITE_FILL_MODE 0 > -#define TRBE_ERRATA_MAX 1 > +#define TRBE_WORKAROUND_WRITE_OUT_OF_RANGE 1 > +#define TRBE_ERRATA_MAX 2 > > /* > * Safe limit for the number of bytes that may be overwritten > @@ -96,6 +97,7 @@ struct trbe_buf { > > static unsigned long trbe_errata_cpucaps[TRBE_ERRATA_MAX] = { > [TRBE_WORKAROUND_OVERWRITE_FILL_MODE] = ARM64_WORKAROUND_TRBE_OVERWRITE_FILL_MODE, > + [TRBE_WORKAROUND_WRITE_OUT_OF_RANGE] = ARM64_WORKAROUND_TRBE_WRITE_OUT_OF_RANGE, > }; > > /* > @@ -279,7 +281,20 @@ trbe_handle_to_cpudata(struct perf_output_handle *handle) > > static u64 trbe_min_trace_buf_size(struct perf_output_handle *handle) > { > - return TRBE_TRACE_MIN_BUF_SIZE; > + u64 size = TRBE_TRACE_MIN_BUF_SIZE; > + struct trbe_cpudata *cpudata = trbe_handle_to_cpudata(handle); > + > + /* > + * When the TRBE is affected by an erratum that could make it > + * write to the next "virtually addressed" page beyond the LIMIT. What if the next "virtually addressed" page is just blocked from future usage in the kernel and never really gets mapped into a physical page ? In that case it would be guaranteed that, a next "virtually addressed" page would not even exist after the LIMIT pointer and hence the errata would not be triggered. Something like there is a virtual mapping cliff right after the LIMIT pointer from the MMU perspective. Although it might be bit tricky. Currently the entire ring buffer gets mapped at once with vmap() in arm_trbe_alloc_buffer(). Just to achieve the above solution, each computation of the LIMIT pointer needs to be followed by a temporary unmapping of next virtual page from existing vmap() buffer. Subsequently it could be mapped back as trbe_buf->pages always contains all the physical pages from the perf ring buffer. > + * We need to make sure there is always a PAGE after the LIMIT, > + * within the buffer. Thus we ensure there is at least an extra > + * page than normal. With this we could then adjust the LIMIT > + * pointer down by a PAGE later. > + */ > + if (trbe_has_erratum(cpudata, TRBE_WORKAROUND_WRITE_OUT_OF_RANGE)) > + size += PAGE_SIZE; > + return size; > } > > /* > @@ -585,6 +600,17 @@ static unsigned long trbe_get_trace_size(struct perf_output_handle *handle, > /* > * If the TRBE has wrapped around the write pointer has > * wrapped and should be treated as limit. > + * > + * When the TRBE is affected by TRBE_WORKAROUND_WRITE_OUT_OF_RANGE, > + * it may write upto 64bytes beyond the "LIMIT". The driver already > + * keeps a valid page next to the LIMIT and we could potentially > + * consume the trace data that may have been collected there. But we > + * cannot be really sure it is available, and the TRBPTR may not > + * indicate the same. Also, affected cores are also affected by another > + * erratum which forces the PAGE_SIZE alignment on the TRBPTR, and thus > + * could potentially pad an entire PAGE_SIZE - 64bytes, to get those > + * 64bytes. Thus we ignore the potential triggering of the erratum > + * on WRAP and limit the data to LIMIT. > */ > if (wrap) > write = get_trbe_limit_pointer(); > @@ -811,6 +837,35 @@ static int trbe_apply_work_around_before_enable(struct trbe_buf *buf) > buf->trbe_write += TRBE_WORKAROUND_OVERWRITE_FILL_MODE_SKIP_BYTES; > } > > + /* > + * TRBE_WORKAROUND_WRITE_OUT_OF_RANGE could cause the TRBE to write to > + * the next page after the TRBLIMITR.LIMIT. For perf, the "next page" > + * may be: > + * - The page beyond the ring buffer. This could mean, TRBE could > + * corrupt another entity (kernel / user) > + * - A portion of the "ring buffer" consumed by the userspace. > + * i.e, a page outisde [head, head + size]. > + * > + * We work around this by: > + * - Making sure that we have at least an extra space of PAGE left > + * in the ring buffer [head, head + size], than we normally do > + * without the erratum. See trbe_min_trace_buf_size(). > + * > + * - Adjust the TRBLIMITR.LIMIT to leave the extra PAGE outside > + * the TRBE's range (i.e [TRBBASER, TRBLIMITR.LIMI] ). > + */ > + if (trbe_has_erratum(buf->cpudata, TRBE_WORKAROUND_WRITE_OUT_OF_RANGE)) { > + s64 space = buf->trbe_limit - buf->trbe_write; > + /* > + * We must have more than a PAGE_SIZE worth space in the proposed > + * range for the TRBE. > + */ > + if (WARN_ON(space <= PAGE_SIZE || > + !IS_ALIGNED(buf->trbe_limit, PAGE_SIZE))) > + return -EINVAL; > + buf->trbe_limit -= PAGE_SIZE; > + } > + > return 0; > } > >