2017-12-01 11:24:57

by Will Deacon

[permalink] [raw]
Subject: Re: [PATCH v4 2/2] arm64: Add software workaround for Falkor erratum 1041

On Mon, Nov 27, 2017 at 05:18:00PM -0600, Shanker Donthineni wrote:
> The ARM architecture defines the memory locations that are permitted
> to be accessed as the result of a speculative instruction fetch from
> an exception level for which all stages of translation are disabled.
> Specifically, the core is permitted to speculatively fetch from the
> 4KB region containing the current program counter 4K and next 4K.
>
> When translation is changed from enabled to disabled for the running
> exception level (SCTLR_ELn[M] changed from a value of 1 to 0), the
> Falkor core may errantly speculatively access memory locations outside
> of the 4KB region permitted by the architecture. The errant memory
> access may lead to one of the following unexpected behaviors.
>
> 1) A System Error Interrupt (SEI) being raised by the Falkor core due
> to the errant memory access attempting to access a region of memory
> that is protected by a slave-side memory protection unit.
> 2) Unpredictable device behavior due to a speculative read from device
> memory. This behavior may only occur if the instruction cache is
> disabled prior to or coincident with translation being changed from
> enabled to disabled.
>
> The conditions leading to this erratum will not occur when either of the
> following occur:
> 1) A higher exception level disables translation of a lower exception level
> (e.g. EL2 changing SCTLR_EL1[M] from a value of 1 to 0).
> 2) An exception level disabling its stage-1 translation if its stage-2
> translation is enabled (e.g. EL1 changing SCTLR_EL1[M] from a value of 1
> to 0 when HCR_EL2[VM] has a value of 1).
>
> To avoid the errant behavior, software must execute an ISB immediately
> prior to executing the MSR that will change SCTLR_ELn[M] from 1 to 0.
>
> Signed-off-by: Shanker Donthineni <[email protected]>
> ---
> Changes since v3:
> Rebased to kernel v4.15-rc1.
> Changes since v2:
> Repost the corrected patches.
> Changes since v1:
> Apply the workaround where it's required.
>
> Documentation/arm64/silicon-errata.txt | 1 +
> arch/arm64/Kconfig | 12 +++++++++++-
> arch/arm64/include/asm/assembler.h | 19 +++++++++++++++++++
> arch/arm64/include/asm/cpucaps.h | 3 ++-
> arch/arm64/kernel/cpu-reset.S | 1 +
> arch/arm64/kernel/cpu_errata.c | 16 ++++++++++++++++
> arch/arm64/kernel/efi-entry.S | 2 ++
> arch/arm64/kernel/head.S | 1 +
> arch/arm64/kernel/relocate_kernel.S | 1 +
> arch/arm64/kvm/hyp-init.S | 1 +

This is an awful lot of code just to add an ISB instruction prior to
disabling the MMU. Why do you need to go through the alternatives framework
for this? Just do it with an #ifdef; this isn't a fastpath.

Will


2017-12-03 13:35:37

by Shanker Donthineni

[permalink] [raw]
Subject: Re: [PATCH v4 2/2] arm64: Add software workaround for Falkor erratum 1041

Hi Will, thanks for your review comments.

On 12/01/2017 05:24 AM, Will Deacon wrote:
> On Mon, Nov 27, 2017 at 05:18:00PM -0600, Shanker Donthineni wrote:
>> The ARM architecture defines the memory locations that are permitted
>> to be accessed as the result of a speculative instruction fetch from
>> an exception level for which all stages of translation are disabled.
>> Specifically, the core is permitted to speculatively fetch from the
>> 4KB region containing the current program counter 4K and next 4K.
>>
>> When translation is changed from enabled to disabled for the running
>> exception level (SCTLR_ELn[M] changed from a value of 1 to 0), the
>> Falkor core may errantly speculatively access memory locations outside
>> of the 4KB region permitted by the architecture. The errant memory
>> access may lead to one of the following unexpected behaviors.
>>
>> 1) A System Error Interrupt (SEI) being raised by the Falkor core due
>> to the errant memory access attempting to access a region of memory
>> that is protected by a slave-side memory protection unit.
>> 2) Unpredictable device behavior due to a speculative read from device
>> memory. This behavior may only occur if the instruction cache is
>> disabled prior to or coincident with translation being changed from
>> enabled to disabled.
>>
>> The conditions leading to this erratum will not occur when either of the
>> following occur:
>> 1) A higher exception level disables translation of a lower exception level
>> (e.g. EL2 changing SCTLR_EL1[M] from a value of 1 to 0).
>> 2) An exception level disabling its stage-1 translation if its stage-2
>> translation is enabled (e.g. EL1 changing SCTLR_EL1[M] from a value of 1
>> to 0 when HCR_EL2[VM] has a value of 1).
>>
>> To avoid the errant behavior, software must execute an ISB immediately
>> prior to executing the MSR that will change SCTLR_ELn[M] from 1 to 0.
>>
>> Signed-off-by: Shanker Donthineni <[email protected]>
>> ---
>> Changes since v3:
>> Rebased to kernel v4.15-rc1.
>> Changes since v2:
>> Repost the corrected patches.
>> Changes since v1:
>> Apply the workaround where it's required.
>>
>> Documentation/arm64/silicon-errata.txt | 1 +
>> arch/arm64/Kconfig | 12 +++++++++++-
>> arch/arm64/include/asm/assembler.h | 19 +++++++++++++++++++
>> arch/arm64/include/asm/cpucaps.h | 3 ++-
>> arch/arm64/kernel/cpu-reset.S | 1 +
>> arch/arm64/kernel/cpu_errata.c | 16 ++++++++++++++++
>> arch/arm64/kernel/efi-entry.S | 2 ++
>> arch/arm64/kernel/head.S | 1 +
>> arch/arm64/kernel/relocate_kernel.S | 1 +
>> arch/arm64/kvm/hyp-init.S | 1 +
>
> This is an awful lot of code just to add an ISB instruction prior to
> disabling the MMU. Why do you need to go through the alternatives framework
> for this? Just do it with an #ifdef; this isn't a fastpath.
>

We can avoid changes to only two files cpu_errata.c and cpucaps.h without using
the alternatives framework. Even though it's in slow path, cpu-errata.c changes
provides a nice debug message which indicates the erratum E1041 is applied.

Erratum log information would be very useful to conform our customers using the
right kernel with E1014 patch by looking at dmesg. Other than that I don't have
any other strong opinion to avoid alternatives and handle using #idef.

Should I go head and post v5 patch without alternatives?

> Will
>
> _______________________________________________
> linux-arm-kernel mailing list
> [email protected]
> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
>

--
Shanker Donthineni
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.

2017-12-04 20:09:57

by Shanker Donthineni

[permalink] [raw]
Subject: Re: [PATCH v4 2/2] arm64: Add software workaround for Falkor erratum 1041

Hi Will,

On 12/03/2017 07:35 AM, Shanker Donthineni wrote:
> Hi Will, thanks for your review comments.
>
> On 12/01/2017 05:24 AM, Will Deacon wrote:
>> On Mon, Nov 27, 2017 at 05:18:00PM -0600, Shanker Donthineni wrote:
>>> The ARM architecture defines the memory locations that are permitted
>>> to be accessed as the result of a speculative instruction fetch from
>>> an exception level for which all stages of translation are disabled.
>>> Specifically, the core is permitted to speculatively fetch from the
>>> 4KB region containing the current program counter 4K and next 4K.
>>>
>>> When translation is changed from enabled to disabled for the running
>>> exception level (SCTLR_ELn[M] changed from a value of 1 to 0), the
>>> Falkor core may errantly speculatively access memory locations outside
>>> of the 4KB region permitted by the architecture. The errant memory
>>> access may lead to one of the following unexpected behaviors.
>>>
>>> 1) A System Error Interrupt (SEI) being raised by the Falkor core due
>>> to the errant memory access attempting to access a region of memory
>>> that is protected by a slave-side memory protection unit.
>>> 2) Unpredictable device behavior due to a speculative read from device
>>> memory. This behavior may only occur if the instruction cache is
>>> disabled prior to or coincident with translation being changed from
>>> enabled to disabled.
>>>
>>> The conditions leading to this erratum will not occur when either of the
>>> following occur:
>>> 1) A higher exception level disables translation of a lower exception level
>>> (e.g. EL2 changing SCTLR_EL1[M] from a value of 1 to 0).
>>> 2) An exception level disabling its stage-1 translation if its stage-2
>>> translation is enabled (e.g. EL1 changing SCTLR_EL1[M] from a value of 1
>>> to 0 when HCR_EL2[VM] has a value of 1).
>>>
>>> To avoid the errant behavior, software must execute an ISB immediately
>>> prior to executing the MSR that will change SCTLR_ELn[M] from 1 to 0.
>>>
>>> Signed-off-by: Shanker Donthineni <[email protected]>
>>> ---
>>> Changes since v3:
>>> Rebased to kernel v4.15-rc1.
>>> Changes since v2:
>>> Repost the corrected patches.
>>> Changes since v1:
>>> Apply the workaround where it's required.
>>>
>>> Documentation/arm64/silicon-errata.txt | 1 +
>>> arch/arm64/Kconfig | 12 +++++++++++-
>>> arch/arm64/include/asm/assembler.h | 19 +++++++++++++++++++
>>> arch/arm64/include/asm/cpucaps.h | 3 ++-
>>> arch/arm64/kernel/cpu-reset.S | 1 +
>>> arch/arm64/kernel/cpu_errata.c | 16 ++++++++++++++++
>>> arch/arm64/kernel/efi-entry.S | 2 ++
>>> arch/arm64/kernel/head.S | 1 +
>>> arch/arm64/kernel/relocate_kernel.S | 1 +
>>> arch/arm64/kvm/hyp-init.S | 1 +
>>
>> This is an awful lot of code just to add an ISB instruction prior to
>> disabling the MMU. Why do you need to go through the alternatives framework
>> for this? Just do it with an #ifdef; this isn't a fastpath.
>>
>
> We can avoid changes to only two files cpu_errata.c and cpucaps.h without using
> the alternatives framework. Even though it's in slow path, cpu-errata.c changes
> provides a nice debug message which indicates the erratum E1041 is applied.
>
> Erratum log information would be very useful to conform our customers using the
> right kernel with E1014 patch by looking at dmesg. Other than that I don't have
> any other strong opinion to avoid alternatives and handle using #idef.
>
> Should I go ahead and post v5 patch without alternatives?
>


Please provide your thoughts on next step. We would like to merge this erratum
to v4.15 kernel.

>> Will
>>
>> _______________________________________________
>> linux-arm-kernel mailing list
>> [email protected]
>> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
>>
>

--
Shanker Donthineni
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.