Received: by 2002:a05:6358:7058:b0:131:369:b2a3 with SMTP id 24csp581025rwp; Wed, 12 Jul 2023 19:27:11 -0700 (PDT) X-Google-Smtp-Source: APBJJlF+r0qcBQ0fGFXcAyLwyqvbtWBVrCZpXKY4tPIV3ZHEI3DH2GxfodjAR58DZZ8u8fmjPjQa X-Received: by 2002:a05:6a20:13da:b0:132:c07c:f042 with SMTP id ho26-20020a056a2013da00b00132c07cf042mr89161pzc.15.1689215230810; Wed, 12 Jul 2023 19:27:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689215230; cv=none; d=google.com; s=arc-20160816; b=yBzUL9Yh8H1BqS/8Z3eurErukC+LDpuH0KLK9FaZ9IT+smWX9ycCSXrR849cjofbbe HmPTgX7M4Kb10n8DkLGvLhFiPzTDmwwp7rkqx3i2zCUiTX+L/D0bh8eJHDJUODFnLMEA X2Ko0eg4MwjKmvWRczJ5Of/dp33UGfeE4R6iELO8AtSai57ZWY3Xx0jTQ2+LQacQuJOZ r/A3wN6ECOtXiIy7zIBKUpr/9DNWEpeNeu1y22qVumgHN5H0zz/+qk5PmHAVd0T0k/yx R0b+f7YJxlZG32UC54u/9Ow2NqEgW9rc1LFY2/SjTJR40i47bc9ktDT3luoizk9HCoLn hGQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=CYrBCdIbGLhs5PrJjQ2ca042r/AYgAnEwWB5RHs7viU=; fh=w/4poKBFZw6yesctBPHdRVITyzEBziWDOeAk169hzHU=; b=aNboTosceCOgl5Rh9aZi1O9nalWYMfGaPyGDsmzSkQ74TQJ6FpSDBYpad6277734LS OBFbs7cXvYJqt8Q0gc8fVJ6bxo0MyLiEmwd+1Aa2+8Zfn2A5ONe6zsdbA8LpCjRm9zIR zZygR0aBNZs86P/xk0AFGr7dvzh+z4HH3lSo7CH5I9qS7POJMsrvygO+qz08jmV157HF QV5LAHGBhESeN+TcD9wideCdIGYzafFH0P74aWGPO3GWCyrnnbv0F9/L2ZVDb21QGeyU nRwHFpp/T7//5jnVcrEs3KkBfOmcJh9M92K7S0VeTvN2I77l8r+jjmp1wr3T5PgncqTo prMA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b=eYNH+jgy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q8-20020a170902a3c800b001b0163c9715si4227900plb.318.2023.07.12.19.26.59; Wed, 12 Jul 2023 19:27:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b=eYNH+jgy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233599AbjGMCYv (ORCPT + 99 others); Wed, 12 Jul 2023 22:24:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50532 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232547AbjGMCYu (ORCPT ); Wed, 12 Jul 2023 22:24:50 -0400 Received: from mx0b-0031df01.pphosted.com (mx0b-0031df01.pphosted.com [205.220.180.131]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E853312D; Wed, 12 Jul 2023 19:24:47 -0700 (PDT) Received: from pps.filterd (m0279873.ppops.net [127.0.0.1]) by mx0a-0031df01.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 36D2Igia005681; Thu, 13 Jul 2023 02:24:31 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; h=message-id : date : mime-version : subject : to : cc : references : from : in-reply-to : content-type : content-transfer-encoding; s=qcppdkim1; bh=CYrBCdIbGLhs5PrJjQ2ca042r/AYgAnEwWB5RHs7viU=; b=eYNH+jgy4o8Q3VTYafY4oXW/HE1rTNrumJzBWF6IuRJtDCe5ffrHeBFdVzp6U9hGfFk2 xm1dMQi6lg6oMz1L+GfoneGh3Vg/lYmZJ7dyMWQ2acETncAj8qKM0EI8KYgnYn0umKME 6nGkefzXKGZuLSByjZhY+qiNsT3Ksd12426a1wlByQq7yLjC26Y/uMep8VcSXJjgh7jG /7mm/DAnFT70cVrfNKqFCup7UnsIJF2eFqSLIYLZybPVJ4fFXHrOn95R+xL7i3GO7YwU 3d4SNEAwu36czxgh4d9hkL1mm5BmB0KQmgS271RymNQhcsY67Iw5uYuLuCkrlhU6OoMu ig== Received: from nasanppmta05.qualcomm.com (i-global254.qualcomm.com [199.106.103.254]) by mx0a-0031df01.pphosted.com (PPS) with ESMTPS id 3rshyuat7e-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 13 Jul 2023 02:24:31 +0000 Received: from nasanex01a.na.qualcomm.com (nasanex01a.na.qualcomm.com [10.52.223.231]) by NASANPPMTA05.qualcomm.com (8.17.1.5/8.17.1.5) with ESMTPS id 36D2OUJm023121 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 13 Jul 2023 02:24:30 GMT Received: from [10.239.133.73] (10.80.80.8) by nasanex01a.na.qualcomm.com (10.52.223.231) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.30; Wed, 12 Jul 2023 19:24:27 -0700 Message-ID: <604ac52d-4336-744f-2ab8-44d1c93fbaa8@quicinc.com> Date: Thu, 13 Jul 2023 10:24:24 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.11.2 Subject: Re: [PATCH] arm64: Add the arm64.nolse_atomics command line option To: Mark Rutland CC: Will Deacon , , , , , , , , , , , References: <20230710055955.36551-1-quic_aiquny@quicinc.com> <20230710093751.GC32673@willie-the-truck> <5cf15f85-0397-96f7-4110-13494551b53b@quicinc.com> <20230711082226.GA1554@willie-the-truck> <84f0994a-26de-c20a-a32f-ec8fe41df3a3@quicinc.com> <20230711102510.GA1809@willie-the-truck> <67c2621f-4cad-2495-9785-7737246d3e90@quicinc.com> From: "Aiqun(Maria) Yu" In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01b.na.qualcomm.com (10.46.141.250) To nasanex01a.na.qualcomm.com (10.52.223.231) X-QCInternal: smtphost X-Proofpoint-Virus-Version: vendor=nai engine=6200 definitions=5800 signatures=585085 X-Proofpoint-GUID: XVYfbToG5UFlZZTNlJeDPdYI_6IcNUrz X-Proofpoint-ORIG-GUID: XVYfbToG5UFlZZTNlJeDPdYI_6IcNUrz X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.254,Aquarius:18.0.957,Hydra:6.0.591,FMLib:17.11.176.26 definitions=2023-07-12_17,2023-07-11_01,2023-05-22_02 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 impostorscore=0 malwarescore=0 bulkscore=0 spamscore=0 mlxlogscore=999 phishscore=0 clxscore=1011 mlxscore=0 adultscore=0 suspectscore=0 lowpriorityscore=0 priorityscore=1501 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2305260000 definitions=main-2307130018 X-Spam-Status: No, score=-2.2 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 7/12/2023 3:36 PM, Mark Rutland wrote: > On Wed, Jul 12, 2023 at 11:09:10AM +0800, Aiqun(Maria) Yu wrote: >> On 7/11/2023 6:25 PM, Will Deacon wrote: >>> On Tue, Jul 11, 2023 at 06:15:49PM +0800, Aiqun(Maria) Yu wrote: >>>> On 7/11/2023 4:22 PM, Will Deacon wrote: >>>>> On Tue, Jul 11, 2023 at 12:02:22PM +0800, Aiqun(Maria) Yu wrote: >>>>>> On 7/10/2023 5:37 PM, Will Deacon wrote: >>>>>>> On Mon, Jul 10, 2023 at 01:59:55PM +0800, Maria Yu wrote: >>>>>>>> In order to be able to disable lse_atomic even if cpu >>>>>>>> support it, most likely because of memory controller >>>>>>>> cannot deal with the lse atomic instructions, use a >>>>>>>> new idreg override to deal with it. >>>>>>> >>>>>>> This should not be a problem for cacheable memory though, right? >>>>>>> >>>>>>> Given that Linux does not issue atomic operations to non-cacheable mappings, >>>>>>> I'm struggling to see why there's a problem here. >>>>>> >>>>>> The lse atomic operation can be issued on non-cacheable mappings as well. >>>>>> Even if it is cached data, with different CPUECTLR_EL1 setting, it can also >>>>>> do far lse atomic operations. >>>>> >>>>> Please can you point me to the place in the kernel sources where this >>>>> happens? The architecture doesn't guarantee that atomics to non-cacheable >>>>> mappings will work, see "B2.2.6 Possible implementation restrictions on >>>>> using atomic instructions". Linux, therefore, doesn't issue atomics >>>>> to non-cacheable memory. >>>> >>>> We encounter the issue on third party kernel modules and third party apps >>>> instead of linux kernel itself. >>> >>> Great, so there's nothing to do in the kernel then! >>> >>> The third party code needs to be modified not to use atomic instructions >>> with non-cacheable mappings. No need to involve us with that. >> >>>> This is a tradeoff of performance and stability. Per my understanding, >>>> options can be used to enable the lse_atomic to have the most performance >>>> cared system, and disable the lse_atomic by stability cared most system. >>> >>> Where do livelock and starvation fit in with "stability"? Disabling LSE >>> atomics for things like qspinlock and the scheduler just because of some >>> badly written third-party code isn't much of a tradeoff. > >> We also have requirement to have cpus/system fully support lse atomic and >> cpus/system not fully support lse atomic with a generic kernel image. > > Who *specifically* has this requirement (i.e. what does 'we' mean here)? The I can use other word to describe the requirement instead of "we". There is requirements like android google gki. It request different cpu arch system to use same generic kernel Image. > upstream kernel does not require that atomics work on non-cacheable memory, and The same issue the system can be down of lse atomic not supported for cachable memory when there need far atomic. > saying "The company I work for want this" doesn't change that. > > AFAICT the system here is architecturally compliant, and what you're relying > upon something that the architecture doesn't guarantee, and Linux doesn't > guarantee. It is not also only our company's problem: To support the atomic instructions added in the Armv8.1 architecture, CHI-B provides Atomic Transactions. while Atomic Transactions support is also *optional* from CHI-B. So far atomic cannot fully supported by ARMv8.1 cpu + CHI-B system as well. from: https://developer.arm.com/documentation/102407/0100/Atomic-operations?lang=en So only cpu support atomic cannot garantee the system support lse atomic > >> Same kernel module wanted to be used by lse atomic fully support cpu and not >> fully support cpu/system as well. > > Which kernel modules *specifically* need to do atomics to non-cacheable memory? The driver want to always do far atomic(no speculatively) and allow a read-modify-write non-interruptible sequence in a single instruction. > >> That's why we want to have a runtime option here. > > As per other replies, a runtime option doesn't solve the issue you have > described, and it will adversely affect the system in other ways (e.g. the > livelock and starvation issues will mentioned, which we have seen with > LDXR+STXR atomics). I myself also have encounter issues from livelock because of LDXR+STXR atomics unfairness before. More likely happened when different performance cpu. So myself also glad to using atomics instead of exclusive access. So if there is a way to fully utilize the atomic instructions for current hardware, and also support the far atomic, that can be much better solution than currently disable the feature. > > Thanks, > Mark. Pls feel free to comments. It would lead to a reasonable and usable solution from our discussions. -- Thx and BRs, Aiqun(Maria) Yu