Received: by 2002:a05:7412:b10a:b0:f3:1519:9f41 with SMTP id az10csp1121441rdb; Fri, 1 Dec 2023 07:35:08 -0800 (PST) X-Google-Smtp-Source: AGHT+IGvv7Qz4zuM6/51xj5S5VoX3zW0xT30pHLqpnPrl4XGMhRcADBh8CSGA60If0PmjiZZoYa7 X-Received: by 2002:a05:6a20:9382:b0:18b:962c:1ead with SMTP id x2-20020a056a20938200b0018b962c1eadmr30860104pzh.3.1701444907725; Fri, 01 Dec 2023 07:35:07 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701444907; cv=none; d=google.com; s=arc-20160816; b=sfFGL0ZpokfciHg2LAUPP8C5uKihFOO6TKukQVuXL4HmEA/h6jCNZSmvmVH1S/bbEz +0h1jCDCPr4Fd0cJ3v3/5LhP7ieoNVt6Fd4GoksGS8CXaL2DVVeZ0lU+Rvwc2DyfkSDa +deF/4l882GDFaJ0MXrKKH9YLaHO8+2cpx4bSS8bnHLnNCP5cKP0ZvVfrWt9qzGcIvKf bAWuDYutA5hpBnCAPMsTsieLETrSBLOD9NX8MSU/u6kISTGGjOR8UJQuEjS9Ty15isSH HleBIMMfEC6jKRAiv38hTTniZNQ6BtYlcQxsTc+w0PBq3LzxHKIvRa88CY/4q+CYU5sb 0F1Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :cc:to:from:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=0ZklXZgMgNVIXFADu1Td3CAYvg3iYt75Ybl27HxE12c=; fh=OXmdeit0PKrKg66JmnGaU+KS7WkEW/xIBQKe3Xu5Y1k=; b=KfNy3UnD8lj3t5QuIEqLMEmk9sb5F6wZUvMSeEyRfdtEaNw35Ntt43yCCw2ZL7WkB4 RxMBBDJamhoPUS+gGrkrPbH8o++gwQQoSlLSaWqicglKNin18wcvu8KvEboY5nGIt8GY Ax9/d+af0y6G3alCR9kFHa3nf/oZ/8yn/Eqpx/v1NnJp3M0jHzJ7NGzKtAAYCxEWg3cP 89hwRw2YUokxn+6vYQa0HredT8HtDLupyYlbqTY1oCNfu8uKmdylZ7mRqHlFcRvvEaWu LdfL4XxPmrf+mHshXCRHbdk9CeToYI7ie5FBBJ6GZSUjWt56Mdbc581TUCa2mpaDo5sz AxnQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b=EKGMR8Zx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [2620:137:e000::3:3]) by mx.google.com with ESMTPS id a12-20020a65604c000000b005c21f2b36dcsi3802319pgp.1.2023.12.01.07.35.06 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 01 Dec 2023 07:35:07 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) client-ip=2620:137:e000::3:3; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b=EKGMR8Zx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id CE74C8184A98; Fri, 1 Dec 2023 07:35:03 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1379464AbjLAPes (ORCPT + 99 others); Fri, 1 Dec 2023 10:34:48 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53890 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1379426AbjLAPer (ORCPT ); Fri, 1 Dec 2023 10:34:47 -0500 Received: from mx0a-0031df01.pphosted.com (mx0a-0031df01.pphosted.com [205.220.168.131]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0022C194; Fri, 1 Dec 2023 07:34:53 -0800 (PST) Received: from pps.filterd (m0279862.ppops.net [127.0.0.1]) by mx0a-0031df01.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 3B1CCNJD031400; Fri, 1 Dec 2023 15:34:46 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; h=message-id : date : mime-version : subject : from : to : cc : references : in-reply-to : content-type : content-transfer-encoding; s=qcppdkim1; bh=0ZklXZgMgNVIXFADu1Td3CAYvg3iYt75Ybl27HxE12c=; b=EKGMR8Zx1hwG/PNhEmIVoxqRqsdQkOuja1cf7g9k48SdbbLfHpGhNUo8My422KnrluIS nsoseuycJcNaPcA/IC9KRLzPwdhhziMADIwxRkxCLvzHEMmjQRc6fI2Z2mXEvvZ0rlDd lwnJcafO7mJ5kEV/h/gJVbv7E7AQHjroXxteozzFdS4j/nh9/KPsL2wrN50GRPyLgYNc nBUrAgSWZex1rM7Q8Is56gqCAmZYZ3d8sWEw31G6DcaD2P5E3NhtOMwwzto8uJRSSX5u Mw9WGNhOHBMDURlnY31SqhJ9NgjfI8rFZcb/rMntp8eO9JBNaxHhadNQIBHjwC/+Km2N Bw== Received: from nasanppmta03.qualcomm.com (i-global254.qualcomm.com [199.106.103.254]) by mx0a-0031df01.pphosted.com (PPS) with ESMTPS id 3uq6dkssbr-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 01 Dec 2023 15:34:46 +0000 Received: from nasanex01a.na.qualcomm.com (nasanex01a.na.qualcomm.com [10.52.223.231]) by NASANPPMTA03.qualcomm.com (8.17.1.5/8.17.1.5) with ESMTPS id 3B1FYjdP009513 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 1 Dec 2023 15:34:45 GMT Received: from [10.239.133.73] (10.80.80.8) by nasanex01a.na.qualcomm.com (10.52.223.231) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.40; Fri, 1 Dec 2023 07:34:43 -0800 Message-ID: Date: Fri, 1 Dec 2023 23:34:40 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [GIT PULL] Pin control fixes for v6.7 From: "Aiqun(Maria) Yu" To: Linus Walleij CC: Linus Torvalds , "open list:GPIO SUBSYSTEM" , linux-kernel , Charles Keepax , Chester Lin References: <9714f76f-0cf3-4f54-8e0d-fe3ca2990eed@quicinc.com> In-Reply-To: <9714f76f-0cf3-4f54-8e0d-fe3ca2990eed@quicinc.com> Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 8bit X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01a.na.qualcomm.com (10.52.223.231) To nasanex01a.na.qualcomm.com (10.52.223.231) X-QCInternal: smtphost X-Proofpoint-Virus-Version: vendor=nai engine=6200 definitions=5800 signatures=585085 X-Proofpoint-ORIG-GUID: 27hOuwpan4iuFUsE4ei40XA481AewGt6 X-Proofpoint-GUID: 27hOuwpan4iuFUsE4ei40XA481AewGt6 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.272,Aquarius:18.0.997,Hydra:6.0.619,FMLib:17.11.176.26 definitions=2023-12-01_13,2023-11-30_01,2023-05-22_02 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 clxscore=1015 priorityscore=1501 adultscore=0 impostorscore=0 mlxscore=0 malwarescore=0 phishscore=0 suspectscore=0 spamscore=0 lowpriorityscore=0 bulkscore=0 mlxlogscore=958 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2311060000 definitions=main-2312010108 X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Fri, 01 Dec 2023 07:35:03 -0800 (PST) On 12/1/2023 6:06 PM, Aiqun(Maria) Yu wrote: > On 12/1/2023 4:10 PM, Linus Walleij wrote: >> Hi Nathan, Nick, >> >> (just picking some LLVM compiler people I know of... and trust) >> >> Context is this patch: >> https://lore.kernel.org/linux-gpio/20231115102824.23727-1-quic_aiquny@quicinc.com/ >> >> On Thu, Nov 30, 2023 at 6:37 AM Aiqun(Maria) Yu >> wrote: >>> On 11/29/2023 11:08 PM, Linus Walleij wrote: >>>> On Wed, Nov 29, 2023 at 3:56 PM Linus Torvalds >>>> wrote: >>>>> On Wed, 29 Nov 2023 at 04:09, Linus Walleij >>>>> wrote: >>>>>> >>>>>> The most interesting patch is the list iterator fix in the core by >>>>>> Maria >>>>>> Yu, it took a while for me to realize what was going on there. >>>>> >>>>> That commit message still doesn't explain what the problem was. >>>>> >>>>> Why is p->state volatile there? It seems to be a serious locking bug >>>>> if p->state can randomly change there, and the READ_ONCE() looks like >>>>> a "this hides the problem" rather than an actual real fix. >>> >>> This is indeed an interesting issue. Thx for the comment, Linus. >>> **Let me explain how: "p->state becomes volatile in the list iterator", >>> and "why READ_ONCE is suggested". >>> >>> The current critical code is: >>>     list_for_each_entry(setting, &p->state->settings, node) >>> >>> after elaborating the define list_for_each_entry, so above critical code >>> will be: >>>     for (setting = list_head(&p->state->settings, typeof(*setting), >>> node); \ >>> &setting->node != (&p->state->settings); \ >>> setting = list_next(setting , node)) >>> >>> The asm code(refer result from Clang version 10.0) can cleared explain >>> the step of p->state reload actions: >>> loop: >>> ldr x22,[x22] ; x22=list_next(setting , node)) >>> add x9,x8,#0x18 ; x9=&p->state->setting >>> cmp x22,x9 ; setting,x9 >>> b.eq 0xFFFFFF9B24483530 >>> >>> ldr w9,[x22,#0x10] ; w9,[setting,#16] >>> cmp w9,#0x2 ; w9,#2 >>> b.ne 0xFFFFFF9B24483504 >>> >>> mov x0,x22 ; x0,setting >>> bl 0xFFFFFF9B24486048 ; pinmux_disable_setting >>> >>> ldr x8,[x19,#0x28] ; x19=p, x8=[p->state], *reload p->state* >>> b loop >>> >>> The *reload p->state* inside the loop for checking is not needed and can >>> cause possible infinite loop. So READ_ONCE is highly suggested even if >>> p->state is not randomly changed. And then unnecessary "ldr >>> x8,[x19,#0x28]" can be removed from above loop code. >>> >>> **Comments about the locking bug: >>> currently pinctrl_select_state is an export symbol and don't have >>> effective reentrance protect design. That's why current infinite loop >>> issue was observed with customer's multi thread call with >>> pinctrl_select_state without lock protection. pinctrl_select_state >>> totally rely on driver module user side to ensure the reentrant state. >>> >>> Usually the suggested usage from driver side who are using pinctrl >>> would be: >>> LOCK; >>> pinctrl_select_state(); >>> gpio pulling; >>> udelay(); >>> check state; >>> other hardware behaviors; >>> UNLOCK; >>> >>> So the locking bug fix I have told customer side to fix from their own >>> driver part. Since usually not only a simple pinctrl_select_state call >>> can finish the hardware state transaction. >>> >>> I myself also is fine to have a small per pinctrl lock to only protect >>> the current pinctrl_select_state->pinctrl_commit_state reentrance >>> issues. Pls any pinctrl maintainer help to comment to suggest or not and >>> I can prepare a change as well. >> >> Luckily I am the pin control maintainer :D >> And I also ha my morning coffee and looked over the patch again. >> >> So tilting the compiler to generate code that is less prone to race >> conditions with READ_ONCE() isn't really the solution is it? We need >> to introduce a proper lock that stops this from happening if it is >> a problem people are facing. >> >> Can you try to make a patch that removes READ_ONCE() >> and introduce a lock instead? >> >> Racing is rarely an issue in pin control for reasons explained >> in another context here: >> https://lore.kernel.org/linux-gpio/CACRpkdZ0cnJpYuzU=47-oW-7N_YGMo2vXpKOeXeNi5PhPY7QMA@mail.gmail.com/ >> >> ...but if people still manage to run into it, we better have a lock >> there. Just make sure it is not just an issue with outoftree code, >> but a real problem? >> >> The change that changes the code to use the old_state variable >> should stay in, it makes the code more readable, it's just the >> READ_ONCE() macro which is dubious. > Thx for confirm. I am preparing the new change now. :) change uploaded link here: https://lore.kernel.org/linux-gpio/20231201152931.31161-1-quic_aiquny@quicinc.com/ > > READ_ONCE can only avoid the possible infinite loop and not crash the > whole kernel, while the lock is needed to protect the multi parallel > call of pinctrl_commit_state api have a consistent atomic hardware > result as well. >> >> Yours, >> Linus Walleij > -- Thx and BRs, Aiqun(Maria) Yu