Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp454049pxj; Wed, 16 Jun 2021 06:21:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw0zhxXrR798ZakKNV9oKjocPpUM0MEzfOkAwH0lylTfCQI1MSJMjhd1O8sECYJmK73Q51n X-Received: by 2002:a92:c748:: with SMTP id y8mr3626865ilp.41.1623849689014; Wed, 16 Jun 2021 06:21:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1623849689; cv=none; d=google.com; s=arc-20160816; b=zbq/IsovERrNYTyPKX2RK4gsDYcYN++rdNyn0QE2tRC4Rzdad47zuSDtILk0xTSWKS gngtXgX1zcCOIjqQNpoIUnAgivRSgnda6kvGfOO1KZZlC7W7kuKFPeTAjPc3OUZ9gPla eW/dfWOqRnNaIfKto/U1VzAk/o0nbI1FDVV6WKQVG4AN/i3iDacqnI7Njo29oFlcigNI 1o3ihOi7RZ0a1TTwen39O1FRs0NffYK5pKcUmTMtnoto7qmihyAs728OBbLkrFrDhu7X yBUTiw6ZL0mVJ/F4P69xYVReQIX8Z3pT2B6Aq09//PDyawf9ThVENn8RPLfdMbBdMOC3 rx9Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:references:to :from:subject; bh=a4DgXdZztHIZy+Z+14KU/QTM1xUY+v68SOecv6whdLE=; b=skrA0xfN3iBwCX7Z5PJMiqxIxfK8yLT8CGTKcmMI22di6YNlk4xp+/Al30fdqNeOQr IyvNWUKi/vDApy12r3ZjppIjsgbEHP3SELoB6P9uLGnc2FqQppv0VybnednHjWtd0usd 3bGvYOOoqktC5CYzfO0YCyo1ekoQWuEpe20v16OVOkB0HOpSHo6qHQ3ebUenOWsWAf5X /cmhsxt0O/GwkZqyWIPs5tk8sZW1uUNq7diFvBZSjv6ZH19uvKQLSDkiZAuQhaDbH8Gh wyPREzpqDfmkSCkv/VFzZVK4etekaC8o+2Bytd6MDsNjvpH5umhrIAtokfKiTN2hotJA yCPg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b14si2395322ioj.18.2021.06.16.06.21.17; Wed, 16 Jun 2021 06:21:28 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232521AbhFPMAw (ORCPT + 99 others); Wed, 16 Jun 2021 08:00:52 -0400 Received: from szxga03-in.huawei.com ([45.249.212.189]:7335 "EHLO szxga03-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231808AbhFPMAw (ORCPT ); Wed, 16 Jun 2021 08:00:52 -0400 Received: from dggemv703-chm.china.huawei.com (unknown [172.30.72.53]) by szxga03-in.huawei.com (SkyGuard) with ESMTP id 4G4kBX5T4gz6yJV; Wed, 16 Jun 2021 19:54:44 +0800 (CST) Received: from dggpemm500006.china.huawei.com (7.185.36.236) by dggemv703-chm.china.huawei.com (10.3.19.46) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2176.2; Wed, 16 Jun 2021 19:58:43 +0800 Received: from [127.0.0.1] (10.174.179.0) by dggpemm500006.china.huawei.com (7.185.36.236) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2176.2; Wed, 16 Jun 2021 19:58:43 +0800 Subject: Re: [PATCH 1/3] scripts: add spelling_sanitizer.sh script From: "Leizhen (ThunderTown)" To: Joe Perches , Andrew Morton , Nicolas Dichtel , Jason Baron , Stefani Seibold , Jacob Keller , Thomas Graf , Herbert Xu , Jens Axboe , Petr Mladek , Sergey Senozhatsky , "Andy Shevchenko" , Rasmus Villemoes , linux-kernel , Colin Ian King , Kees Cook References: <20210611071241.16728-1-thunder.leizhen@huawei.com> <20210611071241.16728-2-thunder.leizhen@huawei.com> <6cff5719-9548-a49e-6c47-8bc92c5bd6b8@huawei.com> Message-ID: Date: Wed, 16 Jun 2021 19:58:41 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 In-Reply-To: <6cff5719-9548-a49e-6c47-8bc92c5bd6b8@huawei.com> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.174.179.0] X-ClientProxiedBy: dggems702-chm.china.huawei.com (10.3.19.179) To dggpemm500006.china.huawei.com (7.185.36.236) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/6/15 15:01, Leizhen (ThunderTown) wrote: > > > On 2021/6/11 23:36, Joe Perches wrote: >> On Fri, 2021-06-11 at 15:12 +0800, Zhen Lei wrote: >>> The file scripts/spelling.txt recorded a large number of >>> "mistake||correction" pairs. These entries are currently maintained in >>> order, but the results are not strict. In addition, when someone wants to >>> add some new pairs, he either sort them manually or write a script, which >>> is clearly a waste of labor. >> >> Try using lintian's make sort >> >> https://salsa.debian.org/lintian/lintian I installed lintian and found no option to support sort. Can anyone give me more specific instructions on how to use it? Although I don't understand the perl language, after reading commit 66b47b4a9dad ("checkpatch: look for common misspellings"), it seems to match from top to bottom. So, as Andy Shevchenko says, they should be sorted by frequency of the word usage. I really don't know the details of the implementation of scripts/checkpatch.pl --types=typo_spelling. Are only misspelled words involved in spelling.txt matching? Otherwise, if correctly spelled words are also traversed, sorting by frequency makes no sense. Because the correct number of words is far more than the wrong number of words. If that's the case, then my modified script could come in handy. And if only misspelled words involved in spelling.txt matching, do we really need spelling.txt? Just output the misspelled words is enough. I don't think anyone needs to follow the tips to complete the fix. >> >> > > Okay, I'll try it > >> >> . >>