Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp343012ybg; Wed, 10 Jun 2020 02:13:23 -0700 (PDT) X-Google-Smtp-Source: ABdhPJygr5ofgjcEy9P2kkMxmtuTexJ+LBNiG6YUl0jpR6jOG89A8Ty2M3Ylv0IT37yliTj+AEHf X-Received: by 2002:a50:f052:: with SMTP id u18mr1604153edl.16.1591780403254; Wed, 10 Jun 2020 02:13:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591780403; cv=none; d=google.com; s=arc-20160816; b=plExLWnShqeEqHAv66GSn2fNGM3noOWwF9coUL1pfngd8C++GZTuKAitsWW4dNc82h oHJuRMGdzH8324EYlo1W66t8bMmrc5OAzHzHnfSdV328Zv/RGoX2rgKstT8TVrN3lvW6 syhL9zFqguPkbd4gDccV7b3ttGlU1ga4BSZpccbLtuaoyqquRv76Ufa1VEHcMsiXDXR3 TJ12KG+G5A63cPJdoUYZXGGnDgrs64G8hfwiyFg3ahPSVFpk5Yth8b16J/FfsG5P3Yz2 nOWkFQD016OWulEzG6tgVx7O3usIkJ3IvxAOl6I+H8zsPawNdBteb7wzdVwZ8sZn8wcC JaWQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:in-reply-to:message-id:date :subject:cc:to:from:ironport-sdr:dkim-signature; bh=KPPDIbWDIIi7NcSqSfqP/mK+TVSC91eXbOcvGk49e8c=; b=yaFquwvPnfFCeA7xAraq8ZrRysKSZs2RjMQfWO2bETjyO7ejc0ZdePLbBYWPmzeBk5 B5w6v41HsgLjvNQ+KBy3LcXyaB4X952JSGNr5+DokBdNUXFZfLXMg50v7XksnlArWPKj edufF3CGp0wDo+K7bnMsiiF1D2Miu5LRoc0suxe8vaZVqCHRKjqv5Ffn+bZ18AG9hcYg vTIemAquaJDIQf3UZDPZ2RauYYOgU+k2DzE1QTkBUuz0azhOFH3CWgL3T3W42DDkRaI2 0O9laKT4CSRY4seSNQZ9gdmD0ygcHNEOTcWHL914iw5WM2oeYn6rh7hrJVf8GfpXIz2N rvlA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=nWWIozjE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y7si10118467edo.557.2020.06.10.02.12.59; Wed, 10 Jun 2020 02:13:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=nWWIozjE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727827AbgFJJJ4 (ORCPT + 99 others); Wed, 10 Jun 2020 05:09:56 -0400 Received: from smtp-fw-9101.amazon.com ([207.171.184.25]:38221 "EHLO smtp-fw-9101.amazon.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726961AbgFJJJz (ORCPT ); Wed, 10 Jun 2020 05:09:55 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amazon.com; i=@amazon.com; q=dns/txt; s=amazon201209; t=1591780194; x=1623316194; h=from:to:cc:subject:date:message-id:in-reply-to: mime-version; bh=KPPDIbWDIIi7NcSqSfqP/mK+TVSC91eXbOcvGk49e8c=; b=nWWIozjEJsaZvpDBF2HNiKSMhlCwlvVppEcAPoeYdms8p7OqPnAKrjZT pH3nD7ITiU1FsW5fe27zLB2ySOQg6EZkWZ8voLa0rqWOrUIXiZGhp9aj1 kJMn329fAGMiXLRkwvjDaAdic0swnjLNGRKTsMPtvU+eqEIdpeUHrPts/ k=; IronPort-SDR: 0edSOZcR2gKDa4qC/LHclU+V8vU3A6CUjYH5h59O4SLzvbQaROCJoduC9+yJqjIvWrXFweMGjp zPrGWzlyVzkw== X-IronPort-AV: E=Sophos;i="5.73,495,1583193600"; d="scan'208";a="42946930" Received: from sea32-co-svc-lb4-vlan3.sea.corp.amazon.com (HELO email-inbound-relay-2a-d0be17ee.us-west-2.amazon.com) ([10.47.23.38]) by smtp-border-fw-out-9101.sea19.amazon.com with ESMTP; 10 Jun 2020 09:09:51 +0000 Received: from EX13MTAUEA002.ant.amazon.com (pdx4-ws-svc-p6-lb7-vlan3.pdx.amazon.com [10.170.41.166]) by email-inbound-relay-2a-d0be17ee.us-west-2.amazon.com (Postfix) with ESMTPS id E9D9FA255A; Wed, 10 Jun 2020 09:09:50 +0000 (UTC) Received: from EX13D31EUA001.ant.amazon.com (10.43.165.15) by EX13MTAUEA002.ant.amazon.com (10.43.61.77) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Wed, 10 Jun 2020 09:09:50 +0000 Received: from u886c93fd17d25d.ant.amazon.com (10.43.162.53) by EX13D31EUA001.ant.amazon.com (10.43.165.15) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Wed, 10 Jun 2020 09:09:43 +0000 From: SeongJae Park To: Joe Perches CC: SeongJae Park , , , , , , SeongJae Park Subject: Re: Re: Re: [PATCH v3 1/2] checkpatch: support deprecated terms checking Date: Wed, 10 Jun 2020 11:09:27 +0200 Message-ID: <20200610090927.2383-1-sjpark@amazon.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <9fcbcf91beb884ef8eac64471fe769edebce660c.camel@perches.com> (raw) MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.43.162.53] X-ClientProxiedBy: EX13D02UWC003.ant.amazon.com (10.43.162.199) To EX13D31EUA001.ant.amazon.com (10.43.165.15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 10 Jun 2020 01:45:41 -0700 Joe Perches wrote: > On Wed, 2020-06-10 at 10:01 +0200, SeongJae Park wrote: > > On Wed, 10 Jun 2020 00:13:42 -0700 Joe Perches wrote: > [] > > > This is a direct copy of the spelling dictionary > > > loading code, so maybe these could be consolidated. > > > > Agreed, how about below one? > [] > > diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl > > index 524df88f9364..226f24e1f1f3 100755 > [] > > +sub read_word_corrections { > > + my ($file) = @_; > > + my $suspects; > > + my %fixes; > > Right. > > But I think this should take a hash reference > as the second argument so the complete hash > isn't created and returned. > > [] > > > +# Load deprecated terms and build regular expression list. > > +my %deprecated_terms_fix = read_word_corrections($deprecated_terms_file); > > So this might be something like: > > my %deprecated_terms; > read_word_corrections($deprecated_terms_file, \%deprecated_terms); > > etc... Appreciate your nice suggestion! So, I updated the patch as below: ================================= >8 ========================================== From 0bcba551f429b0ccec4183437098b3b961d0a724 Mon Sep 17 00:00:00 2001 From: SeongJae Park Date: Wed, 10 Jun 2020 07:11:57 +0200 Subject: [PATCH] checkpatch: support deprecated terms checking Some terms could be deprecated for various reasons, but it is hard to update the entire old usages. That said, we could at least encourage new patches to use the suggested replacements. This commit adds check of deprecated terms in the 'checkpatch.pl' for that. The script will get deprecated terms and suggested replacements of those from 'scripts/deprecated_terms.txt' file and warn if the deprecated terms are used. The mechanism and the format of the file are almost the same as that of 'spelling.txt'. For the reason, this commit modularizes the read of the 'spelling.txt' and reuses. Signed-off-by: SeongJae Park --- scripts/checkpatch.pl | 60 +++++++++++++++++++++++++++--------- scripts/deprecated_terms.txt | 5 +++ 2 files changed, 50 insertions(+), 15 deletions(-) create mode 100644 scripts/deprecated_terms.txt diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl index 524df88f9364..c672091932bb 100755 --- a/scripts/checkpatch.pl +++ b/scripts/checkpatch.pl @@ -57,6 +57,7 @@ my $max_line_length = 100; my $ignore_perl_version = 0; my $minimum_perl_version = 5.10.0; my $min_conf_desc_length = 4; +my $deprecated_terms_file = "$D/deprecated_terms.txt"; my $spelling_file = "$D/spelling.txt"; my $codespell = 0; my $codespellfile = "/usr/share/codespell/dictionary.txt"; @@ -692,29 +693,40 @@ our $allowed_asm_includes = qr{(?x: )}; # memory.h: ARM has a custom one -# Load common spelling mistakes and build regular expression list. -my $misspellings; -my %spelling_fix; +sub read_word_corrections { + my ($file, $fixesRef) = @_; + my $suspects; -if (open(my $spelling, '<', $spelling_file)) { - while (<$spelling>) { - my $line = $_; + if (open(my $corrections, '<', $file)) { + while (<$corrections>) { + my $line = $_; - $line =~ s/\s*\n?$//g; - $line =~ s/^\s*//g; + $line =~ s/\s*\n?$//g; + $line =~ s/^\s*//g; - next if ($line =~ m/^\s*#/); - next if ($line =~ m/^\s*$/); + next if ($line =~ m/^\s*#/); + next if ($line =~ m/^\s*$/); - my ($suspect, $fix) = split(/\|\|/, $line); + my ($suspect, $fix) = split(/\|\|/, $line); - $spelling_fix{$suspect} = $fix; + $fixesRef->{$suspect} = $fix; + } + close($corrections); + } else { + warn "No correction will be found - file '$file': $!\n"; } - close($spelling); -} else { - warn "No typos will be found - file '$spelling_file': $!\n"; } +# Load deprecated terms and build regular expression list. +my %deprecated_terms_fix; +read_word_corrections($deprecated_terms_file, \%deprecated_terms_fix); +my $deprecated_terms = join("|", sort keys %deprecated_terms_fix) if keys %deprecated_terms_fix; + +# Load common spelling mistakes and build regular expression list. +my $misspellings; +my %spelling_fix; +read_word_corrections($spelling_file, \%spelling_fix); + if ($codespell) { if (open(my $spelling, '<', $codespellfile)) { while (<$spelling>) { @@ -2957,6 +2969,24 @@ sub process { } } +# Check for deprecated terms + if (defined($deprecated_terms) && + ($in_commit_log || $line =~ /^(?:\+|Subject:)/i)) { + while ($rawline =~ /(?:^|[^a-z@])($deprecated_terms)(?:\b|$|[^a-z@])/gi) { + my $deprecated_term = $1; + my $suggested = $deprecated_terms_fix{lc($deprecated_term)}; + $suggested = ucfirst($suggested) if ($deprecated_term=~ /^[A-Z]/); + $suggested = uc($suggested) if ($deprecated_term =~ /^[A-Z]+$/); + my $msg_level = \&WARN; + $msg_level = \&CHK if ($file); + if (&{$msg_level}("DEPRECATED_TERM", + "Use of '$deprecated_term' is deprecated, please '$suggested', instead.\n" . $herecurr) && + $fix) { + $fixed[$fixlinenr] =~ s/(^|[^A-Za-z@])($deprecated_term)($|[^A-Za-z@])/$1$suggested$3/; + } + } + } + # Check for various typo / spelling mistakes if (defined($misspellings) && ($in_commit_log || $line =~ /^(?:\+|Subject:)/i)) { diff --git a/scripts/deprecated_terms.txt b/scripts/deprecated_terms.txt new file mode 100644 index 000000000000..6faa06451c3d --- /dev/null +++ b/scripts/deprecated_terms.txt @@ -0,0 +1,5 @@ +# License: GPLv2 +# +# The format of each line is: +# deprecated||suggested +# -- 2.17.1