Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp1887472rwn; Fri, 16 Sep 2022 02:19:07 -0700 (PDT) X-Google-Smtp-Source: AMsMyM4ilzjASWcVDRVFyCnWyrSDFGHYo+6UylsKjq8P6lHS4BFNYfzKstEUSahwRhfnMq6qcIhd X-Received: by 2002:a05:6402:b6c:b0:44e:2faf:1ba4 with SMTP id cb12-20020a0564020b6c00b0044e2faf1ba4mr3184461edb.191.1663319947149; Fri, 16 Sep 2022 02:19:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1663319947; cv=none; d=google.com; s=arc-20160816; b=fkmodh9zLkA886Kv/dj59uq9KjQ0ZUYblYrF/6dGsv5re7x14Pa6qXYYoAZKrOr9lh WiaDMyNwcPDceEayVA11Ovlyzr14bvFp782gf2Y/pXNizgW2t/B+rn7xZ6RPq97UTXAS fFo28c0CO4EuflnhiANxaEl9XLR+hoVALnAi5ZITBmbfRgseU/NE2MVddJZeSnWiOLT8 zO4fb2aT96zX5sgZq1K394xuSufSRI/okIqNy/E9YA/5lYFoHVg+jx2aYU3kke8cxett 1zuJpF+n60gznsmkSIucJx4zrkfISsLERqQpXl5oJ5z+AHzcLgoyF4U+7KA1xuycWifX KSCw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=Fj35Lif6yL3Bu2ZJ3dQsLMendSeBLlPysxkQLPSmAuk=; b=xPJhxlOWKMxe5cgDRxg2jN/i2NVhbXR/QTqDpg5WmWyPuuO+zyKS7PUsfhGK9N/eu7 ysPX4NoaeILulaTHKIImwV1vQZk9W6o5g8CF9KqP9o0DzAsLbBbee1DAZQ72M2dgPNB1 0L5sWvxKTle4+DXeJXJ+cwjwESsr5Py338uJDiVeR9fDVNnulVV7rJgOs9z3APCXlt/a BaHZCtELO9VwM/wjE58wwrP+mT73tj5IKYLmr+MazV/ENcUPV9pb7tW4T5Gx6pqOUxFG C3lIaBPFcy7SOLYN5yOlQ73JCarxoqtW/j0KyeVXkRf21b6sHCa+2f77b72oi+/TaLts MSIw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id p4-20020a170906604400b007309350f2f9si14621057ejj.555.2022.09.16.02.18.42; Fri, 16 Sep 2022 02:19:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230428AbiIPIrW (ORCPT + 99 others); Fri, 16 Sep 2022 04:47:22 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51918 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230195AbiIPIrQ (ORCPT ); Fri, 16 Sep 2022 04:47:16 -0400 Received: from soltyk.jannau.net (soltyk.jannau.net [144.76.91.90]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C94B37AC1C for ; Fri, 16 Sep 2022 01:47:14 -0700 (PDT) Received: from robin.home.jannau.net (unknown [91.200.110.112]) by soltyk.jannau.net (Postfix) with ESMTPSA id D20AF26F070; Fri, 16 Sep 2022 10:47:12 +0200 (CEST) From: Janne Grunau To: Joe Perches Cc: linux-kernel@vger.kernel.org Subject: [PATCH] get_maintainer: Extend matched name characters in maintainers_in_file() Date: Fri, 16 Sep 2022 10:47:12 +0200 Message-Id: <20220916084712.84411-1-j@jannau.net> X-Mailer: git-send-email 2.35.1 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Extend the regexp matching name characters to cover Unicode blocks Latin Extended-A and Extended-B. Fixes 'scripts/get_maintainer.pl -f' for 'Documentation/devicetree/bindings/clock/apple,nco.yaml'. Signed-off-by: Janne Grunau --- This still excludes Greek and Cyrilic characters which should be expected in names as well. I tried to use '\p{L}' to match all Unicode letters but couldn't get it to work. Feel free understand this as bug report with an incomplete fix. best regards, Janne --- scripts/get_maintainer.pl | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/scripts/get_maintainer.pl b/scripts/get_maintainer.pl index ab123b498fd9..7c06f06dcbfa 100755 --- a/scripts/get_maintainer.pl +++ b/scripts/get_maintainer.pl @@ -442,7 +442,7 @@ sub maintainers_in_file { my $text = do { local($/) ; <$f> }; close($f); - my @poss_addr = $text =~ m$[A-Za-zÀ-ÿ\"\' \,\.\+-]*\s*[\,]*\s*[\(\<\{]{0,1}[A-Za-z0-9_\.\+-]+\@[A-Za-z0-9\.-]+\.[A-Za-z0-9]+[\)\>\}]{0,1}$g; + my @poss_addr = $text =~ m$[A-Za-zÀ-ɏ\"\' \,\.\+-]*\s*[\,]*\s*[\(\<\{]{0,1}[A-Za-z0-9_\.\+-]+\@[A-Za-z0-9\.-]+\.[A-Za-z0-9]+[\)\>\}]{0,1}$g; push(@file_emails, clean_file_emails(@poss_addr)); } } @@ -2460,7 +2460,7 @@ sub clean_file_emails { $name = ""; } - my @nw = split(/[^A-Za-zÀ-ÿ\'\,\.\+-]/, $name); + my @nw = split(/[^A-Za-zÀ-ɏ\'\,\.\+-]/, $name); if (@nw > 2) { my $first = $nw[@nw - 3]; my $middle = $nw[@nw - 2]; -- 2.35.1