Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp822565ybe; Thu, 5 Sep 2019 06:31:06 -0700 (PDT) X-Google-Smtp-Source: APXvYqw5UnxXKBKST5mxqSruUCBzBaGzTgJqHu5uJ+xcVBRjzix8gkZcOc4dgz+7y9/kbYk2jEBn X-Received: by 2002:a63:4c5a:: with SMTP id m26mr3122112pgl.270.1567690266077; Thu, 05 Sep 2019 06:31:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1567690266; cv=none; d=google.com; s=arc-20160816; b=H6oSMHM5UtO1CF9jQAJWp9yC8mkI2c4QPVghZYf/WhIivq+oXJlNxJZBin8qxN/lO/ Sbn+8PtHkGkCeW/NwROPt/YmUzLFtNAenulyJuVkLuhfckdJyGvNlaCT5wVIiuL89LCR mF0ZZRTh50zlMSXnCzi3YqwamFR8I75SN1QkbXpgppMDqVlTjCWgwIae8yzQ/LK5Liuc SA98Gr5zjrBiivjdbro3jQZXtExrbAhAzzN/yaAGfuOtp2NY1W/8eAEU7D+czErIyJTU uv2ZjCpfmZKqvNVX3k5KSbE+hPzIil8C//Yh8JXHCcrSByhxxokn1xQJColbQu8ZWu4D 0ozA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=+M5pUPd0CSd5IqfYJSlkUN21WzkPkfZPQNuEhTOUOgo=; b=aQubdl4fKDmYxqt1Kxe7vkgw+jsmHKp7pO3tVXirTKSyOVcqBZifAB4RVQya8mHQU1 RfjhuCeailSly5XBprgMWUkUMoVneNloH9DUTflaoMfmYnvQtxL4QFyF9j3nvegps8VE 6Y6REXzlD8+FrAjmEd+JLGFyfWJ+kTp63h4TGvYFVO+JOKOSFRC7trCWzPB1T6bi944L pEMUXgAd15tTN9rjHSn9KUyr37yvSk6PKNVRYtveJKZbpmlZeXvkHhzIg0KRXJ+hcL6d ymNzRmAopUywm00rnMARzvFQhQHMsrnWa5VCL7tVpsyjloiUoulce0HURl2a64z3UH+1 UszQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=XJxqYaSD; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a65si2173233pfb.153.2019.09.05.06.30.49; Thu, 05 Sep 2019 06:31:06 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=XJxqYaSD; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388548AbfIEMHJ (ORCPT + 99 others); Thu, 5 Sep 2019 08:07:09 -0400 Received: from bombadil.infradead.org ([198.137.202.133]:59870 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732374AbfIEMHJ (ORCPT ); Thu, 5 Sep 2019 08:07:09 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20170209; h=Content-Transfer-Encoding: Content-Type:MIME-Version:References:In-Reply-To:Message-ID:Subject:Cc:To: From:Date:Sender:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=+M5pUPd0CSd5IqfYJSlkUN21WzkPkfZPQNuEhTOUOgo=; b=XJxqYaSDJ0FmgiWHpkI6ft+c3 xpR0PYf82uONs7qbR16qRqH2gjeCTKZcuij6XuYltzRPRBM9aSEx8Lctmhs40s8h4IMaLeu7sqt9X NDqPjvyqhaVfOvVDdCKGb1L2LeSLkunvHggzBLtmw+Uf9c3H+j1f8Eic5cRAwYLPP9Ta6ZnLoJKRM mDAKSnHhoRibTk3TY1j+uaOxMx8JGf/zIFe8yDRjPLVwnAFoasHLZzwIItaVLTwB52kA7QCJbkugF A06FClA44XGCu5sYW3lcm/PAUHIEDnVWs7w9pmr/MNrXK7Fz+w1WNvGXoo/OJ5VqQsm1siPTobWAG p7bsRQUGg==; Received: from [177.159.253.249] (helo=coco.lan) by bombadil.infradead.org with esmtpsa (Exim 4.92 #3 (Red Hat Linux)) id 1i5qXb-0001t5-MF; Thu, 05 Sep 2019 12:07:08 +0000 Date: Thu, 5 Sep 2019 09:07:01 -0300 From: Mauro Carvalho Chehab To: Greg Kroah-Hartman Cc: Linux Media Mailing List , Mauro Carvalho Chehab , Joe Perches , linux-kernel@vger.kernel.org, Jonathan Corbet , Jessica Yu , Federico Vaga , Thomas Gleixner , linux-doc@vger.kernel.org Subject: Re: [PATCH] docs: license-rules.txt: cover SPDX headers on Python scripts Message-ID: <20190905090701.37dcf75e@coco.lan> In-Reply-To: <20190905092703.GA30899@kroah.com> References: <20190905055614.7958918b@coco.lan> <88e638eb959095ab6657d295f9f8c27169569bf2.1567675272.git.mchehab+samsung@kernel.org> <20190905092703.GA30899@kroah.com> X-Mailer: Claws Mail 3.17.3 (GTK+ 2.24.32; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Em Thu, 5 Sep 2019 11:27:03 +0200 Greg Kroah-Hartman escreveu: > On Thu, Sep 05, 2019 at 06:23:13AM -0300, Mauro Carvalho Chehab wrote: > > The author of the license-rules.rst file wanted to be very restrict > > with regards to the location of the SPDX header. It says that > > the SPDX header "shall be added at the first possible line in > > a file which can contain a comment". Not happy with this already > > restrictive requiement, it goes further: > > > > "For the majority of files this is the first line, except for > > scripts", opening an exception to have the SPDX header at the > > second line, if the first line starts with "#!". > > > > Well, it turns that this is too restrictive for Python scripts, > > and may cause regressions if this would be enforced. > > > > As mentioned on: > > https://stackoverflow.com/questions/728891/correct-way-to-define-python-source-code-encoding > > > > Python's PEP-263 [1] dictates that an script that needs to default to > > UTF-8 encoding has to follow this rule: > > > > 'Python will default to ASCII as standard encoding if no other > > encoding hints are given. > > > > To define a source code encoding, a magic comment must be placed > > into the source files either as first or second line in the file' > > > > And: > > 'More precisely, the first or second line must match the following > > regular expression: > > > > ^[ \t\f]*#.*?coding[:=][ \t]*([-_.a-zA-Z0-9]+)' > > > > [1] https://www.python.org/dev/peps/pep-0263/ > > > > If a script has both "#!" and the charset encoding line, we can't place > > a SPDX tag without either violating license-rules.rst or breaking the > > script by making it crash with non-ASCII characters. > > > > So, add a sort notice saying that, for Python scripts, the SPDX > > header may be up to the third line, in order to cover the case > > where both "#!" and "# .*coding.*UTF-8" lines are found. > > > > Signed-off-by: Mauro Carvalho Chehab > > --- > > Documentation/process/license-rules.rst | 7 +++++-- > > 1 file changed, 5 insertions(+), 2 deletions(-) > > > > diff --git a/Documentation/process/license-rules.rst b/Documentation/process/license-rules.rst > > index 2ef44ada3f11..5d23e3498b1c 100644 > > --- a/Documentation/process/license-rules.rst > > +++ b/Documentation/process/license-rules.rst > > @@ -64,9 +64,12 @@ License identifier syntax > > possible line in a file which can contain a comment. For the majority > > of files this is the first line, except for scripts which require the > > '#!PATH_TO_INTERPRETER' in the first line. For those scripts the SPDX > > - identifier goes into the second line. > > + identifier goes into the second line\ [1]_. > > > > -| > > +.. [1] Please notice that Python scripts may also need an encoding rule > > + as defined on PEP-263, which should be defined either at the first > > + or the second line. So, for such scripts, the SPDX identifier may > > + go up to the third line. > > > > 2. Style: > > > > If you are going to do this, can you also fix up scripts/spdxcheck.py to > properly catch this For completeness, just added a check for it, and a "stats" mode at the script that will tell on what line the first SPDX tag occurs. I'll probably rework at the patch later, in order to disable the pedantic mode by default. There are currently 227 files that don't complain with the "up to line 3" rule, including COPYING (with should probably be excluded from the check). Patches are at: https://git.linuxtv.org/mchehab/experimental.git/log/?h=spdx_pedantic Btw, most violations are due to: /* * SPDX... Regards, Mauro