Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp3104225ybt; Mon, 22 Jun 2020 15:15:31 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyVqAuN9SjPZ7TT8TvjwMSxGtUVtWS4TUZHMViqr4nE/UdPMG0syJaYOzTiZFlvinBk2/Uo X-Received: by 2002:a17:906:1b45:: with SMTP id p5mr4661167ejg.51.1592864130812; Mon, 22 Jun 2020 15:15:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1592864130; cv=none; d=google.com; s=arc-20160816; b=NuaE11ZaxFdJkrjN+lKg3yyBPVFc5554h6e1irRHDoHGwLaRogGe+d8kLrNh5Z8gnj VO6fPxpxFcEemO36enMMml+GZQ6u8lgEVNWaDjBXasxC6UDpqEGAKGAW1GUoIPrB1Mv0 JhhpeP/vKqTllD7atkErGTTJT2RKslHevXLItbPe71zVpHTgRHNVRbma3JwYRkDiqy0X zu42LGmQf9LHyxFj5goMCpRkp6WywvODFusmSZJuMpGcUV4HSKHRh2vzpfAMT2bNdyTl 8DFEtzNv7z7poCpX5Zdpjt3/69V1joYh1ZiPCqSTb7X77UuIaJPgCv1kqWWHjua33YcK wCnA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=OZuyls1cMAwtwJjsM2adYow5pWrQPEerf0VMA+Z2UtA=; b=pzv+JOfT6Jyh9wbCpZOBYuHsHoLeDw4/XEOV+2husluqAkix7UMEegChTfpsR5hIVY bzn7nJTtFXYM4tPoXDqWgs/VwoaI3FawLo1erT8b1/mjv+JDsMbkStlQlZNMm4dftDgi 507wUXmGBx5LzkxBwboU4XeQGRKv6BHAZQhg5AsFVFLpp5tvr/15VWRXWa/3nUnva+DA QpgApBoBt1q2I8XqZPJz5di1HrIVdAztH6CJBzJEOEh8M4VfAkqs8V7kJT0WYjsBvJdz t/pD96SBWtWdXXj1yUQfiOQLdEdQ4xNUaiWvK2aiWkJdxePC0IBs/B1IZvPdEyncuKRV kXsw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cm2si9378982edb.370.2020.06.22.15.15.07; Mon, 22 Jun 2020 15:15:30 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730812AbgFVWLS (ORCPT + 99 others); Mon, 22 Jun 2020 18:11:18 -0400 Received: from mail-lf1-f52.google.com ([209.85.167.52]:33727 "EHLO mail-lf1-f52.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727018AbgFVWLS (ORCPT ); Mon, 22 Jun 2020 18:11:18 -0400 Received: by mail-lf1-f52.google.com with SMTP id g2so10568677lfb.0 for ; Mon, 22 Jun 2020 15:11:15 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=OZuyls1cMAwtwJjsM2adYow5pWrQPEerf0VMA+Z2UtA=; b=bP/S+wVWC3AYeh977nxMh3mDl7sXOSwzggKyoDv9sRgXOeufFXpIwzsrYdRtRtF8sj ptNYN8wNBp8i9kSh6OUUkc68Psn3lVR95zSUxZIiujlSJ5M335MIAWSwhPTuyPdkN8Cs RTofefq8osvxz/1chRQmzezxIG3ca0GW9hj2/HS7aY1OE/oBig6miGFBhPK+MsycDkv9 riQBiwVST9aRjnCw29Mo4hmiEDkwT40oAR9PAxDpAxBoyoD2MW8XUcm5n1jMuTvWZ2+H dtCos9KIGK9mLij+WcJ4PtN3eVxzSdmOOwO05+kJzSOVi8y2GpOsHst2EHVvyVv26h7b UCrA== X-Gm-Message-State: AOAM53017y5ZuBmT2NSrhlgiVra6jUPQ+6k+DL9JizIQO2RWuDArmqRi QAlBmHu9VzhRhHsaSUyX6yJqgUWbUTAuzg== X-Received: by 2002:a19:e346:: with SMTP id c6mr11129516lfk.177.1592863874519; Mon, 22 Jun 2020 15:11:14 -0700 (PDT) Received: from localhost.localdomain (broadband-37-110-38-130.ip.moscow.rt.ru. [37.110.38.130]) by smtp.googlemail.com with ESMTPSA id a23sm3738767lfb.10.2020.06.22.15.11.12 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 22 Jun 2020 15:11:13 -0700 (PDT) From: Denis Efremov To: Julia Lawall Cc: Denis Efremov , cocci@systeme.lip6.fr, linux-kernel@vger.kernel.org Subject: [PATCH v4] coccinelle: misc: add array_size_dup script to detect missed overflow checks Date: Tue, 23 Jun 2020 01:10:56 +0300 Message-Id: <20200622221056.34241-1-efremov@linux.com> X-Mailer: git-send-email 2.26.2 In-Reply-To: <20200615102045.4558-1-efremov@linux.com> References: <20200615102045.4558-1-efremov@linux.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Detect an opencoded expression that is used before or after array_size()/array3_size()/struct_size() to compute the same size. Signed-off-by: Denis Efremov --- Changes in v2: - python rules moved next to SmPL patterns - assignment operator used - struct_size patterns fixed to check only E3, since E1, E2 are sizeofs of a structure and a member of a structure Changes in v3: - s/overlow/overflow/ typo fixed (thanks, Markus) - \(&E1\|&E2\) changed to &\(E1\|E2\) - print strings breaks removed Changes in v4: - duplicates warning removed - python2 compatability in report&&org modes added - s/down the code/later/ warning changed - \(E1\|E2\|subE1\|subE2\) patterns simplified to \(subE1\|subE2\) scripts/coccinelle/misc/array_size_dup.cocci | 209 +++++++++++++++++++ 1 file changed, 209 insertions(+) create mode 100644 scripts/coccinelle/misc/array_size_dup.cocci diff --git a/scripts/coccinelle/misc/array_size_dup.cocci b/scripts/coccinelle/misc/array_size_dup.cocci new file mode 100644 index 000000000000..d3d635b2d4fc --- /dev/null +++ b/scripts/coccinelle/misc/array_size_dup.cocci @@ -0,0 +1,209 @@ +// SPDX-License-Identifier: GPL-2.0-only +/// +/// Check for array_size(), array3_size(), struct_size() duplicates. +/// Three types of patterns for these functions: +/// 1. An opencoded expression is used before array_size() to compute the same size +/// 2. An opencoded expression is used after array_size() to compute the same size +/// From security point of view only first case is relevant. These functions +/// perform arithmetic overflow check. Thus, if we use an opencoded expression +/// before a call to the *_size() function we can miss an overflow. +/// +// Confidence: High +// Copyright: (C) 2020 Denis Efremov ISPRAS +// Options: --no-includes --include-headers --no-loops + +virtual context +virtual report +virtual org + +@as@ +expression E1, E2; +@@ + +array_size(E1, E2) + +@as_next@ +expression subE1 <= as.E1; +expression subE2 <= as.E2; +expression as.E1, as.E2, E3; +assignment operator aop; +position p1, p2; +@@ + +* E1 * E2@p1 + ... when != \(subE1\|subE2\) aop E3 + when != &\(subE1\|subE2\) +* array_size(E1, E2)@p2 + +@script:python depends on report@ +p1 << as_next.p1; +p2 << as_next.p2; +@@ + +msg = "WARNING: array_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.report.print_report(p1[0], msg) + +@script:python depends on org@ +p1 << as_next.p1; +p2 << as_next.p2; +@@ + +msg = "WARNING: array_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.org.print_todo(p1[0], msg) + +@as_prev@ +expression subE1 <= as.E1; +expression subE2 <= as.E2; +expression as.E1, as.E2, E3; +assignment operator aop; +position p1, p2; +@@ + +* array_size(E1, E2)@p1 + ... when != \(subE1\|subE2\) aop E3 + when != &\(subE1\|subE2\) +* E1 * E2@p2 + +@script:python depends on report@ +p1 << as_prev.p1; +p2 << as_prev.p2; +@@ + +msg = "WARNING: array_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.report.print_report(p2[0], msg) + +@script:python depends on org@ +p1 << as_prev.p1; +p2 << as_prev.p2; +@@ + +msg = "WARNING: array_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.org.print_todo(p2[0], msg) + +@as3@ +expression E1, E2, E3; +@@ + +array3_size(E1, E2, E3) + +@as3_next@ +expression subE1 <= as3.E1; +expression subE2 <= as3.E2; +expression subE3 <= as3.E3; +expression as3.E1, as3.E2, as3.E3, E4; +assignment operator aop; +position p1, p2; +@@ + +* E1 * E2 * E3@p1 + ... when != \(subE1\|subE2\|subE3\) aop E4 + when != &\(subE1\|subE2\|subE3\) +* array3_size(E1, E2, E3)@p2 + +@script:python depends on report@ +p1 << as3_next.p1; +p2 << as3_next.p2; +@@ + +msg = "WARNING: array3_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.report.print_report(p1[0], msg) + +@script:python depends on org@ +p1 << as3_next.p1; +p2 << as3_next.p2; +@@ + +msg = "WARNING: array3_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.org.print_todo(p1[0], msg) + +@as3_prev@ +expression subE1 <= as3.E1; +expression subE2 <= as3.E2; +expression subE3 <= as3.E3; +expression as3.E1, as3.E2, as3.E3, E4; +assignment operator aop; +position p1, p2; +@@ + +* array3_size(E1, E2, E3)@p1 + ... when != \(subE1\|subE2\|subE3\) aop E4 + when != &\(subE1\|subE2\|subE3\) +* E1 * E2 * E3@p2 + +@script:python depends on report@ +p1 << as3_prev.p1; +p2 << as3_prev.p2; +@@ + +msg = "WARNING: array3_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.report.print_report(p2[0], msg) + +@script:python depends on org@ +p1 << as3_prev.p1; +p2 << as3_prev.p2; +@@ + +msg = "WARNING: array3_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.org.print_todo(p2[0], msg) + +@ss@ +expression E1, E2, E3; +@@ + +struct_size(E1, E2, E3) + +@ss_next@ +expression subE3 <= ss.E3; +expression ss.E1, ss.E2, ss.E3, E4; +assignment operator aop; +position p1, p2; +@@ + +* E1 * E2 + E3@p1 + ... when != subE3 aop E4 + when != &subE3 +* struct_size(E1, E2, E3)@p2 + +@script:python depends on report@ +p1 << ss_next.p1; +p2 << ss_next.p2; +@@ + +msg = "WARNING: struct_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.report.print_report(p1[0], msg) + +@script:python depends on org@ +p1 << ss_next.p1; +p2 << ss_next.p2; +@@ + +msg = "WARNING: struct_size is used later (line %s) to compute the same size" % (p2[0].line) +coccilib.org.print_todo(p1[0], msg) + +@ss_prev@ +expression subE3 <= ss.E3; +expression ss.E1, ss.E2, ss.E3, E4; +assignment operator aop; +position p1, p2; +@@ + +* struct_size(E1, E2, E3)@p1 + ... when != subE3 aop E4 + when != &subE3 +* E1 * E2 + E3@p2 + +@script:python depends on report@ +p1 << ss_prev.p1; +p2 << ss_prev.p2; +@@ + +msg = "WARNING: struct_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.report.print_report(p2[0], msg) + +@script:python depends on org@ +p1 << ss_prev.p1; +p2 << ss_prev.p2; +@@ + +msg = "WARNING: struct_size is already used (line %s) to compute the same size" % (p1[0].line) +coccilib.org.print_todo(p2[0], msg) -- 2.26.2