Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 54B31C74A44 for ; Sat, 11 Mar 2023 20:59:38 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229914AbjCKU7h (ORCPT ); Sat, 11 Mar 2023 15:59:37 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45058 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229561AbjCKU7f (ORCPT ); Sat, 11 Mar 2023 15:59:35 -0500 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 851A432E4A; Sat, 11 Mar 2023 12:59:34 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 3B543B80B34; Sat, 11 Mar 2023 20:59:33 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id B1DE7C433EF; Sat, 11 Mar 2023 20:59:31 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1678568371; bh=Pg2HjLx8GdJVuQLgMAqmAvFJR9XFOR9C+clZu8j8+ao=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=uEP1JfXPci4WKn0FuQulm/BYvNFdBPL+5XsZTXIzaMg6ZuPPYfgDW3bG2zYVOKDO5 blhzjXAS08vNVsbE8rfX13KldetiH39N5NJpc8D/THypfStHj34DoopNHNxPneTk/C 0QI8AvWZ93C2vVyKLFzRK1Gnuz/nrrPBr08h5mHvJlcbvvFKJi+FPAEvAPKsKlCvN3 01ipVjM3kk2BZr/6+/NmfWYnshaZJWm/GxmjFmimCGk7D98E0kMc5GHacwqW6T/Vue bcuJz8/kuajttlYhndbFKNf/MEC8tUZiOEprQ2GXS6gdup4ryzAMv/AZ7TN+LAumCv OhqcHD3y7pmVA== Date: Sat, 11 Mar 2023 15:59:30 -0500 From: Sasha Levin To: Willy Tarreau Cc: Eric Biggers , Theodore Ts'o , Matthew Wilcox , Pavel Machek , linux-kernel@vger.kernel.org, stable@vger.kernel.org, viro@zeniv.linux.org.uk, linux-fsdevel@vger.kernel.org Subject: Re: AUTOSEL process Message-ID: References: <20230311161644.GH860405@mit.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Mar 11, 2023 at 09:19:54PM +0100, Willy Tarreau wrote: >On Sat, Mar 11, 2023 at 11:46:05AM -0800, Eric Biggers wrote: >> (And please note, the key word here is *confidence*. We all agree that it's >> never possible to be absolutely 100% sure whether a commit is appropriate for >> stable or not. That's a red herring. > >In fact even developers themselves sometimes don't know, and even when they >know, sometimes they know after committing it. Many times we've found that >a bug was accidently resolved by a small change. Just for this it's important >to support a post-merge analysis. >> And I would assume, or at least hope, that the neural network thing being used >> for AUTOSEL outputs a confidence rating and not just a yes/no answer. If it >> actually just outputs yes/no, well how is anyone supposed to know that and fix >> that, given that it does not seem to be an open source project?) > >Honestly I don't know. I ran a few experiments with natural language >processors such as GPT-3 on commit messages which contained human-readable >instructions, and asking "what am I expected to do with these patches", and >seeing the bot respond "you should backport them to this version, change >this and that in that version, and preliminary take that patch". It >summarized extremely well the instructions delivered by the developer, >which is awesome, but was not able to provide any form of confidence >level. I don't know what Sasha uses but wouldn't be surprised it shares >some such mechanisms and that it might not always be easy to get such a >confidence level. But I could be wrong. It's actually pretty stupid: it uses the existence of ~10k of the most common words in commit messages + metrics from cqmetrics (github.com/dspinellis/cqmetrics) as input. Although I get a score, which is already set pretty high, confidence is really non-existant here: at the end it depends mostly on the writing style of said commit author more than anything. -- Thanks, Sasha