Received: by 2002:a25:868d:0:0:0:0:0 with SMTP id z13csp1423583ybk; Thu, 14 May 2020 08:38:23 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx8OvYjDPUnWntKzHDo46AIY7Gy755Mu3pEe+qxoCnNMZWXLqBnNRkhL0THKv+qb3Uj3BpF X-Received: by 2002:a05:6402:7d6:: with SMTP id u22mr4422514edy.149.1589470703560; Thu, 14 May 2020 08:38:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1589470703; cv=none; d=google.com; s=arc-20160816; b=rTsexKNKL/D5Xb9WwjpYXi1InCQis/iTLo3SJAY67Eyhvj9CJHN3rw/r5q5kU60VDi to6RuDSxhpzGoWzSOgMKHFkCNbsAu0k0J7SHsG0F7RH8jGKfQgpR/K3oN5/JXc/5FElb U0W8kIk7uxY9Vn3Xv98NoUXpsABjKkVsyVZW2nkprOFH2VdM8D5qQHCGct79467neqig qv+7yoie9tFeoSGGs3x/l4L4L8IcPgwuKzm4h9mihK3CCEHozGIpiVscWBSPA7H7mgRk /IjoRPxY3PNBtKPcjFhGJEEf+GgsJPllW+RNfJSkddS84nNQzK2nfO8C2q2VRrjdDcmC EEjg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=3Mei1KcbI0ookMBaHr7GQag+yZzPrXFoTYa4Y4rxHFA=; b=ApwxI0gM+IeMKXjE6QMnex7qaAIBh1oJNvSSlLHtTJNcDV3ZVB6KUdSS94g6y7KmFu d2pFOFi8jRiUpqihfJ2bf3CqCVmzEijzA/vra3AOZ4XwVE51UP4rqdA2nfinGJF/Nh9f lLBEvS5kdS8QxtObAqBlcSpqaNY+DSPbbkX2E7J1XGYL1gHLLudLt5sKxKAqtxNx0GBF dVnMf+UQaMLjUBJF22bdHiblD2tSj0WIo05rMvYCwdePwfV4muQecXY3T1LNuX2TkgKB LsoAO7XM2wi48XqRUyg6oA6g9VGVrYt4ql6zXcjcycLFXhSph5DbxCY3vizJ2/XbHESb yQ+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=FIDjMi9L; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id w5si2129696eja.332.2020.05.14.08.38.00; Thu, 14 May 2020 08:38:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=FIDjMi9L; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727873AbgENPeB (ORCPT + 99 others); Thu, 14 May 2020 11:34:01 -0400 Received: from mail.kernel.org ([198.145.29.99]:48420 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726056AbgENPeB (ORCPT ); Thu, 14 May 2020 11:34:01 -0400 Received: from paulmck-ThinkPad-P72.home (unknown [50.39.105.78]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id AC10C206A5; Thu, 14 May 2020 15:34:00 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1589470440; bh=c1I6NVD00/l8JyhbZoKN9BhNoOXboGGBJlhOK92sQnc=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=FIDjMi9Lctz3XO1na65x+KHBIKFfPyFg7a3xT6OfYJBxR7pU/Z/mASDJhENqsfxt9 QwtAF7Nkrp1SHNJxptfm53l35itQ2iB52uXu/fe6i6ctiGnVyFsbunm+ZOKEt+hPez Z2CSnLSKskbWfaXoLtrBmGo9hRVDRAO/JF0tnfXE= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 3687235229C5; Thu, 14 May 2020 08:34:00 -0700 (PDT) Date: Thu, 14 May 2020 08:34:00 -0700 From: "Paul E. McKenney" To: Qian Cai Cc: Stephen Rothwell , Linux Next Mailing List , Linux Kernel Mailing List , Madhuparna Bhowmik , Amol Grover , Dmitry Vyukov Subject: Re: Default enable RCU list lockdep debugging with PROVE_RCU Message-ID: <20200514153400.GJ2869@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20200514222535.259cb69e@canb.auug.org.au> <20200514133328.GG2869@paulmck-ThinkPad-P72> <20200514135402.GI2869@paulmck-ThinkPad-P72> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 14, 2020 at 10:03:21AM -0400, Qian Cai wrote: > > > > On May 14, 2020, at 9:54 AM, Paul E. McKenney wrote: > > > > On Thu, May 14, 2020 at 09:44:28AM -0400, Qian Cai wrote: > >> > >> > >>> On May 14, 2020, at 9:33 AM, Paul E. McKenney wrote: > >>> > >>> On Thu, May 14, 2020 at 08:31:13AM -0400, Qian Cai wrote: > >>>> > >>>> > >>>>> On May 14, 2020, at 8:25 AM, Stephen Rothwell wrote: > >>>>> > >>>>> Hi Paul, > >>>>> > >>>>> This patch in the rcu tree > >>>>> > >>>>> d13fee049fa8 ("Default enable RCU list lockdep debugging with PROVE_RCU") > >>>>> > >>>>> is causing whack-a-mole in the syzbot testing of linux-next. Because > >>>>> they always do a debug build of linux-next, no testing is getting done. :-( > >>>>> > >>>>> Can we find another way to find all the bugs that are being discovered > >>>>> (very slowly)? > >>>> > >>>> Alternatively, could syzbot to use PROVE_RCU=n temporarily because it can’t keep up with it? I personally found PROVE_RCU_LIST=y is still useful for my linux-next testing, and don’t want to lose that coverage overnight. > >>> > >>> The problem is that PROVE_RCU is exactly PROVE_LOCKING, and asking people > >>> to test without PROVE_LOCKING is a no-go in my opinion. But of course > >>> on the other hand if there is no testing of RCU list lockdep debugging, > >>> those issues will never be found, let alone fixed. > >>> > >>> One approach would be to do as Stephen asks (either remove d13fee049fa8 > >>> or pull it out of -next) and have testers force-enable the RCU list > >>> lockdep debugging. > >>> > >>> Would that work for you? > >> > >> Alternatively, how about having > >> > >> PROVE_RCU_LIST=n if DEBUG_AID_FOR_SYZBOT > >> > >> since it is only syzbot can’t keep up with it? > > > > Sound good to me, assuming that this works for the syzkaller guys. > > Or could there be a "select PROVE_RCU_LIST" for the people who would > > like to test it. > > > > Alternatively, if we revert d13fee049fa8 from -next, I could provide > > you a script that updates your .config to set both RCU_EXPERT and > > PROVE_RCU_LIST. > > > > There are a lot of ways to appraoch this. > > > > So what would work best for everyone? > > > If PROVE_RCU_LIST=n if DEBUG_AID_FOR_SYZBOT works for syzbot guys, that would be great, so other testing agents could still report/fix those RCU-list bugs and then pave a way for syzbot to return back once all those false positives had been sorted out. On that, I must defer to the syzbot guys. > Otherwise, “select PROVE_RCU_LIST” *might* be better than buried into RCU_EXPERT where we will probably never saw those false positives been addressed since my configs does not cover a wide range of subsystems and probably not many other bots would enable RCU_EXPERT. Yet another option would be to edit your local kernel/rcu/Kconfig.debug and change the code to the following: config PROVE_RCU_LIST def_bool y help Enable RCU lockdep checking for list usages. It is default enabled with CONFIG_PROVE_RCU. Removing the RCU_EXPERT dependency would not go over at all well with some people whose opinions are difficult to ignore. ;-) Thanx, Paul