Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp976619pxu; Mon, 23 Nov 2020 08:39:07 -0800 (PST) X-Google-Smtp-Source: ABdhPJwyVWsfvrmdgvY+2Fk9fFPj2n8HLv/URDhPnOUgvB2DHyD3FOecN2yj0TIqG+a2uLvTN4+K X-Received: by 2002:a05:6402:150d:: with SMTP id f13mr24532edw.119.1606149547142; Mon, 23 Nov 2020 08:39:07 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606149547; cv=none; d=google.com; s=arc-20160816; b=Tr16xEKzi5BgVvjYl+3vRjORg41QvmNNXekAYVRiGptlcCocKZA0L8aNJ/wExrEPR1 mGz3idZw1JbmwrfaPOkNlFsFHBij9AHvFyA1rGbkz7Akq7tZZz3ECH3/bFyx7T6yetNT EZS4aX/mUBwfb9p+5FDgzOcpQ13V8F0JpXJV07qk4CzoHnjsWFhS0GESliCd+G3UMGeM y1dNaXnEFvHoweEzHDWLSCsc45VOZB3C9IIt/xojiq3Qag2clr5nAkIkMJW7Ztem5abq HPiFM4uAQCieBCcGCKW8/sKpS5kUyayF3ccyJPKH+gE+i+V3RyiNmWZVILKUdMNgT+zn 1zVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date; bh=w02U79IgXoVR1Ne73goXhCfX1BoyC+I10BFjRBFSuHo=; b=ZuMPF/OlbStfA4ZAppAmhHQTYS0+KjKTnV2Xpp8IL7q1oNzysfu0CIBq7kR904lI2b EE7zDCnzdGYAznGl1gDPDrzQuXfL5lScU3ouqv1CNCb8JQDREYb6nxHNyaWxqnm5kfAk DZ0szM9KFa747/7g5hT+aMH9psdbMHQs1M8zhqZR5/YJOB0zpd5HR0AsHlhvkSmU7t2F bFUUbnCZI+zJZ1GmUh1/lxXgoAm+qpQejEBeGUdRnr4yTWNyfFKa4Vb64A8sIg5lcMxa YkHkzpgBsDmdmuPxADFGMvkhlSyZY+qsT4rDnietBdH3UMjQ72BF88vjDf5+oePCU6kE Wucg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p13si6799874ejz.147.2020.11.23.08.38.43; Mon, 23 Nov 2020 08:39:07 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2390119AbgKWQhE (ORCPT + 99 others); Mon, 23 Nov 2020 11:37:04 -0500 Received: from mail.kernel.org ([198.145.29.99]:57982 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732535AbgKWQhD (ORCPT ); Mon, 23 Nov 2020 11:37:03 -0500 Received: from gandalf.local.home (cpe-66-24-58-225.stny.res.rr.com [66.24.58.225]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 649D220665; Mon, 23 Nov 2020 16:37:01 +0000 (UTC) Date: Mon, 23 Nov 2020 11:36:59 -0500 From: Steven Rostedt To: Marco Elver Cc: "Paul E. McKenney" , Anders Roxell , Andrew Morton , Alexander Potapenko , Dmitry Vyukov , Jann Horn , Mark Rutland , Linux Kernel Mailing List , Linux-MM , kasan-dev , rcu@vger.kernel.org, Peter Zijlstra , Tejun Heo , Lai Jiangshan Subject: Re: [PATCH] kfence: Avoid stalling work queue task without allocations Message-ID: <20201123113659.3d1fd866@gandalf.local.home> In-Reply-To: <20201123112812.19e918b3@gandalf.local.home> References: <20201112161439.GA2989297@elver.google.com> <20201112175406.GF3249@paulmck-ThinkPad-P72> <20201113175754.GA6273@paulmck-ThinkPad-P72> <20201117105236.GA1964407@elver.google.com> <20201117182915.GM1437@paulmck-ThinkPad-P72> <20201118225621.GA1770130@elver.google.com> <20201118233841.GS1437@paulmck-ThinkPad-P72> <20201119125357.GA2084963@elver.google.com> <20201120142734.75af5cd6@gandalf.local.home> <20201123152720.GA2177956@elver.google.com> <20201123112812.19e918b3@gandalf.local.home> X-Mailer: Claws Mail 3.17.3 (GTK+ 2.24.32; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 23 Nov 2020 11:28:12 -0500 Steven Rostedt wrote: > I noticed: > > > [ 237.650900] enabling event benchmark_event > > In both traces. Could you disable CONFIG_TRACEPOINT_BENCHMARK and see if > the issue goes away. That event kicks off a thread that spins in a tight > loop for some time and could possibly cause some issues. > > It still shouldn't break things, we can narrow it down if it is the culprit. And it probably is the issue because that thread will never sleep! It runs a loop of: static int benchmark_event_kthread(void *arg) { /* sleep a bit to make sure the tracepoint gets activated */ msleep(100); while (!kthread_should_stop()) { trace_do_benchmark(); /* * We don't go to sleep, but let others run as well. * This is basically a "yield()" to let any task that * wants to run, schedule in, but if the CPU is idle, * we'll keep burning cycles. * * Note the tasks_rcu_qs() version of cond_resched() will * notify synchronize_rcu_tasks() that this thread has * passed a quiescent state for rcu_tasks. Otherwise * this thread will never voluntarily schedule which would * block synchronize_rcu_tasks() indefinitely. */ cond_resched_tasks_rcu_qs(); } return 0; } Did something change, where that "cond_resched_tasks_rcu_qs()" doesn't let things progress on ARM64? I noticed that you have PREEMPT enabled so this will only be preempted when its schedule time runs out and something else wants to run. How would that affect other threads? -- Steve