Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp1375466rwd; Wed, 7 Jun 2023 15:29:16 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5e6eD4fd6VJVf6Gqvzz63row4u5XvEBMAe/h2o3Ky+lQfW4Yvc6aVYcP2OyqM+zRNctbg1 X-Received: by 2002:a17:90a:c503:b0:259:cd69:39e6 with SMTP id k3-20020a17090ac50300b00259cd6939e6mr1433729pjt.23.1686176956676; Wed, 07 Jun 2023 15:29:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686176956; cv=none; d=google.com; s=arc-20160816; b=hRHFPLV7NZ7Sg1o506KtP3Zh62Qtg84GndNs49oQvR+y0om1WTG8ZHe8jlSlHLogPV iojmZxl7NwFM1hMi5plL8IQ0z70Sx1QX8psMIx8w+emp4TpjpAUzo5KHolS2EzuR+AWQ UYHBLOeMHUGjn9Az5f3661N7SU+HBBUJClYqaKg/iKkQiymwMvotTpOEpdTXd0PZGTB5 HaFQvnBm7n3pttwd0NASMEqZfjw3Os4CyUjPKlCduF4iLvDsMze1bKxI5cm2iinQ6uxy 9lsei6r2TH1FKuDT+8FZgr7RZNpKU2kYtl1CcqYTTEeft/JbUuXe/uuuNLU4JI/VVmn+ FYTQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:sender:dkim-signature; bh=BVOwXBe+gi9U6MELpPhRwwMitOlIbz1D83FR4yFNyP8=; b=xJw4lXc+iC6A1hBytKHTZvbdYbbQKQeZJQLKHIBMrzsiXe9Ez+zgk9djGJGGbNRA9T g3d2cOYQpZ2L3KIFu/Tv7f+E1X/3kgy7HegjQndAXPjrrc8I242AFJCBQHQu/EaBOZBh u1X6F5kuRIzWEq1r7nNOWxeQca4WbqyVNUgXZTtz87xv30uUP57U7rKN9Vhfu6InfNh5 Y3Zw4fb2X/KfyqhrcXvlN4so+f5fNPcnI81IhwiaLjuVfEAI1VjwTgdUL2fHMv77NWem xMlhuaATMpzyZkSl0+iy2Y5TW8JLUeZx0mroTD+F26ZHasTQcN0TKsAwktrG0UYoTsPg EOhg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=S9fpjN7Y; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id p4-20020a17090a4f0400b002566f56e9aesi1720555pjh.105.2023.06.07.15.29.02; Wed, 07 Jun 2023 15:29:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=S9fpjN7Y; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230450AbjFGWNa (ORCPT + 99 others); Wed, 7 Jun 2023 18:13:30 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60900 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229517AbjFGWN3 (ORCPT ); Wed, 7 Jun 2023 18:13:29 -0400 Received: from mail-pf1-x434.google.com (mail-pf1-x434.google.com [IPv6:2607:f8b0:4864:20::434]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 42C3F2118 for ; Wed, 7 Jun 2023 15:13:28 -0700 (PDT) Received: by mail-pf1-x434.google.com with SMTP id d2e1a72fcca58-655fce0f354so2695163b3a.0 for ; Wed, 07 Jun 2023 15:13:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1686176008; x=1688768008; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:from:to:cc:subject:date:message-id :reply-to; bh=BVOwXBe+gi9U6MELpPhRwwMitOlIbz1D83FR4yFNyP8=; b=S9fpjN7YodN2+IBus8fJhXHiiVEQ1SHuADo3yFRDKlHBgTqHWAdz40R5mJXOedC3Gh z3/JafZ4X3ESPwC4WZpGSxByZ9loZZF5oLTkv9v8MEMnWf737vB766X2YKBdgTLaRFvN ApTzOtMsyIn8NnRszwmegGGfkC2ahGCNulfmm8IV/7H2MamCZjr+ZDRCzFBPeOJP0rDX YXrRoqE5Yshv98UsRj6GGzZrrQNL9iw1veBMjdu86DAp4TmOegvF5IcyrKpDz8w3i+Z2 I4PMtEweX4jkmLeC0BKtiq5Bi8lUN+lyjybZHJFeCmxQdqi5GzM/5oyukGScqQaeGvJr XmdQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1686176008; x=1688768008; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:sender:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=BVOwXBe+gi9U6MELpPhRwwMitOlIbz1D83FR4yFNyP8=; b=jI7gBPNroYDDmRHGBYAx342g5iUjkpudHthDN3RnDidbG3XD4mk+TKl2+uLphljaA6 +pQz2L1heH9qFPca7/UuOUG+XKlFNMeE+ce0GSeL+F3pMuE9rGxHnr5jnGCrCd2Jhyft QIbHBi2nS+8O82iTfklMSItwp6D3596M/ptkFVKplWeMn6gFfgEkSTGwhZZbEFIRt6wP 3/fhdDoNLL/+0jxQBFsZRN7dPUgVU4na+XtPxc5yCna+9u9VHrkdZoFs+F6bOmvHs1/b 8lmOPMNgRj48gm5ihTA6rMhOV0iP334VRb0hBR8QXv6kQnJQbI8wMdNPBL8X6jJZM/DV FpgA== X-Gm-Message-State: AC+VfDzNoC/OKxeCULuTzlSppqpE0Se8oKyMoTmCy19VfumXpdJ6hDtA SqsYdk+k4IR1ESkfwIgk/LZHrslcI5fqrQ== X-Received: by 2002:a05:6a20:9596:b0:117:d81d:f170 with SMTP id iu22-20020a056a20959600b00117d81df170mr2525214pzb.28.1686176007418; Wed, 07 Jun 2023 15:13:27 -0700 (PDT) Received: from localhost (dhcp-72-235-13-41.hawaiiantel.net. [72.235.13.41]) by smtp.gmail.com with ESMTPSA id q14-20020a65494e000000b00530914c3bc1sm8607463pgs.21.2023.06.07.15.13.26 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 07 Jun 2023 15:13:26 -0700 (PDT) Sender: Tejun Heo Date: Wed, 7 Jun 2023 12:13:27 -1000 From: Tejun Heo To: K Prateek Nayak Cc: Sandeep Dhavale , jiangshanlai@gmail.com, torvalds@linux-foundation.org, peterz@infradead.org, linux-kernel@vger.kernel.org, kernel-team@meta.com, joshdon@google.com, brho@google.com, briannorris@chromium.org, nhuck@google.com, agk@redhat.com, snitzer@kernel.org, void@manifault.com, kernel-team@android.com Subject: Re: [PATCH 14/24] workqueue: Generalize unbound CPU pods Message-ID: References: <20230519001709.2563-1-tj@kernel.org> <20230519001709.2563-15-tj@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-1.5 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_EF,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello, On Wed, May 31, 2023 at 05:44:57PM +0530, K Prateek Nayak wrote: ... > The RIP points to dereferencing sd_llc_shared->has_idle_cores > > $ scripts/faddr2line vmlinux select_task_rq_fair+0x9bd > select_task_rq_fair+0x9bd/0x2570: > test_idle_cores at kernel/sched/fair.c:6830 > (inlined by) select_idle_sibling at kernel/sched/fair.c:7189 > (inlined by) select_task_rq_fair at kernel/sched/fair.c:7710 Hmm... the only thing I can think of is workqueue setting ->wake_cpu to something invalid. > My kernel is somewhat stable (I have not seen a panic for ~45min but I > was not stress testing the system either during that time) with the > following changes: > > diff --git a/kernel/workqueue.c b/kernel/workqueue.c > index b2e914655f05..a279cc9c2248 100644 > --- a/kernel/workqueue.c > +++ b/kernel/workqueue.c > @@ -2247,7 +2247,7 @@ static void unbind_worker(struct worker *worker) > if (cpumask_intersects(wq_unbound_cpumask, cpu_active_mask)) > WARN_ON_ONCE(set_cpus_allowed_ptr(worker->task, wq_unbound_cpumask) < 0); > else > - WARN_ON_ONCE(set_cpus_allowed_ptr(worker->task, cpu_possible_mask) < 0); > + WARN_ON_ONCE(set_cpus_allowed_ptr(worker->task, cpu_active_mask) < 0); > } I'm not sure why changing the cpus_allowed_ptr would make a difference here. Maybe the chain of events involves CPUs going offline and the above migrate the tasks resetting their ->wake_cpu. Can you please try the following branch and see if any of the warnings triggers? git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq.git affinity-scopes-dbg-invalid-cpu Thanks. -- tejun