Received: by 2002:a05:6a10:6d10:0:0:0:0 with SMTP id gq16csp29488pxb; Tue, 12 Apr 2022 15:55:35 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwz3FfWeNBMk2ky+2pmtXseEMGsopxe/QdcjA5wZQ7Ap2aAmYM9KhwsFZ6/+a8UKWvGGE5k X-Received: by 2002:a17:902:b092:b0:156:a40f:f716 with SMTP id p18-20020a170902b09200b00156a40ff716mr39543635plr.72.1649804135538; Tue, 12 Apr 2022 15:55:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649804135; cv=none; d=google.com; s=arc-20160816; b=IiwGeUwgHMS5kdAwdIoNTBV4NhfQ/A1XIt5ZWNfHvbE7GozDfA94N2OwYGVXhUw5oZ zQvUMRj2C6xCDf1C1tj0+TauOTt3V0alWjQhHQ/MBrt9Yq5dq+QWpQyCtqcVdJkVuros /dXH8+pyBp6cx7fktLPJw17kAZuvMGfu15XlcWgQTsveInZ1aQf5bfvHJRpu06pRcI0W 4+ZF0UgpFUAXBWCZFlLEvw+1qRSQcr6Izz5lwDCxN+Xh54/UoRG8gI0ba2LJFX03tWX5 PSRD0dKg35UCq8/gkoDKvvbpNbOewyVaBT3nJwsKSP5+ccksadtO5pkPh8eNhdzLEZI6 oj6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:date:message-id:organization:from :references:cc:to:subject; bh=0LypIjk/DjMkW6XG5n+CxNent/YxcNVv1HBjR63WCxM=; b=YlApjhTM7F0nwhS6wIQMWsTmzWLL9RCAOwhtZiYy6JJdhgGx8rWeE8uXKjbEO7cXCw Mne07cwha2OCdKO2cF39mCAzTa0pKJ+rc7+9cv+hvrNxwPYY/QKwvJdPbLN7at9AN8rj EOvZWYkeJA8WM4GzXUIyjJY4VgshxZGGw+vuKxPn7stGMGRd1eXa3xGF3fOK5WERXp06 hITf24dcQfXHB0svKuggRAJ0QjMnPu8iOpwsXCE+9dVWdM2bCQ++8R2w2WK5K6hfj5Kc MxgU9Cxtm7/Y0u1yVYTIReJCF6CEQk6SP6I9t6BxYvdb/f2j2GUt8ABXqL8YrD5LFV05 jUSQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=applied-asynchrony.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id u17-20020a170903125100b00153ee8b0c9dsi13109468plh.176.2022.04.12.15.55.35 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Apr 2022 15:55:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=applied-asynchrony.com Received: from out1.vger.email (out1.vger.email [IPv6:2620:137:e000::1:20]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 5A00611BCDB; Tue, 12 Apr 2022 14:37:22 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229849AbiDKIIG (ORCPT + 99 others); Mon, 11 Apr 2022 04:08:06 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46012 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1343605AbiDKIHf (ORCPT ); Mon, 11 Apr 2022 04:07:35 -0400 Received: from mail.itouring.de (mail.itouring.de [85.10.202.141]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5E7E11AD9E; Mon, 11 Apr 2022 01:05:21 -0700 (PDT) Received: from tux.applied-asynchrony.com (p5ddd7616.dip0.t-ipconnect.de [93.221.118.22]) by mail.itouring.de (Postfix) with ESMTPSA id D21EB124EC0; Mon, 11 Apr 2022 10:05:19 +0200 (CEST) Received: from [192.168.100.221] (hho.applied-asynchrony.com [192.168.100.221]) by tux.applied-asynchrony.com (Postfix) with ESMTP id 9A4F4F01601; Mon, 11 Apr 2022 10:05:19 +0200 (CEST) Subject: Re: [tip: sched/core] sched/tracing: Don't re-read p->state when emitting sched_switch event To: Greg KH Cc: Qais Yousef , linux-kernel@vger.kernel.org, linux-tip-commits@vger.kernel.org, Abhijeet Dharmapurikar , Valentin Schneider , "Peter Zijlstra (Intel)" , "Steven Rostedt (Google)" , x86@kernel.org, stable@vger.kernel.org References: <20220120162520.570782-2-valentin.schneider@arm.com> <164614827941.16921.4995078681021904041.tip-bot2@tip-bot2> <20220308180240.qivyjdn4e3te3urm@wubuntu> <20220308185138.ldxfqd242uxowymd@wubuntu> <20220409233829.o2s6tffuzujkx6w2@airbuntu> <20220410220608.cdf6hmf5mwcqzwun@airbuntu> From: =?UTF-8?Q?Holger_Hoffst=c3=a4tte?= Organization: Applied Asynchrony, Inc. Message-ID: Date: Mon, 11 Apr 2022 10:05:19 +0200 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RDNS_NONE, SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-04-11 09:28, Greg KH wrote: > On Mon, Apr 11, 2022 at 09:18:19AM +0200, Holger Hoffstätte wrote: >> On 2022-04-11 01:22, Holger Hoffstätte wrote: >>> On 2022-04-11 00:06, Qais Yousef wrote: >>>> On 04/10/22 00:38, Qais Yousef wrote: >>>>> On 03/08/22 18:51, Qais Yousef wrote: >>>>>> On 03/08/22 19:10, Greg KH wrote: >>>>>>> On Tue, Mar 08, 2022 at 06:02:40PM +0000, Qais Yousef wrote: >>>>>>>> +CC stable >>>>>>>> >>>>>>>> On 03/01/22 15:24, tip-bot2 for Valentin Schneider wrote: >>>>>>>>> The following commit has been merged into the sched/core branch of tip: >>>>>>>>> >>>>>>>>> Commit-ID:     fa2c3254d7cfff5f7a916ab928a562d1165f17bb >>>>>>>>> Gitweb:        https://git.kernel.org/tip/fa2c3254d7cfff5f7a916ab928a562d1165f17bb >>>>>>>>> Author:        Valentin Schneider >>>>>>>>> AuthorDate:    Thu, 20 Jan 2022 16:25:19 >>>>>>>>> Committer:     Peter Zijlstra >>>>>>>>> CommitterDate: Tue, 01 Mar 2022 16:18:39 +01:00 >>>>>>>>> >>>>>>>>> sched/tracing: Don't re-read p->state when emitting sched_switch event >>>>>>>>> >>>>>>>>> As of commit >>>>>>>>> >>>>>>>>>    c6e7bd7afaeb ("sched/core: Optimize ttwu() spinning on p->on_cpu") >>>>>>>>> >>>>>>>>> the following sequence becomes possible: >>>>>>>>> >>>>>>>>>               p->__state = TASK_INTERRUPTIBLE; >>>>>>>>>               __schedule() >>>>>>>>>             deactivate_task(p); >>>>>>>>>    ttwu() >>>>>>>>>      READ !p->on_rq >>>>>>>>>      p->__state=TASK_WAKING >>>>>>>>>             trace_sched_switch() >>>>>>>>>               __trace_sched_switch_state() >>>>>>>>>                 task_state_index() >>>>>>>>>                   return 0; >>>>>>>>> >>>>>>>>> TASK_WAKING isn't in TASK_REPORT, so the task appears as TASK_RUNNING in >>>>>>>>> the trace event. >>>>>>>>> >>>>>>>>> Prevent this by pushing the value read from __schedule() down the trace >>>>>>>>> event. >>>>>>>>> >>>>>>>>> Reported-by: Abhijeet Dharmapurikar >>>>>>>>> Signed-off-by: Valentin Schneider >>>>>>>>> Signed-off-by: Peter Zijlstra (Intel) >>>>>>>>> Reviewed-by: Steven Rostedt (Google) >>>>>>>>> Link: https://lore.kernel.org/r/20220120162520.570782-2-valentin.schneider@arm.com >>>>>>>> >>>>>>>> Any objection to picking this for stable? I'm interested in this one for some >>>>>>>> Android users but prefer if it can be taken by stable rather than backport it >>>>>>>> individually. >>>>>>>> >>>>>>>> I think it makes sense to pick the next one in the series too. >>>>>>> >>>>>>> What commit does this fix in Linus's tree? >>>>>> >>>>>> It should be this one: c6e7bd7afaeb ("sched/core: Optimize ttwu() spinning on p->on_cpu") >>>>> >>>>> Should this be okay to be picked up by stable now? I can see AUTOSEL has picked >>>>> it up for v5.15+, but it impacts v5.10 too. >>>> >>>> commit: fa2c3254d7cfff5f7a916ab928a562d1165f17bb >>>> subject: sched/tracing: Don't re-read p->state when emitting sched_switch event >>>> >>>> This patch has an impact on Android 5.10 users who experience tooling breakage. >>>> Is it possible to include in 5.10 LTS please? >>>> >>>> It was already picked up for 5.15+ by AUTOSEL and only 5.10 is missing. >>>> >>> >>> https://lore.kernel.org/stable/Yk2PQzynOVOzJdPo@kroah.com/ >>> >>> However, since then further investigation (still in progress) has shown that this >>> may have been the fault of the tool in question, so if you can verify that tracing >>> sched still works for you with this patch in 5.15.x then by all means >>> let's merge it. >> >> So it turns out the lockup is indeed the fault of the tool, which contains multiple >> kernel-version dependent tracepoint definitions and now fails with this >> patch. > > What tools is this? sysdig - which uses a helper kernel module which accesses tracepoints, but of course (as I just found) with copypasta'd TP definitions, which broke with this patch due to the additional parameter in the function signature. It's been prone to breakage forever because of a lack of a stable kernel ABI. Took me a while to find/figure out, but IMHO better safe than sorry. We've had autoselected scheduler patches before that looked fine but really were not. > >> Greg, please re-enqueue this patch where necessary (5.10, 5.15+) > > If I queue it up again, will the tools keep breaking? Yes, but that's their problem with an out-of-tree module; a few more #ifdefs are not going to make a big difference. thanks Holger