Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp11937620rwl; Tue, 3 Jan 2023 06:56:24 -0800 (PST) X-Google-Smtp-Source: AMrXdXvL5R0h25Vw0Cnx5ZpGbI33cEpUSpXfhqRzsrXl4qC9kqMpdvFTEmzTRldhbejcpPemSD/i X-Received: by 2002:a17:90b:3555:b0:219:e38b:5079 with SMTP id lt21-20020a17090b355500b00219e38b5079mr52116245pjb.38.1672757784595; Tue, 03 Jan 2023 06:56:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1672757784; cv=none; d=google.com; s=arc-20160816; b=zUkUUkF52SNwMN6dayGU0yerjumWQ4wdrvzuf1TZf/TOwE8PG2n65p9aBxtQFAirNt GXyeHJPEFc5ev6aF48FaiWj+vxuJHi33Urmx9OvzAQPaqwfFbsm1LdVLyCN4hcPz4OYF yQs0S7JkOjO7cPPlKg4y/6UWIw/sVB0hNjECeYqh+1OfHAihpdI+NJILt6prI2sMHACU cvyCKU0TAzgXQQtdryPzEPv3ndlRq9nh0m/SDv6J0xChBHSP2AJ7zACNjnMGRViXiDCg l7wwzMvWwGdCe34ezRi/ztyV7kUtJuHBhhVq91WbqUR1ZmqQ8zQIo7+TNM1RZv985b0w 1fzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=QMZBZ22IGBh45Cf/NuWC5djX29Cio7BfdtoICqFLw8c=; b=KyuZzOx+Ic6LJchMQkodTytE2g7WkpPOPNbMb/Nth65gEWy3vhQt0WgG+IZpCmcnAS 0I8jT5lCKlPFP7iug1dAnj/sXypz3FUuWL/suWAdFpa7c3qg/sWgmYjDIdyQRQSrjIez bgc+4iS9Y2kfWFe5BulUtXn8rPpg0YlEKPjiZRm2r6SN54USLfLGDBbWbdLJcj56l9wz 7hQeOh6mG10Hvze0wka0vDf9q3mdLzf9jJroHSv9Re7ITR8+yoS24yYb8Rd3LHBCetUL lD61hMBJqPAmLT+i8L7voOAN67VEptHxMf7C6p3g8vqrumBWatoWk9Mxg50CH1qsiN80 zoQg== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail (test mode) header.i=@armlinux.org.uk header.s=pandora-2019 header.b=vzoZM0xw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=armlinux.org.uk Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id s31-20020a63525f000000b00478aa71aa99si16434124pgl.717.2023.01.03.06.56.16; Tue, 03 Jan 2023 06:56:24 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail (test mode) header.i=@armlinux.org.uk header.s=pandora-2019 header.b=vzoZM0xw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=armlinux.org.uk Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237756AbjACOei (ORCPT + 60 others); Tue, 3 Jan 2023 09:34:38 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47612 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230214AbjACOed (ORCPT ); Tue, 3 Jan 2023 09:34:33 -0500 Received: from pandora.armlinux.org.uk (pandora.armlinux.org.uk [IPv6:2001:4d48:ad52:32c8:5054:ff:fe00:142]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A6725EE27 for ; Tue, 3 Jan 2023 06:34:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=armlinux.org.uk; s=pandora-2019; h=Sender:In-Reply-To:Content-Type: MIME-Version:References:Message-ID:Subject:Cc:To:From:Date:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=QMZBZ22IGBh45Cf/NuWC5djX29Cio7BfdtoICqFLw8c=; b=vzoZM0xwgbok6IYxVXqs12zdCr TuabIQ9nU3oiOJcpn9227fxmFE4ePZADHDcPzbXP9D4grvwld0YCZER64IK63BKVAL5Nri8yfouF5 UozlfO+LfJiiOOCzwcuhQ0Ps85nz22Bii8JwmXzQW8TZ8xzxRI145iD+s2S50qh/ed/fZsjfhukqA 84NvaCAM9zCsi2EUdYd+esPEM1WXtvVCvi15lC30pD97myua40DT4pM/82lz/YSb781nVP3v28Qts C5hg2a5MKZeQNYrm7Z+LdV3NhgU71wYvzsi2WgPF14ceGCYvTa6HYqdghaNekCBRPW2nt7hPI1NMv 60ptpCyA==; Received: from shell.armlinux.org.uk ([fd8f:7570:feb6:1:5054:ff:fe00:4ec]:35932) by pandora.armlinux.org.uk with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1pCiN2-0005QQ-2I; Tue, 03 Jan 2023 14:34:27 +0000 Received: from linux by shell.armlinux.org.uk with local (Exim 4.94.2) (envelope-from ) id 1pCiMz-00028E-7l; Tue, 03 Jan 2023 14:34:25 +0000 Date: Tue, 3 Jan 2023 14:34:25 +0000 From: "Russell King (Oracle)" To: Jens Axboe Cc: Hui Tang , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [bug-report] possible performance problem in ret_to_user_from_irq Message-ID: References: <7ecb8f3c-2aeb-a905-0d4a-aa768b9649b5@huawei.com> <50a5ebdb-4107-26cc-a2f6-da551d99ff38@kernel.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <50a5ebdb-4107-26cc-a2f6-da551d99ff38@kernel.dk> Sender: Russell King (Oracle) X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jan 03, 2023 at 07:25:26AM -0700, Jens Axboe wrote: > On 1/3/23 3:06?AM, Russell King (Oracle) wrote: > > On Mon, Dec 26, 2022 at 04:45:20PM +0800, Hui Tang wrote: > >> hi folks. > >> > >> I found a performance problem which is introduced by commit > >> 32d59773da38 ("arm: add support for TIF_NOTIFY_SIGNAL"). > >> After the commit, any bit in the range of 0..15 will cause > >> do_work_pending() to be invoked. More frequent do_work_pending() > >> invoked possible result in worse performance. > >> > >> Some of the tests I've done? as follows: > >> lmbench test base with patch > >> ./lat_ctx -P 1 -s 0 2 7.3167 11.04 > >> ./lat_ctx -P 1 -s 16 2 8.0467 14.5367 > >> ./lat_ctx -P 1 -s 64 2 7.8667 11.43 > >> ./lat_ctx -P 1 -s 16 16 16.47 18.3667 > >> ./lat_pipe -P 1 28.1671 44.7904 > >> > >> libMicro-0.4.1 test base with patch > >> ./cascade_cond -E -C 200\ > >> -L -S -W -N "c_cond_1" -I 100 286.3333 358 > >> > >> When I adjust test bit, the performance problem gone. > >> - movs r1, r1, lsl #16 > >> + ldr r2, =#_TIF_WORK_MASK > >> + tst r1, r2 > >> > >> Does anyone have a good suggestion for this problem? > >> should just test _TIF_WORK_MASK, as before? > > > > I think it should be fine - but I would suggest re-organising the > > TIF definitions so that those TIF bits that shouldn't trigger > > do_work_pending are not in the first 16 bits. > > > > Note that all four bits in _TIF_SYSCALL_WORK need to stay within > > an 8-bit even-bit-aligned range, so the value is suitable for an > > immediate assembly constant. > > > > I'd suggest moving the TIF definitions for 20 to 19, and 4..7 to > > 20..23, and then 8 to 4. > > Like this? > > diff --git a/arch/arm/include/asm/thread_info.h b/arch/arm/include/asm/thread_info.h > index aecc403b2880..7f092cb55a41 100644 > --- a/arch/arm/include/asm/thread_info.h > +++ b/arch/arm/include/asm/thread_info.h > @@ -128,15 +128,16 @@ extern int vfp_restore_user_hwstate(struct user_vfp *, > #define TIF_NEED_RESCHED 1 /* rescheduling necessary */ > #define TIF_NOTIFY_RESUME 2 /* callback before returning to user */ > #define TIF_UPROBE 3 /* breakpointed or singlestepping */ > -#define TIF_SYSCALL_TRACE 4 /* syscall trace active */ > -#define TIF_SYSCALL_AUDIT 5 /* syscall auditing active */ > -#define TIF_SYSCALL_TRACEPOINT 6 /* syscall tracepoint instrumentation */ > -#define TIF_SECCOMP 7 /* seccomp syscall filtering active */ > -#define TIF_NOTIFY_SIGNAL 8 /* signal notifications exist */ > +#define TIF_NOTIFY_SIGNAL 4 /* signal notifications exist */ > > #define TIF_USING_IWMMXT 17 > #define TIF_MEMDIE 18 /* is terminating due to OOM killer */ > -#define TIF_RESTORE_SIGMASK 20 > +#define TIF_RESTORE_SIGMASK 19 > +#define TIF_SYSCALL_TRACE 20 /* syscall trace active */ > +#define TIF_SYSCALL_AUDIT 21 /* syscall auditing active */ > +#define TIF_SYSCALL_TRACEPOINT 22 /* syscall tracepoint instrumentation */ > +#define TIF_SECCOMP 23 /* seccomp syscall filtering active */ > + > > #define _TIF_SIGPENDING (1 << TIF_SIGPENDING) > #define _TIF_NEED_RESCHED (1 << TIF_NEED_RESCHED) Yep, LGTM, thanks. -- RMK's Patch system: https://www.armlinux.org.uk/developer/patches/ FTTP is here! 40Mbps down 10Mbps up. Decent connectivity at last!