Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp2253279imu; Sat, 10 Nov 2018 10:42:29 -0800 (PST) X-Google-Smtp-Source: AJdET5cDGcaSzN+ift6QwqoQggK372LTSoV0D7Qgz2I9d5zXm/oNyY9loI3ygeLO80xZCxLVpPun X-Received: by 2002:a63:4d:: with SMTP id 74mr12096771pga.248.1541875349873; Sat, 10 Nov 2018 10:42:29 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1541875349; cv=none; d=google.com; s=arc-20160816; b=0cZLBnTH0gDlqaRSdgptYhjHfteA7MfNvedNbo1yX/Hiujojg3Uf0iAwOkB2nxfQao /kFtklbFCWz/jA3Nspo3xvq5e0LsYYAZ40xoCIniybmhf7YxAJrsf56QyUwCIKErYXIj sppL4nJJcZs3hsFpEgavho5UstZ45RHoTHQNdc51tLjjS2lkKbny1bdCMSRAduaauUOs fgWen8fEWm6br1OXv3a++RmvxS3hEahblvD/7e5qc5X1hFW31e1iyDE/7Qz7nMCSXeS1 1w5VZQGO/hncmS6mX43x+zdjYu8lxMQ/RD1tFRiJOWimJZ4JvDI0ixwd2VC0PKq/5v3e +01Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=tAmc81hP2tBdcDkmYSjFJU6uvgTKUXMTzjg7WpkTVYE=; b=XOZntqlQ/w3QqyfngWuZJNqeyV/RHZruCRGLcQ1QBDjEF2AMfFwZg7USDqI4N2Zo7v qXkgVdSQm1PsFhHsdkoNylfnHaBo8nhZQumL5SARqiW/lo32h+1y4g6MzvxIlQtfyDkH YYo25fpDT7QXGel+YT6UB0xTfsOvwbOHUC24oeIa4j+GikXySi5t4VAK2zm0Mx1BB9wp 1MQT5JjSGrL6PTUSWicEXkPcIzEe6ejvcnti8U3gZpzufryAHNmOLh1DOEPjj+kKfW+/ +zyX4W7THawbmTHsdj3bsiPYAgUgAJQB7D97SLrxJz30NQVitIuSh+bdyZlhT+0Pb3js lXOg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HPN7EeNH; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g80-v6si14765835pfj.37.2018.11.10.10.42.14; Sat, 10 Nov 2018 10:42:29 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HPN7EeNH; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726897AbeKKE1u (ORCPT + 99 others); Sat, 10 Nov 2018 23:27:50 -0500 Received: from mail.kernel.org ([198.145.29.99]:52650 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726487AbeKKE1u (ORCPT ); Sat, 10 Nov 2018 23:27:50 -0500 Received: from localhost (71-6-98-120.static-ip.telepacific.net [71.6.98.120]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 08BA720858; Sat, 10 Nov 2018 18:41:53 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1541875313; bh=KxFELvIqhXNUKC440YJyV5pO/WeyNsZBmkG7Nacgid4=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=HPN7EeNHu0D80lV+Yx2ekbJbO9kQA3+RhIMkCN2lNc3EiWW3tP61JZEebnR0h5pdr BoZXqHWMohqXDIstdV6o1D7kbc9FlghcyEZcaFxzKSGFnwNnFNjxUQCDjeJl8CqWAp 3iD5PtEAde1tQcY8+x7Z6I/2+xo3TexVhD40XQOM= Date: Sat, 10 Nov 2018 10:41:52 -0800 From: Greg Kroah-Hartman To: Daniel Wagner Cc: stable@vger.kernel.org, linux-kernel@vger.kernel.org, Daniel Wagner , Sebastian Andrzej Siewior , Peter Zijlstra , Thomas Gleixner Subject: Re: [PATCH v4.4] x86/kconfig: Fall back to ticket spinlocks Message-ID: <20181110184152.GA890@kroah.com> References: <20181031081458.13996-1-wagi@monom.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181031081458.13996-1-wagi@monom.org> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Oct 31, 2018 at 09:14:58AM +0100, Daniel Wagner wrote: > From: Daniel Wagner > > Sebastian writes: > > """ > We reproducibly observe cache line starvation on a Core2Duo E6850 (2 > cores), a i5-6400 SKL (4 cores) and on a NXP LS2044A ARM Cortex-A72 (4 > cores). > > The problem can be triggered with a v4.9-RT kernel by starting > > cyclictest -S -p98 -m -i2000 -b 200 > > and as "load" > > stress-ng --ptrace 4 > > The reported maximal latency is usually less than 60us. If the problem > triggers then values around 400us, 800us or even more are reported. The > upperlimit is the -i parameter. > > Reproduction with 4.9-RT is almost immediate on Core2Duo, ARM64 and SKL, > but it took 7.5 hours to trigger on v4.14-RT on the Core2Duo. > > Instrumentation show always the picture: > > CPU0 CPU1 > => do_syscall_64 => do_syscall_64 > => SyS_ptrace => syscall_slow_exit_work > => ptrace_check_attach => ptrace_do_notify / rt_read_unlock > => wait_task_inactive rt_spin_lock_slowunlock() > -> while task_running() __rt_mutex_unlock_common() > / check_task_state() mark_wakeup_next_waiter() > | raw_spin_lock_irq(&p->pi_lock); raw_spin_lock(¤t->pi_lock); > | . . > | raw_spin_unlock_irq(&p->pi_lock); . > \ cpu_relax() . > - . > *IRQ* > > In the error case we observe that the while() loop is repeated more than > 5000 times which indicates that the pi_lock can be acquired. CPU1 on the > other side does not make progress waiting for the same lock with interrupts > disabled. > > This continues until an IRQ hits CPU0. Once CPU0 starts processing the IRQ > the other CPU is able to acquire pi_lock and the situation relaxes. > """ > > This matches with the observeration for v4.4-rt on a Core2Duo E6850: > > CPU 0: > > - no progress for a very long time in rt_mutex_dequeue_pi): > > stress-n-1931 0d..11 5060.891219: function: __try_to_take_rt_mutex > stress-n-1931 0d..11 5060.891219: function: rt_mutex_dequeue > stress-n-1931 0d..21 5060.891220: function: rt_mutex_enqueue_pi > stress-n-1931 0....2 5060.891220: signal_generate: sig=17 errno=0 code=262148 comm=stress-ng-ptrac pid=1928 grp=1 res=1 > stress-n-1931 0d..21 5060.894114: function: rt_mutex_dequeue_pi > stress-n-1931 0d.h11 5060.894115: local_timer_entry: vector=239 > > CPU 1: > > - IRQ at 5060.894114 on CPU 1 followed by the IRQ on CPU 0 > > stress-n-1928 1....0 5060.891215: sys_enter: NR 101 (18, 78b, 0, 0, 17, 788) > stress-n-1928 1d..11 5060.891216: function: __try_to_take_rt_mutex > stress-n-1928 1d..21 5060.891216: function: rt_mutex_enqueue_pi > stress-n-1928 1d..21 5060.891217: function: rt_mutex_dequeue_pi > stress-n-1928 1....1 5060.891217: function: rt_mutex_adjust_prio > stress-n-1928 1d..11 5060.891218: function: __rt_mutex_adjust_prio > stress-n-1928 1d.h10 5060.894114: local_timer_entry: vector=239 > > Thomas writes: > > """ > This has nothing to do with RT. RT is merily exposing the > problem in an observable way. The same issue happens with upstream, it's > harder to trigger and it's harder to observe for obvious reasons. > > If you read through the discussions [see the links below] then you > really see that there is an upstream issue with the x86 qrlock > implementation and Peter has posted fixes which resolve it, both at > the practical and the theoretical level. > """ > > Backporting all qspinlock related patches is very likely to introduce > regressions on v4.4. Therefore, the recommended solution by Peter and > Thomas is to drop back to ticket spinlocks for v4.4. > > Link :https://lkml.kernel.org/r/20180921120226.6xjgr4oiho22ex75@linutronix.de > Link: https://lkml.kernel.org/r/20180926110117.405325143@infradead.org > Cc: Sebastian Andrzej Siewior > Cc: Peter Zijlstra > Cc: Thomas Gleixner > Signed-off-by: Daniel Wagner > --- > > Thomas suggest following plan for fixing the issues on the varous > stable trees: > > 4.4: Trivial by switching back to ticket locks. Now queued up, thanks, greg k-h