Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp5479103ybl; Tue, 14 Jan 2020 09:34:59 -0800 (PST) X-Google-Smtp-Source: APXvYqytrBEljl4XRLAERPtWScksisefdRmeHwz5mNHup+hbMCXh17l5X8EhzoaXrPETVF7F8JBS X-Received: by 2002:aca:e189:: with SMTP id y131mr16613572oig.111.1579023299587; Tue, 14 Jan 2020 09:34:59 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579023299; cv=none; d=google.com; s=arc-20160816; b=EBhgu9V32gruoAPctH5t5LPrVjNrwECzTUos9X94JmqyDQ/9H3/3sG/y2PYTxaU4qv jF1PbPYE1iMpM7x6sMOzKPi8Cen1izOGXts9h5DOUhzs2Zn6qrv5Ro5KujsSThKyoE+A /5/VW03aPNLwl84DxSy9k9YHbVyUiDUQzaYZyeJNjQBdb9+n5/QGvhvqT89JbFUrYgNo P7WnAQecrPMRLUkwsaHDiQtHoVKNPbGj1pr13/n5IUa9ydIGHXBJRrKvCtY9pJ/uEyYo 1dYJ4Gk6HaEeLIHbHePkBhAYocrPy6zhVY7wWTPOiNAdET+3xLGL+w2cQtEciSeEx1mG k5qg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :content-language:accept-language:in-reply-to:references:message-id :date:thread-index:thread-topic:subject:cc:to:from; bh=kFI2MRVZqaLfSs8CAizWsofes/JL/wgehHsnSzeKq+w=; b=SsaOvInDszB8vKeAgtibN9e+7IKg7p2vbqDwUH5/fMw0GOwjUznSKEtZLvO0AwikrS 5HRTTMDnksT8REIHiJWdktNnsd+4Bq9+Y4PIiR6NW/7sMh8AP4hyzob6593jSkAVkPLG CnkyfsF6C/CsEAEjeEmM5w8XjnqSG6n2odm5JLCM1U2aWO9XaXshH2N61vuswKszNbiK CNUjG7yXvmQVl0aJSjE121/oqlqwfY4eZ9CFOUHCCpLQb2JjhyID6Ac/XWKlKZ9OgT2x taJM/b68bOQoKXa5XbOzV1qwOFVYgvhLbwZ/QZlyuyugRRRj82sTcF4I/4QPy9kRWv75 s7+w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w18si10036269otl.54.2020.01.14.09.34.48; Tue, 14 Jan 2020 09:34:59 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728794AbgANReA convert rfc822-to-8bit (ORCPT + 99 others); Tue, 14 Jan 2020 12:34:00 -0500 Received: from eu-smtp-delivery-151.mimecast.com ([207.82.80.151]:37887 "EHLO eu-smtp-delivery-151.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726270AbgANReA (ORCPT ); Tue, 14 Jan 2020 12:34:00 -0500 Received: from AcuMS.aculab.com (156.67.243.126 [156.67.243.126]) (Using TLS) by relay.mimecast.com with ESMTP id uk-mta-48-ZhPI99vXO2Klxv0T-F0ggw-1; Tue, 14 Jan 2020 17:33:51 +0000 Received: from AcuMS.Aculab.com (fd9f:af1c:a25b:0:43c:695e:880f:8750) by AcuMS.aculab.com (fd9f:af1c:a25b:0:43c:695e:880f:8750) with Microsoft SMTP Server (TLS) id 15.0.1347.2; Tue, 14 Jan 2020 17:33:51 +0000 Received: from AcuMS.Aculab.com ([fe80::43c:695e:880f:8750]) by AcuMS.aculab.com ([fe80::43c:695e:880f:8750%12]) with mapi id 15.00.1347.000; Tue, 14 Jan 2020 17:33:51 +0000 From: David Laight To: 'Steven Rostedt' CC: 'Vincent Guittot' , Peter Zijlstra , Viresh Kumar , Ingo Molnar , Juri Lelli , Dietmar Eggemann , Ben Segall , Mel Gorman , linux-kernel Subject: RE: sched/fair: scheduler not running high priority process on idle cpu Thread-Topic: sched/fair: scheduler not running high priority process on idle cpu Thread-Index: AdXK8cUFXa7JpPXmQNq7oQ32S9fYHAACik4AAADJLkA= Date: Tue, 14 Jan 2020 17:33:50 +0000 Message-ID: <5ba2ae2d426c4058b314c20c25a9b1d0@AcuMS.aculab.com> References: <212fabd759b0486aa8df588477acf6d0@AcuMS.aculab.com> <20200114115906.22f952ff@gandalf.local.home> In-Reply-To: <20200114115906.22f952ff@gandalf.local.home> Accept-Language: en-GB, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [10.202.205.107] MIME-Version: 1.0 X-MC-Unique: ZhPI99vXO2Klxv0T-F0ggw-1 X-Mimecast-Spam-Score: 0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Steven Rostedt > Sent: 14 January 2020 16:59 > > On Tue, 14 Jan 2020 16:50:43 +0000 > David Laight wrote: > > > I've a test that uses four RT priority processes to process audio data every 10ms. > > One process wakes up the other three, they all 'beaver away' clearing a queue of > > jobs and the last one to finish sleeps until the next tick. > > Usually this takes about 0.5ms, but sometimes takes over 3ms. > > > > AFAICT the processes are normally woken on the same cpu they last ran on. > > There seems to be a problem when the selected cpu is running a (low priority) > > process that is looping in kernel [1]. > > I'd expect my process to be picked up by one of the idle cpus, but this > > doesn't happen. > > Instead the process sits in state 'waiting' until the active processes sleeps > > (or calls cond_resched()). > > > > Is this really the expected behaviour????? > > It is with CONFIG_PREEMPT_VOLUNTARY. I think you want to recompile your > kernel with CONFIG_PREEMPT. The idea is that the RT task will continue > to run on the CPU it last ran on, and would push off the lower priority > task to the idle CPU. But CONFIG_PREEMPT_VOLUNTARY means that this > will have to wait for the running task to not be in kernel context or > hit a cond_resched() which is the "voluntary" scheduling point. I have added a cond_resched() to the offending loop, but a close look implies that code is called with a lock held in another (less common) path so that can't be directly committed and so CONFIG_PREEMPT won't help. Indeed requiring CONFIG_PREEMPT doesn't help when customers are running the application, nor (probably) on AWS since I doubt it is ever the default. Does the same apply to non-RT tasks? I can select almost any priority, but RT ones are otherwise a lot better. I've also seen RT processes delayed by the network stack 'bh' that runs in a softint from the hardware interrupt. That can take a while (clearing up tx and refilling rx) and I don't think we have any control over the cpu it runs on? The cost of ftrace function call entry/exit (about 200 clocks) makes it rather unsuitable for any performance measurements unless only a very few functions are traced - which rather requires you know what the code is doing :-( David - Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK Registration No: 1397386 (Wales)