Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp6438617ybl; Wed, 15 Jan 2020 04:46:33 -0800 (PST) X-Google-Smtp-Source: APXvYqzdSBUFVmzzz9CnNIGOE2D6gISGM5TWTQVltDQw20Af+yHwdDaliYxmEY0bedFaIUOEu98Z X-Received: by 2002:a05:6808:aac:: with SMTP id r12mr21196612oij.59.1579092393447; Wed, 15 Jan 2020 04:46:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579092393; cv=none; d=google.com; s=arc-20160816; b=gn/sYC9yBa4fjek49LnVZ9JsTgTDOoUYEYk9hhabJ9KUCga7Ouh5yxoT8oBNoHdFLX +sjmEW5IbOCLhmdA/P9CipoeQlUtjDGAGNbuCJN7QILct4foLF0OJf0Y+iYG1PIGD9vi M4k3xJ1FlIN0OkS0FzEnwGnpJAgYAjJcOZX/ZPOPhdhP9IbRk5Mnp/ieHioV8alKbt7x aZ9N6l0ZWi4ND5hkVsv9iF1nLGyhgHVgYz2AuLdudC5lIsE+sHW1PmEJz1/fNaXmaunv XryVl53BPMXmqv3i/hld0kojYPg3LjC0KFDZFv+jsYOh9PSDahayATtdNaQg6dlbWV/D JlMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :content-language:accept-language:in-reply-to:references:message-id :date:thread-index:thread-topic:subject:cc:to:from; bh=VvK5NYnA0d9R9z9RUOqnfKchq4JxkqJRZq02E2ox61U=; b=Gu77c4yZmMLtNqlr/PDWP50ZaRAxzU0LZkX56wAZv4f+90enQTM6dXJG52sOOe6STh Gb2m11MACbnGZhQZ+SGM+hu6XHSh1tmGhvJq2dvoqzgtoviomZVZS7oXIxzQ40SoEn1F 3yQs/GI+3YatXVt9iHUvBHkBNMwKzw2Uq7uD/825UGqoFcdZyCJLFBpe5+s75EYfpgjK fE6B2p58nyPHwW3mHFQKAX9FCoAmIKQqKJaLN2yYb16KDsLnPs7Hv2qAnL4RDdzwqJ7L /4bvdGEsLMJDFVMUjA+7Yen6WFXw5GUpcXByJ1A2iCaYSCXBRvTzbCA8YH96SFYpn4v6 WR/Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f203si9231065oib.56.2020.01.15.04.46.20; Wed, 15 Jan 2020 04:46:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729145AbgAOMoZ convert rfc822-to-8bit (ORCPT + 99 others); Wed, 15 Jan 2020 07:44:25 -0500 Received: from eu-smtp-delivery-151.mimecast.com ([207.82.80.151]:35849 "EHLO eu-smtp-delivery-151.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725999AbgAOMoY (ORCPT ); Wed, 15 Jan 2020 07:44:24 -0500 Received: from AcuMS.aculab.com (156.67.243.126 [156.67.243.126]) (Using TLS) by relay.mimecast.com with ESMTP id uk-mta-10-dGzufp_XNGKEFWd_Sp1x8w-1; Wed, 15 Jan 2020 12:44:20 +0000 Received: from AcuMS.Aculab.com (fd9f:af1c:a25b:0:43c:695e:880f:8750) by AcuMS.aculab.com (fd9f:af1c:a25b:0:43c:695e:880f:8750) with Microsoft SMTP Server (TLS) id 15.0.1347.2; Wed, 15 Jan 2020 12:44:19 +0000 Received: from AcuMS.Aculab.com ([fe80::43c:695e:880f:8750]) by AcuMS.aculab.com ([fe80::43c:695e:880f:8750%12]) with mapi id 15.00.1347.000; Wed, 15 Jan 2020 12:44:19 +0000 From: David Laight To: 'Steven Rostedt' CC: 'Vincent Guittot' , Peter Zijlstra , Viresh Kumar , Ingo Molnar , Juri Lelli , Dietmar Eggemann , Ben Segall , Mel Gorman , linux-kernel Subject: RE: sched/fair: scheduler not running high priority process on idle cpu Thread-Topic: sched/fair: scheduler not running high priority process on idle cpu Thread-Index: AdXK8cUFXa7JpPXmQNq7oQ32S9fYHAACik4AAADJLkAAAO3PAAAmXEgg Date: Wed, 15 Jan 2020 12:44:19 +0000 Message-ID: <878a35a6642d482aa0770a055506bd5e@AcuMS.aculab.com> References: <212fabd759b0486aa8df588477acf6d0@AcuMS.aculab.com> <20200114115906.22f952ff@gandalf.local.home> <5ba2ae2d426c4058b314c20c25a9b1d0@AcuMS.aculab.com> <20200114124812.4d5355ae@gandalf.local.home> In-Reply-To: <20200114124812.4d5355ae@gandalf.local.home> Accept-Language: en-GB, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [10.202.205.107] MIME-Version: 1.0 X-MC-Unique: dGzufp_XNGKEFWd_Sp1x8w-1 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: aculab.com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Steven Rostedt > Sent: 14 January 2020 17:48 > > On Tue, 14 Jan 2020 17:33:50 +0000 > David Laight wrote: > > > I have added a cond_resched() to the offending loop, but a close look implies > > that code is called with a lock held in another (less common) path so that > > can't be directly committed and so CONFIG_PREEMPT won't help. > > > > Indeed requiring CONFIG_PREEMPT doesn't help when customers are running > > the application, nor (probably) on AWS since I doubt it is ever the default. > > > > Does the same apply to non-RT tasks? > > I can select almost any priority, but RT ones are otherwise a lot better. > > > > I've also seen RT processes delayed by the network stack 'bh' that runs > > in a softint from the hardware interrupt. > > That can take a while (clearing up tx and refilling rx) and I don't think we > > have any control over the cpu it runs on? > > Yes, even with CONFIG_PREEMPT, Linux has no guarantees of latency for > any task regardless of priority. If you have latency requirements, then > you need to apply the PREEMPT_RT patch (which may soon make it to > mainline this year!), which spin locks and bh wont stop a task from > scheduling (unless they need the same lock) We're not trying to do anything life-threatening. So the latency requirements are only moderate - failures mess up telephone audio quality. There is also allowance for jitter elsewhere. OTOH not running a high priority process when there are idle cpu seems 'sub-optimal'. Code that runs with a spin-lock held (or otherwise disables preemption) for significant periods probably ought to be detected and warned. I'm not sure of a suitable limit, 100us is probably excessive on x86. IIUC PREEMPT_RT adds overhead to quite a bit of code and is unlikely to get enabled in 'distro' kernels. Especially since they've not enabled CONFIG_PREEMPT which probably has a lower impact - provided the cv+mutex wakeup has been arranged to avoid the treble process switch. Running the driver bh (which is often significant) from a high priority worker thread instead of a softint (which isn't much different to the 'hardint' it is scheduled from) probably doesn't cost much (in-kernel process switches shouldn't be much more than a stack switch). That would benefit RT processes since they could be higher priority than the bh code. Although you'd probably want a 'strongly preferred' cpu for them. David - Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK Registration No: 1397386 (Wales)