Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752873AbdLSQBx (ORCPT ); Tue, 19 Dec 2017 11:01:53 -0500 Received: from resqmta-ch2-06v.sys.comcast.net ([69.252.207.38]:43720 "EHLO resqmta-ch2-06v.sys.comcast.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750898AbdLSQBs (ORCPT ); Tue, 19 Dec 2017 11:01:48 -0500 Date: Tue, 19 Dec 2017 10:01:46 -0600 (CST) From: Christopher Lameter X-X-Sender: cl@nuc-kabylake To: Peter Zijlstra cc: Frederic Weisbecker , LKML , Chris Metcalf , Thomas Gleixner , Luiz Capitulino , "Paul E . McKenney" , Ingo Molnar , Wanpeng Li , Mike Galbraith , Rik van Riel Subject: Re: [PATCH 4/5] sched/isolation: Residual 1Hz scheduler tick offload In-Reply-To: <20171219091911.tg2k4w7mgv2bcmeb@hirez.programming.kicks-ass.net> Message-ID: References: <1513653838-31314-1-git-send-email-frederic@kernel.org> <1513653838-31314-5-git-send-email-frederic@kernel.org> <20171219091911.tg2k4w7mgv2bcmeb@hirez.programming.kicks-ass.net> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-CMAE-Envelope: MS4wfN45HgzzLQy18ueHQ8EPqixFqeW312/cq8tKXOMqebCKMmvssi2bn4XKakloS5LE7/pzmriI9zY6l8Ja2RGrfz4tSW34trMJvzvVmi30tZPld9z0aZq2 a5XW9By5dHdVokeXrU+Bvj6uOt+5IrxZFpbSunluongvDYYLhCuW+Qzp9R3Pd61VPQ9AJaOHWu5H2uJyMJ3Po5xM6gb35WjW8AfNpx141GFOmutrhfAJuEXe J8eNmqO6Y4Kvk7fU4UA8guBF1QvfplXqG8tNFG1H19wPXeU9MhS83M5ZQXp0rYEsY4h41L1Bhp9k/NgrUe1rzz2ywLyfbhUH6suonYDzlVP14geGBujaS5HM s50PjDc19SabTRBsXLrk9vUTOi71UJvMZmXaHXSxPF8Qi2GIZoCmh82ljD2M/3e7gyAfOzrlgzBxCym6Egt/KWeSQ9GdDWFsqCk+Ie0cuJsTlQVnozQZ4pa3 NIY1wQAA7GU5uIaQ Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1422 Lines: 29 On Tue, 19 Dec 2017, Peter Zijlstra wrote: > On Tue, Dec 19, 2017 at 04:23:57AM +0100, Frederic Weisbecker wrote: > > When a CPU runs in full dynticks mode, a 1Hz tick remains in order to > > keep the scheduler stats alive. However this residual tick is a burden > > for Real-Time tasks that can't stand no interruption at all. > > I'm not sure that is accurate. RT doesn't necessarily have anything much > to so with this. The tick is per definition very deterministic and thus > should not be a problem. Depends what one means by RT. Certainly if you want bare metal performance then the tick is a problem. There are numerous loads sensitive to cpu denial through the scheduler tick for a few microseconds. MPI and software that rendevouz frequently in HPC are one example. Here at my workplace we regular monitor these effects and would like a system that runs as clean as possible. > > Adding the boot parameter "isolcpus=nohz_offload" will now outsource > > these scheduler ticks to the global workqueue so that a housekeeping CPU > > handles that tick remotely. > > The global workqueue sounds horrific; surely you want at least one such > housekeeping CPU per node or something ? Well that determines how many processors you can free from the OS noise. Having one workqueue per node sacrifices one core more to the operating system. But it would increase OS performance. So maybe we can have that configurable?