Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756927AbdHYOiB (ORCPT ); Fri, 25 Aug 2017 10:38:01 -0400 Received: from mail.santannapisa.it ([193.205.80.98]:53583 "EHLO mail.santannapisa.it" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755900AbdHYOh7 (ORCPT ); Fri, 25 Aug 2017 10:37:59 -0400 Date: Fri, 25 Aug 2017 16:37:54 +0200 From: Luca Abeni To: Mathieu Poirier Cc: Ingo Molnar , Peter Zijlstra , tj@kernel.org, vbabka@suse.cz, Li Zefan , akpm@linux-foundation.org, weiyongjun1@huawei.com, Juri Lelli , Steven Rostedt , Claudio Scordino , Daniel Bristot de Oliveira , "linux-kernel@vger.kernel.org" , Tommaso Cucinotta Subject: Re: [PATCH 0/7] sched/deadline: fix cpusets bandwidth accounting Message-ID: <20170825163754.08bda23f@luca> In-Reply-To: References: <1502918443-30169-1-git-send-email-mathieu.poirier@linaro.org> <20170822142136.3604336e@luca> X-Mailer: Claws Mail 3.13.2 (GTK+ 2.24.30; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2003 Lines: 52 Hi Mathieu, On Wed, 23 Aug 2017 13:47:13 -0600 Mathieu Poirier wrote: > On 22 August 2017 at 06:21, Luca Abeni wrote: > > Hi Mathieu, > > Good day to you, > > > > > On Wed, 16 Aug 2017 15:20:36 -0600 > > Mathieu Poirier wrote: > > > >> This is a renewed attempt at fixing a problem reported by Steve Rostedt [1] > >> where DL bandwidth accounting is not recomputed after CPUset and CPUhotplug > >> operations. When CPUhotplug and some CUPset manipulation take place root > >> domains are destroyed and new ones created, loosing at the same time DL > >> accounting pertaining to utilisation. > > > > Thanks for looking at this longstanding issue! I am just back from > > vacations; in the next days I'll try your patches. > > Do you have some kind of scripts for reproducing the issue > > automatically? (I see that in the original email Steven described how > > to reproduce it manually; I just wonder if anyone already scripted the > > test). > > I didn't bother scripting it since it is so easy to do. I'm eager to > see how things work out on your end. I ran some tests with your patchset, and I confirm that it fixes the issue originally pointed out by Steven. But I still need to run some more tests (I'll continue on Monday). I think I found an issue by: 1) creating two disjoint cpusets (CPUs 0 and 1 in the first cpuset, CPUs 2 and 3 in the second one) and setting sched_load_balance to 0 2) starting a task in one of the two cpusets, and making it SCHED_DEADLINE <--- up to here, everything looks fine 3) setting sched_load_balance to 1 <--- At this point, I think there is a bug: the system has only one root domain, and the task utilization is summed to it... But the task affinity mask is still the one of the "old root domain" that was associated with the cpuset where the task is executing. I still need to run some experiments about this. Thanks, Luca