Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp5837068pxj; Wed, 23 Jun 2021 09:56:41 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxBQWX/UHPrXpfTWOvZ5mfNv2ljQnFUnocnsuvvCwn99zGIM3pss32K2P2DrJIwz03ph+Eh X-Received: by 2002:a92:c7c5:: with SMTP id g5mr275811ilk.153.1624467401496; Wed, 23 Jun 2021 09:56:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1624467401; cv=none; d=google.com; s=arc-20160816; b=zaC4u8E2p4KVTUNA4qOK4XDNRPWv76r5GTEHM6Ni83u0VlphqTqBPpRrfqf1n8ThvC +Ly2+VQaWF+ct3JjKQXnIJslLIi/UfgE2xom8TwxPVqa9iUM39d5SpPBSXF83EhdsgfJ YLmzYIUAbRr6ASDB7SMlGOuHSEeVef5NvGOUAEKpC++F02YEdVIz7m3JxyMSMlEnYXAy oz5/TOQ9B9qRjs+oKVNofxWuonPjBrIbDiYrvUCwecFeRM2JPKGVqQr+IHkMLyjN4wHK JwRtSg031fpjb9p6YkMFigaCKds1UfMFl78B8mwPRnIZLmjBxWkxX0avK5XLgvoVGOip mwog== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=ax+qMkz9tc+3GIw6VBZG84/vn46E053CBBShLS/lu7Y=; b=tnHT7uKj82S3kcztK1srJoyFhmKQ2LEu06eOXnrIP+T3Jj+k04Og9pB4vUUm8KuO1h hABWBuuhgEWzTnveTTi+PwKE1d3bdDFNQtx7V0BuPn99QEbQD5pnUt2CaroI9KXzCbi6 GkvFSmGCq9uPDxKJUHSaaWKl1GaOx6O+gQbOVB/cbWmyzpLa0TzV9Hqxbc7lUwpolZ9o ZbQdaOISdkg7FLXrPNqeOFuyl5WlAwRusPnl2tQIz4uHzzUBzdkNrXxmO0MUNJJ27xuC 74HYTB3E4s2Q/PTXlJTAa84n9xbhwvb/rMVSWremat6NJLrlhRHRdz7+zwzQJRRzBr0s Pi2A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=CueGo6nh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r6si75798ilm.57.2021.06.23.09.56.28; Wed, 23 Jun 2021 09:56:41 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=CueGo6nh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229818AbhFWQ5k (ORCPT + 99 others); Wed, 23 Jun 2021 12:57:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36280 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229726AbhFWQ5j (ORCPT ); Wed, 23 Jun 2021 12:57:39 -0400 Received: from mail-lj1-x234.google.com (mail-lj1-x234.google.com [IPv6:2a00:1450:4864:20::234]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A53CBC061574 for ; Wed, 23 Jun 2021 09:55:21 -0700 (PDT) Received: by mail-lj1-x234.google.com with SMTP id d2so3853844ljj.11 for ; Wed, 23 Jun 2021 09:55:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=ax+qMkz9tc+3GIw6VBZG84/vn46E053CBBShLS/lu7Y=; b=CueGo6nhL0ih/I8+FW2BDzxSi8s4y6185nndbX4VnBGU3Q0H2Dp7iJ49iu+3Go58DR yqUuhfV3Ox29PVtvujH9wwUuCShzRU8yiOVxzFinEjKUAJAS0aTMVkDBL/dqryqKGyky NpznK3UeD2MPMmPb/Lv+GFPLtebrapqwECa55UF+u7pJn21DUBY2OP+7tjRH3FyOCpuH WOA2/vw1hBW/rfog/bzKCiV5ZjXIf28yVBXXXuGxf3/e4kKGxkUCeGguZxtnd+TI0VuF +oL0McSqtoJmyHof+hJzkoDTkfVTXv2p9QvJiEJIrRbohBXdMIrZa5ojlZUh2rPjduL7 qDXw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=ax+qMkz9tc+3GIw6VBZG84/vn46E053CBBShLS/lu7Y=; b=bp+MAYbLSIMy36N4MWEnmcYBK9hALrwR9VNUg9AtqaTZpePISqcJ8Avu2IUywJ4qLA buNOLO5qEghstdXiu3Jb9bDrAGezSLd2mJyIjvHH9xF1a0eb9YfJE2Vv5BgV/PnfVp5C a0QMtiJp63yPGFhV3W6PvFuVb9hLZR67cVUyivwN4INOjMFews0HP96d4lMkMjjyotT/ 63j20EbCw3Ynlr0y1bIbowaEMWc4Ph23paEM0joOIcwxytwKKRwMxh3K+nbZQk0sEaqK cp/2L550WIvy+lmqj5M4YV2qsPItWy1+K5H6qMsb1FXafupsZREbQIsvHXQ4rM2nXKJj fdiA== X-Gm-Message-State: AOAM530YtyNSbvkc3pbQG4GtJJjj9VmxmkVrMgUUhZGgW1QcKdLsbqau JryYXdVXerizs6/uJO4TxPkj7tkjzy0Y+6kB2jB55Q== X-Received: by 2002:a2e:90ca:: with SMTP id o10mr476096ljg.299.1624467318567; Wed, 23 Jun 2021 09:55:18 -0700 (PDT) MIME-Version: 1.0 References: <2ED1BDF5-BC0C-47CD-8F33-9A46C738F8CF@linux.vnet.ibm.com> <20210622143154.GA804@vingu-book> <53968DDE-9E93-4CB4-B5E4-526230B6E154@linux.vnet.ibm.com> <20210623071935.GA29143@vingu-book> <6C676AB3-5D06-471A-8715-60AABEBBE392@linux.vnet.ibm.com> <20210623120835.GB29143@vingu-book> <5D874F72-B575-4830-91C3-8814A2B371CD@linux.vnet.ibm.com> In-Reply-To: <5D874F72-B575-4830-91C3-8814A2B371CD@linux.vnet.ibm.com> From: Vincent Guittot Date: Wed, 23 Jun 2021 18:55:07 +0200 Message-ID: Subject: Re: [powerpc][next-20210621] WARNING at kernel/sched/fair.c:3277 during boot To: Sachin Sant Cc: Odin Ugedal , Linux Next Mailing List , linuxppc-dev@lists.ozlabs.org, open list Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 23 Jun 2021 at 18:46, Sachin Sant wrote: > > > > Ok. This becomes even more weird. Could you share your config file and more details about > > you setup ? > > > > Have you applied the patch below ? > > https://lore.kernel.org/lkml/20210621174330.11258-1-vincent.guittot@linaro.org/ > > > > Regarding the load_avg warning, I can see possible problem during attach. Could you add > > the patch below. The load_avg warning seems to happen during boot and sched_entity > > creation. > > > > Here is a summary of my testing. > > I have a POWER box with PowerVM hypervisor. On this box I have a logical partition(LPAR) or guest > (allocated with 32 cpus 90G memory) running linux-next. > > I started with a clean slate. > Moved to linux-next 5.13.0-rc7-next-20210622 as base code. > Applied patch #1 from Vincent which contains changes to dequeue_load_avg() > Applied patch #2 from Vincent which contains changes to enqueue_load_avg() > Applied patch #3 from Vincent which contains changes to attach_entity_load_avg() > Applied patch #4 from https://lore.kernel.org/lkml/20210621174330.11258-1-vincent.guittot@linaro.org/ > > With these changes applied I was still able to recreate the issue. I could see kernel warning > during boot. > > I then applied patch #5 from Odin which contains changes to update_cfs_rq_load_avg() > > With all the 5 patches applied I was able to boot the kernel without any warning messages. > I also ran scheduler related tests from ltp (./runltp -f sched) . All tests including cfs_bandwidth01 > ran successfully. No kernel warnings were observed. ok so Odin's patch fixes the problem which highlights that we overestimate _sum or don't sync _avg and _sum correctly I'm going to look at this further > > Have also attached .config in case it is useful. config has CONFIG_HZ_100=y Thanks, i will have a look > > Thanks > -Sachin >