Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp9814pxv; Thu, 8 Jul 2021 13:40:00 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz3aD9+e3T+3x2ymkvHMcblv4YLqyBZhBDdweKCmQoJ9mmGMOd+tCgMsR0yenwqULsdvWOd X-Received: by 2002:a02:c95a:: with SMTP id u26mr20397281jao.49.1625776800286; Thu, 08 Jul 2021 13:40:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1625776800; cv=none; d=google.com; s=arc-20160816; b=G3DCdQwh0xtycyxdNl/Tb12BhffxdYHX8gNcrl4S4ePeujYEhpmPjBF6Sqcv/F5LBC Hnb6VRkY07JRszPlqbwd9hpuG6VJCUxLwoU2W5tiG3TQFRHluUQJlHiPmTNSshfnQCry AgfQuC80A4JU7T4AAgTp+g+yFYDalmSi9pNxqMUvWg5URqT2yKdOjflb0RQZvgi143E9 JT11O21i6QpnCPyxXRF+bMm6tN7uZ1PT7nsxn3jhvFX43jzzrv9hWzuRfJj1kOgRWGrX tsEnga5i7L0MDeRZKcTw26mJCWNqjRhXTgm3TlZKM9vtneU3s8YBcc/9PT6QEziOU19k 99lA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=quNkVFERBfqRsQdRUvNNLim7z/eBis9fHeSRZyqbuMY=; b=m4mob+9Rt+WsiSdd+QVqYc7SiXYm/RWCps7PsVnkIt9b/SryV3T7NGCFZkblV3/f6l 5YywzFra2YY4tJoiDaKGLfqntkjG57vmmH7hHq+kmx089woMVJi4tutEc4tFKrm/B8Nv fvxT9SHWIqS5DJFQf0MKpxfc3eiKS/tFbdvgj7jT6uaPkrfB1vcwSIs8bahMvc9eobIQ sWArkxK4jFpwb2Y/AQuQ8iaftlBkZpFK3td76URMi4rUA1i40hFSB3wZ2VJ6IJC61v4B TqgBgmkJWrrJYJGVlMj9S3FLKaxRjv2LcKK3EKiVNAZKbN5fzh7iz2sYYqjvBX03gqcB XIow== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=L4CmWEzq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z7si3410232ilq.32.2021.07.08.13.39.47; Thu, 08 Jul 2021 13:40:00 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=L4CmWEzq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230463AbhGHUkg (ORCPT + 99 others); Thu, 8 Jul 2021 16:40:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47060 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230379AbhGHUkg (ORCPT ); Thu, 8 Jul 2021 16:40:36 -0400 Received: from mail-yb1-xb2c.google.com (mail-yb1-xb2c.google.com [IPv6:2607:f8b0:4864:20::b2c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 21CCCC061574 for ; Thu, 8 Jul 2021 13:37:53 -0700 (PDT) Received: by mail-yb1-xb2c.google.com with SMTP id r135so11149420ybc.0 for ; Thu, 08 Jul 2021 13:37:53 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=quNkVFERBfqRsQdRUvNNLim7z/eBis9fHeSRZyqbuMY=; b=L4CmWEzq1HtHgDkp1bstSyQk/Yiv/b6rcPVzbCc4+9OJGa+BNDQ+mtyTzgZ+RdOnvz P1jOvcAOm+7Tz/iNxmVpuLa/bgx5cuU1YHHu0Mze6Sfkjigb+ZAO8dqnQyiwuZxSPxd7 CFv0lbgX+v3MeUdjraiiXSC1kSe6Nntqkxv5M2rIFFjyREmcwvZS5youBX3FW0sQZgoK xvtwN5aOGwnwtbRNRdVCYjqfJLHMSIcr4c6uNZBLmFKKoS3TFL5YKbYvYZeZ277QEdmp tTsblN32Ghp5uxWNd0ZiI63GSseD9wckJ6Fx/yGbWWaXwBvpM9mAwryapBNcGM8M299L 1pOQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=quNkVFERBfqRsQdRUvNNLim7z/eBis9fHeSRZyqbuMY=; b=stFEvnnc6KCmW+WILbI3DOTYC2gbDt1+00qKXPXl7J7Dqlj9jgC1LZ7pr5zd/hsipW 0j9NkX28rzPhenWYDKGm5zqBuDD7DPe2QwQdyFCv56LO0X1SIPrI7Bj0jYI2HT+AjoUb s5ycbCWi/u9ffhnkrIVw5dPKgbdtbyeEnmzF0HURaj8hPqvCKoOat/g6s6tKhJ4qZNLt bfKbsnu1sFAXSfk+7xRNGXIKqe9SiVpdu6Rod/Sl6P54he+7DUbafbuE28JUyKotA0zw WcEbafCsBNQqlnjTuT75+LBMX4byh4vxV478K/+L8TTrlGcnhIs+417elNczsV/auKaF eVwg== X-Gm-Message-State: AOAM530BLdvThzT2zqUWKf77ryxd1vcF3ZXmDTkELL6kb0YucXvm53eB TEwj56CgnZ9Hs3bMR65rVtHixwu+4XXz0ZdrQyWujg== X-Received: by 2002:a25:2341:: with SMTP id j62mr43330071ybj.190.1625776672101; Thu, 08 Jul 2021 13:37:52 -0700 (PDT) MIME-Version: 1.0 References: <20210707023933.1691149-1-surenb@google.com> In-Reply-To: From: Suren Baghdasaryan Date: Thu, 8 Jul 2021 13:37:40 -0700 Message-ID: Subject: Re: [PATCH v3 1/1] psi: stop relying on timer_pending for poll_work rescheduling To: Johannes Weiner Cc: Peter Zijlstra , Ingo Molnar , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Steven Rostedt , Benjamin Segall , Mel Gorman , Daniel Bristot de Oliveira , matthias.bgg@gmail.com, Minchan Kim , Tim Murray , YT Chang , =?UTF-8?B?V2VuanUgWHUgKOiuuOaWh+S4vik=?= , =?UTF-8?B?Sm9uYXRoYW4gSk1DaGVuICjpmbPlrrbmmI4p?= , LKML , linux-arm-kernel@lists.infradead.org, linux-mediatek@lists.infradead.org, kernel-team , SH Chen Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jul 8, 2021 at 12:55 PM Suren Baghdasaryan wrote: > > On Thu, Jul 8, 2021 at 11:38 AM Johannes Weiner wrote: > > > > On Thu, Jul 08, 2021 at 08:54:56AM -0700, Suren Baghdasaryan wrote: > > > On Thu, Jul 8, 2021 at 7:44 AM Johannes Weiner wrote: > > > > On Wed, Jul 07, 2021 at 03:43:48PM -0700, Suren Baghdasaryan wrote: > > > > > On Wed, Jul 7, 2021 at 6:39 AM Johannes Weiner wrote: > > > > > > This looks good to me now code wise. Just a comment on the comments: > > > > > > > > > > > > On Tue, Jul 06, 2021 at 07:39:33PM -0700, Suren Baghdasaryan wrote: > > > > > > > @@ -559,18 +560,14 @@ static u64 update_triggers(struct psi_group *group, u64 now) > > > > > > > return now + group->poll_min_period; > > > > > > > } > > > > > > > > > > > > > > -/* Schedule polling if it's not already scheduled. */ > > > > > > > -static void psi_schedule_poll_work(struct psi_group *group, unsigned long delay) > > > > > > > +/* Schedule polling if it's not already scheduled or forced. */ > > > > > > > +static void psi_schedule_poll_work(struct psi_group *group, unsigned long delay, > > > > > > > + bool force) > > > > > > > { > > > > > > > struct task_struct *task; > > > > > > > > > > > > > > - /* > > > > > > > - * Do not reschedule if already scheduled. > > > > > > > - * Possible race with a timer scheduled after this check but before > > > > > > > - * mod_timer below can be tolerated because group->polling_next_update > > > > > > > - * will keep updates on schedule. > > > > > > > - */ > > > > > > > - if (timer_pending(&group->poll_timer)) > > > > > > > + /* xchg should be called even when !force to set poll_scheduled */ > > > > > > > + if (atomic_xchg(&group->poll_scheduled, 1) && !force) > > > > > > > return; > > > > > > > > > > > > This explains what the code does, but not why. It would be good to > > > > > > explain the ordering with poll_work, here or there. But both sides > > > > > > should mention each other. > > > > > > > > > > How about this: > > > > > > > > > > /* > > > > > * atomic_xchg should be called even when !force to always set poll_scheduled > > > > > * and to provide a memory barrier (see the comment inside psi_poll_work). > > > > > */ > > > > > > > > The memory barrier part makes sense, but the first part says what the > > > > code does and the message is unclear to me. Are you worried somebody > > > > might turn this around in the future and only conditionalize on > > > > poll_scheduled when !force? Essentially, I don't see the downside of > > > > dropping that. But maybe I'm missing something. > > > > > > Actually you are right. Originally I was worried that there might be a > > > case when poll_scheduled==0 and force==true and if someone flips the > > > conditions we will reschedule the timer but will not set > > > poll_scheduled back to 1. > > > > Oh I see. > > > > Right, flipping the condition doesn't make sense because we need > > poll_scheduled to be set when we go ahead - whether we're forcing or > > not. I.e. if we were in a locked section, we'd write it like this: > > > > if (poll_scheduled) > > if (!force) > > return; > > else > > poll_scheduled = 1; > > > > > However I don't think this condition is possible. We set force=true > > > only when we skipped resetting poll_schedule to 0 and on initial > > > wakeup we always reset poll_schedule. How about changing the comment > > > to this: > > > > > > /* > > > * atomic_xchg should be called even when !force to provide a > > > * full memory barrier (see the comment inside psi_poll_work). > > > */ > > > > Personally, I still find this more confusing than no comment on > > !force, because when you read it it sort of raises the question what > > the alternatives would be. And the alternatives appear to be > > nonsensical code rather than legitimate options. > > > > But I won't insist if you prefer to leave it in. Your call. > > I would like to keep it as a precaution, if you don't mind. In case > someone in the future thinks about "optimizing" this by flipping the > condition, hopefully the comment will give them a pause to think about > it :) > > > > > > > /* > > > > * A task change can race with the poll worker that is supposed to > > > > * report on it. To avoid missing events, ensure ordering between > > > > * poll_scheduled and the task state accesses, such that if the poll > > > > * worker misses the state update, the task change is guaranteed to > > > > * reschedule the poll worker: > > > > * > > > > * poll worker: > > > > * atomic_set(poll_scheduled, 0) > > > > * smp_mb() > > > > * LOAD states > > > > * > > > > * task change: > > > > * STORE states > > > > * if atomic_xchg(poll_scheduled, 1) == 0: > > > > * schedule poll worker > > > > * > > > > * The atomic_xchg() implies a full barrier. > > > > */ > > > > smp_mb(); > > > > > > > > This gives a high-level view of what's happening but it can still be > > > > mapped to the code by following the poll_scheduled variable. > > > > > > This looks really good to me. > > > If you agree on the first comment modification, should I respin the > > > next version? > > > > Yeah, sounds good to me! > > Thanks! I'll post an update shortly. v4 is posted at https://lore.kernel.org/patchwork/patch/1455172/