Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp1144287ioo; Fri, 27 May 2022 02:27:26 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwS9esJ/bK6vlrsjL6MJ9kCWrRxDcmExF4DdlhUdijNiuk/BwRANk4z2Z5wHL6vEo1idUHV X-Received: by 2002:a05:6402:3046:b0:42b:505a:4f26 with SMTP id bs6-20020a056402304600b0042b505a4f26mr29674771edb.183.1653643646207; Fri, 27 May 2022 02:27:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653643646; cv=none; d=google.com; s=arc-20160816; b=NNTOZS3J7GiXU8t8YpIJ6n6fD4D+Rwb0TJnspktOrCNfNk/ebFIz30Dp/OJ3eqgTJk GkDe1gU3jCfwa4NyP78pek4d33xjmBmxwjJU5ZF/GzbiSUR46VvkwMmQ4rA7LUEgTWYv z9BYxcMtfBRvTAMfAc/xmGa54z1LX6lqC9Qx4OtKD6ZlLmhAyf+yBLHeGv2Y8nVRiwlt LXyp5Uu4etuDIFVJ3KzOmL/PAX4PIzK7J+u95Ch4woMFor+gETYepWSs1ScMaUp9RE3p vPkr5muN6xykvbgrcpeC5puv/2PhP5Mykl+Kf0Euns3lS/6gb6who86NOQA48A5jROpr p/Tw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=UhmIdgybmPNo1CiSIcVuHOaZlcl+c/C91j2RNLaULfE=; b=g0XVb7EFFC1pRnbpVd6/GBlC/mCpqBKOztWTVLz4Xd0z8r2XhD152UwRvYbyMObv6Y B3e6AhAfLDRiX2NnIC3zgQ/oJoNTZWBFJA+FsrZ894WZhHRtADuxFubH5PD/DpCC0fWR ypEmy5E16NHkrpNqXM/zdJsFCA9JLOG0i0URj6tJnna1aDJOtCjksGRBrKEuCbkubSOO u58RO0zuFDzm3+EUthsB3TUHZT/elbXUeRZFdx/NbU/vb6lg2/zyVcAX82hJw1RkaMAE 0IHk+jpNIgUlIrf3U84HCU1DX8YFiC6YycEn5gaTvPTb9fr5zaXnp80fP8ywQw3ZGJ5x 0Ivg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id mu31-20020a1709068a9f00b006df79ec22fdsi2022499ejc.567.2022.05.27.02.26.59; Fri, 27 May 2022 02:27:26 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344171AbiE0Ij5 (ORCPT + 99 others); Fri, 27 May 2022 04:39:57 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59454 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1349814AbiE0Ijz (ORCPT ); Fri, 27 May 2022 04:39:55 -0400 Received: from outbound-smtp21.blacknight.com (outbound-smtp21.blacknight.com [81.17.249.41]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1D1A15FF3 for ; Fri, 27 May 2022 01:39:51 -0700 (PDT) Received: from mail.blacknight.com (pemlinmail04.blacknight.ie [81.17.254.17]) by outbound-smtp21.blacknight.com (Postfix) with ESMTPS id 57240CCBC7 for ; Fri, 27 May 2022 09:39:49 +0100 (IST) Received: (qmail 13188 invoked from network); 27 May 2022 08:39:49 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.198.246]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 27 May 2022 08:39:49 -0000 Date: Fri, 27 May 2022 09:39:46 +0100 From: Mel Gorman To: Qian Cai Cc: Andrew Morton , Nicolas Saenz Julienne , Marcelo Tosatti , Vlastimil Babka , Michal Hocko , LKML , Linux-MM Subject: Re: [PATCH 0/6] Drain remote per-cpu directly v3 Message-ID: <20220527083946.GF3441@techsingularity.net> References: <20220512085043.5234-1-mgorman@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 26, 2022 at 01:19:38PM -0400, Qian Cai wrote: > On Thu, May 12, 2022 at 09:50:37AM +0100, Mel Gorman wrote: > > Changelog since v2 > > o More conversions from page->lru to page->[pcp_list|buddy_list] > > o Additional test results in changelogs > > > > Changelog since v1 > > o Fix unsafe RT locking scheme > > o Use spin_trylock on UP PREEMPT_RT > > > > This series has the same intent as Nicolas' series "mm/page_alloc: Remote > > per-cpu lists drain support" -- avoid interference of a high priority > > task due to a workqueue item draining per-cpu page lists. While many > > workloads can tolerate a brief interruption, it may be cause a real-time > > task runnning on a NOHZ_FULL CPU to miss a deadline and at minimum, > > the draining in non-deterministic. > > > > Currently an IRQ-safe local_lock protects the page allocator per-cpu lists. > > The local_lock on its own prevents migration and the IRQ disabling protects > > from corruption due to an interrupt arriving while a page allocation is > > in progress. The locking is inherently unsafe for remote access unless > > the CPU is hot-removed. > > > > This series adjusts the locking. A spinlock is added to struct > > per_cpu_pages to protect the list contents while local_lock_irq continues > > to prevent migration and IRQ reentry. This allows a remote CPU to safely > > drain a remote per-cpu list. > > > > This series is a partial series. Follow-on work should allow the > > local_irq_save to be converted to a local_irq to avoid IRQs being > > disabled/enabled in most cases. Consequently, there are some TODO comments > > highlighting the places that would change if local_irq was used. However, > > there are enough corner cases that it deserves a series on its own > > separated by one kernel release and the priority right now is to avoid > > interference of high priority tasks. > > > > Patch 1 is a cosmetic patch to clarify when page->lru is storing buddy pages > > and when it is storing per-cpu pages. > > > > Patch 2 shrinks per_cpu_pages to make room for a spin lock. Strictly speaking > > this is not necessary but it avoids per_cpu_pages consuming another > > cache line. > > > > Patch 3 is a preparation patch to avoid code duplication. > > > > Patch 4 is a simple micro-optimisation that improves code flow necessary for > > a later patch to avoid code duplication. > > > > Patch 5 uses a spin_lock to protect the per_cpu_pages contents while still > > relying on local_lock to prevent migration, stabilise the pcp > > lookup and prevent IRQ reentrancy. > > > > Patch 6 remote drains per-cpu pages directly instead of using a workqueue. > > Mel, we saw spontanous "mm_percpu_wq" crash on today's linux-next tree > while running CPU offlining/onlining, and wondering if you have any > thoughts? > Do you think it's related to the series and if so why? From the warning, it's not obvious to me why it would be given that it's a warning about a task not being inactive when it is expected to be. -- Mel Gorman SUSE Labs