Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp1084960pxb; Thu, 15 Apr 2021 13:41:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyO7D8QI6cxgL8IFDM+JeNUu35A9/+WMALkB03AlUyPsTv625J0Gmv1BveghRO0+ZUw4xJH X-Received: by 2002:a05:6402:3079:: with SMTP id bs25mr6420815edb.369.1618519274724; Thu, 15 Apr 2021 13:41:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618519274; cv=none; d=google.com; s=arc-20160816; b=mtWTA1I+ypyjnd+EvosWax9GQIJazHG4fIMQQXhIbUtZvOoQsmmUM29TPV1kRmoPfs xQh+P1+Ov1vtR5P06Eb9QiZT8YOom/UbF5lo2XDQ9fHQR9JC8b4a5exuw8PtnP44AQG/ AewyYUH6JU19BvlCynQ5geiM6yCHP8oMwBjuEvp6aTjNONJ9Oen2wpdHkGqeEDx5wpQJ aFpoh2s3ZSLTq4a2sCctWmDxNFoG4sDdYXPC3TbN9J/7mkOJvth8voKRxp7SnQKPZ8FR m6C9Ge3iO60X71izDcT+n6wUwRk4WeeyL2ZhGFSKeYOU+aIIM1C68nuVqt1dNXmDVlhq K0Mg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=olbuZ/mNJfDWZdGKBFNMZr0U0A3cz9Cq6tnv1KxE8ZQ=; b=SoIJOE1ml4y4ZoPHM1EEdChoM6PXsBpCWZNbCpFu0P2vFIIOXYMtuqdJ6UziDcKtCZ sw29wY9EUvMmrLeKeyJ6uzONm3qNQWgiu9intPGCdzRMEmyRtU3r1FjDT5IT3G+3Kkii Qvneqg+3Wpy0F8duiVVRTDsk3vd2uar1GzhjwnBOIpXsRx2N2FKv0r93qfoRWv3p92kr fWz5rixrDGXyAykQkqKUuZCc9qr5tNxULNwIzoCR+56FdJcG92dOwUHDQOgyCw0DzEM0 OIsnwF/U4b4vKN6O8SR9Qty5viarfWR1LlPLtv/0gyibu3R50UuY92ovVpAvEuCKFIrm GckQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=U2o1zir6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id do13si2869872ejc.87.2021.04.15.13.40.50; Thu, 15 Apr 2021 13:41:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=U2o1zir6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235443AbhDOUZp (ORCPT + 99 others); Thu, 15 Apr 2021 16:25:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60086 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234748AbhDOUZo (ORCPT ); Thu, 15 Apr 2021 16:25:44 -0400 Received: from mail-io1-xd33.google.com (mail-io1-xd33.google.com [IPv6:2607:f8b0:4864:20::d33]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3B8ABC061574 for ; Thu, 15 Apr 2021 13:25:20 -0700 (PDT) Received: by mail-io1-xd33.google.com with SMTP id x16so25552114iob.1 for ; Thu, 15 Apr 2021 13:25:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=olbuZ/mNJfDWZdGKBFNMZr0U0A3cz9Cq6tnv1KxE8ZQ=; b=U2o1zir6cWC/gUvJMIzysxsi1RHpyOWNV33sZRt0cR6oKbqe9RFQ1w2sN8E27CvYeP E+8yKaYclrlZORYN8O06PLW9TsNcut21OGQpmOUn21FxUaWMAQobTMWDeHU7vYR77n6s QZjw7Akf3rOol62Brs0jtlXoGTYy3jEnxA+TVbA3uA68/ANFd0ak1TKtiwdJIP+JPsky g9uO3ZvPLTzyr675+tsXEb1OVGYuTpzl6tRFDjDhxV/0+lUKalkOU3Y3Zcol5oeKswKQ Yzy2Edk5uD4Pr+umrlmPZ6Ny5M6mDivqNN0V1Pk8dXQjSUts5Kh9hwIj7/Q1bGv5fME8 Cggw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=olbuZ/mNJfDWZdGKBFNMZr0U0A3cz9Cq6tnv1KxE8ZQ=; b=W54zsz52/Q7wMpBO4BefYMFgsnvXQidGCYb/jYXl9aVOFiigdJepXUIY5s3uMVHuOh 8Z2A1ISVGjgH4pAyQTb6DuZvW0S9vPKDYcy/6eeA0XRXKu5jMVERY+d71/wgF2Tu5E7O EVT+V6/k5INYj7/pBadEeq0FQdmtufNeqvQ2rURq4aT1VkPM4AY0aWjQpwhZTzzV1voe k8zU6eocv2mWsjTrWqyTQM/i5kNMBjbmUDAcI/DLmDCQVTTQVYNcUol+8sDc8y0FYcEC Bk1RUsLj+2CkdQIi/B9WUc/DHNbxCHWrz4amvpKK/xqIdNXckTw0nIALNpDl/Oy4cvGZ MXAQ== X-Gm-Message-State: AOAM531DTsIEaIuifelOsEkSEsdktI8tvSRMho239KX5kEQ8WVyfe2L9 r7KB2F54mkMeZj+IAS3+kYDNJlm8VcRbfyi2DZt0WUm2rkJshQ== X-Received: by 2002:a5d:9c03:: with SMTP id 3mr802184ioe.32.1618518319512; Thu, 15 Apr 2021 13:25:19 -0700 (PDT) MIME-Version: 1.0 References: <20210401183216.443C4443@viggo.jf.intel.com> <20210401183219.DC1928FA@viggo.jf.intel.com> <20210414080849.GA20886@linux> <6215a690-d14a-de7e-72cb-1aa4e2822f2e@intel.com> In-Reply-To: <6215a690-d14a-de7e-72cb-1aa4e2822f2e@intel.com> From: Wei Xu Date: Thu, 15 Apr 2021 13:25:08 -0700 Message-ID: Subject: Re: [PATCH 02/10] mm/numa: automatically generate node migration order To: Dave Hansen Cc: Oscar Salvador , Dave Hansen , Linux MM , Linux Kernel Mailing List , Yang Shi , David Rientjes , Huang Ying , Dan Williams , David Hildenbrand Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Apr 15, 2021 at 8:35 AM Dave Hansen wrote: > > This can help enable more flexible demotion policies to be > > configured, such as to allow a cgroup to allocate from all fast tier > > nodes, but only demote to a local slow tier node. Such a policy can > > reduce memory stranding at the fast tier (compared to if memory > > hardwall is used) and still allow demotion from all fast tier nodes > > without incurring the expensive random accesses to the demoted pages > > if they were demoted to remote slow tier nodes. > > Could you explain this stranding effect in a bit more detail? I'm not > quite following. By memory stranding, I mean that memory on a machine (or a NUMA node) cannot be utilized even under extremely high work loads. Memory stranding happens usually due to mismatches between job/machine shapes as well as resource fragmentation resulted from bin-packing scheduling. It is an important problem for cloud resource efficiency. If NUMA hardwalling is used, we effectively split a single machine into multiple smaller machines based on NUMA nodes. This changes the machine shapes and also makes memory more fragmented, which can lead to more memory being stranded. Here is a simple example: Suppose that each machine has 2 NUMA nodes, each with 4 cores and 5GB RAM, and all the jobs have the shape of 2 CPUs and 3GB memory. Without NUMA memory hardwalling, we can pack 3 jobs onto each machine, which leaves 1GB memory and 2 cores in stranding. However, with NUMA memory hardwalling enabled, we can then only pack 2 jobs onto each machine (one job on each NUMA node), which increases the resource stranding to 4GB memory and 4 cores.