Received: by 2002:a25:d7c1:0:0:0:0:0 with SMTP id o184csp4035749ybg; Fri, 25 Oct 2019 12:23:37 -0700 (PDT) X-Google-Smtp-Source: APXvYqwWyrYIJYNe4cKhLniBMxtdqpEjobDwPCfFL6bl3O6jvE5dlHwBFpjO4f1zcGCAGu4DTEZT X-Received: by 2002:a50:fd03:: with SMTP id i3mr3922163eds.70.1572031416950; Fri, 25 Oct 2019 12:23:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1572031416; cv=none; d=google.com; s=arc-20160816; b=b7ljD1qROqHpvCL/HCnDk+s6lC8mOL5eE/V2RlivmDXg/UktBSf2zMaZf0thdOKfjJ WB2ppmFyF8SBSyoLlhLfAWtQCG62dufdSz1Z6Vwoh9IbpuWTt2pWMHmrzSUFPCe0Gf5H nOjoO/6mXhyaWEkFF2CLPeD05gFgg2YeEiS5CLX/AvR5XRwpr0rUw6bdKHG86UDnvdzY hVKG3b4gddT2jtWs+/P2yMtjD0oN+mU8cJ0DMEJ+qoYWO2aM5NfK6h4OMO+e6zZTUhbW cnEE1vt7EwxRg790ELhIgja9uyLuS355WoTi9x97T9Ipng05IHhkRlofSLkbeKfE09CK H7IQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:message-id :in-reply-to:date:references:subject:cc:to:from; bh=U333r8D2qaSAKAAfnpnaWvc5NXgw5mBKrNnwzYti7uE=; b=CxqRVnXcLD58pLYZNLnxGgE0VMzPAkOm8zw/PEl7WAczZxHyDk61uN+qjbKThNTiXY bnR0lf2B750KwkFMsQUwfdHX7qgvzxsStDFMZSs2CZuKHDHZAX0H2pfY1y8kv4YSwQit KbEeP46k8YlNLYDmJI0s5go0gbi+AJ4yph77r6orxtwib6czL2Z/c/ABO9EQ2HONdnRK rmg0UqAeRTwQp92Z3lWJau5oYOMU4FwZypkvjvlQsqFJUcysToQBjEolQER8jsOjOBq5 ZwQaGY6U5D4eIACxXVbYc0JK3OaObkt3I2wRNYyBtk+4vvfzb3FXdV36b+7nzcOHkZTP bm2w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o8si2378529edi.34.2019.10.25.12.23.13; Fri, 25 Oct 2019 12:23:36 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392095AbfJYDat (ORCPT + 99 others); Thu, 24 Oct 2019 23:30:49 -0400 Received: from mga01.intel.com ([192.55.52.88]:25912 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390875AbfJYDas (ORCPT ); Thu, 24 Oct 2019 23:30:48 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga101.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Oct 2019 20:30:48 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.68,227,1569308400"; d="scan'208";a="398638072" Received: from yhuang-dev.sh.intel.com (HELO yhuang-dev) ([10.239.159.29]) by fmsmga005.fm.intel.com with ESMTP; 24 Oct 2019 20:30:46 -0700 From: "Huang\, Ying" To: Dave Hansen Cc: Jonathan Adams , Linux-MM , LKML , "Williams\, Dan J" , "Verma\, Vishal L" , Wu Fengguang Subject: Re: [RFC] Memory Tiering References: Date: Fri, 25 Oct 2019 11:30:46 +0800 In-Reply-To: (Dave Hansen's message of "Thu, 24 Oct 2019 09:33:01 -0700") Message-ID: <87o8y5h57d.fsf@yhuang-dev.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=ascii Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Dave Hansen writes: > On 10/23/19 4:11 PM, Jonathan Adams wrote: >> we would have a bidirectional attachment: >> >> A is marked "move cold pages to" B >> B is marked "move hot pages to" A >> C is marked "move cold pages to" D >> D is marked "move hot pages to" C >> >> By using autonuma for moving PMEM pages back to DRAM, you avoid >> needing the B->A & D->C links, at the cost of migrating the pages >> back synchronously at pagefault time (assuming my understanding of how >> autonuma works is accurate). >> >> Our approach still lets you have multiple levels of hierarchy for a >> given socket (you could imaging an "E" node with the same relation to >> "B" as "B" has to "A"), but doesn't make it easy to represent (say) an >> "E" which was equally close to all sockets (which I could imagine for >> something like remote memory on GenZ or what-have-you), since there >> wouldn't be a single back link; there would need to be something like >> your autonuma support to achieve that. >> >> Does that make sense? > > Yes, it does. We've actually tried a few other approaches separate from > autonuma-based ones for promotion. For some of those, we have a > promotion path which is separate from the demotion path. > > That said, I took a quick look to see what the autonuma behavior was and > couldn't find anything obvious. Ying, when moving a slow page due to > autonuma, do we move it close to the CPU that did the access, or do we > promote it to the DRAM close to the slow memory where it is now? Now in autonuma, the slow page will be moved to the CPU that did the access. So I think Jonathan's requirement has been covered already. Best Regards, Huang, Ying