Received: by 2002:a05:6358:45e:b0:b5:b6eb:e1f9 with SMTP id 30csp1533722rwe; Thu, 1 Sep 2022 22:13:23 -0700 (PDT) X-Google-Smtp-Source: AA6agR57S/GTlEvyB1NxGLwNXTfhpMaG+CIuFF7cNPEloEpCyxDbPAri1zBYAVuxrqkIgel73cpR X-Received: by 2002:a17:907:1690:b0:731:56b6:fded with SMTP id hc16-20020a170907169000b0073156b6fdedmr26388451ejc.119.1662095603212; Thu, 01 Sep 2022 22:13:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662095603; cv=none; d=google.com; s=arc-20160816; b=g9YdMaMhcxHlnjbvtl+F8NhInEr4+8NyOXQziv9JoVbR7jzKIihPj8yJ3KqpR3huJi HP5WDsZE4KLpiepqvAnMUvQBQZhD0uOWDUrxTVWExbQJaO585HpFbitXG+EeODSoRFo5 Oam+MmphfGVT0DLf69Ub9clYH4P8oQS9AWWXeWqGkYsXTTUH9JOrkVuMkXX0jTlsEznX RVNkNM9y1/QL9s0xIOQMUpA4S3FGzUxyfE8ECrCtGfO7+edmUv44PCebkioPMRYFZTdI bps5dsFAqLuc2O+qL0OxK10FebxmWiUWeK+Sq9AWeWzwysIAHhIdRoQbKRNEP0ThFsYt 1kZw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=PmZ+jeHheCTitnwYvU2IM5Mh5fQ9dWuFRDxNtLNgxGM=; b=gkRpfpPFtyI45t2aX+Bpp5DKrEO+IxHNLTsySdtasm4349EX/eUpcPvfuDWVJjcms4 a5EQoclamoPE3u/hVwRQWax7A7D1VPh/qccBnzRU2OWXuojAeLN0IcpEUyD5xB5z0l6y X8jM0+kWAvPbL0LDjQzbgFKwLWz7U6c/dsUOXNfr+7/kSU5tEmRJ4SAepa8dDeKPTrE3 kzs/ud0WzdZ3mTPUAtrXJ5U7P9vUdymwIrcHYetqkEfgO9p+dbHSR/WfOUcLXgSSYBu9 YRxZO8mdecmIbvLYV8ukBjQ88xCIpq0rITDJLUpLaxVVrT+NOE9IBrdLrh04HhekrX9B rgZQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=kGybGFCn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id c3-20020aa7df03000000b00448176995cfsi780562edy.500.2022.09.01.22.12.57; Thu, 01 Sep 2022 22:13:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=kGybGFCn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230295AbiIBFJ1 (ORCPT + 99 others); Fri, 2 Sep 2022 01:09:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39182 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229528AbiIBFJZ (ORCPT ); Fri, 2 Sep 2022 01:09:25 -0400 Received: from mail-pj1-x1032.google.com (mail-pj1-x1032.google.com [IPv6:2607:f8b0:4864:20::1032]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E43BCB5E60 for ; Thu, 1 Sep 2022 22:09:24 -0700 (PDT) Received: by mail-pj1-x1032.google.com with SMTP id q3so962791pjg.3 for ; Thu, 01 Sep 2022 22:09:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date; bh=PmZ+jeHheCTitnwYvU2IM5Mh5fQ9dWuFRDxNtLNgxGM=; b=kGybGFCn0r9hnHDD+XlTAUGy+rtoNiikm2mL5bvqn+0h4z1q4yzz3Ns7ajlCHdrdD7 g0dI/EU/xfSodqQS67E+p+MEscy/R40NUQ6JBx/HWVFnHy3NznELkLnyoGTOVzfThNfN XtUV1XgOogkXv8XVLpMF05vdye3kpjHgo0mQCxh2ik+qs/Fir1/o/Fs0PIyRUbXul2GF WpSSXSA+kfvF8yT/K0X1P7doQGU6kb6+o2xNezqlefMyIE6/nIfSKGiucjP/yRhloqs7 J42GoAmP2NFJbOIkChC90ot3MlQapEROsuhYzYMwxk2VNmJGTAj+NBikTfRICRZDlX2v B4gQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date; bh=PmZ+jeHheCTitnwYvU2IM5Mh5fQ9dWuFRDxNtLNgxGM=; b=ocrOqE8wEg/jUvcYbV9SZBegRhmlfI/X9v46Wbj9tn5ikhaKOg/E/DS7zi8YzLOW0g 0nhvoOmj7AnydElnMHHVe3w0IXSZUt86LzQpDTRPime2tvSAncYdHyvYyptPU1xaR1+S cEPkrRCGAuIc+2ZoaeDNkln8XqHvuvFnaVuLGtGBcPfPRVcafQt1Ls1HMgnawJ9sIVSJ 7VdPLH6O3WvT10ILrMEAdmRXTV8/knF+KNwvKIlpaC/LboRxoB3CJNb4sXgAc+fMhAiO kFHsl6wjjKT5Ib0c6ACa/74NWpsCvYBzcu3BKrRyfIxM7VTufmgtlioxm5WJbIG0ZLEj ynxQ== X-Gm-Message-State: ACgBeo0u8U7o3uUOe/SgucmIqN+7M+7YmpYuFucW4jkB/abwLcKW7iJA yD7j5x6aR1UwFUz19Kz25852yf+bWY/AcxpH2jwNyQ== X-Received: by 2002:a17:90b:3142:b0:1fa:f533:3191 with SMTP id ip2-20020a17090b314200b001faf5333191mr2851172pjb.193.1662095364249; Thu, 01 Sep 2022 22:09:24 -0700 (PDT) MIME-Version: 1.0 References: <20220830081736.119281-1-aneesh.kumar@linux.ibm.com> <87tu5rzigc.fsf@yhuang6-desk2.ccr.corp.intel.com> <87pmgezkhp.fsf@yhuang6-desk2.ccr.corp.intel.com> In-Reply-To: <87pmgezkhp.fsf@yhuang6-desk2.ccr.corp.intel.com> From: Wei Xu Date: Thu, 1 Sep 2022 22:09:13 -0700 Message-ID: Subject: Re: [PATCH v3 updated] mm/demotion: Expose memory tier details via sysfs To: "Huang, Ying" Cc: Aneesh Kumar K V , Johannes Weiner , Linux MM , Andrew Morton , Yang Shi , Davidlohr Bueso , Tim C Chen , Michal Hocko , Linux Kernel Mailing List , Hesham Almatary , Dave Hansen , Jonathan Cameron , Alistair Popple , Dan Williams , jvgediya.oss@gmail.com, Bharata B Rao , Greg Thelen Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Sep 1, 2022 at 5:33 PM Huang, Ying wrote: > > Aneesh Kumar K V writes: > > > On 9/1/22 12:31 PM, Huang, Ying wrote: > >> "Aneesh Kumar K.V" writes: > >> > >>> This patch adds /sys/devices/virtual/memory_tiering/ where all memory= tier > >>> related details can be found. All allocated memory tiers will be list= ed > >>> there as /sys/devices/virtual/memory_tiering/memory_tierN/ > >>> > >>> The nodes which are part of a specific memory tier can be listed via > >>> /sys/devices/virtual/memory_tiering/memory_tierN/nodes > >> > >> I think "memory_tier" is a better subsystem/bus name than > >> memory_tiering. Because we have a set of memory_tierN devices inside. > >> "memory_tier" sounds more natural. I know this is subjective, just my > >> preference. > >> > >>> > >>> A directory hierarchy looks like > >>> :/sys/devices/virtual/memory_tiering$ tree memory_tier4/ > >>> memory_tier4/ > >>> =E2=94=9C=E2=94=80=E2=94=80 nodes > >>> =E2=94=9C=E2=94=80=E2=94=80 subsystem -> ../../../../bus/memory_tieri= ng > >>> =E2=94=94=E2=94=80=E2=94=80 uevent > >>> > >>> All toptier nodes are listed via > >>> /sys/devices/virtual/memory_tiering/toptier_nodes > >>> > >>> :/sys/devices/virtual/memory_tiering$ cat toptier_nodes > >>> 0,2 > >>> :/sys/devices/virtual/memory_tiering$ cat memory_tier4/nodes > >>> 0,2 > >> > >> I don't think that it is a good idea to show toptier information in us= er > >> space interface. Because it is just a in kernel implementation > >> details. Now, we only promote pages from !toptier to toptier. But > >> there may be multiple memory tiers in toptier and !toptier, we may > >> change the implementation in the future. For example, we may promote > >> pages from DRAM to HBM in the future. > >> > > > > > > In the case you describe above and others, we will always have a list o= f > > NUMA nodes from which memory promotion is not done. > > /sys/devices/virtual/memory_tiering/toptier_nodes shows that list. > > I don't think we will need that interface if we don't restrict promotion > in the future. For example, he can just check the memory tier with > smallest number. > > TBH, I don't know why do we need that interface. What is it for? We > don't want to expose unnecessary information to restrict our in kernel > implementation in the future. > > So, please remove that interface at least before we discussing it > thoroughly. I have asked for this interface to allow the userspace to query a list of top-tier nodes as the targets of userspace-driven promotions. The idea is that demotion can gradually go down tier by tier, but we promote hot pages directly to the top-tier and bypass the immediate tiers. Certainly, this can be viewed as a policy choice. Given that now we have a clearly defined memory tier hierarchy in sysfs and the toptier_nodes content can be constructed from this memory tier hierarchy and other information from the node sysfs interfaces, I am fine if we want to remove toptier_nodes and keep the current memory tier sysfs interfaces to the minimal. Wei Xu > >> Do we need a way to show the default memory tier in sysfs? That is, t= he > >> memory tier that the DRAM nodes belong to. > >> > > > > I will hold adding that until we have support for modifying memory tier= details from > > userspace. That is when userspace would want to know about the default = memory tier. > > > > For now, the user interface is a simpler hierarchy of memory tiers, it'= s associated > > nodes and the list of nodes from which promotion is not done. > > OK. > > Best Regards, > Huang, Ying