Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp588591iob; Thu, 28 Apr 2022 08:18:39 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxcLh7gI3ZcX7rm9XfVEFImSJKIu0BHdg6j9Ol5uAQ6oCiv2mNkn19FvRwdqhP6iZ1uo6qp X-Received: by 2002:a2e:3207:0:b0:24f:11aa:4017 with SMTP id y7-20020a2e3207000000b0024f11aa4017mr13910038ljy.380.1651159119387; Thu, 28 Apr 2022 08:18:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651159119; cv=none; d=google.com; s=arc-20160816; b=DN38ydrcQ98KbjxFFTdVI1rlQjKad3wKUM5y5wgcue9s4T3ljG7PXOzyKpFTFtOgBn FwOJIQcv0W2HHxyPJXMV1ZP1t34nu1ziZ+tZNITutpcq0XSpRUup9gslJyt9TwFw6jwZ iJr/5HqJK6hT4T8Ft9pX4tvndLY4qCQgqahB/dU1QVvyW6TsjsioGrQNRVW7wn6TJgWN Wf0OO+7/jjL3BxLldd930mFiH+sPLHEGAGZesFQOMEs4NC8MAXcGmDB7n9vjGB279/Pl yHVMIefHmUxibprC5J/3aqfl3JqQWateWCvPpZohcovzrMxyBZsHw+HDjqaq4GEoCHnS QXkA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=eSrTpAY01c2sAeG8S3OyerXPiSesjlWTEqP60Y4j5NQ=; b=nNbWN6QdHiD+hclPw7AA925ckKvyoE9nfAh9w/MahgCD9K3JptLgX1BySDJO5/9oZK O4bRXP1TkkE0iTnUtGt/G25KZfLVjllR7Sal0Bfwfq6MUb05VPqwfSkFwoV+papm+eU+ cyyUZlbQoqRtJ9mHsZpkYIIuiNN8tmb/6QcNBQPThWNCvJxVFCEn41BOCsP7Hry29MYw kNenusnU4mmXctTzA9sohWiE26boo8tcFDM82Du+0WgPGLrCosqYnmjzuQ0G0EKy7UX8 ch4yk2IfC1IoB8vyDWNQZk0zYLnLbK7NviHERkzst6eKISQpa1XQCahj0bK9M7mR81CG jcUw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=S2VSAmd8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a3-20020a2eb163000000b0024f0f692a21si4698629ljm.511.2022.04.28.08.18.09; Thu, 28 Apr 2022 08:18:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=S2VSAmd8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1347711AbiD1Nxg (ORCPT + 99 others); Thu, 28 Apr 2022 09:53:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51480 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1347710AbiD1Nxe (ORCPT ); Thu, 28 Apr 2022 09:53:34 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id D2389DFFF for ; Thu, 28 Apr 2022 06:50:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1651153817; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=eSrTpAY01c2sAeG8S3OyerXPiSesjlWTEqP60Y4j5NQ=; b=S2VSAmd8IDBW4FF3CW19t0VYpS5j8B0LOdcDBj3bg0Sfw9QvIuwPglZJsOAuEPpp3JQUZy Asmhza+x+s4SFvQz+TZQbVXJYgplumxA679fSAMqTE5vrUbB6Ai+AJmYv9W7cRxw7asAnk qFPTQ2COBrjNdgAuAuCm3yVIUjH/T5E= Received: from mail-il1-f198.google.com (mail-il1-f198.google.com [209.85.166.198]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-210-POftygYiPf2kk2rUqt_T9Q-1; Thu, 28 Apr 2022 09:50:15 -0400 X-MC-Unique: POftygYiPf2kk2rUqt_T9Q-1 Received: by mail-il1-f198.google.com with SMTP id x1-20020a056e020f0100b002c98fce9c13so1779594ilj.3 for ; Thu, 28 Apr 2022 06:50:15 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=eSrTpAY01c2sAeG8S3OyerXPiSesjlWTEqP60Y4j5NQ=; b=Q4bDhboru0by1/tTRtmCN7gtTi8+byQBnM1wRPqBHuHaK+FVibrPCj12hp8ko98GHE SRxXGB1Q2oJrfOGoFtY+SX3bPMVyi+EqOM5L0fh0AKJJ1KJrKGr8iYlrkEf19v++IUNe lI/XiiTCQdlmrUnpjFUhBxnE+QoNsvRPos0OPpsO0VAqo2TalKic2BTobQAIcbbR+8mk OGCBiXe1HZN8vUuc+tCm+ZCJ7lFHFTn8eTA2/JCLBkbbz86t8vjiGmAF2Qtsd3cN9OJy i1NGBwc+/g+HvaTiucytEiP9qv9kvgHrdlO70nLKjQO9A4vPu7l8q2rvo1L7LeJrTDp2 YegA== X-Gm-Message-State: AOAM530g38CmDz4pl7X7CQzL6+7z8pYbzcZIGpKFfXXVHnH1vGVoYuAv gxqR4JALD3bms1/4PBWaoFIWhBAggh2klccC6a0C1ny/I1izddzT7LfdXff/bMJQPVkJIT7qmPA AOqrGJq8LMHrfH5xebRGx2DFw X-Received: by 2002:a05:6638:dc9:b0:323:68d6:e15a with SMTP id m9-20020a0566380dc900b0032368d6e15amr13889548jaj.238.1651153814152; Thu, 28 Apr 2022 06:50:14 -0700 (PDT) X-Received: by 2002:a05:6638:dc9:b0:323:68d6:e15a with SMTP id m9-20020a0566380dc900b0032368d6e15amr13889543jaj.238.1651153813906; Thu, 28 Apr 2022 06:50:13 -0700 (PDT) Received: from xz-m1.local (cpec09435e3e0ee-cmc09435e3e0ec.cpe.net.cable.rogers.com. [99.241.198.116]) by smtp.gmail.com with ESMTPSA id n4-20020a6b4104000000b006549fdd0780sm12944272ioa.51.2022.04.28.06.50.12 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 28 Apr 2022 06:50:13 -0700 (PDT) Date: Thu, 28 Apr 2022 09:50:12 -0400 From: Peter Xu To: Bibo Mao Cc: Andrew Morton , linux-mm@kvack.org, linux-kernel@vger.kernel.org, David Hildenbrand , Yang Shi Subject: Re: [PATCH v3] mm/khugepaged: sched to numa node when collapse huge page Message-ID: References: <20220317065024.2635069-1-maobibo@loongson.cn> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20220317065024.2635069-1-maobibo@loongson.cn> X-Spam-Status: No, score=-3.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_LOW, SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Bibo, On Thu, Mar 17, 2022 at 02:50:24AM -0400, Bibo Mao wrote: > collapse huge page will copy huge page from general small pages, > dest node is calculated from most one of source pages, however > THP daemon is not scheduled on dest node. The performance may be > poor since huge page copying across nodes, also cache is not used > for target node. With this patch, khugepaged daemon switches to > the same numa node with huge page. It saves copying time and makes > use of local cache better. > > With this patch, specint 2006 base performance is improved with 6% > on Loongson 3C5000L platform with 32 cores and 8 numa nodes. Totally not familiar with specint, so a pure question is whether it'll make a real difference in real-world workloads? As I assume in real world the memory affinity to the processors should change relatively slow on tuned systems, so even if khugepaged copied a bit slower then it'll not affect much on the real workload after the movement completes? The other question is if it makes sense, whether it's applicable to file thps too (collapse_file)? Thanks, > > Signed-off-by: Bibo Mao > --- > changelog: > V2: remove node record for thp daemon > V3: remove unlikely statement > --- > mm/khugepaged.c | 8 ++++++++ > 1 file changed, 8 insertions(+) > > diff --git a/mm/khugepaged.c b/mm/khugepaged.c > index 131492fd1148..b3cf0885f5a2 100644 > --- a/mm/khugepaged.c > +++ b/mm/khugepaged.c > @@ -1066,6 +1066,7 @@ static void collapse_huge_page(struct mm_struct *mm, > struct vm_area_struct *vma; > struct mmu_notifier_range range; > gfp_t gfp; > + const struct cpumask *cpumask; > > VM_BUG_ON(address & ~HPAGE_PMD_MASK); > > @@ -1079,6 +1080,13 @@ static void collapse_huge_page(struct mm_struct *mm, > * that. We will recheck the vma after taking it again in write mode. > */ > mmap_read_unlock(mm); > + > + /* sched to specified node before huage page memory copy */ > + if (task_node(current) != node) { > + cpumask = cpumask_of_node(node); > + if (!cpumask_empty(cpumask)) > + set_cpus_allowed_ptr(current, cpumask); > + } > new_page = khugepaged_alloc_page(hpage, gfp, node); > if (!new_page) { > result = SCAN_ALLOC_HUGE_PAGE_FAIL; -- Peter Xu