Received: by 2002:a05:7412:e794:b0:fa:551:50a7 with SMTP id o20csp2393657rdd; Fri, 12 Jan 2024 08:08:40 -0800 (PST) X-Google-Smtp-Source: AGHT+IFfgi3O2EqLcuGalFG14nye/gcMKDfAaRY5nkndZhi1EMa5Gdbs1iqp7bwg98FRMYP8kapB X-Received: by 2002:a05:6214:d62:b0:681:245b:c594 with SMTP id 2-20020a0562140d6200b00681245bc594mr1097668qvs.18.1705075720026; Fri, 12 Jan 2024 08:08:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1705075720; cv=none; d=google.com; s=arc-20160816; b=jsRhsAyReiIbo6vGcxPrkq4lhypNAniVCg5Mf9InA0wXrdMyRYWVp6X75jGhjOhHbv 1B5bOO8dx3Gcaiz2vbW7UwMDvE9H1ZNCGzfV/qqv9xXq6ajz/aqohJ6uXOKLLZKkNnkh x57EEcXHpwX+Mfrv+rZ8edqoxHT3/QgE5Q5WanyqzuPbhwH9ObDogQSbcQAjzURv/uPO Fk9aXsY4GSDp4um5HOy1gD6V9dTy+leoyJrgRJThMLfZ+GkkqYMkPGbPRwZl8eauukfP /M9IZjuun3q1N7H5JKCM79qiGo/MhKDXP2WAYDHag6HVBgsDwqh1zCRKh5YQyIZmPPZy 1hsw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:date:message-id; bh=bEPAXXjH5KC2IoBAP+LyevfUT2wsF8VFR9Py4S3M9T4=; fh=5FwSu8NITXdaOqsBIynVvkCEMTuUDpkoNtI/Cl4IDas=; b=DdF7v8zSsQU27QhrgubVQwrJnJUJjPCRMkM15ePGdAQ1q4Vpmrdw6cupD+BK3KKcsP Zvq/3dyR4NP7Nd8BP4hWz1B5XWn+7qyPUmtw7ytTtnglUdgnrHX8Vc27SZoeVTU1n7kk OWFusMJ6eNa6w2Gpl5OI2B/MA4Trd0c0xiEEMJTOayBrZXRz7ReAe5etcjCBKWB6Fnh/ NO0RCxgfeVlKMa8pOtPzA32ehdlhl+cvCemLzon1ki7SSUR8Ita2AaHfgbQGkLm3ZbeM xAAf56gACR15k6XIEcX5+yNn+FEc6VFFGR+HFUhzHJKISYhWzsmVMFYU2kMBMGzNc9hl rzRQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel+bounces-24844-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-24844-linux.lists.archive=gmail.com@vger.kernel.org" Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id u7-20020a0c8dc7000000b0067f9f28533asi2987371qvb.505.2024.01.12.08.08.39 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 12 Jan 2024 08:08:40 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-24844-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel+bounces-24844-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-24844-linux.lists.archive=gmail.com@vger.kernel.org" Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id C630A1C21A0B for ; Fri, 12 Jan 2024 16:08:39 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 950337319D; Fri, 12 Jan 2024 16:08:15 +0000 (UTC) Received: from proxmox-new.maurer-it.com (proxmox-new.maurer-it.com [94.136.29.106]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id C8BAA73165; Fri, 12 Jan 2024 16:08:11 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=proxmox.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=proxmox.com Received: from proxmox-new.maurer-it.com (localhost.localdomain [127.0.0.1]) by proxmox-new.maurer-it.com (Proxmox) with ESMTP id 46C69490CC; Fri, 12 Jan 2024 17:08:09 +0100 (CET) Message-ID: <533e4b73-105c-401d-b496-25d20eba2d76@proxmox.com> Date: Fri, 12 Jan 2024 17:08:08 +0100 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: Temporary KVM guest hangs connected to KSM and NUMA balancer Content-Language: en-US To: Sean Christopherson Cc: kvm@vger.kernel.org, Paolo Bonzini , Andrew Morton , linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <832697b9-3652-422d-a019-8c0574a188ac@proxmox.com> From: Friedrich Weber In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit On 11/01/2024 17:00, Sean Christopherson wrote: > This is a known issue. It's mostly a KVM bug[1][2] (fix posted[3]), but I suspect > that a bug in the dynamic preemption model logic[4] is also contributing to the > behavior by causing KVM to yield on preempt models where it really shouldn't. Thanks a lot for the pointers and the proposed fixes! I still see the same temporary hangs with [3] applied on top of 6.7 (0dd3ee31). However, with [4] applied in addition, I have not seen any temporary hangs yet. As the v1 of [3] was reported to fix the reported bug [2] and looks very similar to the v2 I tried, I wonder whether I might be seeing a slightly different kind of hangs than the one reported in [2] -- also because the reproducer relies heavily on KSM and AFAICT, KSM was entirely disabled in [2]. I'll try to run a few more tests next week. FWIW, the kernel config relevant to preemption: CONFIG_PREEMPT_BUILD=y # CONFIG_PREEMPT_NONE is not set CONFIG_PREEMPT_VOLUNTARY=y # CONFIG_PREEMPT is not set CONFIG_PREEMPT_COUNT=y CONFIG_PREEMPTION=y CONFIG_PREEMPT_DYNAMIC=y CONFIG_PREEMPT_RCU=y CONFIG_HAVE_PREEMPT_DYNAMIC=y CONFIG_HAVE_PREEMPT_DYNAMIC_CALL=y CONFIG_PREEMPT_NOTIFIERS=y CONFIG_DRM_I915_PREEMPT_TIMEOUT=640 CONFIG_DRM_I915_PREEMPT_TIMEOUT_COMPUTE=7500 # CONFIG_DEBUG_PREEMPT is not set # CONFIG_PREEMPT_TRACER is not set # CONFIG_PREEMPTIRQ_DELAY_TEST is not set Thanks again! Friedrich > [1] https://lore.kernel.org/all/ZNnPF4W26ZbAyGto@yzhao56-desk.sh.intel.com > [2] https://lore.kernel.org/all/bug-218259-28872@https.bugzilla.kernel.org%2F > [3] https://lore.kernel.org/all/20240110012045.505046-1-seanjc@google.com > [4] https://lore.kernel.org/all/20240110214723.695930-1-seanjc@google.com