Received: by 2002:ab2:6991:0:b0:1f7:f6c3:9cb1 with SMTP id v17csp1167507lqo; Thu, 9 May 2024 06:42:48 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWk6OeT5kKRWD4IsH8mcrXC84SbkrJJT3jjwZj5tGOSwz5jbVWlHBm9J5zRq+vmaw/jwHq6+Pqz/OK90RedA8zBmN8h8M/U+vW32cRGwQ== X-Google-Smtp-Source: AGHT+IHpgjINzVEkSVEWa4Gy4rvn84Z1yUVzXHbHTMIG4dU4cjWw6iwudQuy0Q7qkNTzQafxKlYi X-Received: by 2002:ac8:7f46:0:b0:43a:ee41:f581 with SMTP id d75a77b69052e-43dbf74ea68mr63495781cf.57.1715262167918; Thu, 09 May 2024 06:42:47 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1715262167; cv=pass; d=google.com; s=arc-20160816; b=yJjWw7AuSd2h4WIgY1hu8q2ck5yr7ZqhqeiolNVlm9IU0anYIByB1rtqfOL8125taS 5M/CyB9938wjySlVu051C8O+zRoYt4tJdrzjVSQ7JnEUW5ntZsFeh52TE9iEROY2pHXM deNK0DhlLNqgo32WsV3vVxKGnpaVhMzX23tUrlosloR14zX3AEEANacxde1waqEg3nvu +YdTa2JF43UjUxvmvyJPTXu6fLcJmfn2KCmFWORrD2x/mzZIALDxq5V+w5MitS9dFvxi q7NtEFXg6g5bfeKm4vxlt+p1dsfd3EgiacQbfmEuAXTipr66YAnRF4GxLhcbFKPXeHLt OoFQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:mime-version:list-unsubscribe:list-subscribe:list-id :precedence:user-agent:content-transfer-encoding:references :in-reply-to:date:cc:to:from:subject:message-id; bh=3SzFTWiRHb3CLDtYFfiRADNKHA8TW0PS4sStlljhHKI=; fh=lGRxVZMXzzIsXEGo3hYa7d2WgT5kUuxrsTp4UjPiqd8=; b=DEp86mRiNAl1vHOChtiwJPxqFYi1g5fn23nd2AsZWbGmZgAw5TPzjrreQ5JahY9Ao4 PhlfggOy1qUcbo2yuoJ0umXzvtTrJMGxGa6MwCR2iXz5S20hs5vru+3b38KsHPGBJSK/ GGa18s5alPR9KOvge/r45lG7aL2wVPVzKVDDIoKXeilH7te2M7yuBTni/A3Hlg52vUw1 HHuAjeB2YmHdwH6PPKJGZOMLL7HEIdUVWQXoqHtLanDJK6iot4e7DHxnQ7Q1M6VfIElL n4J2fajZZIIkNlbq4eKu9ygT/QaGstqZZBtJbxXKKUmR2RvbO+PIIIjMxdzoNT0q70Ad q+hg==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=shelob.surriel.com); spf=pass (google.com: domain of linux-kernel+bounces-174547-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-174547-linux.lists.archive=gmail.com@vger.kernel.org" Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id d75a77b69052e-43df54a1258si13214661cf.50.2024.05.09.06.42.47 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 09 May 2024 06:42:47 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-174547-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=shelob.surriel.com); spf=pass (google.com: domain of linux-kernel+bounces-174547-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-174547-linux.lists.archive=gmail.com@vger.kernel.org" Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id A21561C21256 for ; Thu, 9 May 2024 13:42:47 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id AF3F715B112; Thu, 9 May 2024 13:42:40 +0000 (UTC) Received: from shelob.surriel.com (shelob.surriel.com [96.67.55.147]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 26D49147C85; Thu, 9 May 2024 13:42:28 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=96.67.55.147 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715262160; cv=none; b=GBe/lG4SjVYIyYAUclwCOi6bxfBUs/mE6RoVozBYpgNtiLjyTgu+0GDMEdB441ogee7XhyrWUHW4SIDq6pGoHp0gVypDGpCEiljBb9xB6aCeUlsuCo44HM57gWrLLKQSk7YA64d5BVv3dGJgAAuNavp7CL3yVwhMM7TsFo/tTyQ= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715262160; c=relaxed/simple; bh=3SzFTWiRHb3CLDtYFfiRADNKHA8TW0PS4sStlljhHKI=; h=Message-ID:Subject:From:To:Cc:Date:In-Reply-To:References: Content-Type:MIME-Version; b=qjKv6Ja954of/tHKqV3YaAaDgAOI48nj37va17wvwolx/sIiNajSiU57uKXoDVUkm8/bj/Peca7i+3N5OijgUOv6s4/lABmmUgXS1flfA/llg4e+HujZn7l/Pzn743OCMJ7dnTZ07iUOB+1kAu8agLeHjKa+d9FGLrvwvVIIaWs= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=surriel.com; spf=pass smtp.mailfrom=shelob.surriel.com; arc=none smtp.client-ip=96.67.55.147 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=surriel.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=shelob.surriel.com Received: from [2601:18c:9101:a8b6:6e0b:84ff:fee2:98bb] (helo=imladris.surriel.com) by shelob.surriel.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.97.1) (envelope-from ) id 1s541X-0000000063q-16NF; Thu, 09 May 2024 09:41:27 -0400 Message-ID: Subject: Re: [PATCH] fs/proc: fix softlockup in __read_vmcore From: Rik van Riel To: Baoquan He Cc: akpm@linux-foundation.org, Vivek Goyal , Dave Young , kexec@lists.infradead.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, kernel-team@meta.com Date: Thu, 09 May 2024 09:41:27 -0400 In-Reply-To: References: <20240507091858.36ff767f@imladris.surriel.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable User-Agent: Evolution 3.46.4 (3.46.4-1.fc37) Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Sender: riel@surriel.com On Thu, 2024-05-09 at 11:52 +0800, Baoquan He wrote: > Hi, >=20 > On 05/07/24 at 09:18am, Rik van Riel wrote: > > While taking a kernel core dump with makedumpfile on a larger > > system, > > softlockup messages often appear. > >=20 > > While softlockup warnings can be harmless, they can also interfere > > with things like RCU freeing memory, which can be problematic when > > the kdump kexec image is configured with as little memory as > > possible. > >=20 > > Avoid the softlockup, and give things like work items and RCU a > > chance to do their thing during __read_vmcore by adding a > > cond_resched. >=20 > Thanks for fixing this. >=20 > By the way, is it easy to reproduce? And should we add some trace of > the > softlockup into log so that people can search for it and confirm when > encountering it? It is pretty easy to reproduce, but it does not happen all the time. With millions of systems, even rare errors are common :) However, we have been running with this fix for long enough (we deployed it in order to test it) that I don't think we have the=C2=A0 warning stored any more. Those logs were rotated out long ago. kind regards, Rik --=20 All Rights Reversed.