Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp1492056pxa; Thu, 13 Aug 2020 09:34:10 -0700 (PDT) X-Google-Smtp-Source: ABdhPJydtpDcARgryVRQ54JHyyD4EAr+Apv6TPkKAwdY8kvjK+riM0e7VdRJivOezu/LsDx1SgWS X-Received: by 2002:a17:906:496:: with SMTP id f22mr5524875eja.180.1597336449883; Thu, 13 Aug 2020 09:34:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597336449; cv=none; d=google.com; s=arc-20160816; b=hUtt9p6Bklm1wPLTcik6zz9wir1Eii2/uEqtrJqDyFiacJzqenB72bLwLUr0gfafeY BI2jJNvOSgjpqZF8ippXX0rBrENV6N1/Q+6I6ivKJIyMG/QWWLRoWbLLDK9teoyytTcM 7b+M16voEW68mCq/Cqwi9DPn1o5IbmWkg6aS6o3fiiVvT4vaS1a6C3VdnUoBuPHFw6pG ObPtsx5Yai4ndwxa5RbrRYNqrwgqr4MovAgehkXwLCedAHJ673tNZ+y//hMcXnXTIszW 6NMiAthDnN0eRHVRNTgjRCUWoFjarI3exW4zqycDbZPdOVhYAUzKTx8B8I5V+DwgjqDf 1nzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=AymXWB48xx8kFEo1cQg+lMU5MCnO7OqYKs9gZkiaeHw=; b=fjsfhxzcU42Ev6FIta8IFBIksEnVMG0+R7zInlXEM73ki/lAoTeQeUhezq7Ma1eBAH lYCfhWuQvVad6lz+8dhmi4vQiPcbCrtrLNFjBCyRlACE4zCSHSqbANrFeDQMSwIlxuZ5 BhP5SnQEFPa9C/pR/tDt4CRs+HTuoOiagXokaiqBssknoAX1vGbXg/MBAKT1g9DgBkYq vJTg7ktFNu9Sgb9chLfDe2Va73Y2Q4b/40KcyKbukxkK9DkP9Rgk78mOrv9x5fQhyZ7z MujXhFdvKOCdXOnPcO/rdMHYp95O+zzvFdQGWTb0Zx/zcud3qoxw+HKLVefHyKntgG/e xjgw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id rk3si3727539ejb.12.2020.08.13.09.33.46; Thu, 13 Aug 2020 09:34:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726533AbgHMQa6 (ORCPT + 99 others); Thu, 13 Aug 2020 12:30:58 -0400 Received: from mx2.suse.de ([195.135.220.15]:47484 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726142AbgHMQa4 (ORCPT ); Thu, 13 Aug 2020 12:30:56 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 8B28CAD63; Thu, 13 Aug 2020 16:31:17 +0000 (UTC) Date: Thu, 13 Aug 2020 18:30:54 +0200 From: Michal Hocko To: Charan Teja Kalla Cc: akpm@linux-foundation.org, vbabka@suse.cz, david@redhat.com, rientjes@google.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, vinmenon@codeaurora.org Subject: Re: [PATCH V2] mm, page_alloc: fix core hung in free_pcppages_bulk() Message-ID: <20200813163054.GR9477@dhcp22.suse.cz> References: <1597150703-19003-1-git-send-email-charante@codeaurora.org> <20200813114105.GI9477@dhcp22.suse.cz> <9ca76893-dfe8-9a46-f2ec-6b3c663e848e@codeaurora.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <9ca76893-dfe8-9a46-f2ec-6b3c663e848e@codeaurora.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu 13-08-20 21:51:29, Charan Teja Kalla wrote: > Thanks Michal for comments. > > On 8/13/2020 5:11 PM, Michal Hocko wrote: > > On Tue 11-08-20 18:28:23, Charan Teja Reddy wrote: > > [...] > >> diff --git a/mm/page_alloc.c b/mm/page_alloc.c > >> index e4896e6..839039f 100644 > >> --- a/mm/page_alloc.c > >> +++ b/mm/page_alloc.c > >> @@ -1304,6 +1304,11 @@ static void free_pcppages_bulk(struct zone *zone, int count, > >> struct page *page, *tmp; > >> LIST_HEAD(head); > >> > >> + /* > >> + * Ensure proper count is passed which otherwise would stuck in the > >> + * below while (list_empty(list)) loop. > >> + */ > >> + count = min(pcp->count, count); > >> while (count) { > >> struct list_head *list; > > > > > > How does this prevent the race actually? > > This doesn't prevent the race. This only fixes the core hung(as this is > called with spin_lock_irq()) caused by the race condition. This core > hung is because of incorrect count value is passed to the > free_pcppages_bulk() function. Let me ask differently. What does enforce that the count and lists do not get out of sync in the loop. Your changelog says that the fix is to use the proper value without any specifics. -- Michal Hocko SUSE Labs