Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp2252946ybl; Thu, 9 Jan 2020 09:24:57 -0800 (PST) X-Google-Smtp-Source: APXvYqzBoz4T3RU9kMLGgVxAD+yK6Yx8zAGkKQiVRZ1tG8G3uQyW0zWY+OzVSrMWFvBXOAefW1Iq X-Received: by 2002:aca:db41:: with SMTP id s62mr3791821oig.87.1578590697735; Thu, 09 Jan 2020 09:24:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1578590697; cv=none; d=google.com; s=arc-20160816; b=JJxR1hSx+xvFG4FPwIPSmKqz/m/Zxw0QJghWQuCeYT3kiOZ1S7rLpSKBtKJ4lLTNQ1 3nLeiYdWWuYba+tTiaDPYHDK3/YpQP8PLVYVVZABX+D2NxsqkWjTBcvN2jCd/gdNQNHd 5zNhMDFRXpB9tAbvOYEd+5Hti98z6W2pSpvMudd1BQE4npFDlIA/KFxpY0eoZ8DgYdHR eraeX6lcY7jBg7OzanV7XxqBymJlEg8X9PtUmdtanzl3ppo50Rs/4WxJuHrzZvqPffHN O7rrlyzuOx0Piv+JgksskgUDHYuQrxsugRjKtUrHblO1uwzKFS4g4FKo/b/3U6O0qoCd VTnw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from; bh=Bdep6TkWsHaRYOUB6L8+WxGlAlT6uSSqIRvuhMg/Fho=; b=M1Kyun8JB/IbVoU0ErKSL1d0O0PQ+NLX4SHgUi6JD7sJ6Xc6U+PXU+g3cZj+LxDfub l1lqWzLSdz7haRVdoxQ6E+kCbaHoXKjtFlTzPF0HO9VXaYDWDyccB+3bAusEJIVm/sNw 7eSPaATNTnypbl2hALZElGXQltDeZYmFZgfMYPIAQGggvcSPPXwPPWy7yppRORUy237P Phdmo0fbTMPzoEN9/ggfJ7HnTnoMnlQKVxrEEPUNn5VJCPjLWv999SI4hg642lAu2aff AVvXTBCIHtrOszisBk40ZtkuhtPQ6ZrTfLTF3temOr5WDRYiT/zyo+E/rJcmbkT1/Npc zMRw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i15si4493829otk.120.2020.01.09.09.24.45; Thu, 09 Jan 2020 09:24:57 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731694AbgAIObk (ORCPT + 99 others); Thu, 9 Jan 2020 09:31:40 -0500 Received: from mga02.intel.com ([134.134.136.20]:62611 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727854AbgAIObk (ORCPT ); Thu, 9 Jan 2020 09:31:40 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga101.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 09 Jan 2020 06:31:38 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.69,414,1571727600"; d="scan'208";a="254610972" Received: from richard.sh.intel.com (HELO localhost) ([10.239.159.54]) by fmsmga002.fm.intel.com with ESMTP; 09 Jan 2020 06:31:37 -0800 From: Wei Yang To: hannes@cmpxchg.org, mhocko@kernel.org, vdavydov.dev@gmail.com, akpm@linux-foundation.org Cc: cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, kirill.shutemov@linux.intel.com, yang.shi@linux.alibaba.com, alexander.duyck@gmail.com, rientjes@google.com, Wei Yang Subject: [Patch v2] mm: thp: grab the lock before manipulation defer list Date: Thu, 9 Jan 2020 22:30:54 +0800 Message-Id: <20200109143054.13203-1-richardw.yang@linux.intel.com> X-Mailer: git-send-email 2.17.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org As all the other places, we grab the lock before manipulate the defer list. Current implementation may face a race condition. For example, the potential race would be: CPU1 CPU2 mem_cgroup_move_account split_huge_page_to_list !list_empty lock !list_empty list_del unlock lock # !list_empty might not hold anymore list_del_init unlock When this sequence happens, the list_del_init() in mem_cgroup_move_account() would crash if CONFIG_DEBUG_LIST since the page is already been removed by list_del in split_huge_page_to_list(). Fixes: 87eaceb3faa5 ("mm: thp: make deferred split shrinker memcg aware") Signed-off-by: Wei Yang Acked-by: David Rientjes --- v2: * move check on compound outside suggested by Alexander * an example of the race condition, suggested by Michal --- mm/memcontrol.c | 18 +++++++++++------- 1 file changed, 11 insertions(+), 7 deletions(-) diff --git a/mm/memcontrol.c b/mm/memcontrol.c index bc01423277c5..1492eefe4f3c 100644 --- a/mm/memcontrol.c +++ b/mm/memcontrol.c @@ -5368,10 +5368,12 @@ static int mem_cgroup_move_account(struct page *page, } #ifdef CONFIG_TRANSPARENT_HUGEPAGE - if (compound && !list_empty(page_deferred_list(page))) { + if (compound) { spin_lock(&from->deferred_split_queue.split_queue_lock); - list_del_init(page_deferred_list(page)); - from->deferred_split_queue.split_queue_len--; + if (!list_empty(page_deferred_list(page))) { + list_del_init(page_deferred_list(page)); + from->deferred_split_queue.split_queue_len--; + } spin_unlock(&from->deferred_split_queue.split_queue_lock); } #endif @@ -5385,11 +5387,13 @@ static int mem_cgroup_move_account(struct page *page, page->mem_cgroup = to; #ifdef CONFIG_TRANSPARENT_HUGEPAGE - if (compound && list_empty(page_deferred_list(page))) { + if (compound) { spin_lock(&to->deferred_split_queue.split_queue_lock); - list_add_tail(page_deferred_list(page), - &to->deferred_split_queue.split_queue); - to->deferred_split_queue.split_queue_len++; + if (list_empty(page_deferred_list(page))) { + list_add_tail(page_deferred_list(page), + &to->deferred_split_queue.split_queue); + to->deferred_split_queue.split_queue_len++; + } spin_unlock(&to->deferred_split_queue.split_queue_lock); } #endif -- 2.17.1