Received: by 10.223.185.111 with SMTP id b44csp227419wrg; Fri, 9 Mar 2018 04:06:36 -0800 (PST) X-Google-Smtp-Source: AG47ELueQhuhVjkK8mhcUam+LEjLkr9EnoPZsp0oTcFPY8RcRp3v0waElwg5Qc+jc+8oY4pOtioK X-Received: by 2002:a17:902:8c83:: with SMTP id t3-v6mr28041916plo.310.1520597196712; Fri, 09 Mar 2018 04:06:36 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520597196; cv=none; d=google.com; s=arc-20160816; b=zuVj44b0Ck7E4K13zFbBcDhXu5V5INt5zvip4xT4lx54lAw0o5bSCFcLdNBeG7WW1d W1iCK9QvGohFFxXtE5GaEDsQrSe2zZVDiSd7ocC1XH5e5g5AOyMaJGyz0y3twitOZIyr 1CUg+xePkcDSku1D/5sbJkHE7W1yZKS+NeVHhpdKEmN+D7fv1vrYZD8peXIXDRXGtm1q Mol3llAR9lWujkA/XvBp9iZfgjRvT5eayifWkCtlkDWo1XWGmN4YgQ7RIM5YuErBMcJC ftTL4nwDX0vyCWZNe/ZOH9ODaasdGldKuxAd4RAzJy6LS36w+6jbqMiEdWIQRxyiuguO KdXg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dmarc-filter :dkim-signature:dkim-signature:arc-authentication-results; bh=p45UElXxEUTY55BZUneGCUrVHdIF16KZx2b0x5wAaao=; b=0Vp77rkXycMBYCfVhxCUyxkHceqd6dBTBDhL2GlDgB6AyWppomzPa6f11/n5l7Po9U ytQavmAMU8zcADHe5lQgxlMGBCsSrO8Z8yAQgbvbXqOe8iR9EpiWT9F+AXhIx2bMsCUa Knp3/4NYL5o3LFZ4c2NCzyGSaK/2JPmquvAPqikrLeiGUTJSu+s64L8BOiR5IsSl2SF5 x58UKFiCI9PKBTBf4KscwczSh+JyKNa8qLtd9Tw32ew/YRu8LvGRuFo8gOJ62dlJuTNY IfQlJQ2r4DciCxPIeo8AkqNtnullKvzyX6YscrS7OXXE7gN0nMDZv0YeFR25R1GrJl8D VRSA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=Nkq5twca; dkim=pass header.i=@codeaurora.org header.s=default header.b=Nkq5twca; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z3si661732pgs.71.2018.03.09.04.06.11; Fri, 09 Mar 2018 04:06:36 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=Nkq5twca; dkim=pass header.i=@codeaurora.org header.s=default header.b=Nkq5twca; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751106AbeCIMEr (ORCPT + 99 others); Fri, 9 Mar 2018 07:04:47 -0500 Received: from smtp.codeaurora.org ([198.145.29.96]:53002 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750937AbeCIMEp (ORCPT ); Fri, 9 Mar 2018 07:04:45 -0500 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id C44B46038E; Fri, 9 Mar 2018 12:04:44 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1520597084; bh=9NEZHjcU5IslCsiEthjlMHLqlhPkdbVJlR9OjbBPJFA=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=Nkq5twcah7D9FWwfBEx40HBFo4fasByki2OzOavbaVjxvqdE3iJAGiBTWf7AZMrYX MBL2LZ9ODPYm8qxAkyIE8CMoAmakJw7d9+Xp9GZKJ0ckD8UmzzEy8jVFeZ9zriLBfP gKZrI60S1De7B3YwVSyucRVrVhXKNv+UdrM1byxU= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from [10.204.79.20] (blr-c-bdr-fw-01_globalnat_allzones-outside.qualcomm.com [103.229.19.19]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: gkohli@smtp.codeaurora.org) by smtp.codeaurora.org (Postfix) with ESMTPSA id BBC566038E; Fri, 9 Mar 2018 12:04:41 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1520597084; bh=9NEZHjcU5IslCsiEthjlMHLqlhPkdbVJlR9OjbBPJFA=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=Nkq5twcah7D9FWwfBEx40HBFo4fasByki2OzOavbaVjxvqdE3iJAGiBTWf7AZMrYX MBL2LZ9ODPYm8qxAkyIE8CMoAmakJw7d9+Xp9GZKJ0ckD8UmzzEy8jVFeZ9zriLBfP gKZrI60S1De7B3YwVSyucRVrVhXKNv+UdrM1byxU= DMARC-Filter: OpenDMARC Filter v1.3.2 smtp.codeaurora.org BBC566038E Authentication-Results: pdx-caf-mail.web.codeaurora.org; dmarc=none (p=none dis=none) header.from=codeaurora.org Authentication-Results: pdx-caf-mail.web.codeaurora.org; spf=none smtp.mailfrom=gkohli@codeaurora.org Subject: Re: [PATCH] mm: oom: Fix race condition between oom_badness and do_exit of task To: Tetsuo Handa , rientjes@google.com Cc: akpm@linux-foundation.org, mhocko@suse.com, kirill.shutemov@linux.intel.com, aarcange@redhat.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-arm-msm@vger.kernel.org References: <1520427454-22813-1-git-send-email-gkohli@codeaurora.org> <22ebd655-ece4-37e5-5a98-e9750cb20665@codeaurora.org> <14ba6c44-d444-bd0a-0bac-0c6851b19344@codeaurora.org> <201803091948.FBC21396.LHOMSFFOVFtQJO@I-love.SAKURA.ne.jp> From: "Kohli, Gaurav" Message-ID: Date: Fri, 9 Mar 2018 17:34:39 +0530 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: <201803091948.FBC21396.LHOMSFFOVFtQJO@I-love.SAKURA.ne.jp> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 3/9/2018 4:18 PM, Tetsuo Handa wrote: > Kohli, Gaurav wrote: >>> t->alloc_lock is still held when leaving find_lock_task_mm(), which means >>> that t->mm != NULL. But nothing prevents t from setting t->mm = NULL at >>> exit_mm() from do_exit() and calling exit_creds() from __put_task_struct(t) >>> after task_unlock(t) is called. Seems difficult to trigger race window. Maybe >>> something has preempted because oom_badness() becomes outside of RCU grace >>> period upon leaving find_lock_task_mm() when called from proc_oom_score(). >> Hi Tetsuo, >> >> Yes it is not easy to reproduce seen twice till now and i agree with >> your analysis. But David has already fixing this in different way, >> So that also looks better to me: >> >> https://patchwork.kernel.org/patch/10265641/ >> > Yes, I'm aware of that patch. > >> But if need to keep that code, So we have to bump up the task >> reference that's only i can think of now. > I don't think so, for I think it is safe to call > has_capability_noaudit(p) with p->alloc_lock held. > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > index f2e7dfb..4efcfb8 100644 > --- a/mm/oom_kill.c > +++ b/mm/oom_kill.c > @@ -222,7 +222,6 @@ unsigned long oom_badness(struct task_struct *p, struct mem_cgroup *memcg, > */ > points = get_mm_rss(p->mm) + get_mm_counter(p->mm, MM_SWAPENTS) + > mm_pgtables_bytes(p->mm) / PAGE_SIZE; > - task_unlock(p); > > /* > * Root processes get 3% bonus, just like the __vm_enough_memory() > @@ -230,6 +229,7 @@ unsigned long oom_badness(struct task_struct *p, struct mem_cgroup *memcg, > */ > if (has_capability_noaudit(p, CAP_SYS_ADMIN)) > points -= (points * 3) / 100; > + task_unlock(p); Earlier i have thought the same to post this, but this may create problem if there are sleeping calls in has_capability_noaudit ? > > /* Normalize to oom_score_adj units */ > adj *= totalpages / 1000; > -- Qualcomm India Private Limited, on behalf of Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.