Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp6432184iob; Tue, 10 May 2022 19:21:17 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwtVi/uHcZOB9K0HHRo+WcruuCs07IKKcIO72nuA5nwMWe7h658cr/8NtPWs+vJgL5tXD3T X-Received: by 2002:a17:907:3c81:b0:6e6:cf3e:6e14 with SMTP id gl1-20020a1709073c8100b006e6cf3e6e14mr22402710ejc.181.1652235677697; Tue, 10 May 2022 19:21:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652235677; cv=none; d=google.com; s=arc-20160816; b=ELnug+SKFPfB1UXzsJXRIDjjvV9g0AhzIQacSQJkb8k62GRiWqDRfugl68X/Acu0lS YZcq3qrvKi0irtwbUo+SxSIbzq0PtF5h6NBJnQlsxZkrUteCpKtzYrGidzobIK/1ZC6N SIcHeIMw8Q8rzoV/YYNlFtjjGCZN+nWgPUXYgbz2JQrGAGa7mDS8OppxdoN+LOVuu4X+ vJ037j7thaipEBLTQ3lYCyWrh7ICYiN8VmJa22fV083JfpXJ6TkVVmSoCQx1wcw4aPTf q3Iu8/CvqibQOx1seQz178KOQtvJy2tR4PKL3kwLqYSbRYQXP7kGRlf5htcHjm0FPJU7 Zvkw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=oLOHdpuCYzAccJ6ajaHotiOG99qDB43cAS4y17f1Oec=; b=o24/4m0gdp/LOPOMYoFudnhH1bqZnYzWoCKjXhhtoUlbpWOKYU+V05pFgLtGfIZxPz 9Ot6T6Qhlg7smMro+mL441dfp/FHytClgzZJGCO/A4ALGGRuE+8ASECiuBgrepyZe0Gs DcrHnaZxJUNk8H7g+fXroGfhLnNd4nnCPUdrGKWcIGyYG9yexkYQEwvNMfcRE9sFGcBO UGJXon1EKsH2gRDW+1Scv4XDsuLbwXwVTYpYbpCpTuWEUq+ySUdU9XCZq2ABeonaxQyb rfEqVYrC+lo/nV1yC2OBOXzeEpQVMm4gwTb/F5mUKrN9tXFSt2obmTncF6xu5svKECNJ ZMww== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=mDJpnWdL; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id b17-20020a1709063f9100b006f3be42074fsi1313148ejj.112.2022.05.10.19.20.51; Tue, 10 May 2022 19:21:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=mDJpnWdL; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232443AbiEJUxv (ORCPT + 99 others); Tue, 10 May 2022 16:53:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55744 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229532AbiEJUxt (ORCPT ); Tue, 10 May 2022 16:53:49 -0400 Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B13D721E326; Tue, 10 May 2022 13:53:48 -0700 (PDT) Received: from relay2.suse.de (relay2.suse.de [149.44.160.134]) by smtp-out1.suse.de (Postfix) with ESMTP id 4F7C221BC9; Tue, 10 May 2022 20:53:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1652216027; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=oLOHdpuCYzAccJ6ajaHotiOG99qDB43cAS4y17f1Oec=; b=mDJpnWdLo/vamrPVU9LuFUdyYMMMU2py1rUwVE/l5ixKrRbwJ7ncUbVlbywXkEfQ/VKoGI i8yYR6YctJu+b+ooPUYbEvYemkKMlTqe45XeHTn/s5TnCUrMCNNDbvinQGQzFQhem7fjKV I3dTbizO+oqVAStb3iNU29aTuXgFDko= Received: from suse.cz (unknown [10.100.201.86]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by relay2.suse.de (Postfix) with ESMTPS id C98972C141; Tue, 10 May 2022 20:53:46 +0000 (UTC) Date: Tue, 10 May 2022 22:53:43 +0200 From: Michal Hocko To: Suren Baghdasaryan Cc: Andrew Morton , David Rientjes , Matthew Wilcox , Johannes Weiner , Roman Gushchin , Minchan Kim , "Kirill A. Shutemov" , Andrea Arcangeli , Christian Brauner , Christoph Hellwig , Oleg Nesterov , David Hildenbrand , Jann Horn , Shakeel Butt , Peter Xu , John Hubbard , shuah@kernel.org, LKML , linux-mm , linux-kselftest@vger.kernel.org, kernel-team Subject: Re: [PATCH 2/3] mm: drop oom code from exit_mmap Message-ID: References: <20220510030014.3842475-1-surenb@google.com> <20220510030014.3842475-2-surenb@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 10-05-22 09:31:50, Suren Baghdasaryan wrote: > On Tue, May 10, 2022 at 6:06 AM Michal Hocko wrote: > > > > On Mon 09-05-22 20:00:13, Suren Baghdasaryan wrote: > > > With the oom-killer being able to operate on locked pages, exit_mmap > > > does not need to ensure that oom_reap_task_mm is done before it can > > > proceed. Instead it can rely on mmap_lock write lock to prevent > > > oom-killer from operating on the vma tree while it's freeing page > > > tables. exit_mmap can hold mmap_lock read lock when unmapping vmas > > > and then take mmap_lock write lock before freeing page tables. > > > > The changelog is rather light on nasty details which might be good but > > for the sake of our future us let's be more verbose so that we do not > > have to reinvent the prior history each time we are looking into this > > code. I would go with something like this instead: > > " > > The primary reason to invoke the oom reaper from the exit_mmap path used > > to be a prevention of an excessive oom killing if the oom victim exit > > races with the oom reaper (see 212925802454 ("mm: oom: let oom_reap_task > > and exit_mmap run concurrently") for more details. The invocation has > > moved around since then because of the interaction with the munlock > > logic but the underlying reason has remained the same (see 27ae357fa82b > > ("mm, oom: fix concurrent munlock and oom reaper unmap, v3"). > > > > Munlock code is no longer a problem since a213e5cf71cb ("mm/munlock: > > delete munlock_vma_pages_all(), allow oomreap") and there shouldn't be > > any blocking operation before the memory is unmapped by exit_mmap so > > the oom reaper invocation can be dropped. The unmapping part can be done > > with the non-exclusive mmap_sem and the exclusive one is only required > > when page tables are freed. > > > > Remove the oom_reaper from exit_mmap which will make the code easier to > > read. This is really unlikely to make any observable difference although > > some microbenchmarks could benefit from one less branch that needs to be > > evaluated even though it almost never is true. > > " > > Looks great! Thanks for collecting all the history. Will update the description. Please make sure you double check the story. This is mostly my recollection and brief reading through the said commits. I might misremember here and there. -- Michal Hocko SUSE Labs