Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp1023199ybt; Tue, 7 Jul 2020 06:08:46 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzTv1J8y+LD7dffYhG8Fi8W9+GhxdXuekkxIkP9ocPuJjLVYEipgFtCDermxhbkgRwFI4Zq X-Received: by 2002:aa7:c504:: with SMTP id o4mr60304647edq.311.1594127326048; Tue, 07 Jul 2020 06:08:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1594127326; cv=none; d=google.com; s=arc-20160816; b=P0ZcnELr536ITwB86+AAMT8AKFBcwfF14sqykxHmI0IZ0lqybfWV0p1kycQC01EGga +Jir5x0ACupVlQKWNNuLmUc1ia1ALvGO9G5+P0bqNWBPlIq+0GaU8PeyALVndeKH++oB OcCJOLtqaDvXsytNP52os2dG0eHFxFa85p+PYhVeeZ0Jv+a4wJ5iHtG/+Fi9blHqQqkF iqYeJMorbvx191EhfSa+E1tMB2Qsq8Bv53nxWxh+1TUMgAyVnQf85m7h2AoX37nTCunX qg7+eMuMLdJWx9iChSahWJOWNA0NcirfEX0wXTqnOjpQF5G9HIK8+21Y3XKA2flIvxHr owVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=3vCyH9yces12BRj3OiWTj9K/S80Ld9W3fkzJn+Hp168=; b=wSJDjoBuKt9SuOoLG3C+ZQdFv04XCghP+Hk/spXsNczbkhXeX94lj+g2ALmYaTEScp 2dyBxps9ZC1L79MrxHMG2gWl50i2RNDD2UNlKeoRkiiZQNRp3V/7gK1sx4K0KkujQwks 0fzwaY9B1D/ktTuOUZkrQaznPOjKiaQxPih8qP/YNZ2S1/QaH0b5Sts/AnewcOieCpBP s/Xz/Kf2KZJwVEtZ+YHni92b/Ni3i9YhTuDUDm0Rfk+DLTPOJcvBQ4+2MwLx6LbelQf4 g0n5F6RWng4BTBffQfbIlvoWVmfD5brE4duN3uD2MVWFKafG3deKYOvC0Skx+MINQ7wT rPag== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=oAhYJb3Y; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i25si15282493edr.503.2020.07.07.06.08.23; Tue, 07 Jul 2020 06:08:46 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=oAhYJb3Y; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727827AbgGGNEp (ORCPT + 99 others); Tue, 7 Jul 2020 09:04:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39206 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727800AbgGGNEo (ORCPT ); Tue, 7 Jul 2020 09:04:44 -0400 Received: from mail-qv1-xf44.google.com (mail-qv1-xf44.google.com [IPv6:2607:f8b0:4864:20::f44]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8123CC061755 for ; Tue, 7 Jul 2020 06:04:44 -0700 (PDT) Received: by mail-qv1-xf44.google.com with SMTP id m9so18779473qvx.5 for ; Tue, 07 Jul 2020 06:04:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lca.pw; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to; bh=3vCyH9yces12BRj3OiWTj9K/S80Ld9W3fkzJn+Hp168=; b=oAhYJb3YIr+QqdyG66wYt2hOm1J8utDn7lO7TMYNOFmAxXD3xRaj6WQei4kOIM1PB6 P9mNqreAczhhJJ6l/zJ82QwVtL56a8dmu41XvcO4HJlrBCVWsYVFpkEGnLEk5uYjVRVs 5dudlShmnAvX3n4CN002ngEIkhCNxAEshUH16pnG2VnMqw89/YIRalCc5ksx3d8a0s8Q U4VbFgTwdK/uoXhWmTBasiSMrss6abx1SJI8kiy2r3HB46fbsm6uCboAx6gp0pbkLGMJ 7BA6lnPlkDYM0NNI8c0IgRSMATZ0lGYSj+ieFIfEb8XGvm855TXq2kvpUzlE7ayk30fg kiEQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to; bh=3vCyH9yces12BRj3OiWTj9K/S80Ld9W3fkzJn+Hp168=; b=kLk19HarKf54Wshw+tn8fWvVh6mzwhKizoYL5T1J43gjCt570YmoTaOUcJgW34RCVf +zkN/tfSIFd2/2RbdBt6SKJHVuk5tVx6YTZEhK5IhlYP9xGsTFLJXzb6DQrwAJCC9II9 4lG728KoJAux5hBap15gw1b0Ztz6wVZ5c7noKEBGSRFGuuLUdETo2Mf0yyX9bUeDAV5b RRHrBd3KHpe70gOTIj3MihZu5l/DnNPq9j2fZozy4toRR0t357jc3xHr3JTNn0EXDM/m T0YhK+LpBbybu1dGcb7GASMvhAewnQ+XJAaTRq4OQXCNaVFC9ADJDDTJoeYDu0p1Dx7O QFJQ== X-Gm-Message-State: AOAM530UbKhR40TXa0p/aks+Fxvm1V//djCTKC0rCJmrgmJR6sgfC/Br 06/ep94KLMIy/Uk3nN9KrkL3IA== X-Received: by 2002:a0c:a4e3:: with SMTP id x90mr22521958qvx.36.1594127083631; Tue, 07 Jul 2020 06:04:43 -0700 (PDT) Received: from lca.pw (pool-71-184-117-43.bstnma.fios.verizon.net. [71.184.117.43]) by smtp.gmail.com with ESMTPSA id x29sm19429137qtx.74.2020.07.07.06.04.41 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 07 Jul 2020 06:04:42 -0700 (PDT) Date: Tue, 7 Jul 2020 09:04:36 -0400 From: Qian Cai To: Michal Hocko Cc: Feng Tang , kernel test robot , Andrew Morton , Johannes Weiner , Matthew Wilcox , Mel Gorman , Kees Cook , Luis Chamberlain , Iurii Zaikin , andi.kleen@intel.com, tim.c.chen@intel.com, dave.hansen@intel.com, ying.huang@intel.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, lkp@lists.01.org Subject: Re: [mm] 4e2c82a409: ltp.overcommit_memory01.fail Message-ID: <20200707130436.GA992@lca.pw> References: <20200707120619.GO5913@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20200707120619.GO5913@dhcp22.suse.cz> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jul 07, 2020 at 02:06:19PM +0200, Michal Hocko wrote: > On Tue 07-07-20 07:43:48, Qian Cai wrote: > > > > > > > On Jul 7, 2020, at 6:28 AM, Michal Hocko wrote: > > > > > > Would you have any examples? Because I find this highly unlikely. > > > OVERCOMMIT_NEVER only works when virtual memory is not largerly > > > overcommited wrt to real memory demand. And that tends to be more of > > > an exception rather than a rule. "Modern" userspace (whatever that > > > means) tends to be really hungry with virtual memory which is only used > > > very sparsely. > > > > > > I would argue that either somebody is running an "OVERCOMMIT_NEVER" > > > friendly SW and this is a permanent setting or this is not used at all. > > > At least this is my experience. > > > > > > So I strongly suspect that LTP test failure is not something we should > > > really lose sleep over. It would be nice to find a way to flush existing > > > batches but I would rather see a real workload that would suffer from > > > this imprecision. > > > > I hear you many times that you really don’t care about those use > > cases unless you hear exactly people are using in your world. > > > > For example, when you said LTP oom tests are totally artificial last > > time and how less you care about if they are failing, and I could only > > enjoy their efficiencies to find many issues like race conditions > > and bad error accumulation handling etc that your “real world use > > cases” are going to take ages or no way to flag them. > > Yes, they are effective at hitting corner cases and that is fine. I > am not dismissing their usefulness. I have tried to explain that many > times but let me try again. Seeing a corner case and think about a > potential fix is one thing. On the other hand it is not really ideal to > treat such a failure a hard regression and consider otherwise useful Well, terms like "corner cases" and "hard regression" are rather subjective. > functionality/improvement to be reverted without a proper cost benefit > analysis. Sure having corner cases is not really nice but really, look > at this example again. Overcommit setting is a global thing, it is hard > to change it during runtime nilly willy. Because that might have really > detrimental side effects on all workloads running. So it is quite > reasonable to expect that this is either early after the boot or when > the system is in quiescent state when almost nothing but very core > services are running and likelihood that the mode of operation changes. Not really convinced that is only way people will use those tunables. > > > There are just too many valid use cases in this wild world. The > > difference is that I admit that I don’t know or even aware all the > > use cases, and I don’t believe you do as well. > > Me neither and I am not claiming that. All I am saying is that a real > risk of a regression is reasonably low that I wouldn't lose sleep over > that. It is perfectly fine to address this pro-actively if the fix is > reasonably maintainable. I was mostly reacting to your pushing for a > revert solely based on LTP results. > > LTP is a very useful tool to raise awareness of potential problems but > you shouldn't really follow those results just blindly. You must think I am a newbie tester to give me this piece of advice then.