Received: by 2002:a17:90a:1609:0:0:0:0 with SMTP id n9csp852776pja; Wed, 1 Apr 2020 09:52:00 -0700 (PDT) X-Google-Smtp-Source: ADFU+vuk7nzRLh45slKL8UdYQ+oTShLEOJQPy3K3bzznsJCbfJLVFlN/FurOips7A+iJM2sm3rlQ X-Received: by 2002:a05:6830:1104:: with SMTP id w4mr8214114otq.54.1585759920264; Wed, 01 Apr 2020 09:52:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585759920; cv=none; d=google.com; s=arc-20160816; b=JIEopyzRVSaxoTx3Z8E5ZM9aSG02ThG5rfDK33DZN9VizVl2+JkrFIQtnk8f/vZI1l rlwbfsxZEePeduzWYZrmXHPxiZwVxg+C1w/bGPT++FV6mZ3S3E3Ynns24IU0uMD3R0Q0 /yooDTv5Wx8Zp7jMM+WT+xlyWz8EftPyOsBIDx8bQOz99Y0T2U3C44E/A6EQSDnICxJt Jb4iYPwuljkbvfIWvta0BcpKLwFM4oNG7q0lS83076vm2xZR7M7WhTO/D4rB5NgIwBQ+ FL2tNnGlKydcNGVNXB81f0+iDuY2n+/R5XIF6kSy0W4aQMC3y58NxuGE377/lojWVqYY /WgA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=RHntT7rz/8kXjZW+chO0kzb7YsyM5bEPqAG2duuo+dY=; b=ksnkbrcmio1VkAKQjP2NxV05oCVoJTB5d7rZjwRF2R8j5t6e9jDU1G9kbNflKdnheG nZ7LlJG9+Q8+7UD+B4d0k3nWtC8KBmG4uWUbLcEYW735mWEqwVtnuDBCQCYLEvzwYQ05 84OJp4kWgvGzJGYYiMB2rLkYg23HWTGOYZNdtoi5j8ag1SV8Z+JBRIx1qesuYATjRRMu d1UMzMR+fLpb0Y3plHo+IzBSWG6MuuzGQnBL2sFm1WLs9kvKGg4jaw4AnaImDyEZDqiQ Y6hZc4G4F1XpHl0PFREwueVVXHigofaDmCFWIHzeROPju8jxTTEhp6VhWRpUCoqLyQ5u hE6Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@soleen.com header.s=google header.b=ACyT1dui; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d125si1108400oif.236.2020.04.01.09.51.47; Wed, 01 Apr 2020 09:52:00 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@soleen.com header.s=google header.b=ACyT1dui; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1733224AbgDAQtv (ORCPT + 99 others); Wed, 1 Apr 2020 12:49:51 -0400 Received: from mail-ed1-f68.google.com ([209.85.208.68]:44865 "EHLO mail-ed1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2389262AbgDAQl2 (ORCPT ); Wed, 1 Apr 2020 12:41:28 -0400 Received: by mail-ed1-f68.google.com with SMTP id i16so645497edy.11 for ; Wed, 01 Apr 2020 09:41:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=soleen.com; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=RHntT7rz/8kXjZW+chO0kzb7YsyM5bEPqAG2duuo+dY=; b=ACyT1duiYos03DooLNAqRqKdl9uYfMkBhAjMLzLUDyqTzqG0o5wbU245Y1BVSFDZ+U SkhJVSNuTisEqh60Y4jSkj9bT6NAylbj0ahmFFBUH/aQtjBV7OdqQHh12FSdE31N+NFc +CVM/nVEA/OMUdd6uqmOO1rd/muhSlIklPt82IEeRVW18Fmz0hHB7AwgnUPew0W9RM0D WxMky9mYO94zrKnER/yTYvcGUa2QEwd2j7eeFysVgvhYVIWqUMFaNY+JptZG/1u3VXR/ CPXoJJNF8otuaWjQ0WRdvI2upXO2S2ptQ0aewluNXHuBu+M8aYpLj0IwNCMnLDBvOBGq lpBA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=RHntT7rz/8kXjZW+chO0kzb7YsyM5bEPqAG2duuo+dY=; b=Lo0oVFOqc70kBwZCfbGUortMpQZtnHN9fuHQ9S4vPjalHWPZ0oP59HBfjWX0qH1Tuw LHQlTUdsCGLibyMS2e5H0F6qbV1PiifXiaTCsN5RqlGnDIROlVUwEgJylAGmCgHZMJ5d ceWP6bBRFVDrHiAaICENgUU2hCxIMs1AXn8nXVLyx0k4Ip58zOQrvnUSToxd7I4sEahW L5Nr1hYrokdvk+uqZZV+oSnLsjVrsKpjjO1pchby4jtBAV1dZfUrC6dc230rnbULJ5j7 LX8nTDCmzgQ9SMvf2SOaZrY2R6lcJS6tr/lkhi92CPvP+k5fiq8L+36PrgG+d0LnwfYi Csrw== X-Gm-Message-State: ANhLgQ07/wFGJYpf7o48wfLTOwDtHOiLvLz9OI2/4/ew3UKq8kMTA4IP YcF7doyinhocZGRrqi4qa1M7U504xGJrv0c35EmDEg== X-Received: by 2002:a05:6402:c88:: with SMTP id cm8mr22183019edb.142.1585759286419; Wed, 01 Apr 2020 09:41:26 -0700 (PDT) MIME-Version: 1.0 References: <20200311123848.118638-1-shile.zhang@linux.alibaba.com> <20200401154217.GQ22681@dhcp22.suse.cz> <20200401160048.GU22681@dhcp22.suse.cz> <20200401160929.jwekhr24tb44odea@ca-dmjordan1.us.oracle.com> <20200401161243.GW22681@dhcp22.suse.cz> <20200401161810.xvqikca2x46yqrlx@ca-dmjordan1.us.oracle.com> <20200401162655.GX22681@dhcp22.suse.cz> In-Reply-To: <20200401162655.GX22681@dhcp22.suse.cz> From: Pavel Tatashin Date: Wed, 1 Apr 2020 12:41:13 -0400 Message-ID: Subject: Re: [PATCH v3] mm: fix tick timer stall during deferred page init To: Michal Hocko Cc: Daniel Jordan , Vlastimil Babka , David Hildenbrand , Shile Zhang , Andrew Morton , Kirill Tkhai , linux-mm , LKML Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Apr 1, 2020 at 12:26 PM Michal Hocko wrote: > > On Wed 01-04-20 12:18:10, Daniel Jordan wrote: > > On Wed, Apr 01, 2020 at 06:12:43PM +0200, Michal Hocko wrote: > > > On Wed 01-04-20 12:09:29, Daniel Jordan wrote: > > > > On Wed, Apr 01, 2020 at 06:00:48PM +0200, Michal Hocko wrote: > > > > > On Wed 01-04-20 17:50:22, David Hildenbrand wrote: > > > > > > On 01.04.20 17:42, Michal Hocko wrote: > > > > > > > This needs a double checking but I strongly believe that the lock can be > > > > > > > simply dropped in this path. > > > > > > > > This is what my fix does, it limits the time the resize lock is held. > > > > > > Just remove it from the deferred intialization and add a comment that we > > > deliberately not taking the lock here because abc > > > > I think it has to be a little more involved because of the window where > > interrupts might allocate during deferred init, as Vlastimil pointed out a few > > years ago when the change was made. > > I do not remember any details but do we have any actual real allocation > failure or was this mostly a theoretical concern. Vlastimil? For your > context we are talking about 3a2d7fa8a3d5 ("mm: disable interrupts while > initializing deferred pages") I do not remember seeing any real failures, this was a theoretical window. So, we could potentially simply remove these locks until we see a real boot failure in some interrupt thread. The allocation has to be rather large as well. Pasha