Received: by 2002:a25:ca44:0:0:0:0:0 with SMTP id a65csp565943ybg; Sun, 26 Jul 2020 13:42:03 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzHCNwry9v+p3j/jWRP3s1f2o5QeqWXDVVKZcXwIPB0Lx8rO29g+QlgiOvYKsnOLgANwgfS X-Received: by 2002:a17:906:2e06:: with SMTP id n6mr18115903eji.34.1595796123233; Sun, 26 Jul 2020 13:42:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1595796123; cv=none; d=google.com; s=arc-20160816; b=u9Bob3n/KdA6spiOcvPT9hRbeVjGcAyaByfLYTUe5TRNn58E8URtW58p+umNpopZ8t x3uOthZIOPVaOAhLAqbeR1wTYjiu6mvtgTXNT/vaPzxqdqChK2gE96PIHdjD/Aa2+/uv tjH7AU30MZzixdIfSLpVwtu2+6BhL5Hl1E9T7mVCdqhR5GPhW+mrKnUEBsixdYza98/Q PS5D4SBm3f49FkQvZLx4yJOGg+mhKUGUSxIp4De/x7k4/RFDnk3AyiQIlwy7vdksgiEQ Ru17ZGC/3V0QKadnCV+wjrDYNiM4RbN4hKewpU6tNhCOi/anIXI3dbIc602Pk2DIwy84 RXpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=zMAqM19mk6Z9hrysbVvmXBiZmKyuJZ/Q/akYm6Ar5O8=; b=NaXCKziP7WlDGDoWStPbYbrfOmBNj2OLkaYD9skMQTSfWOfXdbTZ3mXmnMGD9cSUIX SzY6Pv8upBCFeok8zjLouEVKx7t8MzRBDGTRdMk8SmPx47gfg0SFBLjN7KzioEf5qYNd MV3PICmMTtqYQTPNInayXT3P7BhN960uG/2UrSCRRK/SdIAnNH0qxQyhXb6FX0eciI1i ydUbxji2Io0l+7gICDsa5pzuUQ3+J12k6LovdguvG2OFbp8ztMoZsgI17kotvRy5n+za Th1kNRjZz2BsW0DtDePPot6peAYTtOQhK93hTUt08k40w7XrHF/fjTp45vyMtpqPEdjK kFyw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=google header.b=VgsUmhcZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z16si4189585eju.554.2020.07.26.13.41.40; Sun, 26 Jul 2020 13:42:03 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=google header.b=VgsUmhcZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726730AbgGZUlh (ORCPT + 99 others); Sun, 26 Jul 2020 16:41:37 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42722 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726292AbgGZUlh (ORCPT ); Sun, 26 Jul 2020 16:41:37 -0400 Received: from mail-lf1-x143.google.com (mail-lf1-x143.google.com [IPv6:2a00:1450:4864:20::143]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id EA43BC0619D2 for ; Sun, 26 Jul 2020 13:41:35 -0700 (PDT) Received: by mail-lf1-x143.google.com with SMTP id m15so7217076lfp.7 for ; Sun, 26 Jul 2020 13:41:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux-foundation.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=zMAqM19mk6Z9hrysbVvmXBiZmKyuJZ/Q/akYm6Ar5O8=; b=VgsUmhcZ3TmL8PgzB6o7dC+JS3frPjbw6MQDMCMiEzv9oMbMxuQYkleMes2lYNtFRX ZDWnCDaWTL+RvDudxDYAUTOCn1MqyzZQxhxZU0etSaJR+Qj6T+1p4T9CjrG6CK524+F5 FyyAoED45cuGDWZQmmbxBFVESvc0nq6NlSqsQ= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=zMAqM19mk6Z9hrysbVvmXBiZmKyuJZ/Q/akYm6Ar5O8=; b=Ay63IjnjUI1E7cQQEkSHRafFThuPkcsaLHj0IH2pE0EatIvxvSBT6R/aAXMT/5fUmw dlFMiuqiWFjqTdBMAhCOoYpnoNFdDiTk7i/gvxYgYAwjY9bLQRjwA87i6bUSeSDh4nZX XRwUedvg6B9RDoT1KsW0aUrBBq0xY7NPkxmvaL5VZJ3IQUIAYSu1gc6iLR6Htgb+IY0L ivynykM6fXATR2yFa88bNyzuC3k13tpRv5zh0FxxKLGGbnXyezlMt3hVseEepKxgqp/I B1zXg/HWtxFivLMxM7IxijbxLWrHDicNtwgSDQFyC/Bt8E5D1o60T0YcbF76iRVwBeES X4oA== X-Gm-Message-State: AOAM533+TBAHZ/wQz3mkxqWrJOrOIz9wFChEx7UvYl0nYvTQpvpc1Ej2 X75G0tENl2XtmHMBDg5765KNFYkQebM= X-Received: by 2002:a05:6512:312b:: with SMTP id p11mr9458825lfd.202.1595796093988; Sun, 26 Jul 2020 13:41:33 -0700 (PDT) Received: from mail-lj1-f174.google.com (mail-lj1-f174.google.com. [209.85.208.174]) by smtp.gmail.com with ESMTPSA id c14sm1955733ljj.112.2020.07.26.13.41.32 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sun, 26 Jul 2020 13:41:32 -0700 (PDT) Received: by mail-lj1-f174.google.com with SMTP id d17so15030446ljl.3 for ; Sun, 26 Jul 2020 13:41:32 -0700 (PDT) X-Received: by 2002:a2e:9b42:: with SMTP id o2mr8304055ljj.102.1595796091999; Sun, 26 Jul 2020 13:41:31 -0700 (PDT) MIME-Version: 1.0 References: <20200723124749.GA7428@redhat.com> <20200724152424.GC17209@redhat.com> <20200725101445.GB3870@redhat.com> In-Reply-To: From: Linus Torvalds Date: Sun, 26 Jul 2020 13:41:16 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [RFC PATCH] mm: silence soft lockups from unlock_page To: Hugh Dickins Cc: Oleg Nesterov , Michal Hocko , Linux-MM , LKML , Andrew Morton , Tim Chen , Michal Hocko Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Jul 26, 2020 at 1:30 PM Hugh Dickins wrote: > > I've deduced nothing useful from the logs, will have to leave that > to others here with more experience of them. But my assumption now > is that you have successfully removed one bottleneck, so the tests > get somewhat further and now stick in the next bottleneck, whatever > that may be. Which shows up as "failure", where the unlock_page() > wake_up_page_bit() bottleneck had allowed the tests to proceed in > a more serially sedate way. Well, that's the very optimistic reading. As the optimistic and happy person I am (hah!) I'm going to agree with you, and plan on just merging that patch early in the next merge window. It may fix a real bug in the current trere, but it's much too late to apply right now, particularly with your somewhat ambiguous results. Oleg's theoretical race has probably never been seen, and while the watchdog triggering is clearly a real bug, it's also extreme enough not to really be a strong argument for merging this out-of-window.. > The xhci handle_cmd_completion list_del bugs (on an older version > of the driver): weird, nothing to do with page wakeups, I'll just > have to assume that it's some driver bug exposed by the greater > stress allowed down, and let driver people investigate (if it > still manifests) when we take in your improvements. Do you have the bug-report, just to google against anybody else reporting something simialr> > One nice thing from the comparison runs without your patches: > watchdog panic did crash one of those with exactly the unlock_page() > wake_up_page_bit() softlockup symptom we've been fighting, that did > not appear with your patches. So although the sample size is much > too small to justify a conclusion, it does tend towards confirming > your changes. You win some, you lose some. But yes, I'll take that as a tentative success and that the approach is valid. Thanks, Linus