Received: by 10.223.185.116 with SMTP id b49csp3912794wrg; Tue, 13 Feb 2018 09:37:40 -0800 (PST) X-Google-Smtp-Source: AH8x224OMlTHkNXAMg9kH2sihQPlXp0KUhpQvGHsMPcy+bvbV6zZ++BjyTjeSF/W6Vv91jWfSKoT X-Received: by 10.101.76.143 with SMTP id m15mr1591168pgt.445.1518543460744; Tue, 13 Feb 2018 09:37:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518543460; cv=none; d=google.com; s=arc-20160816; b=Wx0Efb1vJNfwoy1/QHn84ImVcCVgsd1vAJOHFiS+ATCjCP+clROqU048KTMLb0ElGF 599+m0xmnVkOaRNxNGKjn3WQM0YNdzZo3kkKgGw1HsoxPmogf4gviMxOabsEl8E8Otdk 0TLjDcLrauyFQYyqEQ5XIeXehE8TAxZvQHOa4j4m1E+sMiAVXRm5KCiSyaKvdVf1+Q2N 5i7aGn6knxb1LA+oWMBwZ4e1AGwiG3dYRjweH4kUyNDvlPcD/4I0BSAJwvbzrZkwHTt0 lzGXudmhGF1j62pCyjHthnL5UST/JoEZUaHybVKB6fstYbvcRy7c2KAbxbrEgZX2RFYy H28Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=ng9WY7QJFnE6bCPyvZAYZ3VY+u8SxI25DSCTC+Sr0Cw=; b=OL71cjVXgP0dkwaxjYw2+DJRmzuFn4/2z8PN9V0NRrd0d6dWRbufBzodwqB3Nt1LQ2 V6VdLw8Rj+HDVW1dM4u7hafHCIXw5NdTBMMklYQHgl45vR9GtUYpqKjsxz1jRB8+uEvU XQ84uR8QAVHGfEL61aHbsazqZBdWRUJUVB7BS6k33Bxlx4LCYJ8znwmBw0qLBGa3iyOR sTFyns9WxSvO4LYesHZJu3NGFk++XPbUOwstaeMsZqj7LgvAyLCI4oxAEbh4IkLPp4+O Q0OMxv9G8VpNaRIenM7qsaRiUb7FF/WQE0hGRZm8EgF0jdYQ8pW7PehWL7V0jgnK8dqo 0Zdg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=LkSVI16E; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i70si1409262pgc.148.2018.02.13.09.37.26; Tue, 13 Feb 2018 09:37:40 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=LkSVI16E; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S965379AbeBMRe6 (ORCPT + 99 others); Tue, 13 Feb 2018 12:34:58 -0500 Received: from mail-it0-f48.google.com ([209.85.214.48]:38198 "EHLO mail-it0-f48.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S965099AbeBMRe4 (ORCPT ); Tue, 13 Feb 2018 12:34:56 -0500 Received: by mail-it0-f48.google.com with SMTP id k6so11738126ita.3; Tue, 13 Feb 2018 09:34:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=ng9WY7QJFnE6bCPyvZAYZ3VY+u8SxI25DSCTC+Sr0Cw=; b=LkSVI16EIXk0oaXN3P7hbDI5WPoaaiMbpJ85D4UVs/zfmWijXVwuL1g84vsKnCfaGh zw6itd4cTyJS+ltZ5HQRlwzA6LGEEdL7+JHqhNqwjY3Oubm6S8o6uG0RuJCgHGglwJKq gGvQHDbfJ4xIcNgU1FKHWTLfHODC+lF5WIwNK+E9qT9zKIiVRXBdtY1SV4SWIkGT/ffL yWTOqxTMVGZ7nimwv7sIs8FoCJymI1X1ELi/Jkmr7OyxTcZg9K/ZgpQeDr+ahJp+q6o7 sUQFP5zodCue7Mh+9d+jAgBSLbk50eWGpQf/Wb1c1Q5+apOd9XJAXpm4lwGItzXd6VQV 6Jpg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=ng9WY7QJFnE6bCPyvZAYZ3VY+u8SxI25DSCTC+Sr0Cw=; b=RWfWwNe7f31yhNLPH+qsBPf4bELukRp8PqolMNcqnqfMH4K4PoNJ3vcRMg8Zr8gAdJ /7NtCDBUoFo/i0hB9nRTj+HxxHuKi4Sl3vF6o6KCfjnCeywxrlYdih8JHZhYJPPCFEpR JT9a8ICj1EzhpcOJzc5Gtzq9v9naja/6iLXFoHWhEQcn7zOHUJJKdY1eT+lJNLHwdKzf Op6zHkYdfvGcqEI1zHNfkZ1UqGdNNBCWZmyNDvUiFHjUuHN2NA90wPceeNxJy7V6IFpI H8svpSPq/5hxH3hFWA68EUOnbv1JBQ4rd1RALW62EWdxocbfkbPl6k/8Qsat3LN+ZTjG w+yQ== X-Gm-Message-State: APf1xPDWUk0S04xEFd2IF1cM/WXKfmCQOx5lg5DFYhmy+VvDjXPSnMWc p9W2tAk98IPgByJ8UKoD8YU= X-Received: by 10.36.153.2 with SMTP id a2mr2644961ite.71.1518543295280; Tue, 13 Feb 2018 09:34:55 -0800 (PST) Received: from localhost.uwnet.wisc.edu (dyn-72-33-2-110.uwnet.wisc.edu. [72.33.2.110]) by smtp.gmail.com with ESMTPSA id n73sm15496501ioi.84.2018.02.13.09.34.54 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 13 Feb 2018 09:34:54 -0800 (PST) Date: Tue, 13 Feb 2018 11:34:38 -0600 From: Dennis Zhou To: Eric Dumazet Cc: Tejun Heo , Daniel Borkmann , Dmitry Vyukov , syzbot , Alexei Starovoitov , netdev , LKML , syzkaller-bugs@googlegroups.com Subject: Re: lost connection to test machine (4) Message-ID: <20180213173438.GA60641@localhost.uwnet.wisc.edu> References: <001a113f8734783e94056505f8fd@google.com> <00c45ca8-305d-1818-e974-a9903c8494b8@iogearbox.net> <20180212170325.GW695913@devbig577.frc2.facebook.com> <20180212200548.GG695913@devbig577.frc2.facebook.com> <1518528926.3715.173.camel@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1518528926.3715.173.camel@gmail.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Eric, On Tue, Feb 13, 2018 at 05:35:26AM -0800, Eric Dumazet wrote: > > Also I would consider using this fix as I had warnings of cpus being > stuck there for more than 50 ms : > > > diff --git a/mm/percpu-vm.c b/mm/percpu-vm.c > index 9158e5a81391ced4e268e3d5dd9879c2bc7280ce..6309b01ceb357be01e857e5f899429403836f41f 100644 > --- a/mm/percpu-vm.c > +++ b/mm/percpu-vm.c > @@ -92,6 +92,7 @@ static int pcpu_alloc_pages(struct pcpu_chunk *chunk, > *pagep = alloc_pages_node(cpu_to_node(cpu), gfp, 0); > if (!*pagep) > goto err; > + cond_resched(); > } > } > return 0; > > This function gets called from pcpu_populate_chunk while holding the pcpu_alloc_mutex and is called from two scenarios. First, when an allocation occurs to a place without backing pages, and second when the workqueue item is scheduled to replenish the number of empty pages. So, I don't think this is a good idea. My understanding is if we're seeing warnings here, that means we're struggling to find backing pages. I believe adding __GFP_NORETRY on the workqueue path as Tejun mentioned above would help with warnings as well, but not if they are caused by the allocation path. Thanks, Dennis