Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp3749453ybb; Tue, 31 Mar 2020 11:13:45 -0700 (PDT) X-Google-Smtp-Source: APiQypJfPpm1VaXj4HO3MEVFSdK7NG5tgli910GRayaVQPDZWMiHSuincABIX9YD/InbC6o8OCeo X-Received: by 2002:aca:4b84:: with SMTP id y126mr87765oia.31.1585678425257; Tue, 31 Mar 2020 11:13:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585678425; cv=none; d=google.com; s=arc-20160816; b=0o1335whf+qSu5Z6ua8cvDGoePhojVnuwP26xbQfrL1nnrvhaRJ6ZqozUBwfHgcma7 ZMZJkBXvKzZWAjxRk6uzB76GTcaqARz9rqDWKyk46P6Hfb5ceCrW2bDfIeAK/OdQ2Xz4 A3hlJVBM/TDt7AiGPtfqM2E1FHXcfMpg1goUaAIyX9nmP91itUTCP9TviCWnEg3Dv/Qj RXhw/vFVcq+aCFi0IYDELzfnVNxuZcUE3xd5ZtQrhk4shX6Dp5KPv5WyRk/jEvvIBXAK qeZr1e/j4Dtn3Ob5OvffiFoAaK4Udn4upI/Wq0qa739pkcrF4CmRytvRIHitxE7LfAJ0 H6BA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=+wYVrcYhKu2iAbkoZJNd2+YgnzYNu3ZqyTiXOiv4DbE=; b=ZXlFm8fEeFH/CH3xEHn6L+ZV0WDjZQT950DeROOfrv3hEOrfVy1rRIfyUJsYqY6ymK KhJ93g5eTYg2S6cZ3QLcDgI1bSf8rbTBigikfEcVCuMgsQD2ZF7ovjwq/paY3csDYSxc R+x2CAmvcBK4ePLxu1tWxWoblbahGspelN/zkFg1eaSkISxO8ZDvfub6NoKtSmH8+U8r MhSwxQy2tbyqnCtzoag2gWhfmolx8+KoFtjRbDMt1zCdnToNwO9zpaW3OeJcw5zFHtfJ NoTOrTC4bWianFzulG24IZTaqmAVGfVL1TDdjcoC+Yo/PCOLCZjVT9Ex02oxPV05fJ2d 5QpQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c20si7171660otf.37.2020.03.31.11.13.32; Tue, 31 Mar 2020 11:13:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726282AbgCaSMz (ORCPT + 99 others); Tue, 31 Mar 2020 14:12:55 -0400 Received: from mx2.suse.de ([195.135.220.15]:59384 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725947AbgCaSMz (ORCPT ); Tue, 31 Mar 2020 14:12:55 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id DE461B11A; Tue, 31 Mar 2020 18:12:53 +0000 (UTC) Date: Tue, 31 Mar 2020 11:10:53 -0700 From: Davidlohr Bueso To: Randy Dunlap Cc: Omar Kilani , linux-kernel@vger.kernel.org Subject: Re: Weird issue with epoll and kernel >= 5.0 Message-ID: <20200331181053.qyt32mgraa4q6iep@linux-p48b> References: <34206eb5-1280-4aac-9a50-76f967646ca1@infradead.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <34206eb5-1280-4aac-9a50-76f967646ca1@infradead.org> User-Agent: NeoMutt/20180716 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 28 Mar 2020, Randy Dunlap wrote: >On 3/28/20 11:10 AM, Omar Kilani wrote: >> Hi there, >> >> I've observed an issue with epoll and kernels 5.0 and above when a >> system is generating a lot of epoll events. >> >> I see this issue with nginx and jvm / netty based apps (using the >> jvm's native epoll support as well as netty's own optimized epoll >> support) but *not* with haproxy (?). >> >> I'm not really sure what the actual problem is (nginx complains about >> epoll_wait with a generic error), but it doesn't happen on 4.19.x and >> lower. >> >> I thought it was a netty problem at first and opened this ticket: >> >> https://github.com/netty/netty/issues/8999 >> >> But then saw the same issue in nginx. >> >> I haven't debugged a kernel issue in something like 20 years so I'm >> not really sure where to start myself. >> >> I'd be more than happy to provide my test case that has a very quick >> repro to anyone who needs it. > >Hi, >Please do. > >> Also happy to provide a VM/machine with enough CPUs to trigger it >> easily (it seems to happen quicker with more CPUs present) to test >> with. Yeah, more than a VM, an actual reproducer would be much welcome here. > > >There have been around 10 changes in fs/eventpoll.c since v5.0 was >released in March, 2019, so it would be helpful if you could test >the latest mainline kernel to see if the problem is still present. > >Hm, it looks like you have identified this commit: >https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v5.1-rc5&id=c5a282e9635e9c7382821565083db5d260085e3e >as the/a problem. As this been bisected down to this? As you mention there are more commits in there that are dependent of each other, so I'd like to be certain this is actually the broken change. Thanks, Davidlohr