Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp1629185yba; Thu, 25 Apr 2019 03:06:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqxFy7c4Wmuvb5MHUsgFYIRVft3VqWgpkGt6iU3lH+POp0ozI99fbph8vpAaXXUcZjgymUv8 X-Received: by 2002:a62:e315:: with SMTP id g21mr40076064pfh.2.1556186791731; Thu, 25 Apr 2019 03:06:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1556186791; cv=none; d=google.com; s=arc-20160816; b=g+pPGJBwLaypm1NGuWI2nWVX8ABZWPe/RzDPhY84iGt4N59D6N7f8qkRoHbirEHNp2 pI1F4g8yq7y7CpOzY5VFiWu7dERUFme1efAd+VOKFxvvZyveodjYwusnbElWWwbH4mmv t/3opg00oWbuC5Y/EceEkgzV4gzNrApAZnKPwdkajx1BDE+dE4TpETqwRb6jBVLGD5HS K6dDYMSEuc1IesRCmbtY65LnsBKyCUr4MhzrgSLlAH9F6yguNLrct7nd25268w62pVdX TkIO8g2cMO+RlsTmXxHpt1iEwRLdXQ2pv0vYvJTtUaqKiyXQCi/1NURJBNnB1GdZuaUo +rrg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=nDobCxdaEAkOdTedAikVr+5O9xup7T7zCS7PipkEvlE=; b=cw7ks+CznMSa+FF9b0pIs8owkm9d2yaJaPbGj5AAAMQYKozi25m+4LWi9JFSsAlcEz 1uZI/nS+d27uOM2N1vWodsWdUPSgtfYaiP20P32gJeson1g3q20l022oq1S3PN3ZZslk GFlkIwOGJy/aP0FuwteAxaaYU+fvKQXVGbkaUSryeElbXGMcL4lmhuNZxLAF9s4lq13N 30VZlZ0NnXGy5QDrOeD0hloNoaJWnOPtY+LdetMAH8gjIUNyS2ugFvxw8BM5gkO1uFSs 45MPi+LLTYDThp2xHXkNQ/Vn8+Ie8fYdrtCjQXdH+TBi7M1L+dhnTgdwJ6M1SzP5QMTQ HwDQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e25si21689883pfi.123.2019.04.25.03.06.15; Thu, 25 Apr 2019 03:06:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731697AbfDXVeX (ORCPT + 99 others); Wed, 24 Apr 2019 17:34:23 -0400 Received: from mx2.suse.de ([195.135.220.15]:53424 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1730946AbfDXVeX (ORCPT ); Wed, 24 Apr 2019 17:34:23 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id E108AACCD; Wed, 24 Apr 2019 21:34:21 +0000 (UTC) Date: Wed, 24 Apr 2019 14:34:15 -0700 From: Davidlohr Bueso To: Eric Wong Cc: linux-kernel@vger.kernel.org, Omar Kilani Subject: Re: Strange issues with epoll since 5.0 Message-ID: <20190424213415.7muqd4tkuxsgwigz@linux-r8p5> References: <20190424193903.swlfmfuo6cqnpkwa@dcvr> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <20190424193903.swlfmfuo6cqnpkwa@dcvr> User-Agent: NeoMutt/20180323 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 24 Apr 2019, Eric Wong wrote: >Omar Kilani wrote: >> Hi there, >> >> I???m still trying to piece together a reproducible test that triggers >> this, but I wanted to post in case someone goes ???hmmm... change X >> might have done this???. > >Maybe Davidlohr knows, since he's responsible for most of the >epoll changes in 5.0. Not really, I have not been made aware of any issues until now. > >> Basically, something???s broken (or at least, has changed enough to >> cause problems in user space) in epoll since 5.0. It???s still broken in >> 5.1-rc5. >> >> It doesn???t happen 100% of the time. It???s sort of hard to pin down but >> I???ve observed the following: >> >> * nginx not accepting connections under load >> * A java app which uses netty / NIO having strange writability >> semantics on channels, which confuses netty / java enough to not >> properly flush written data on the socket. >> >> I went and tested these Linux kernels: >> >> 4.20.17 >> 4.19.32 >> 4.14.111 >> >> And the issue(s) do not show up there. >> >> I???m still actively chasing this up, and will report back ??? I haven???t >> touched kernel code in 15 years so I???m a little rusty. :) A bisection and/or workload that triggers the issue would be great to see what's going on. Thanks, Davidlohr