Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp4049383ybe; Mon, 9 Sep 2019 03:25:27 -0700 (PDT) X-Google-Smtp-Source: APXvYqwqVSQxoRkOhclLreBR9p00arCzRCs/+dL8rurgp6M46km12TLzAwBa2oZUv8Fm/UfBigSK X-Received: by 2002:a17:906:a895:: with SMTP id ha21mr18743686ejb.291.1568024727216; Mon, 09 Sep 2019 03:25:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1568024727; cv=none; d=google.com; s=arc-20160816; b=P5/meLkp1Gp29q9MBLkpYSqEtaxIsq/7xFWV50ga7ypg9X4YBDaSUaXJaixvIiAXHP ZdDJ//7aq/En2koNG2twz2JPHcGkw96vaGrf4qQEYX3xA4tIzbUtGAsUS0u11bfTFwv4 kPdmLkWN80w4hb43THKdQs5RnLGyglJ2hslDDiY1/zpzshEOpk9+MJtOwbpoi314Q9+S 2bVCX6VQwH9wRmG+OQWSxo3OLBl3CfZZQSqoqPVUwGTfqEJjdMydjzxaDE7zpCETadVG mcQhMFT3TnRZOUD6OHKHgIrIk36uJZPdcj3b3lCT7P/utydpIiTKath+2mx0GCv3aIg4 u6wQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=9dUXE1qcRb0NU/Ki8UfC+PSPK1pnIXHn3XCZVGYIysQ=; b=bsLhZTh9iTk3fzFm7kQRQcRMiRRlBSS/WRbFsQrAZa8cPDZlWga6+Dr4FQMWrOGd98 HIo++93/HbAr9pkBDbaR0YAkdeH1g/YVqdhdFNtNudlWSIfWHIGLtT8orMsMs8w6kzXl azJcLVE28C8cKEuCU+74efKfp4WmbOzoBNhac5ywDHYSUCd/GRJWkWlkoud2EGXXA1C/ 8NUQVSRTLtR8ABRicoer7zNHyykjXgpU1BpyLT6GCoOkEEnPs7LSE+j6arqD87WAYj+H 7SqsvvjiwgGZH0FtH07rMjRgwWBYdkH9R6rmzeCzOAY/vK2nXNF6IUc3YRIVANcGaVPh K59w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=P4LzuvUk; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id cd13si7093829ejb.160.2019.09.09.03.25.03; Mon, 09 Sep 2019 03:25:27 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=P4LzuvUk; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731637AbfIHMtl (ORCPT + 99 others); Sun, 8 Sep 2019 08:49:41 -0400 Received: from mail.kernel.org ([198.145.29.99]:39708 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731594AbfIHMti (ORCPT ); Sun, 8 Sep 2019 08:49:38 -0400 Received: from localhost (unknown [62.28.240.114]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 2350B21924; Sun, 8 Sep 2019 12:49:36 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1567946977; bh=W1koJkvkMYnVS+GM1B7GyDGkTgqgk2lOuB3FVKbM4qY=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=P4LzuvUkoZiaBcsek8gTNhAEcc1H6X4lwHh+R2p6xCfHMnzzbZPSGgchH7/LvCk4V hr7OYMgO9I46bRTSQmVyFeYAPsASVrLSwNfQGpQbggNieVuvdFYHIJuDcfGSd12p3C fSy9vV/vJID3ZhKA2C1uJ9Z/34TZzSHdcEwucSZw= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Eric Dumazet , Jason Baron , Vladimir Rutsky , Soheil Hassas Yeganeh , Neal Cardwell , "David S. Miller" Subject: [PATCH 5.2 07/94] tcp: remove empty skb from write queue in error cases Date: Sun, 8 Sep 2019 13:41:03 +0100 Message-Id: <20190908121150.642000321@linuxfoundation.org> X-Mailer: git-send-email 2.23.0 In-Reply-To: <20190908121150.420989666@linuxfoundation.org> References: <20190908121150.420989666@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Eric Dumazet [ Upstream commit fdfc5c8594c24c5df883583ebd286321a80e0a67 ] Vladimir Rutsky reported stuck TCP sessions after memory pressure events. Edge Trigger epoll() user would never receive an EPOLLOUT notification allowing them to retry a sendmsg(). Jason tested the case of sk_stream_alloc_skb() returning NULL, but there are other paths that could lead both sendmsg() and sendpage() to return -1 (EAGAIN), with an empty skb queued on the write queue. This patch makes sure we remove this empty skb so that Jason code can detect that the queue is empty, and call sk->sk_write_space(sk) accordingly. Fixes: ce5ec440994b ("tcp: ensure epoll edge trigger wakeup when write queue is empty") Signed-off-by: Eric Dumazet Cc: Jason Baron Reported-by: Vladimir Rutsky Cc: Soheil Hassas Yeganeh Cc: Neal Cardwell Acked-by: Soheil Hassas Yeganeh Acked-by: Neal Cardwell Signed-off-by: David S. Miller Signed-off-by: Greg Kroah-Hartman --- net/ipv4/tcp.c | 30 ++++++++++++++++++++---------- 1 file changed, 20 insertions(+), 10 deletions(-) --- a/net/ipv4/tcp.c +++ b/net/ipv4/tcp.c @@ -935,6 +935,22 @@ static int tcp_send_mss(struct sock *sk, return mss_now; } +/* In some cases, both sendpage() and sendmsg() could have added + * an skb to the write queue, but failed adding payload on it. + * We need to remove it to consume less memory, but more + * importantly be able to generate EPOLLOUT for Edge Trigger epoll() + * users. + */ +static void tcp_remove_empty_skb(struct sock *sk, struct sk_buff *skb) +{ + if (skb && !skb->len) { + tcp_unlink_write_queue(skb, sk); + if (tcp_write_queue_empty(sk)) + tcp_chrono_stop(sk, TCP_CHRONO_BUSY); + sk_wmem_free_skb(sk, skb); + } +} + ssize_t do_tcp_sendpages(struct sock *sk, struct page *page, int offset, size_t size, int flags) { @@ -1064,6 +1080,7 @@ out: return copied; do_error: + tcp_remove_empty_skb(sk, tcp_write_queue_tail(sk)); if (copied) goto out; out_err: @@ -1388,18 +1405,11 @@ out_nopush: sock_zerocopy_put(uarg); return copied + copied_syn; +do_error: + skb = tcp_write_queue_tail(sk); do_fault: - if (!skb->len) { - tcp_unlink_write_queue(skb, sk); - /* It is the one place in all of TCP, except connection - * reset, where we can be unlinking the send_head. - */ - if (tcp_write_queue_empty(sk)) - tcp_chrono_stop(sk, TCP_CHRONO_BUSY); - sk_wmem_free_skb(sk, skb); - } + tcp_remove_empty_skb(sk, skb); -do_error: if (copied + copied_syn) goto out; out_err: