Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp3699207pxk; Tue, 29 Sep 2020 04:09:13 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwIptyrwgCEPpZ8JNAcESXVdXIoG/D9ka9SYRvAOaexpcgZK6FEZHMU/J+sB9ggx8euhR5+ X-Received: by 2002:a17:906:8401:: with SMTP id n1mr3132699ejx.215.1601377753664; Tue, 29 Sep 2020 04:09:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1601377753; cv=none; d=google.com; s=arc-20160816; b=ETWe/7fRDef6ZyU7JYhGr+l8VrUnUdL5PeEvkIRlWL7b+Y6kSbTPXG+fkQBnjDERge THNxdiZQZIVF11vYTIGdSxcLUMPwCDVyZ+M8urs7pMIm790qLRlfCwScSUBC0oBQvTRb DY8aFC5en3VZeMhjX8AipLwWJTDtUoVt5ojWUXYuIugyABepPV2dpQ2a1X6A5n///x8F tLuQxGWklWuQrQwG32j02vOlHlh42FqY6sWvqHWy5vrirNAr9evdvD17l7HjaU29Bw6P /wLYX0quQHcTQFt7x/l7HScXpHxSbtNmVwsCnWF/MgiZ4XOYtCmaBk9xAtvTMKWZ6GTS 1xWw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=BnsOemEA8dyqPj0Jb/QNK5umoBceSPIzLJv/ePvDLB8=; b=wR5h98/jK8UpawHiVZ1WzQMeFzUlGBA6tF0RS0ooiE8+Gzw2PPDS2Mm/b/CYBu0CS9 hYKw8CaDfzbivrNpgAac0CiE248K6e6emVC/ybTQq+ETm6r29VDKtmND9XfEfyBPf/1B gRTmHrphmSTpMaHOwz6Poy1XIV/wo74PgU8fBFTpJDdg1egpoEwJZLxQ2dQXLla64xig /J28tYlmVgGt+UAm1LMGHWJURFcKcef7WQkLcum6G8888lD4FMXp0GuwzGM7VPggeYp/ RV49NaEGoh6IuPV/GiafGraaUfiOHRRzPgGzehxNmB+OFYpcdeVwVfA0Zgdwn9P/pO42 b0Ew== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=NbB5MVnP; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id d2si2722734ejm.716.2020.09.29.04.08.50; Tue, 29 Sep 2020 04:09:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=NbB5MVnP; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728840AbgI2LGk (ORCPT + 99 others); Tue, 29 Sep 2020 07:06:40 -0400 Received: from mail.kernel.org ([198.145.29.99]:41564 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728297AbgI2LFP (ORCPT ); Tue, 29 Sep 2020 07:05:15 -0400 Received: from localhost (83-86-74-64.cable.dynamic.v4.ziggo.nl [83.86.74.64]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 89C5621734; Tue, 29 Sep 2020 11:05:12 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1601377512; bh=uQsG3jFKRmd236heYix3rvHt30j7s0Zb8CG8ugHrr6A=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=NbB5MVnPgkupiRzC3E3Q3fmKrnqioy8KVUNKy7Gj/UO45/AAf/c1QJt/kwLAcm4IL IgkREmoD6k/VkvvH1aYVZxmW4mESJj+MsOdqpcO1RDXYYua1uti67vxXA8TJMxUpLN m9AY/m5DU3VlN3raHC7fuEsvl84olw/9Ld2rvDf0= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Qian Cai , "David S. Miller" , Sasha Levin Subject: [PATCH 4.4 32/85] skbuff: fix a data race in skb_queue_len() Date: Tue, 29 Sep 2020 12:59:59 +0200 Message-Id: <20200929105929.830354639@linuxfoundation.org> X-Mailer: git-send-email 2.28.0 In-Reply-To: <20200929105928.198942536@linuxfoundation.org> References: <20200929105928.198942536@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Qian Cai [ Upstream commit 86b18aaa2b5b5bb48e609cd591b3d2d0fdbe0442 ] sk_buff.qlen can be accessed concurrently as noticed by KCSAN, BUG: KCSAN: data-race in __skb_try_recv_from_queue / unix_dgram_sendmsg read to 0xffff8a1b1d8a81c0 of 4 bytes by task 5371 on cpu 96: unix_dgram_sendmsg+0x9a9/0xb70 include/linux/skbuff.h:1821 net/unix/af_unix.c:1761 ____sys_sendmsg+0x33e/0x370 ___sys_sendmsg+0xa6/0xf0 __sys_sendmsg+0x69/0xf0 __x64_sys_sendmsg+0x51/0x70 do_syscall_64+0x91/0xb47 entry_SYSCALL_64_after_hwframe+0x49/0xbe write to 0xffff8a1b1d8a81c0 of 4 bytes by task 1 on cpu 99: __skb_try_recv_from_queue+0x327/0x410 include/linux/skbuff.h:2029 __skb_try_recv_datagram+0xbe/0x220 unix_dgram_recvmsg+0xee/0x850 ____sys_recvmsg+0x1fb/0x210 ___sys_recvmsg+0xa2/0xf0 __sys_recvmsg+0x66/0xf0 __x64_sys_recvmsg+0x51/0x70 do_syscall_64+0x91/0xb47 entry_SYSCALL_64_after_hwframe+0x49/0xbe Since only the read is operating as lockless, it could introduce a logic bug in unix_recvq_full() due to the load tearing. Fix it by adding a lockless variant of skb_queue_len() and unix_recvq_full() where READ_ONCE() is on the read while WRITE_ONCE() is on the write similar to the commit d7d16a89350a ("net: add skb_queue_empty_lockless()"). Signed-off-by: Qian Cai Signed-off-by: David S. Miller Signed-off-by: Sasha Levin --- include/linux/skbuff.h | 14 +++++++++++++- net/unix/af_unix.c | 11 +++++++++-- 2 files changed, 22 insertions(+), 3 deletions(-) diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h index 2528f712b8c0b..95feb153fe9a8 100644 --- a/include/linux/skbuff.h +++ b/include/linux/skbuff.h @@ -1438,6 +1438,18 @@ static inline __u32 skb_queue_len(const struct sk_buff_head *list_) return list_->qlen; } +/** + * skb_queue_len_lockless - get queue length + * @list_: list to measure + * + * Return the length of an &sk_buff queue. + * This variant can be used in lockless contexts. + */ +static inline __u32 skb_queue_len_lockless(const struct sk_buff_head *list_) +{ + return READ_ONCE(list_->qlen); +} + /** * __skb_queue_head_init - initialize non-spinlock portions of sk_buff_head * @list: queue to initialize @@ -1641,7 +1653,7 @@ static inline void __skb_unlink(struct sk_buff *skb, struct sk_buff_head *list) { struct sk_buff *next, *prev; - list->qlen--; + WRITE_ONCE(list->qlen, list->qlen - 1); next = skb->next; prev = skb->prev; skb->next = skb->prev = NULL; diff --git a/net/unix/af_unix.c b/net/unix/af_unix.c index b5e2ef242efe7..ac78c5ac82846 100644 --- a/net/unix/af_unix.c +++ b/net/unix/af_unix.c @@ -191,11 +191,17 @@ static inline int unix_may_send(struct sock *sk, struct sock *osk) return unix_peer(osk) == NULL || unix_our_peer(sk, osk); } -static inline int unix_recvq_full(struct sock const *sk) +static inline int unix_recvq_full(const struct sock *sk) { return skb_queue_len(&sk->sk_receive_queue) > sk->sk_max_ack_backlog; } +static inline int unix_recvq_full_lockless(const struct sock *sk) +{ + return skb_queue_len_lockless(&sk->sk_receive_queue) > + READ_ONCE(sk->sk_max_ack_backlog); +} + struct sock *unix_peer_get(struct sock *s) { struct sock *peer; @@ -1792,7 +1798,8 @@ restart_locked: * - unix_peer(sk) == sk by time of get but disconnected before lock */ if (other != sk && - unlikely(unix_peer(other) != sk && unix_recvq_full(other))) { + unlikely(unix_peer(other) != sk && + unix_recvq_full_lockless(other))) { if (timeo) { timeo = unix_wait_for_peer(other, timeo); -- 2.25.1