Received: by 2002:a05:6504:5087:b0:1e5:d399:13c4 with SMTP id i7csp984122ltq; Tue, 27 Sep 2022 21:28:29 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7vjtf+HfSA8HRTpd6kxlQP3bj3fvGdnsnWGIgg7Pii/4lV3GePfTHIriTE2RWMy8W1JnTS X-Received: by 2002:a17:906:bcf6:b0:781:be0a:5c8 with SMTP id op22-20020a170906bcf600b00781be0a05c8mr25641835ejb.363.1664339309495; Tue, 27 Sep 2022 21:28:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664339309; cv=none; d=google.com; s=arc-20160816; b=m/mNEy9ESYfkz3VMelq4FcHndtsQBi4LJbtrDmQUbzF8XPmRIHgyK7y8eLRF7KDkMw 7dz4hCLpwIFr3nkIEjqo5gSjrqbpbn3ffnwac0DvUOxUZFau531QNBX3IjHFrjMVzsfP YhGnGkeuYHB4lHGCyRrfmQFGczhM7h+aQ25Eq/gJDVTr1kblAyM+eJnTSGx+83zf9Azm 1iU29ce8I2TlPHOZ5jgXuE17cBzjMftIo6ryyBT95c9U1I2fRqtmZd/t5a9MHNWFCt0T lz51TxgOAKnm8PvaVPbS+WtKemXr+TNpvpY/ClzpVDfoFxRaul9oo++BmRiwHQEyaNdy Un7Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=QjIVaKLzClHMiAxq9fOZFFdrChnDpkTP4k4IQgSTM8U=; b=xaWBOjzv1KF8BDHXgvs0h40o7khLyFQMQFZCJvWF2wUNhLsaWWytPUFC63Hi1xb9iR CbpSE4FTkEnFOJDxrZvJGsY6h78qmgtzuxoUgAax0Q328T9y2IAwtGiZgbXRyhRO3J6v CC4nhlSsNtvka+0pfSTn8N3cRfRySsP2wFcvHHof3RyB17rNqNAXg/dqG4p6qlz4QG/f Z/tt/OVDUBRizy2QMIOMRheflqQl4SOfKKTtpeHu/mRm6kLTmn2s1On0LqL156Mdw127 AOPi/aPwjsGpJ3o/U94MfWnCmm4GlCDtxR6YG/SSGuchPcOnH9xKOFLcDI0nii2qo2xk pC4A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=MGx0BreE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id dn6-20020a17090794c600b0076fb816dae7si3900314ejc.97.2022.09.27.21.28.03; Tue, 27 Sep 2022 21:28:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=MGx0BreE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229838AbiI1EKh (ORCPT + 99 others); Wed, 28 Sep 2022 00:10:37 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56028 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230185AbiI1EKg (ORCPT ); Wed, 28 Sep 2022 00:10:36 -0400 Received: from mail-yw1-x1131.google.com (mail-yw1-x1131.google.com [IPv6:2607:f8b0:4864:20::1131]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D4F5D12870F for ; Tue, 27 Sep 2022 21:10:34 -0700 (PDT) Received: by mail-yw1-x1131.google.com with SMTP id 00721157ae682-345528ceb87so119063077b3.11 for ; Tue, 27 Sep 2022 21:10:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date; bh=QjIVaKLzClHMiAxq9fOZFFdrChnDpkTP4k4IQgSTM8U=; b=MGx0BreEOY/Q8jUkfiwrckK9ze5Xv+ZeERkrXpBh7tg36Bca5OroQv1Onh36njsAGH OkYjEwWXH8N8oVOyf7X4MIo6+/+W6nPF4lrqitm6rQjZl9E2GH4J8tLFQ+yDodkvIoO0 kpfkSYUa8sfjXdK7cGq4lrh9M04CkzlwC3yNdlmdTlDhDX5DEYjNzvqQ5ZQ+32pc/jwR xjT6PcLzwp1z7HTeYXfOg0NTbrIPofpaBuPMAZiq2d8f2kp+AYkaFXz/NPRWGn8Ric6Q RYZnqmsCGFpwGmcdboC9mM3gVwb3u2SjUiHfUL1V2EPPbDQeiEXgtu0p9zo1ngxCfzk5 kYTg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date; bh=QjIVaKLzClHMiAxq9fOZFFdrChnDpkTP4k4IQgSTM8U=; b=iI1D82DD+4v2VRoNKGDBQQs5huD3mIkOrSt63h/SAPE1YtCiQhZ8d0yHL5qx5YNINv yxJxrroIQilkegUmX7NcAnfDrKj8Pj5SZdcEJTaQ00E8TAfhUOOBCue7i3HNoE4zDmQt wO5X4QvGC20AeHS+/2teebzGnh94vsAio9lYhunlz7orD9bCGQDegN4EFl0RahQCXfLc gSUdNooFYpADrySMeA+bsd3qyGduVt14pLbVD9mjQRX8pRejqzHlGQmeFhI0qeQV2mmm U/TKsYPMqY9AEkLSHz64wKKUk3xQ/+9C3Q39PpSfv0ujrmvevWbWRld6wISwSHbuPGz4 /PyQ== X-Gm-Message-State: ACrzQf2TplhvMbSYF5xSUo5IQS6ePzIqrtcYPFdgpmX+RQOcqIE4rNmu xG7KZwlKLC6QNIjWrGWlrwdKtU24dvQsW+xCX48DtQ== X-Received: by 2002:a0d:d508:0:b0:352:43a6:7ddc with SMTP id x8-20020a0dd508000000b0035243a67ddcmr5223195ywd.55.1664338233797; Tue, 27 Sep 2022 21:10:33 -0700 (PDT) MIME-Version: 1.0 References: <20220928040014.76884-1-kuniyu@amazon.com> In-Reply-To: <20220928040014.76884-1-kuniyu@amazon.com> From: Eric Dumazet Date: Tue, 27 Sep 2022 21:10:22 -0700 Message-ID: Subject: Re: [PATCH v2 net 3/5] tcp/udp: Call inet6_destroy_sock() in IPv6 sk->sk_destruct(). To: Kuniyuki Iwashima Cc: David Miller , David Ahern , Jakub Kicinski , Kuniyuki Iwashima , LKML , netdev , Paolo Abeni , syzkaller-bugs Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Sep 27, 2022 at 9:00 PM Kuniyuki Iwashima wrote: > > From: Eric Dumazet > Date: Tue, 27 Sep 2022 20:43:51 -0700 > > On Tue, Sep 27, 2022 at 5:29 PM Kuniyuki Iwashima wrote: > > > > > > Originally, inet6_sk(sk)->XXX were changed under lock_sock(), so we were > > > able to clean them up by calling inet6_destroy_sock() during the IPv6 -> > > > IPv4 conversion by IPV6_ADDRFORM. However, commit 03485f2adcde ("udpv6: > > > Add lockless sendmsg() support") added a lockless memory allocation path, > > > which could cause a memory leak: > > > > > > setsockopt(IPV6_ADDRFORM) sendmsg() > > > +-----------------------+ +-------+ > > > - do_ipv6_setsockopt(sk, ...) - udpv6_sendmsg(sk, ...) > > > - lock_sock(sk) ^._ called via udpv6_prot > > > - WRITE_ONCE(sk->sk_prot, &tcp_prot) before WRITE_ONCE() > > > - inet6_destroy_sock() > > > - release_sock(sk) - ip6_make_skb(sk, ...) > > > ^._ lockless fast path for > > > the non-corking case > > > > > > - __ip6_append_data(sk, ...) > > > - ipv6_local_rxpmtu(sk, ...) > > > - xchg(&np->rxpmtu, skb) > > > ^._ rxpmtu is never freed. > > > > > > - lock_sock(sk) > > > > > > For now, rxpmtu is only the case, but let's call inet6_destroy_sock() > > > in IPv6 sk->sk_destruct() not to miss the future change and a similar > > > bug fixed in commit e27326009a3d ("net: ping6: Fix memleak in > > > ipv6_renew_options().") > > > > I do not see how your patches prevent rxpmtu to be created at the time > > of IPV6_ADDRFROM ? > > > > There seem to be races. > > > > lockless UDP sendmsg() is a disaster really. > > I think we are never able to prevent it and races exist unless we remove > the lockless path itself, so the patch makes sure to free rxpmtu at least > when we close() the socket. Currently, we can not even free it. I am saying your patches do not guarantee the rxpmtu is freed at close() time. Once the v6 socket has been transformed to IPv4 one, inet6_sock_destruct() is not going to be called. > > > > > We can now remove all inet6_destroy_sock() calls from IPv6 protocol > > > specific ->destroy() functions, but such changes are invasive to > > > backport. So they can be posted as a follow-up later for net-next. > > > > > > Fixes: 03485f2adcde ("udpv6: Add lockless sendmsg() support") > > > Signed-off-by: Kuniyuki Iwashima > > > --- > > > include/net/ipv6.h | 1 + > > > include/net/udp.h | 2 +- > > > net/ipv4/udp.c | 8 ++++++-- > > > net/ipv6/af_inet6.c | 9 ++++++++- > > > net/ipv6/udp.c | 15 ++++++++++++++- > > > 5 files changed, 30 insertions(+), 5 deletions(-) > > > > > > diff --git a/include/net/ipv6.h b/include/net/ipv6.h > > > index de9dcc5652c4..11f1a9a8b066 100644 > > > --- a/include/net/ipv6.h > > > +++ b/include/net/ipv6.h > > > @@ -1178,6 +1178,7 @@ void ipv6_icmp_error(struct sock *sk, struct sk_buff *skb, int err, __be16 port, > > > void ipv6_local_error(struct sock *sk, int err, struct flowi6 *fl6, u32 info); > > > void ipv6_local_rxpmtu(struct sock *sk, struct flowi6 *fl6, u32 mtu); > > > > > > +void inet6_sock_destruct(struct sock *sk); > > > int inet6_release(struct socket *sock); > > > int inet6_bind(struct socket *sock, struct sockaddr *uaddr, int addr_len); > > > int inet6_getname(struct socket *sock, struct sockaddr *uaddr, > > > diff --git a/include/net/udp.h b/include/net/udp.h > > > index 5ee88ddf79c3..fee053bcd17c 100644 > > > --- a/include/net/udp.h > > > +++ b/include/net/udp.h > > > @@ -247,7 +247,7 @@ static inline bool udp_sk_bound_dev_eq(struct net *net, int bound_dev_if, > > > } > > > > > > /* net/ipv4/udp.c */ > > > -void udp_destruct_sock(struct sock *sk); > > > +void udp_destruct_common(struct sock *sk); > > > void skb_consume_udp(struct sock *sk, struct sk_buff *skb, int len); > > > int __udp_enqueue_schedule_skb(struct sock *sk, struct sk_buff *skb); > > > void udp_skb_destructor(struct sock *sk, struct sk_buff *skb); > > > diff --git a/net/ipv4/udp.c b/net/ipv4/udp.c > > > index 560d9eadeaa5..a84ae44db7e2 100644 > > > --- a/net/ipv4/udp.c > > > +++ b/net/ipv4/udp.c > > > @@ -1598,7 +1598,7 @@ int __udp_enqueue_schedule_skb(struct sock *sk, struct sk_buff *skb) > > > } > > > EXPORT_SYMBOL_GPL(__udp_enqueue_schedule_skb); > > > > > > -void udp_destruct_sock(struct sock *sk) > > > +void udp_destruct_common(struct sock *sk) > > > { > > > /* reclaim completely the forward allocated memory */ > > > struct udp_sock *up = udp_sk(sk); > > > @@ -1611,10 +1611,14 @@ void udp_destruct_sock(struct sock *sk) > > > kfree_skb(skb); > > > } > > > udp_rmem_release(sk, total, 0, true); > > > +} > > > +EXPORT_SYMBOL_GPL(udp_destruct_common); > > > > > > +static void udp_destruct_sock(struct sock *sk) > > > +{ > > > + udp_destruct_common(sk); > > > inet_sock_destruct(sk); > > > } > > > -EXPORT_SYMBOL_GPL(udp_destruct_sock); > > > > > > int udp_init_sock(struct sock *sk) > > > { > > > diff --git a/net/ipv6/af_inet6.c b/net/ipv6/af_inet6.c > > > index dbb1430d6cc2..0774cff62f2d 100644 > > > --- a/net/ipv6/af_inet6.c > > > +++ b/net/ipv6/af_inet6.c > > > @@ -109,6 +109,13 @@ static __inline__ struct ipv6_pinfo *inet6_sk_generic(struct sock *sk) > > > return (struct ipv6_pinfo *)(((u8 *)sk) + offset); > > > } > > > > > > +void inet6_sock_destruct(struct sock *sk) > > > +{ > > > + inet6_destroy_sock(sk); > > > + inet_sock_destruct(sk); > > > +} > > > +EXPORT_SYMBOL_GPL(inet6_sock_destruct); > > > + > > > static int inet6_create(struct net *net, struct socket *sock, int protocol, > > > int kern) > > > { > > > @@ -201,7 +208,7 @@ static int inet6_create(struct net *net, struct socket *sock, int protocol, > > > inet->hdrincl = 1; > > > } > > > > > > - sk->sk_destruct = inet_sock_destruct; > > > + sk->sk_destruct = inet6_sock_destruct; > > > sk->sk_family = PF_INET6; > > > sk->sk_protocol = protocol; > > > > > > diff --git a/net/ipv6/udp.c b/net/ipv6/udp.c > > > index 3366d6a77ff2..a5256f7184ab 100644 > > > --- a/net/ipv6/udp.c > > > +++ b/net/ipv6/udp.c > > > @@ -56,6 +56,19 @@ > > > #include > > > #include "udp_impl.h" > > > > > > +static void udpv6_destruct_sock(struct sock *sk) > > > +{ > > > + udp_destruct_common(sk); > > > + inet6_sock_destruct(sk); > > > +} > > > + > > > +static int udpv6_init_sock(struct sock *sk) > > > +{ > > > + skb_queue_head_init(&udp_sk(sk)->reader_queue); > > > + sk->sk_destruct = udpv6_destruct_sock; > > > + return 0; > > > +} > > > + > > > static u32 udp6_ehashfn(const struct net *net, > > > const struct in6_addr *laddr, > > > const u16 lport, > > > @@ -1723,7 +1736,7 @@ struct proto udpv6_prot = { > > > .connect = ip6_datagram_connect, > > > .disconnect = udp_disconnect, > > > .ioctl = udp_ioctl, > > > - .init = udp_init_sock, > > > + .init = udpv6_init_sock, > > > .destroy = udpv6_destroy_sock, > > > .setsockopt = udpv6_setsockopt, > > > .getsockopt = udpv6_getsockopt, > > > -- > > > 2.30.2