Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp594932pxb; Wed, 27 Jan 2021 16:10:50 -0800 (PST) X-Google-Smtp-Source: ABdhPJyy+Vf+84WjFw3crpLE3XOMn+9rCGnwVBmZPa8wM+yWKhLkn1E+5qoHcB6UNahJKRetbxUk X-Received: by 2002:a17:906:4050:: with SMTP id y16mr3234031ejj.43.1611792650227; Wed, 27 Jan 2021 16:10:50 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611792650; cv=none; d=google.com; s=arc-20160816; b=nneqNLn7hEp5aVVo5sxsyyJy2djKirYJQe/LDJRDjViqe1xOlpc2nMyPAb7ESc7h5C vi82wBynESoIxYIFO7YlBiosW/uDR+06tKF1AxUTQfXnu8GxF2FeFj83nihSrpZt7Ohl EZ0D/aHwqXP8xP2SSpm43WUbonbDwEby3NeRaDu5/6yV1oxc49M+rN41sRJiLHyI9rb2 LGPc+sd+cTP/9lsM5E8BQLcKS1Dm1W9UuZAxwwd2ZVb0FSf7MpSKqsLnPlKm2UmIoMm2 91rgRVf6QZTR1NM0AV/bC96vsSv7qKgO3mxysFf0zGvBgPVk5cJkexm2Vi39z3srDax+ pi7A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=VJZP8MOLyoKM/GkUK5rpElN/cwuUFmfUf42GSpkbWeQ=; b=I+uOAfSDg0FbThiWIGOzh2Ee6jqUULjmI/XeBGyQNtJuw7kuYGjq6oEPo6aoOxKLI0 Lo+D/mtGMjp8Iay05ofe3OD3vkLlSrRPwaCJpeJGAHgGDxfURKRHuIJ3eqAwt4zvml5L QdHDheiiPdtJLi1H0u71dT3qsq4h0vIaLwhz8/umEE3kth8Csax78dfcKtHniSjme/Hy Jr8Cnca2DG41z0hF6nCGZM0MbQbv70pK3PVUd8DEVB4rbRSoYPiPsNr6292XD3xhsTMU QHpnndCHFOxOwUimfhWnpv9gkJ8X01j7Uck0MF3y24uGWzoun4p1LoCzfJHpg2OoOq2Y caxQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=G6K4nAYU; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id lz2si1073229ejb.81.2021.01.27.16.10.26; Wed, 27 Jan 2021 16:10:50 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=G6K4nAYU; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344390AbhA0SIq (ORCPT + 99 others); Wed, 27 Jan 2021 13:08:46 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35438 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1343941AbhA0SIo (ORCPT ); Wed, 27 Jan 2021 13:08:44 -0500 Received: from mail-il1-x12e.google.com (mail-il1-x12e.google.com [IPv6:2607:f8b0:4864:20::12e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 111BEC061573 for ; Wed, 27 Jan 2021 10:08:04 -0800 (PST) Received: by mail-il1-x12e.google.com with SMTP id q5so2710466ilc.10 for ; Wed, 27 Jan 2021 10:08:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=VJZP8MOLyoKM/GkUK5rpElN/cwuUFmfUf42GSpkbWeQ=; b=G6K4nAYUowzQBptAwFLHdUeTSXDxQ54VE64+B+o9EdF/N5hAzv+6VdaAacC5LrLzb+ 1p4I+J1M/Q237uxVzWEVqbLdTkuGkqsSIrrq2fUZmO5VF1TV0/0FwS5l8+Q6BI24YMXZ IB4eQ4Y6RAE74hDHDgP6G32hhr7Z9FwsAAQ+UrGp17wZcO0mIo/1JFOx4utQz5ej7B3U o6cgbOyVy5urVnvQ6jdrgqnfPMslVoUIGCskY2o1dTTWvg435vsn27i78SRQv5g7sIs+ P4ikxGlSxL/Abi47SRFOHxu7OtF2NzdotAjMNCStlIWnvgvFk6NpUbgXn+ecbrWDjZ6q siWQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=VJZP8MOLyoKM/GkUK5rpElN/cwuUFmfUf42GSpkbWeQ=; b=SZzL9wU0LEg/e/B2QgEmZ5i9FEMTEbRUvN4Zvay9DujIEUSGpITGaDZSESnQYPTRxb 6V9eq0+jkkUcYJ8eR6csPQAe/69Ren2l2Y7EYAt41lMImNVbhlgzWOVJWscj7v4k+qSV 6N2nSjIQRXbd0k+E8gYwkHQg9VeUdqysN6vLJKdc2VcNa/Z6IB8XpchAg3NfIj9niCqx arSp5JvwR6C+uUZ3Auohjkt1M47bYa3uvfimHe1+oQaeTEEmoIkW/MyUVjbDNP3+dtzK Mkq4SMFrs3E7vdDGRVNofai/ehhcjPJnz4NUsfkWpGZKUxoQmrtWVFJUqATIUfgQRKgC YBYQ== X-Gm-Message-State: AOAM533P24LsJb0hyQo60enVgxMyUUJvFq0pgazn/jBw8aAsS8uLhm/t aiLKcJUqTyPIws8bWRhMmPXm3dOmo7KKYLOZ84+Rgg== X-Received: by 2002:a05:6e02:1d0e:: with SMTP id i14mr9422982ila.69.1611770883172; Wed, 27 Jan 2021 10:08:03 -0800 (PST) MIME-Version: 1.0 References: <20210127175611.62871-1-kuniyu@amazon.co.jp> In-Reply-To: <20210127175611.62871-1-kuniyu@amazon.co.jp> From: Eric Dumazet Date: Wed, 27 Jan 2021 19:07:51 +0100 Message-ID: Subject: Re: [PATCH net] net: Remove redundant calls of sk_tx_queue_clear(). To: Kuniyuki Iwashima Cc: Amit Shah , Boris Pismenny , David Miller , Jakub Kicinski , Kuniyuki Iwashima , LKML , netdev , Tariq Toukan Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 27, 2021 at 6:56 PM Kuniyuki Iwashima wrote: > > From: Eric Dumazet > Date: Wed, 27 Jan 2021 18:34:35 +0100 > > On Wed, Jan 27, 2021 at 6:32 PM Kuniyuki Iwashima wrote: > > > > > > From: Eric Dumazet > > > Date: Wed, 27 Jan 2021 18:05:24 +0100 > > > > On Wed, Jan 27, 2021 at 5:52 PM Kuniyuki Iwashima wrote: > > > > > > > > > > From: Eric Dumazet > > > > > Date: Wed, 27 Jan 2021 15:54:32 +0100 > > > > > > On Wed, Jan 27, 2021 at 1:50 PM Kuniyuki Iwashima wrote: > > > > > > > > > > > > > > The commit 41b14fb8724d ("net: Do not clear the sock TX queue in > > > > > > > sk_set_socket()") removes sk_tx_queue_clear() from sk_set_socket() and adds > > > > > > > it instead in sk_alloc() and sk_clone_lock() to fix an issue introduced in > > > > > > > the commit e022f0b4a03f ("net: Introduce sk_tx_queue_mapping"). However, > > > > > > > the original commit had already put sk_tx_queue_clear() in sk_prot_alloc(): > > > > > > > the callee of sk_alloc() and sk_clone_lock(). Thus sk_tx_queue_clear() is > > > > > > > called twice in each path currently. > > > > > > > > > > > > Are you sure ? > > > > > > > > > > > > I do not clearly see the sk_tx_queue_clear() call from the cloning part. > > > > > > > > > > > > Please elaborate. > > > > > > > > > > If sk is not NULL in sk_prot_alloc(), sk_tx_queue_clear() is called [1]. > > > > > Also the callers of sk_prot_alloc() are only sk_alloc() and sk_clone_lock(). > > > > > If they finally return not NULL pointer, sk_tx_queue_clear() is called in > > > > > each function [2][3]. > > > > > > > > > > In the cloning part, sock_copy() is called after sk_prot_alloc(), but > > > > > skc_tx_queue_mapping is defined between skc_dontcopy_begin and > > > > > skc_dontcopy_end in struct sock_common [4]. So, sock_copy() does not > > > > > overwrite skc_tx_queue_mapping, and thus we can initialize it in > > > > > sk_prot_alloc(). > > > > > > > > That is a lot of assumptions. > > > > > > > > What guarantees do we have that skc_tx_queue_mapping will never be > > > > moved out of this section ? > > > > AFAIK it was there by accident, for cache locality reasons, that might > > > > change in the future as we add more stuff in socket. > > > > > > > > I feel this optimization is risky for future changes, for a code path > > > > that is spending thousands of cycles anyway. > > > > > > If someone try to move skc_tx_queue_mapping out of the section, should > > > they take care about where it is used ? > > I'm sorry if it might be misleading, I would like to mean someone/they is > the author of a patch to move skc_tx_queue_mapping. > > > > Certainly not. You hide some knowledge, without a comment or some runtime check. > > It was my bad, I should have written about sock_copy() in the changelog. I think you also want to add some compile time check. BUILD_BUG_ON( skc_tx_queue_mapping is in the no copy area) Because maintainers do not remember changelogs in their mind. > > > > You can not ask us (maintainers) to remember thousands of tricks. > > I'll keep this in mind. > > > > > > > > But I agree that we should not write error-prone code. > > > > > > Currently, sk_tx_queue_clear() is the only initialization code in > > > sk_prot_alloc(). So, does it make sense to remove sk_tx_queue_clear() in > > > sk_prot_alloc() so that it does only allocation and other fields are > > > initialized in each caller ? > > Can I ask what you think about this ? Yes, this would be fine. > > > > > > > > > > > > [1] sk_prot_alloc > > > > > https://github.com/torvalds/linux/blob/master/net/core/sock.c#L1693 > > > > > > > > > > [2] sk_alloc > > > > > https://github.com/torvalds/linux/blob/master/net/core/sock.c#L1762 > > > > > > > > > > [3] sk_clone_lock > > > > > https://github.com/torvalds/linux/blob/master/net/core/sock.c#L1986 > > > > > > > > > > [4] struct sock_common > > > > > https://github.com/torvalds/linux/blob/master/include/net/sock.h#L218-L240 > > > > > > > > > > > > > > > > In any case, this seems to be a candidate for net-next, this is not > > > > > > fixing a bug, > > > > > > this would be an optimization at most, and potentially adding a bug. > > > > > > > > > > > > So if you resend this patch, you can mention the old commit in the changelog, > > > > > > but do not add a dubious Fixes: tag > > > > > > > > > > I see. > > > > > > > > > > I will remove the tag and resend this as a net-next candidate. > > > > > > > > > > Thank you, > > > > > Kuniyuki > > > > > > > > > > > > > > > > > > > > > > > > This patch removes the redundant calls of sk_tx_queue_clear() in sk_alloc() > > > > > > > and sk_clone_lock(). > > > > > > > > > > > > > > Fixes: 41b14fb8724d ("net: Do not clear the sock TX queue in sk_set_socket()") > > > > > > > CC: Tariq Toukan > > > > > > > CC: Boris Pismenny > > > > > > > Signed-off-by: Kuniyuki Iwashima > > > > > > > Reviewed-by: Amit Shah > > > > > > > --- > > > > > > > net/core/sock.c | 2 -- > > > > > > > 1 file changed, 2 deletions(-) > > > > > > > > > > > > > > diff --git a/net/core/sock.c b/net/core/sock.c > > > > > > > index bbcd4b97eddd..5c665ee14159 100644 > > > > > > > --- a/net/core/sock.c > > > > > > > +++ b/net/core/sock.c > > > > > > > @@ -1759,7 +1759,6 @@ struct sock *sk_alloc(struct net *net, int family, gfp_t priority, > > > > > > > cgroup_sk_alloc(&sk->sk_cgrp_data); > > > > > > > sock_update_classid(&sk->sk_cgrp_data); > > > > > > > sock_update_netprioidx(&sk->sk_cgrp_data); > > > > > > > - sk_tx_queue_clear(sk); > > > > > > > } > > > > > > > > > > > > > > return sk; > > > > > > > @@ -1983,7 +1982,6 @@ struct sock *sk_clone_lock(const struct sock *sk, const gfp_t priority) > > > > > > > */ > > > > > > > sk_refcnt_debug_inc(newsk); > > > > > > > sk_set_socket(newsk, NULL); > > > > > > > - sk_tx_queue_clear(newsk); > > > > > > > RCU_INIT_POINTER(newsk->sk_wq, NULL); > > > > > > > > > > > > > > if (newsk->sk_prot->sockets_allocated) > > > > > > > -- > > > > > > > 2.17.2 (Apple Git-113) > > > > > > >