Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp2975294pxb; Mon, 16 Nov 2020 02:18:40 -0800 (PST) X-Google-Smtp-Source: ABdhPJywLokpdKHjlDxx71JMloxqwmRFsSi8ywCw7uGv2acjCy7zgx/9u9WNIsnUuT0IjGYiSrVA X-Received: by 2002:a05:6402:1714:: with SMTP id y20mr14801059edu.306.1605521920541; Mon, 16 Nov 2020 02:18:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605521920; cv=none; d=google.com; s=arc-20160816; b=FS8QuIYa4WXQfHWqPRXuyS7fHOt26pMWQOT4SdOP8lVVglsYjyTxOVzdhw4GNyV9po DNwTe4LrJ+YZ/rvvnaJRUsLLYQxt9TaJ25yD5MpUKV/c61KyGAIie6y4qPNfuVsutIoJ Y3T1MaJbXo7WPoq/4jMv5rUMr72qGjjvT+FnPi05H8NKzfVPQkMzY8nFQ4CuW8Ldr+yD uL1mPtjp1Bw5CqJJVwM6Hq97f731GBjG+cpu3h7ity1z/SOnNrByIE84Re6ia6/gH1QV UE8INBtdldlFoOgs6iPWUgmEJ/oiWeFZ4LoANyvfn2iRUZ+iD69ROJ2ychZLLXWYdvY8 EW3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:references :in-reply-to:mime-version:dkim-signature; bh=eUexxalWWW3sAgPdtJ+hwphCF2dSR9rNJsVsIvPlNVI=; b=tXNbe6Jrqa2TB5ftRdIGHHQWUoa9NZcChR9Q/4O4L+voL7zoySKZvCzzJEfwvQ/2pY tmtCMuv329/MpJQTyFNAMyUp9SLwJbRswNOZ5u99vFl5WQki8HeW3fZsVB3f9K0bcubz 0zD4zNjSJpBXN4zHljPwn2btxJYvwOh5pOHC4G47VeHN4kHp6+MCiqknBOiNHg/3gn2g 5hAE43Vb1kuooNtvqM44X8FHlvv3LfGtJhFx2FKiB/QiQRljDzy7AnuVpX93gQ0MQ6mh 58fsLjvWxrKJBtn1Wi8fZ5KUpGtpQIfTMrhwxmqonfBaoNCbSUF+utQvrKNdXdnq1RCb ++fQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux-powerpc-org.20150623.gappssmtp.com header.s=20150623 header.b="fxwkArX/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l23si13881865ejb.743.2020.11.16.02.18.16; Mon, 16 Nov 2020 02:18:40 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linux-powerpc-org.20150623.gappssmtp.com header.s=20150623 header.b="fxwkArX/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728765AbgKPKNt (ORCPT + 99 others); Mon, 16 Nov 2020 05:13:49 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:32956 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728610AbgKPKNs (ORCPT ); Mon, 16 Nov 2020 05:13:48 -0500 Received: from mail-ej1-x644.google.com (mail-ej1-x644.google.com [IPv6:2a00:1450:4864:20::644]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 46F01C0613CF for ; Mon, 16 Nov 2020 02:13:48 -0800 (PST) Received: by mail-ej1-x644.google.com with SMTP id f20so23603295ejz.4 for ; Mon, 16 Nov 2020 02:13:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux-powerpc-org.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=eUexxalWWW3sAgPdtJ+hwphCF2dSR9rNJsVsIvPlNVI=; b=fxwkArX/oc4z9qAMJ9gSO7AYFOBvA3NpMKLIORKO3O3UCRnF+tkXMLoF8Tm59fuXZy qUOTeGe75cH7xbQzG+gR/kcLnp6IhsD9iDs7fetgPMeD8va8+nVKILw2MBugKrqr3XI/ o3dLtIUA28vNWH80ic3GKB1S2lZyCKRN2XzjS5UkAjO6sjdZu+PSsyT2JRWDA8QsWj7I Csff5+M5JCRThMF2jhQPvXWxuT2qhTQb07ucEDkvGYPzgG873dKb3/VWQ2kCKD1EAav+ 6FZoizAUFl7PY4OipPZ34WfJX0yvhgz9bNzCoVvQKRl3S/7uf8TreaW93aPP0jfH98P3 pePA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=eUexxalWWW3sAgPdtJ+hwphCF2dSR9rNJsVsIvPlNVI=; b=qnBWTr0wCYFIs4giQMB+OJW/pQtTQaqryTVKK3tod+dkQeRjBZhiQvzZQA09rngfnh EGpsXEIaeztzyON88YGvJL5K6XvKwz14x2/BGrBCGm5y0hzbWiy9mZ1ImEwMDmOHAh8v i9U43shME3/fSH4jevAjJaOdTT4ux/YWc2DYVN86rA0zfdC2uAHBB87mCdAqWDbC52S5 8a7ENMPDSVbIJAD0XGgQpDNTll4WQ9kHJgRS3dU+2BUdFurzZfijvZ4gboa+dCnqYvtJ TQmyxdPO+5m8amrYlb6dPLWul05DCw++rcWc5EKdH1dhnhTLDCgGZFHnUTrYe2TGlOk1 BL1A== X-Gm-Message-State: AOAM530xFw2LCfoikQRx8t1dJzgp8w8V/I/TpJYg0xOth7NmLI2sGNtK gnt+8WiszdpuC5LxT56rkMG8IG6FMsfBcu2DYqZbNbfmD2xBxg== X-Received: by 2002:a17:906:1b09:: with SMTP id o9mr14429592ejg.79.1605521626845; Mon, 16 Nov 2020 02:13:46 -0800 (PST) MIME-Version: 1.0 Received: by 2002:a54:380d:0:0:0:0:0 with HTTP; Mon, 16 Nov 2020 02:13:45 -0800 (PST) X-Originating-IP: [5.35.10.61] In-Reply-To: <5fb245b1.1c69fb81.e2685.976dSMTPIN_ADDED_MISSING@mx.google.com> References: <5fb245b1.1c69fb81.e2685.976dSMTPIN_ADDED_MISSING@mx.google.com> From: Denis Kirjanov Date: Mon, 16 Nov 2020 13:13:45 +0300 Message-ID: Subject: Re: [PATCH] xsk: add cq event To: Xuan Zhuo Cc: netdev@vger.kernel.org, =?UTF-8?B?QmrDtnJuIFTDtnBlbA==?= , Magnus Karlsson , Jonathan Lemon , "David S. Miller" , Jakub Kicinski , Alexei Starovoitov , Daniel Borkmann , Jesper Dangaard Brouer , John Fastabend , bpf@vger.kernel.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/16/20, Xuan Zhuo wrote: > On Mon, 16 Nov 2020 12:13:21 +0300, Denis Kirjanov > wrote: >> On 11/16/20, Xuan Zhuo wrote: >> > When we write all cq items to tx, we have to wait for a new event based >> > on poll to indicate that it is writable. But the current writability is >> > triggered based on whether tx is full or not, and In fact, when tx is >> > dissatisfied, the user of cq's item may not necessarily get it, because >> > it >> > may still be occupied by the network card. In this case, we need to >> > know >> > when cq is available, so this patch adds a socket option, When the user >> > configures this option using setsockopt, when cq is available, a >> > readable event is generated for all xsk bound to this umem. >> > >> > I can't find a better description of this event, >> > I think it can also be 'readable', although it is indeed different from >> > the 'readable' of the new data. But the overhead of xsk checking >> > whether >> > cq or rx is readable is small. >> > >> > Signed-off-by: Xuan Zhuo >> > --- >> > include/net/xdp_sock.h | 1 + >> > include/uapi/linux/if_xdp.h | 1 + >> > net/xdp/xsk.c | 28 ++++++++++++++++++++++++++++ >> > 3 files changed, 30 insertions(+) >> > >> > diff --git a/include/net/xdp_sock.h b/include/net/xdp_sock.h >> > index 1a9559c..faf5b1a 100644 >> > --- a/include/net/xdp_sock.h >> > +++ b/include/net/xdp_sock.h >> > @@ -49,6 +49,7 @@ struct xdp_sock { >> > struct xsk_buff_pool *pool; >> > u16 queue_id; >> > bool zc; >> > + bool cq_event; >> > enum { >> > XSK_READY = 0, >> > XSK_BOUND, >> > diff --git a/include/uapi/linux/if_xdp.h b/include/uapi/linux/if_xdp.h >> > index a78a809..2dba3cb 100644 >> > --- a/include/uapi/linux/if_xdp.h >> > +++ b/include/uapi/linux/if_xdp.h >> > @@ -63,6 +63,7 @@ struct xdp_mmap_offsets { >> > #define XDP_UMEM_COMPLETION_RING 6 >> > #define XDP_STATISTICS 7 >> > #define XDP_OPTIONS 8 >> > +#define XDP_CQ_EVENT 9 >> > >> > struct xdp_umem_reg { >> > __u64 addr; /* Start of packet data area */ >> > diff --git a/net/xdp/xsk.c b/net/xdp/xsk.c >> > index cfbec39..0c53403 100644 >> > --- a/net/xdp/xsk.c >> > +++ b/net/xdp/xsk.c >> > @@ -285,7 +285,16 @@ void __xsk_map_flush(void) >> > >> > void xsk_tx_completed(struct xsk_buff_pool *pool, u32 nb_entries) >> > { >> > + struct xdp_sock *xs; >> > + >> > xskq_prod_submit_n(pool->cq, nb_entries); >> > + >> > + rcu_read_lock(); >> > + list_for_each_entry_rcu(xs, &pool->xsk_tx_list, tx_list) { >> > + if (xs->cq_event) >> > + sock_def_readable(&xs->sk); >> > + } >> > + rcu_read_unlock(); >> > } >> > EXPORT_SYMBOL(xsk_tx_completed); >> > >> > @@ -495,6 +504,9 @@ static __poll_t xsk_poll(struct file *file, struct >> > socket *sock, >> > __xsk_sendmsg(sk); >> > } >> > >> > + if (xs->cq_event && pool->cq && !xskq_prod_is_empty(pool->cq)) >> > + mask |= EPOLLIN | EPOLLRDNORM; >> > + >> > if (xs->rx && !xskq_prod_is_empty(xs->rx)) >> > mask |= EPOLLIN | EPOLLRDNORM; >> > if (xs->tx && !xskq_cons_is_full(xs->tx)) >> > @@ -882,6 +894,22 @@ static int xsk_setsockopt(struct socket *sock, int >> > level, int optname, >> > mutex_unlock(&xs->mutex); >> > return err; >> > } >> > + case XDP_CQ_EVENT: >> > + { >> > + int cq_event; >> > + >> > + if (optlen < sizeof(cq_event)) >> > + return -EINVAL; >> > + if (copy_from_sockptr(&cq_event, optval, sizeof(cq_event))) >> > + return -EFAULT; >> > + >> > + if (cq_event) >> > + xs->cq_event = true; >> > + else >> > + xs->cq_event = false; >> >> It's false by default, isn't it? > > I add cq_event inside "xdp_sock", that is got by sk_alloc, this call > sk_prot_alloc by __GFP_ZERO. So I think it is false. Right, I meant that what's the point to set it explicitly to 'false'? > > Thanks. > >> >> > + >> > + return 0; >> > + } >> > default: >> > break; >> > } >> > -- >> > 1.8.3.1 >> > >> > >