Received: by 2002:a05:6358:7058:b0:131:369:b2a3 with SMTP id 24csp5077247rwp; Sun, 16 Jul 2023 19:00:13 -0700 (PDT) X-Google-Smtp-Source: APBJJlHdKjsyPe5yfl037z7+pBH62VTdzgA6eZeizHjbWyay4MYmpEPgQQh47saie/ydhh0h+Tyu X-Received: by 2002:a05:622a:1a98:b0:3e4:e2ce:526f with SMTP id s24-20020a05622a1a9800b003e4e2ce526fmr13178536qtc.39.1689559213240; Sun, 16 Jul 2023 19:00:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689559213; cv=none; d=google.com; s=arc-20160816; b=b/aJxAZHFF+wqz5ux4XVdkrSQCgYrXzlfsyCbZNYQBJyf9tlF1uQ+FL9Hb60fdYavp 5c2o7cXsxsn4RDJWSGW+3gjKiJPnTWWUrKd4JBjPnoQ0VC8TvnRnFKDF/v/gQ8H7Vrjl sXkY3IA8lGwCpPIWo7qBrGV4PRhdgir1LLCF9ynIOuAHrSAY5w+GRsAli6GHOdbPeNtw ExTIZ/1OPPuU81pfaR+uKKaMSlbBHHfwoB3y+T9SzPOxohNSm9QDrhoPR3rkoDkY7j9e BCluQgwMkhO04NFvnntV6k7H6ScRXbT5LorNGhsdnauMchS1RNhZ7EgcY9Y6WiVZFrRA 6aUw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=0/6R0t0MHhqco+8UDZucjwl/aaBXGVqtQLdiuoDLwso=; fh=okiuQYhv4y3qb8iue6P6LpchsdYOXoaJQSirA6wRgFo=; b=ubilq1Uf1tqwlblzF7tHaWfiI6jNA1ecE4eG2/sp6HRwki+Ivx6obVBvtNYzdDAxOF Bz1Ih881JqmGZeZQhbSM9qzsb2kXLOPQfAFXZbexVgwPwkEPWqThcUl98etvoDX94Ao9 KoCvzgHXWWwPo6lj4h6UXVPehi43pyucM65lJMSsxX2JZz2/pI5dbdEuqoMUAT1DlC2i dmT75hdCAUNhUq4pU1Wj9ESMP1m/8hOItz5wdedbue2R9MOvkLune6wYpC9h5z2t+vLi dZUH9+uBJCjUFtfxhJuSeoMOPO0ccw9db/x3YG2uplbPZCtgoGbkLAX20Hl8ToFxhLu7 uVsw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=RWXeODKe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q3-20020a170902e30300b001b8944aa948si10648837plc.518.2023.07.16.19.00.00; Sun, 16 Jul 2023 19:00:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=RWXeODKe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230263AbjGQByA (ORCPT + 99 others); Sun, 16 Jul 2023 21:54:00 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42868 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230204AbjGQBx6 (ORCPT ); Sun, 16 Jul 2023 21:53:58 -0400 Received: from mail-vs1-xe33.google.com (mail-vs1-xe33.google.com [IPv6:2607:f8b0:4864:20::e33]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E337E122 for ; Sun, 16 Jul 2023 18:53:56 -0700 (PDT) Received: by mail-vs1-xe33.google.com with SMTP id ada2fe7eead31-44504b2141bso1073002137.2 for ; Sun, 16 Jul 2023 18:53:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1689558836; x=1692150836; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=0/6R0t0MHhqco+8UDZucjwl/aaBXGVqtQLdiuoDLwso=; b=RWXeODKevMU++AlYxd4ioslmKJzbf68u3bThrYPVbGcxKhv8otUEa6+hbfY96qiER5 yG9zwdlkb9SErmueuzE9OXzmhX3XF70Uh3CVUci1zN7irgoJZCY2yTYuHF04CTbC5fHt Pc/GRORNx+QAvX5IXD+KIh2mW27JSn+VxVyXd0luS3z0aDbDsFyEF2foCzIfOmSLv/te 9EWKMbfGQPjZ9PahTM7UMRyEhmKjXqzwcSvdyiudRatiEGgs4vByaTvCQ6gTCn0Erf+H sZWZqUW06jMRXBDzX3FIu9FV3ztdZxqBBPUjYsdjVsyS3aku/QgvFp1ej+qjjR7UmkpC B8JA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1689558836; x=1692150836; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=0/6R0t0MHhqco+8UDZucjwl/aaBXGVqtQLdiuoDLwso=; b=GnnxugBemE6oPII5q941DZZ0OIoLYu0c5fOVW3EpQZxzZqplvzOVZh6R1OdxiAM+42 rjENZIGKuhVyh1NQz+9jw2l3iEQFV5RaHvwhjB2os6uEH9JZKO504V+vg7YJRPnn2SqM eYv7XhUHkQsDBaz4KXiQ699Yduy0vecQocvGjT3tpJkVlGImKfJohYVbE+a+sIQf/S5B ANin2CRTg4fXCVJA9wI38aNP9oBGZBYBtCpDrnUIkpv66alwzpU4iwapjsbwDQiF3wTJ iVAJaT94UkaaOkI6AjfjpflOMp+5MHnxNdf1JuBsqidF11xyJjWfkanG9ysVJHIjC7M+ mAwQ== X-Gm-Message-State: ABy/qLZwwEOZYQtkFs0do4+/k220jz7+gK58IF4U/+JHs1tKvkQV24eZ OABC+xhwets0Ede0usOxG/AGfOhX3mNqmh6SlNd7ig== X-Received: by 2002:a05:6102:a37:b0:445:209:cac7 with SMTP id 23-20020a0561020a3700b004450209cac7mr4642927vsb.27.1689558835901; Sun, 16 Jul 2023 18:53:55 -0700 (PDT) MIME-Version: 1.0 References: <04187826-8dad-d17b-2469-2837bafd3cd5@kernel.org> <20230711093224.1bf30ed5@kernel.org> <20230711133915.03482fdc@kernel.org> <2263ae79-690e-8a4d-fca2-31aacc5c9bc6@kernel.org> <143a7ca4-e695-db98-9488-84cf8b78cf86@amd.com> In-Reply-To: From: Mina Almasry Date: Sun, 16 Jul 2023 18:53:44 -0700 Message-ID: Subject: Re: Memory providers multiplexing (Was: [PATCH net-next v4 4/5] page_pool: remove PP_FLAG_PAGE_FRAG flag) To: Jason Gunthorpe Cc: =?UTF-8?Q?Christian_K=C3=B6nig?= , Hari Ramakrishnan , David Ahern , Samiullah Khawaja , Willem de Bruijn , Jakub Kicinski , Christoph Hellwig , John Hubbard , Dan Williams , Jesper Dangaard Brouer , brouer@redhat.com, Alexander Duyck , Yunsheng Lin , davem@davemloft.net, pabeni@redhat.com, netdev@vger.kernel.org, linux-kernel@vger.kernel.org, Lorenzo Bianconi , Yisen Zhuang , Salil Mehta , Eric Dumazet , Sunil Goutham , Geetha sowjanya , Subbaraya Sundeep , hariprasad , Saeed Mahameed , Leon Romanovsky , Felix Fietkau , Ryder Lee , Shayne Chen , Sean Wang , Kalle Valo , Matthias Brugger , AngeloGioacchino Del Regno , Jesper Dangaard Brouer , Ilias Apalodimas , linux-rdma@vger.kernel.org, linux-wireless@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-mediatek@lists.infradead.org, Jonathan Lemon , logang@deltatee.com, Bjorn Helgaas Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jul 14, 2023 at 8:55=E2=80=AFAM Jason Gunthorpe wrot= e: > > On Fri, Jul 14, 2023 at 07:55:15AM -0700, Mina Almasry wrote: > > > Once the skb frags with struct new_abstraction are in the TCP stack, > > they will need some special handling in code accessing the frags. But > > my RFC already addressed that somewhat because the frags were > > inaccessible in that case. In this case the frags will be both > > inaccessible and will not be struct pages at all (things like > > get_page() will not work), so more special handling will be required, > > maybe. > > It seems sort of reasonable, though there will be interesting concerns > about coherence and synchronization with generial purpose DMABUFs that > will need tackling. > > Still it is such a lot of churn and weridness in the netdev side, I > think you'd do well to present an actual full application as > justification. > > Yes, you showed you can stick unordered TCP data frags into GPU memory > sort of quickly, but have you gone further with this to actually show > it is useful for a real world GPU centric application? > > BTW your cover letter said 96% utilization, the usual server > configuation is one NIC per GPU, so you were able to hit 1500Gb/sec of > TCP BW with this? > I do notice that the number of NICs is missing from our public documentation so far, so I will refrain from specifying how many NICs are on those A3 VMs until the information is public. But I think I can confirm that your general thinking is correct, the perf that we're getting is 96.6% line rate of each GPU/NIC pair, and scales linearly for each NIC/GPU pair we've tested with so far. Line rate of each NIC/GPU pair is 200 Gb/sec. So if we have 8 NIC/GPU pairs we'd be hitting 96.6% * 200 * 8 =3D 1545 GB/s= ec. If we have, say, 2 NIC/GPU pairs, we'd be hitting 96.6% * 200 * 2 =3D 384 G= B/sec ... etc. --=20 Thanks, Mina