Received: by 2002:a05:6a10:2785:0:0:0:0 with SMTP id ia5csp239214pxb; Thu, 14 Jan 2021 04:54:52 -0800 (PST) X-Google-Smtp-Source: ABdhPJyLyZbDUG6U+GdmgLi2zzyFI1fnvqfxAo6KDmIScuIVDEp6RUgBBIITLeSwQPcFzZcSPuKy X-Received: by 2002:a50:d4d9:: with SMTP id e25mr5695705edj.243.1610628891795; Thu, 14 Jan 2021 04:54:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1610628891; cv=none; d=google.com; s=arc-20160816; b=YVgBq5ZYSurOkgYKybv9DS44hjq/KopmeqJBSZPsa2bHbqd6I/X+Ots+xLVkuntV72 WNkuZK/yDsbCvpDhck8mkWdk3CewVLB/HjF0KVvlZFPALGe88Y/pPdUSGUxK2z29CLI1 mkVv095jANXKTQMjfYdMZ7QaY0uOcAK1R7OQu6xK8HWm2KksWgpoOQpSQCfJys5PmxAr HWy4QVcAKPix+em2Fn97iwI861c+eWGaThWrSBZBXa3I2WWKYDC/NW4tB416M8FryxZF dfzjXgwLPzTJF4MHaUOnn07pMAvLm4yiw+20vQPZ2hOyJ9yxrUmEFeLj7Vl+AsHBBZ59 gG4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=b0xwzUQ09tQNNwHy+qgRNeihhGrvkVw2IceHHCVee2s=; b=L2Wwzk58+XUZH2k9VvZ6BkxW2vmonLrdHE6UW9Q64P4i4dTS7CpvJjpmILBiWlv7NL 7KaaQBBqTV0vE47Qa+ftfXIBL6jlY12nhe/SzWCNouicmgx6LNfRDfCp/pdFhSGSjcdD HvNZGKaB3N+HMZpaUu5vScdLH1H6Gqzm+RaySGWiYgDKscueNg4WmLCndAuJlfdNn/7y QQZoyWxJJEpXyjlUCl5aWx/3UoidZ/ijZP7BYC1i69eytilmj+DL4OvNCVy1yBi3XAQH wQCZH7yZdmC33rM+2Q8GV7qxQ5+cbi6kFH4yFfzk7APJZFHqxBuVTyg7Re1WUlV4F6Rr /KnA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=rVpL7l0B; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id lh17si2354778ejb.328.2021.01.14.04.54.26; Thu, 14 Jan 2021 04:54:51 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=rVpL7l0B; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726427AbhANMwh (ORCPT + 99 others); Thu, 14 Jan 2021 07:52:37 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44014 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725955AbhANMwg (ORCPT ); Thu, 14 Jan 2021 07:52:36 -0500 Received: from mail-qk1-x72a.google.com (mail-qk1-x72a.google.com [IPv6:2607:f8b0:4864:20::72a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8A21DC061757 for ; Thu, 14 Jan 2021 04:51:56 -0800 (PST) Received: by mail-qk1-x72a.google.com with SMTP id v126so7446694qkd.11 for ; Thu, 14 Jan 2021 04:51:56 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=b0xwzUQ09tQNNwHy+qgRNeihhGrvkVw2IceHHCVee2s=; b=rVpL7l0BbhnTd+dgj69NKSfaqOPMz1TQEKO/BxztQi257jHhLSVhhnMCAJMIzbe8gQ l2ExqJb3dT6JIOD2FvhWiH8GhokBKkpFSGi5Jtwc9pb4cON15WjNzSImVliRBDZZgmsm Hy/vE0rW9VsT49cQZeLRMrSBKg8NX+4UDa4Sa6uzRKCLVqNozo3cuP7elccVeBnnkMVU rnUlQvNpuh7UI56kwO8JpuRRvfuW46SVkqixs95bgpyqWNa8OayAzQsslSxit1aiEjMn gA1aldEqQbaEFNDFbCypRMVdCsttM4VsCxXI7pE4kz102YZNtOjk6sjvIPZg9AA2Tjf1 3bzA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=b0xwzUQ09tQNNwHy+qgRNeihhGrvkVw2IceHHCVee2s=; b=jA9bOx4jGOafhm5P9G8b/xaidzJJSMmAtZ9epwiakaBEF72HexxKr4QGPe3OcueWIP jh3aJHvVFuCAiD/Tf3K72CEb2tSvSJn41unLBdQkgFBUTTj4rDrAJafMJlCzzDfnLmE9 8ksIA2B8G8sJfGV86kr8NHYYtCHdhBWQpMU5PFX9tTtXeecdqUK0En8Jw81QHvyglEwW 8eRbwLMVQpw+Jde/02DtU0YjeOOdhIS2fOMgW0VV00kguw0R/hMpob5ukCr979ZzCK+k cwM3hF9h0ZN3h2FTykscHR21q052RMBipv0h1CbaZrd0Io5ty6fYmfhGr9Vj38DKuVH1 veDg== X-Gm-Message-State: AOAM5328QTGtepZg+Al/i4WQ1uEY2DFP+JPVWq82Oh8AN3OrpZl4F8on FCfQr1vYBxfhCih8X2AO2L1qxLiKqgBwybSODE5i1w== X-Received: by 2002:a37:9a97:: with SMTP id c145mr6769847qke.350.1610628715515; Thu, 14 Jan 2021 04:51:55 -0800 (PST) MIME-Version: 1.0 References: <20210113133523.39205-1-alobakin@pm.me> <20210113133635.39402-1-alobakin@pm.me> <20210113133635.39402-2-alobakin@pm.me> <20210114114046.7272-1-alobakin@pm.me> <20210114124406.9049-1-alobakin@pm.me> In-Reply-To: From: Dmitry Vyukov Date: Thu, 14 Jan 2021 13:51:44 +0100 Message-ID: Subject: Re: [PATCH v2 net-next 2/3] skbuff: (re)use NAPI skb cache on allocation path To: Alexander Lobakin Cc: Eric Dumazet , "David S. Miller" , Jakub Kicinski , Edward Cree , Jonathan Lemon , Willem de Bruijn , Miaohe Lin , Steffen Klassert , Guillaume Nault , Yadu Kishore , Al Viro , netdev , LKML Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 14, 2021 at 1:50 PM Dmitry Vyukov wrote: > > On Thu, Jan 14, 2021 at 1:44 PM Alexander Lobakin wrote: > > > > From: Dmitry Vyukov > > Date: Thu, 14 Jan 2021 12:47:31 +0100 > > > > > On Thu, Jan 14, 2021 at 12:41 PM Alexander Lobakin wrote: > > >> > > >> From: Eric Dumazet > > >> Date: Wed, 13 Jan 2021 15:36:05 +0100 > > >> > > >>> On Wed, Jan 13, 2021 at 2:37 PM Alexander Lobakin wrote: > > >>>> > > >>>> Instead of calling kmem_cache_alloc() every time when building a NAPI > > >>>> skb, (re)use skbuff_heads from napi_alloc_cache.skb_cache. Previously > > >>>> this cache was only used for bulk-freeing skbuff_heads consumed via > > >>>> napi_consume_skb() or __kfree_skb_defer(). > > >>>> > > >>>> Typical path is: > > >>>> - skb is queued for freeing from driver or stack, its skbuff_head > > >>>> goes into the cache instead of immediate freeing; > > >>>> - driver or stack requests NAPI skb allocation, an skbuff_head is > > >>>> taken from the cache instead of allocation. > > >>>> > > >>>> Corner cases: > > >>>> - if it's empty on skb allocation, bulk-allocate the first half; > > >>>> - if it's full on skb consuming, bulk-wipe the second half. > > >>>> > > >>>> Also try to balance its size after completing network softirqs > > >>>> (__kfree_skb_flush()). > > >>> > > >>> I do not see the point of doing this rebalance (especially if we do not change > > >>> its name describing its purpose more accurately). > > >>> > > >>> For moderate load, we will have a reduced bulk size (typically one or two). > > >>> Number of skbs in the cache is in [0, 64[ , there is really no risk of > > >>> letting skbs there for a long period of time. > > >>> (32 * sizeof(sk_buff) = 8192) > > >>> I would personally get rid of this function completely. > > >> > > >> When I had a cache of 128 entries, I had worse results without this > > >> function. But seems like I forgot to retest when I switched to the > > >> original size of 64. > > >> I also thought about removing this function entirely, will test. > > >> > > >>> Also it seems you missed my KASAN support request ? > > >> I guess this is a matter of using kasan_unpoison_range(), we can ask for help. > > >> > > >> I saw your request, but don't see a reason for doing this. > > >> We are not caching already freed skbuff_heads. They don't get > > >> kmem_cache_freed before getting into local cache. KASAN poisons > > >> them no earlier than at kmem_cache_free() (or did I miss someting?). > > >> heads being cached just get rid of all references and at the moment > > >> of dropping to the cache they are pretty the same as if they were > > >> allocated. > > > > > > KASAN should not report false positives in this case. > > > But I think Eric meant preventing false negatives. If we kmalloc 17 > > > bytes, KASAN will detect out-of-bounds accesses beyond these 17 bytes. > > > But we put that data into 128-byte blocks, KASAN will miss > > > out-of-bounds accesses beyond 17 bytes up to 128 bytes. > > > The same holds for "logical" use-after-frees when object is free, but > > > not freed into slab. > > > > > > An important custom cache should use annotations like > > > kasan_poison_object_data/kasan_unpoison_range. > > > > As I understand, I should > > kasan_poison_object_data(skbuff_head_cache, skb) and then > > kasan_unpoison_range(skb, sizeof(*skb)) when putting it into the > > cache? > > I think it's the other way around. It should be _un_poisoned when used. > If it's fixed size, then unpoison_object_data should be a better fit: > https://elixir.bootlin.com/linux/v5.11-rc3/source/mm/kasan/common.c#L253 Variable-size poisoning/unpoisoning would be needed for the skb data itself: https://bugzilla.kernel.org/show_bug.cgi?id=199055