Received: by 2002:a05:7412:b10a:b0:f3:1519:9f41 with SMTP id az10csp41524rdb; Wed, 29 Nov 2023 19:18:44 -0800 (PST) X-Google-Smtp-Source: AGHT+IGeCogQYmyY4Tx1v4VuM6MSrsOGCyNsEvmydA/GLS98uE3KNXnaqDj2re+6xqTuvrTYnDXR X-Received: by 2002:a05:6870:9605:b0:1fa:16f4:7f3f with SMTP id d5-20020a056870960500b001fa16f47f3fmr16820544oaq.43.1701314324341; Wed, 29 Nov 2023 19:18:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701314324; cv=none; d=google.com; s=arc-20160816; b=Nh1xQw/4YuXqkXjnchQez2fdtnvG3Ggn3Ax4EeNauPUddUy955zPl47O0UsHK+oeAP tjG/tXWNz9CH8Wk4RIFb/neihRcCCt0vRPyqOePwJPU4jncbFtcbdNWhx9mW3nibzwE+ I1/krGQJY2sPNrBFPlGZEytyt7BRHcLDzvfNP632wNyASvI56ASEfLXJg+xTLionm3JV 7VTJV5u4Gu1EoCsnZtuAhD5x0TqOBIdyK5WpJ8CwXeV921mE34om+5Gp9AadWtblSsnz s5FinA402QyEH98y7IzZ4VaO0b95R+79vem/WnxV6df0zS4G0sEQ+MypF7vawQn0rjaK lCFg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=fpc8g0lA/UAskNB5LDyII3L0XJyWCtne1XEN1G8Bsqo=; fh=uTQNGWPcSoguiTKdRqkzbBU3S0NZDnggiXPdYbGJMfE=; b=dllY/OhzT6Tz/jJwLzV6Pt8SIZX3UPuwqHs503pnrO8kqYf7j3RBwFK5XpYd6IgZ/I QOuDCtz7f3DCzNTIMiYnP9r9LE898a0S47v98iTaPc5W0KROHtkDVgnkQbhKMLx5CqJ3 s+z1Aa+BVmjC0lMPuat6e2YQF9rvcCxpJxRPrlbCaa9tIsMuZSYTB8Vtq+rMw0H6RNSj POweMngUmybu/9DOp7tF2l0zA8/qdFm5B84Y5GfhFH1PPMEYhdKxga1x5KrptjSkvqNU VAtQbtSIQoaXJhozvlUnBLobmUgwv/4Qk4fCPInayUwKl2jCOx3JyH6hD80ug+fHrRHq Yd/w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@126.com header.s=s110527 header.b=kKkUpaRc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=126.com Return-Path: Received: from agentk.vger.email (agentk.vger.email. [23.128.96.32]) by mx.google.com with ESMTPS id ch4-20020a056a0208c400b005bd2c1c6f60si327062pgb.459.2023.11.29.19.18.43 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 29 Nov 2023 19:18:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) client-ip=23.128.96.32; Authentication-Results: mx.google.com; dkim=pass header.i=@126.com header.s=s110527 header.b=kKkUpaRc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=126.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id DE8F6803A6E5; Wed, 29 Nov 2023 19:18:41 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231639AbjK3DST (ORCPT + 99 others); Wed, 29 Nov 2023 22:18:19 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50018 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230393AbjK3DSS (ORCPT ); Wed, 29 Nov 2023 22:18:18 -0500 Received: from m126.mail.126.com (m126.mail.126.com [220.181.12.35]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 9EA37C4; Wed, 29 Nov 2023 19:18:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=126.com; s=s110527; h=Message-ID:Date:MIME-Version:Subject:From: Content-Type; bh=fpc8g0lA/UAskNB5LDyII3L0XJyWCtne1XEN1G8Bsqo=; b=kKkUpaRcwCpyW69JN2drALmh3HxkHXtTo+p3opjL6qHu+B/zjmHCsGR9gCG0vT wrm2gFv+RJKc3NQ0Kd3HQVRXZAycPBQsuFh+DPAXVl2boOmLjoaq995xX2e7wTz1 gcS69XR5B635kEp3tXm/y0cbjE7MjDDns+0g36EYQwEJE= Received: from [172.23.69.7] (unknown [121.32.254.146]) by zwqz-smtp-mta-g3-0 (Coremail) with SMTP id _____wCHzzgT_WdlKyv3DA--.60934S2; Thu, 30 Nov 2023 11:10:12 +0800 (CST) Message-ID: <6da55a6b-7852-4832-9541-3f3c45a09a3a@126.com> Date: Thu, 30 Nov 2023 11:10:08 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v2] net/mlx5e: Fix a race in command alloc flow To: Moshe Shemesh , saeedm@nvidia.com, leon@kernel.org, davem@davemloft.net, edumazet@google.com, kuba@kernel.org, pabeni@redhat.com, jackm@dev.mellanox.co.il, ogerlitz@mellanox.com, roland@purestorage.com, eli@mellanox.com Cc: dinghui@sangfor.com.cn, netdev@vger.kernel.org, linux-rdma@vger.kernel.org, linux-kernel@vger.kernel.org References: <20231121115251.588436-1-lishifeng1992@126.com> From: Shifeng Li In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-CM-TRANSID: _____wCHzzgT_WdlKyv3DA--.60934S2 X-Coremail-Antispam: 1Uf129KBjvJXoWxCr1ruF45Kr43CF48Cw4xCrg_yoW5ZFy8pr W8Gw17AFn5Krsxtrn7WF4jq3W8Jrs7Gw1UGF1v9r1xWwn8Aa4kA34Ikr4jg34UX3yjqFyU JayqqFy8Xr4fX3JanT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDUYxBIdaVFxhVjvjDU0xZFpf9x07jog4fUUUUU= X-Originating-IP: [121.32.254.146] X-CM-SenderInfo: xolvxx5ihqwiqzzsqiyswou0bp/1S2mtgE4r1pD4XyoMgABs- X-Spam-Status: No, score=-0.6 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Wed, 29 Nov 2023 19:18:42 -0800 (PST) On 2023/11/27 0:13, Moshe Shemesh wrote: > > > On 11/21/2023 1:52 PM, Shifeng Li wrote: >> Fix a cmd->ent use after free due to a race on command entry. >> Such race occurs when one of the commands releases its last refcount and >> frees its index and entry while another process running command flush >> flow takes refcount to this command entry. The process which handles >> commands flush may see this command as needed to be flushed if the other >> process allocated a ent->idx but didn't set ent to cmd->ent_arr in >> cmd_work_handler(). Fix it by moving the assignment of cmd->ent_arr into >> the spin lock. >> >> [70013.081955] BUG: KASAN: use-after-free in mlx5_cmd_trigger_completions+0x1e2/0x4c0 [mlx5_core] >> [70013.081967] Write of size 4 at addr ffff88880b1510b4 by task kworker/26:1/1433361 >> [70013.081968] >> [70013.081989] CPU: 26 PID: 1433361 Comm: kworker/26:1 Kdump: loaded Tainted: G           OE     4.19.90-25.17.v2101.osc.sfc.6.10.0.0030.ky10.x86_64+debug #1 >> [70013.082001] Hardware name: SANGFOR 65N32-US/ASERVER-G-2605, BIOS SSSS5203 08/19/2020 >> [70013.082028] Workqueue: events aer_isr >> [70013.082053] Call Trace: >> [70013.082067]  dump_stack+0x8b/0xbb >> [70013.082086]  print_address_description+0x6a/0x270 >> [70013.082102]  kasan_report+0x179/0x2c0 >> [70013.082133]  ? mlx5_cmd_trigger_completions+0x1e2/0x4c0 [mlx5_core] >> [70013.082173]  mlx5_cmd_trigger_completions+0x1e2/0x4c0 [mlx5_core] >> [70013.082213]  ? mlx5_cmd_use_polling+0x20/0x20 [mlx5_core] >> [70013.082223]  ? kmem_cache_free+0x1ad/0x1e0 >> [70013.082267]  mlx5_cmd_flush+0x80/0x180 [mlx5_core] >> [70013.082304]  mlx5_enter_error_state+0x106/0x1d0 [mlx5_core] >> [70013.082338]  mlx5_try_fast_unload+0x2ea/0x4d0 [mlx5_core] >> [70013.082377]  remove_one+0x200/0x2b0 [mlx5_core] >> [70013.082390]  ? __pm_runtime_resume+0x58/0x70 >> [70013.082409]  pci_device_remove+0xf3/0x280 >> [70013.082426]  ? pcibios_free_irq+0x10/0x10 >> [70013.082439]  device_release_driver_internal+0x1c3/0x470 >> [70013.082453]  pci_stop_bus_device+0x109/0x160 >> [70013.082468]  pci_stop_and_remove_bus_device+0xe/0x20 >> [70013.082485]  pcie_do_fatal_recovery+0x167/0x550 >> [70013.082493]  aer_isr+0x7d2/0x960 >> [70013.082510]  ? aer_get_device_error_info+0x420/0x420 >> [70013.082526]  ? __schedule+0x821/0x2040 >> [70013.082536]  ? strscpy+0x85/0x180 >> [70013.082543]  process_one_work+0x65f/0x12d0 >> [70013.082556]  worker_thread+0x87/0xb50 >> [70013.082563]  ? __kthread_parkme+0x82/0xf0 >> [70013.082569]  ? process_one_work+0x12d0/0x12d0 >> [70013.082571]  kthread+0x2e9/0x3a0 >> [70013.082579]  ? kthread_create_worker_on_cpu+0xc0/0xc0 >> [70013.082592]  ret_from_fork+0x1f/0x40 >> >> Fixes: e126ba97dba9 ("mlx5: Add driver for Mellanox Connect-IB adapters") >> Signed-off-by: Shifeng Li > > Fixes tag should be : > Fixes: 50b2412b7e78 ("net/mlx5: Avoid possible free of command entry while timeout comp handler") > > Reviewed-by: Moshe Shemesh > I have sent v3. Thanks! > Thanks!