Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp427851iog; Thu, 30 Jun 2022 03:36:49 -0700 (PDT) X-Google-Smtp-Source: AGRyM1vguEbHq2zmboXPPPqCR4Gv2nVdii55kpra4Yss2ESg2w2S6J0/Fze+7vfqW0n7KzqQTNK5 X-Received: by 2002:a17:90a:d98f:b0:1ec:943c:4fbe with SMTP id d15-20020a17090ad98f00b001ec943c4fbemr9356874pjv.161.1656585409501; Thu, 30 Jun 2022 03:36:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656585409; cv=none; d=google.com; s=arc-20160816; b=fTnUwjNmD3gPxsrI4Ksw5ho701MJdJnQDfSD9gotp8A+9ggQCnJHcjniFQ/Iod7sM3 DCVLgSKGaOOkf7hhG3uHC1kL/STkhKNxSVxqHZeTmyx+9HGXRdj72fLk0IaXjmZNNwTC xSc3GVIeza51gnnMQ7Xmp4b6NsOqvnnCUJd8x6N424ystHBcsRewND+w+7JzSqV93YHE Wcp2o03Z4EkjNZG16ZyKQevUzIC1IYCNHNg9O7dc3gYrNtqOtW9GhVJh1B1dchO623Pu emchlFF0HKQBDqOAtX0YLbGtZCBGtFI5OHL0oQaPZMLAVq+g2HU8axPst08TC0KCisI5 9BEQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=cbnlfACHSMD2kFp2nkR9i85EQl5tAVPGzguGHV0sdHk=; b=pGOIw29TRlPr49nABFEq6ZC2qi99aOkdEgLx+bWQA5kzcVxFNwEeUPDUiuXLS9G2hQ m90iecK+Z4a02BeFVvCdL7d7cm/N0Ry90E76HJVQ0K93mqVJuf/iBBsYtzqAR22WhZNP rIvhNS9wd6ln2Cyd7BvqO1HJiYCAXB8nRf3WDa0z8XZza1UAloQ++1uKdd8tcs2GmGh0 5JVbBNMH0KA03wHPR0Vd/76M5mxQjtfdozlNR6amvA2eNzzOrbJ/3sradmNJ5KqdBV8h lNOqeNpdOKUvErbPqfrADOSq4YBxrIwx24KJfLc3K9I6TSEUjlkmXc5wsZLLSWtU0tV8 U9NA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=e2jXjAyJ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id d65-20020a633644000000b0040d3073018fsi27981281pga.391.2022.06.30.03.36.38; Thu, 30 Jun 2022 03:36:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=e2jXjAyJ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232927AbiF3KY5 (ORCPT + 99 others); Thu, 30 Jun 2022 06:24:57 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36214 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233146AbiF3KYU (ORCPT ); Thu, 30 Jun 2022 06:24:20 -0400 Received: from mail-yw1-x1136.google.com (mail-yw1-x1136.google.com [IPv6:2607:f8b0:4864:20::1136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 907637E02F for ; Thu, 30 Jun 2022 03:23:39 -0700 (PDT) Received: by mail-yw1-x1136.google.com with SMTP id 00721157ae682-317710edb9dso174764287b3.0 for ; Thu, 30 Jun 2022 03:23:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=cbnlfACHSMD2kFp2nkR9i85EQl5tAVPGzguGHV0sdHk=; b=e2jXjAyJXC6mG92CBxrGO58oqtaV37I1fe1bSVpuNM4nnb9BHM2jyyXedb503XdFQH QJdkVKBuiwGJUmiBJksvpW6BUFmf3dMzzgyz/A3eri5sXm0kOe7d8RNB+/CrPIQB+YB+ GWzeXAJ69dc1n39+VA0ypOSyXDjEfFGeemHj9sLtH1TfEzaBPmlkJHG1on5A0IhJdfRJ Rlx2SYgl0eMegOeO5UPL+GRJGV3Yx4p9JlcI4jiMSgX7mV0eLszZLbwX4g7MTyvSCZww Ac0dSlotFBBwqO8VeZDCa00cTJzo8ic8jmE/yrkETJixIG1BclYrvPKEeDDjrFs7NyZY YhCw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=cbnlfACHSMD2kFp2nkR9i85EQl5tAVPGzguGHV0sdHk=; b=h9TM3jiQ4EYo7ivJqSQrntRlLCHdaPwepzqJU/FYuoxN1Zxae9fORCeBF7MLoViZSb cvYOz3ejQifWKnWnIV9rFJlqxb9masVNiY56cHhLbInYCLRiXKoZ7tLxPcY0w307SD7K YKF4Fb9OrDYx2PUqg342/+lIj0YchbRgwDaecwecQUobjsdWnGlcFf6jyLN6BcAs8nKB OzDmyNzxROvYwXIopJKj0FDRgQwaiC+W0cEGMp8Kduwp0glf94qWpFMwwKw/SQjuHh14 8+eU2IRnlPGSi1Yr3A5/hJvKOhID+RxGv9E0QxaeZLnrbSWAhusce7xB5SYNNW/Ir9aR juag== X-Gm-Message-State: AJIora/SfznZLjbnkdu3gITxJy77zZI9+wwzzK5FwmOyont90f/ZTptX mrI6dmfEfj5b22rPdNJp6i7EzlbKkF1Z8Z8nORLryA== X-Received: by 2002:a81:1809:0:b0:317:c014:f700 with SMTP id 9-20020a811809000000b00317c014f700mr9808354ywy.255.1656584618094; Thu, 30 Jun 2022 03:23:38 -0700 (PDT) MIME-Version: 1.0 References: <07736c2b7019b6883076a06129e06e8f7c5f7154.1656487154.git.mqaio@linux.alibaba.com> In-Reply-To: <07736c2b7019b6883076a06129e06e8f7c5f7154.1656487154.git.mqaio@linux.alibaba.com> From: Eric Dumazet Date: Thu, 30 Jun 2022 12:23:26 +0200 Message-ID: Subject: Re: [PATCH] net: hinic: avoid kernel hung in hinic_get_stats64() To: Qiao Ma Cc: David Miller , Jakub Kicinski , Paolo Abeni , gustavoars@kernel.org, cai.huoqing@linux.dev, Aviad Krawczyk , zhaochen6@huawei.com, netdev , LKML Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jun 29, 2022 at 9:28 AM Qiao Ma wrote: > > When using hinic device as a bond slave device, and reading device stats of > master bond device, the kernel may hung. > > The kernel panic calltrace as follows: > Kernel panic - not syncing: softlockup: hung tasks > Call trace: > native_queued_spin_lock_slowpath+0x1ec/0x31c > dev_get_stats+0x60/0xcc > dev_seq_printf_stats+0x40/0x120 > dev_seq_show+0x1c/0x40 > seq_read_iter+0x3c8/0x4dc > seq_read+0xe0/0x130 > proc_reg_read+0xa8/0xe0 > vfs_read+0xb0/0x1d4 > ksys_read+0x70/0xfc > __arm64_sys_read+0x20/0x30 > el0_svc_common+0x88/0x234 > do_el0_svc+0x2c/0x90 > el0_svc+0x1c/0x30 > el0_sync_handler+0xa8/0xb0 > el0_sync+0x148/0x180 > > And the calltrace of task that actually caused kernel hungs as follows: > __switch_to+124 > __schedule+548 > schedule+72 > schedule_timeout+348 > __down_common+188 > __down+24 > down+104 > hinic_get_stats64+44 [hinic] > dev_get_stats+92 > bond_get_stats+172 [bonding] > dev_get_stats+92 > dev_seq_printf_stats+60 > dev_seq_show+24 > seq_read_iter+964 > seq_read+220 > proc_reg_read+164 > vfs_read+172 > ksys_read+108 > __arm64_sys_read+28 > el0_svc_common+132 > do_el0_svc+40 > el0_svc+24 > el0_sync_handler+164 > el0_sync+324 > > When getting device stats from bond, kernel will call bond_get_stats(). > It first holds the spinlock bond->stats_lock, and then call > hinic_get_stats64() to collect hinic device's stats. > However, hinic_get_stats64() calls `down(&nic_dev->mgmt_lock)` to > protect its critical section, which may schedule current task out. > And if system is under high pressure, the task cannot be woken up > immediately, which eventually triggers kernel hung panic. > > Fixes: edd384f682cc ("net-next/hinic: Add ethtool and stats") > Signed-off-by: Qiao Ma > --- > drivers/net/ethernet/huawei/hinic/hinic_dev.h | 1 + > drivers/net/ethernet/huawei/hinic/hinic_main.c | 7 +++---- > 2 files changed, 4 insertions(+), 4 deletions(-) > > diff --git a/drivers/net/ethernet/huawei/hinic/hinic_dev.h b/drivers/net/ethernet/huawei/hinic/hinic_dev.h > index fb3e89141a0d..1fb343d03fd5 100644 > --- a/drivers/net/ethernet/huawei/hinic/hinic_dev.h > +++ b/drivers/net/ethernet/huawei/hinic/hinic_dev.h > @@ -97,6 +97,7 @@ struct hinic_dev { > > struct hinic_txq_stats tx_stats; > struct hinic_rxq_stats rx_stats; > + spinlock_t stats_lock; > > u8 rss_tmpl_idx; > u8 rss_hash_engine; > diff --git a/drivers/net/ethernet/huawei/hinic/hinic_main.c b/drivers/net/ethernet/huawei/hinic/hinic_main.c > index 56a89793f47d..32a3d700ad26 100644 > --- a/drivers/net/ethernet/huawei/hinic/hinic_main.c > +++ b/drivers/net/ethernet/huawei/hinic/hinic_main.c > @@ -125,11 +125,13 @@ static void update_nic_stats(struct hinic_dev *nic_dev) > { > int i, num_qps = hinic_hwdev_num_qps(nic_dev->hwdev); > > + spin_lock(&nic_dev->stats_lock); > for (i = 0; i < num_qps; i++) > update_rx_stats(nic_dev, &nic_dev->rxqs[i]); > > for (i = 0; i < num_qps; i++) > update_tx_stats(nic_dev, &nic_dev->txqs[i]); > + spin_unlock(&nic_dev->stats_lock); > } > > /** > @@ -859,13 +861,9 @@ static void hinic_get_stats64(struct net_device *netdev, > nic_rx_stats = &nic_dev->rx_stats; > nic_tx_stats = &nic_dev->tx_stats; > > - down(&nic_dev->mgmt_lock); > - > if (nic_dev->flags & HINIC_INTF_UP)) > update_nic_stats(nic_dev); > > - up(&nic_dev->mgmt_lock); > - Note: The following is racy, because multiple threads can call hinic_get_stats64() at the same time. It needs a loop, see include/linux/u64_stats_sync.h for detail. > stats->rx_bytes = nic_rx_stats->bytes; > stats->rx_packets = nic_rx_stats->pkts; > stats->rx_errors = nic_rx_stats->errors; Same remark for nic_tx_stats->{bytes|pkts|errors} > @@ -1239,6 +1237,7 @@ static int nic_dev_init(struct pci_dev *pdev) > > u64_stats_init(&tx_stats->syncp); > u64_stats_init(&rx_stats->syncp); > + spin_lock_init(&nic_dev->stats_lock); > > nic_dev->vlan_bitmap = devm_bitmap_zalloc(&pdev->dev, VLAN_N_VID, > GFP_KERNEL); > -- > 1.8.3.1 >