Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp10753imm; Fri, 25 May 2018 13:11:47 -0700 (PDT) X-Google-Smtp-Source: AB8JxZorUyTsJ8eein/JVHO2MMKc8CfW2B8CzD3RBMEE+l0U8Ij2xQN9QVFGxDUbps8XvfOP1meH X-Received: by 2002:a63:8c4f:: with SMTP id q15-v6mr3196105pgn.236.1527279106960; Fri, 25 May 2018 13:11:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1527279106; cv=none; d=google.com; s=arc-20160816; b=okpEn1ctqniqgyooVwbJXo9GCylJ+z9PiIl4eT5deKLyba0KqobjM+yfRfsm1F+T7s KJAvK92AdiAM2W3PGPe5ljHzIPGzvOIQ1je6xQqmxS5fsAuaVOhP5vt/NRbLEWNse92W ApDg+ai1ikvZojUtESL+86d0ykRYFE9pOzW8xfy+n/RJhaAWWdUjxA00KNDqgGeRFSMe ro1IlYEanwcL5Ul1QVxA7r32WoaLcd2kupruP6pEMqT6fXxsacbLwwWoI9vBbP573xa+ bb9XYuorxRdDbkeG8ZfjPjWUpCrA8yF14T3PnS3r/zXrJIYW85jCpk9negF9tw66QQh5 B9+g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=hOf2Wb9nB8epOHlo43RiZsbfPsYQwpBYzyi4NQFSLHY=; b=zwcIFqNsK19SbPI60u0a8Ol94dgLIQpShf/Jc6wqvKkbdwCO50lDzi7JIR75ftm2Mh q3AGziF17BLS0QzGaMhBSlR51otqN2NNZvyMXaEaBw5ngANkBqXm3AblGiQExVJeJIhk nKI94XDp2tRqc0MSgnQ0LHEUCzXjV7SjEeqcJCPbdG9Up0EgZu+4kjkVzGZbsnOzsJKD +q+e7EDhjVDg59wRKir+UY+Zs9lCq6gB9ZT313o/Htj6/zSFzVz+f4AELkOcHxVNRnrt Hi3QoMYMEpTRdMEngnMazFY7w66u0+h+gSJae+2tY9zkyLkMwJQl2keZLup7Ctm8INey J4iQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=T/Yhy5yc; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d11-v6si25114526pfh.131.2018.05.25.13.11.28; Fri, 25 May 2018 13:11:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=T/Yhy5yc; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S968194AbeEYUKx (ORCPT + 99 others); Fri, 25 May 2018 16:10:53 -0400 Received: from mail-qk0-f196.google.com ([209.85.220.196]:44950 "EHLO mail-qk0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S967879AbeEYUKw (ORCPT ); Fri, 25 May 2018 16:10:52 -0400 Received: by mail-qk0-f196.google.com with SMTP id 185-v6so4986140qkk.11 for ; Fri, 25 May 2018 13:10:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=hOf2Wb9nB8epOHlo43RiZsbfPsYQwpBYzyi4NQFSLHY=; b=T/Yhy5yc5D0ae/o9QY34hixQpKt2otDly3l4wtJYDbIRg4V/o79QmCCL4uj6WGon6z bzidGhY1e3STt0iEplH2yStfS4ByiemLWJW7XZdNzoskbg5TgY47zlqgGiCYOcV1l9qN 3rc4JixcS+29U6ddp8tVvsuaCa0G0j/DxQy+dCIae4gUNGcslmLFGOh/OSv6fonAc6xz /F78mLJ44kKREpE0S5e2hyRAGZutkr9OowWwhpec1QUDerqOw9At8TdH9oVmOaVzYNcg kDjti4rHSDL8YqQcIgWP0wzeibAW4U0bXwR7+4B4BBdxxBBG2AOymsyMOL5wCvF/SKNW MjbQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=hOf2Wb9nB8epOHlo43RiZsbfPsYQwpBYzyi4NQFSLHY=; b=DtH1cEha7R27JBY8IxS51qTQgl4XqHv8GsnS89gV2KHNiJscb9HREFZ9wQa0WK+Llu L1ZQbbHfkbo2Dz43P0Wx+QDu+q6epzmf8izd6jovtYFOyRWq5ARsJX7sH3L3HDFFJsOw Lt7KettxhWsCjzah+PGxgD6Z6CROKqtjaTQnCajzsKfutCDBGQcOOYcZR/Ze+BO8O+cE jWm2DHBOYVNJhVSAQlXwHygx+TyaNK1pDM2C875sj+WCyLcOA0J/Zj60PzD6GboMCaOq vV6fvcskruis2VaOojWtF5gU4xhobi8JfUmFs4Z5R2/FiS4sZvyeV3jFXRDxJlAM0FR2 Cu0w== X-Gm-Message-State: ALKqPwfyy6wqiCHH3kkkZZc9j6S5iroCFWh1iqnziIPCK5qVye4JFlk5 6sE8BALkneTHBv3qwcNoB1wsGx8O5+bImALq2XQ= X-Received: by 2002:a37:4152:: with SMTP id o79-v6mr3337741qka.327.1527279051210; Fri, 25 May 2018 13:10:51 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a0c:9e10:0:0:0:0:0 with HTTP; Fri, 25 May 2018 13:10:50 -0700 (PDT) In-Reply-To: References: <16f47fa4-1555-cddb-3dfb-7d56fb992ea1@mellanox.com> From: Song Liu Date: Fri, 25 May 2018 13:10:50 -0700 Message-ID: Subject: Re: WARNING and PANIC in irq_matrix_free To: Thomas Gleixner Cc: Tariq Toukan , Dmitry Safonov <0x7f454c46@gmail.com>, open list , Maor Gottlieb , kernel-team@fb.com Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, We are seeing something probably related. We run ethtool on a system with Broadcom NIC to increase number of combined queues. [root@ ~]# ethtool -l eth0 Channel parameters for eth0: Pre-set maximums: RX: 9 TX: 8 Other: 0 Combined: 17 Current hardware settings: RX: 0 TX: 0 Other: 0 Combined: 8 [root@ ~]# ethtool -L eth0 combined 16 The last command PANIC the kernel easily (5 out of 5 in my tests). I haven't got luck to catch much console output, the only line I got is: [ 504.727865] WARNING: CPU: 10 PID: 0 at kernel/irq/matrix.c:371 irq_matrix_free+0x32/0xd0 The NIC we have is Broadcom Limited BCM57302 NetXtreme-C 10Gb/25Gb Ethernet Controller Thanks, Song On Wed, May 23, 2018 at 1:49 AM, Thomas Gleixner wrote: > On Wed, 23 May 2018, Tariq Toukan wrote: >> On 19/05/2018 2:20 PM, Thomas Gleixner wrote: >> > On Fri, 18 May 2018, Dmitry Safonov wrote: >> > > I'm not entirely sure that it's the same fault, but at least backtrace >> > > looks resembling. >> > >> > Yes, it's similar, but not the same issue. I'll stare are the code ... >> > >> > Thanks, >> > >> > tglx >> > >> >> We still see the issue in our daily regression runs. >> I have your patch merged into my internal branch, it prints the following: >> >> [ 4898.226258] Trying to clear prev_vector: 0 >> [ 4898.226439] Trying to clear prev_vector: 0 >> >> i.e. vector(0) is lower than FIRST_EXTERNAL_VECTOR. > > Could you please enable the vector and irq matrix trace points and capture > the trace when this happens? > > Thanks, > > tglx >