Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751681AbeAPPfj (ORCPT + 1 other); Tue, 16 Jan 2018 10:35:39 -0500 Received: from mail-qt0-f177.google.com ([209.85.216.177]:34088 "EHLO mail-qt0-f177.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751131AbeAPPfd (ORCPT ); Tue, 16 Jan 2018 10:35:33 -0500 X-Google-Smtp-Source: ACJfBov9IXLYNGSjoWM6a8KXzOZjOdDKDvYpylSYw0h67ttkwppPrnUVY3/Pq2cxpOYL2THpo1BR4A== Message-ID: <1516116931.9574.3.camel@redhat.com> Subject: Re: [PATCH 0/2] genirq/affinity: try to make sure online CPU is assgined to irq vector From: Laurence Oberman To: Don Brace , Thomas Gleixner , Ming Lei Cc: Christoph Hellwig , Jens Axboe , "linux-block@vger.kernel.org" , "linux-kernel@vger.kernel.org" , Mike Snitzer Date: Tue, 16 Jan 2018 10:35:31 -0500 In-Reply-To: References: <20180115160345.2611-1-ming.lei@redhat.com> <20180115174036.GA20191@infradead.org> <20180116013043.GA3213@ming.t460p> <1516109317.9574.1.camel@redhat.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.22.6 (3.22.6-10.el7) Mime-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On Tue, 2018-01-16 at 15:22 +0000, Don Brace wrote: > > -----Original Message----- > > From: Laurence Oberman [mailto:loberman@redhat.com] > > Sent: Tuesday, January 16, 2018 7:29 AM > > To: Thomas Gleixner ; Ming Lei > .com> > > Cc: Christoph Hellwig ; Jens Axboe > >; > > linux-block@vger.kernel.org; linux-kernel@vger.kernel.org; Mike > > Snitzer > > ; Don Brace > > Subject: Re: [PATCH 0/2] genirq/affinity: try to make sure online > > CPU is assgined > > to irq vector > > > > > > It is because of irq_create_affinity_masks(). > > > > > > That still does not answer the question. If the interrupt for a > > > queue > > > is > > > assigned to an offline CPU, then the queue should not be used and > > > never > > > raise an interrupt. That's how managed interrupts have been > > > designed. > > > > > > Thanks, > > > > > >       tglx > > > > > > > > > > > > > > > > I captured a full boot log for this issue for Microsemi, I will > > send it > > to Don Brace. > > I enabled all the HPSA debug and here is snippet > > > > > > .. > > .. > > .. > >   246.751135] INFO: task systemd-udevd:413 blocked for more than > > 120 > > seconds. > > [  246.788008]       Tainted: G          I      4.15.0-rc4.noming+ > > #1 > > [  246.822380] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" > > disables this message. > > [  246.865594] systemd-udevd   D    0   413    411 0x80000004 > > [  246.895519] Call Trace: > > [  246.909713]  ? __schedule+0x340/0xc20 > > [  246.930236]  schedule+0x32/0x80 > > [  246.947905]  schedule_timeout+0x23d/0x450 > > [  246.970047]  ? find_held_lock+0x2d/0x90 > > [  246.991774]  ? wait_for_completion_io+0x108/0x170 > > [  247.018172]  io_schedule_timeout+0x19/0x40 > > [  247.041208]  wait_for_completion_io+0x110/0x170 > > [  247.067326]  ? wake_up_q+0x70/0x70 > > [  247.086801]  hpsa_scsi_do_simple_cmd+0xc6/0x100 [hpsa] > > [  247.114315]  hpsa_scsi_do_simple_cmd_with_retry+0xb7/0x1c0 > > [hpsa] > > [  247.146629]  hpsa_scsi_do_inquiry+0x73/0xd0 [hpsa] > > [  247.174118]  hpsa_init_one+0x12cb/0x1a59 [hpsa] > > This trace comes from internally generated discovery commands. No > SCSI devices have > been presented to the SML yet. > > At this point we should be running on only one CPU. These commands > are meant to use > reply queue 0 which are tied to CPU 0. It's interesting that the > patch helps. > > However, I was wondering if you could inspect the iLo IML logs and > send the > AHS logs for inspection. > > Thanks, > Don Brace > ESC - Smart Storage > Microsemi Corporation Hello Don I took two other dl380 g7's and ran the same kernel and it hangs in the identical place. Its absolutely consistent here. I doubt all three have hardware issues. Nothing is logged of interest in the IML. Ming will have more to share on specifically why it helps. I think he sent that along to you already. Regards Laurence