Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp381257pxb; Thu, 12 Nov 2020 06:17:17 -0800 (PST) X-Google-Smtp-Source: ABdhPJybY5j0Pyta0tXZtkoOfS5k9qXZNg9X59KNp45l+MdEHkBJhZQhHfidU1KtO/fMME/OCyCZ X-Received: by 2002:a17:906:911:: with SMTP id i17mr29894376ejd.40.1605190637415; Thu, 12 Nov 2020 06:17:17 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605190637; cv=none; d=google.com; s=arc-20160816; b=osvplxnnVntPc1kCKnQL2ZkUrZfp6vHmm8Pi2pqdiVmlwzmD4yxs9rmAf151ZY/Ewn WSh1U6De48TAJuwgoXdjRpP9aABnU7T8yCTtFgNPAoIKhnu3jRKffU/wnT+Ot7Zh9pjQ jAt+iWGcZREMgki/4/Bp0yU9bBYaX10R0wJhcTy4IngAwav9obBKoOqcJ1+BRW159fFw EHQv35XwrJ9mM+5TTcKQfmYMkYevhmp3hMWquDwsoMxx8oy5e6oqkhy1Bk/gEaWPb6G2 DOBy8esIrZmMFqnMjV3IOr11oacHgqBPDm2Yb51pgW9QcogAxvjV2KUM5X+DhHbH43ZG aI7g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:references:in-reply-to:subject:cc:to:dkim-signature :dkim-signature:from; bh=JDqpKAUrItpHlbE5XWwW32ZAnSVkMvadrlti6qz8qYg=; b=wTHFzOto3EI0LWAF20AtfbbqkdCeRKVBfzHHBjPVihctoeYIjyI+sKC5ezlhnEfQkH 8+Z45kbZizgJTKYbF915osiLzOPv2zFUr5hhgws457vL4l0Ti/A4oRK/U5Y8fotePpBX 0ES0sU1QMEPqDHmmOtd+QnLqBh5arZTrDX4aEQHr0OxNcCe11TC5SIvHHL/Z+u+DeN5S ry9zGZx0IxR6nAHPvopMBLnZYqIknN7VlAuyys63+QjEFF4Y4WIRoodVBGouHnz7s5bY rWIclRM053sHJsLrx5nFvTLSs45+oCXN6MLx1mFO9S8jZG3+vSiA7TGpyy1X0fQOW/Ye tafA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@linutronix.de header.s=2020 header.b=VoYtF8BI; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e header.b=rHy7YOjn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id o16si3536660eja.345.2020.11.12.06.16.51; Thu, 12 Nov 2020 06:17:17 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=fail header.i=@linutronix.de header.s=2020 header.b=VoYtF8BI; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e header.b=rHy7YOjn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728088AbgKLOPX (ORCPT + 99 others); Thu, 12 Nov 2020 09:15:23 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56916 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727035AbgKLOPX (ORCPT ); Thu, 12 Nov 2020 09:15:23 -0500 Received: from galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 52E67C0613D1; Thu, 12 Nov 2020 06:15:23 -0800 (PST) From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1605190521; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=JDqpKAUrItpHlbE5XWwW32ZAnSVkMvadrlti6qz8qYg=; b=VoYtF8BI7RLcnvT3P0hzVSJo32ng+9NyUF+JXWg9Pn+C/ZBbgYLROeCmx7sFdyFUT46rJ8 CMgvz9C2LahmdePRNxCLbS0p64AoaEaNgwcwXHWd1keRONGI8MFHJ3BTtaKmpHo4I0TWZ2 vAwHFQ7z9N2nz7aWk4kT7ClUADpdz1eCBBAilOODAeqO0AZvlmYSQAnqlKbuHY09FRhMD2 leUg6Jii4k/Ama8r+Y8t9/x03N3KxlA/ZHF/uovVa8YZaKOZmppCgQROOI9hCdwFYpcd5N /gbLBb+JfH8XDmNEGF6gpfuInPulOkmBNtKHQDfQhVyp4CGcjfioiM8ZPO62Vw== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1605190521; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=JDqpKAUrItpHlbE5XWwW32ZAnSVkMvadrlti6qz8qYg=; b=rHy7YOjnh3ZoFKx73oTAYHW5ZZpTYL59dAo5H0GGx5XK+D91l/tZPlDTpXltpPSMTfhgqa 5GykZaJPuBPhNnCQ== To: Jason Gunthorpe , Ziyad Atiyyeh , Itay Aveksis , Moshe Shemesh Cc: LKML , x86@kernel.org, Joerg Roedel , iommu@lists.linux-foundation.org, linux-pci@vger.kernel.org, Bjorn Helgaas , Marc Zyngier , David Woodhouse Subject: Re: REGRESSION: Re: [patch V2 00/46] x86, PCI, XEN, genirq ...: Prepare for device MSI In-Reply-To: <20201112125531.GA873287@nvidia.com> References: <20200826111628.794979401@linutronix.de> <20201112125531.GA873287@nvidia.com> Date: Thu, 12 Nov 2020 15:15:21 +0100 Message-ID: <87mtzmmzk6.fsf@nanos.tec.linutronix.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Jason, (trimmed CC list a bit) On Thu, Nov 12 2020 at 08:55, Jason Gunthorpe wrote: > On Wed, Aug 26, 2020 at 01:16:28PM +0200, Thomas Gleixner wrote: > They were unable to bisect further into the series because some of the > interior commits don't boot :( > > When we try to load the mlx5 driver on a bare metal VF it gets this: > > [Thu Oct 22 08:54:51 2020] DMAR: DRHD: handling fault status reg 2 > [Thu Oct 22 08:54:51 2020] DMAR: [INTR-REMAP] Request device [42:00.2] fa= ult index 1600 [fault reason 37] Blocked a compatibility format interrupt r= equest > [Thu Oct 22 08:55:04 2020] mlx5_core 0000:42:00.1 eth4: Link down > [Thu Oct 22 08:55:11 2020] mlx5_core 0000:42:00.1 eth4: Link up > [Thu Oct 22 08:55:54 2020] mlx5_core 0000:42:00.2: mlx5_cmd_eq_recover:26= 4:(pid 3390): Recovered 1 EQEs on cmd_eq > [Thu Oct 22 08:55:54 2020] mlx5_core 0000:42:00.2: wait_func_handle_exec_= timeout:1051:(pid 3390): cmd0: CREATE_EQ(0=C3=83=C2=97301) recovered after = timeout > [Thu Oct 22 08:55:54 2020] DMAR: DRHD: handling fault status reg 102 > [Thu Oct 22 08:55:54 2020] DMAR: [INTR-REMAP] Request device [42:00.2] fa= ult index 1600 [fault reason 37] Blocked a compatibility format interrupt r= equest > > If you have any idea Ziyad and Itay can run any debugging you like. > > I suppose it is because this series is handing out compatability > addr/data pairs while the IOMMU is setup to only accept remap ones > from SRIOV VFs? So the issue seems to be that the VF device has the default irq domain assigned and not the remapping domain. Let me stare into the code to see how these VF devices are set up and registered with the IOMMU/remap unit. Thanks, tglx