Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp3284140pxk; Mon, 5 Oct 2020 06:08:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyi1FL7u605jdvyjlpSQbrs6X+z+U6MDFRgi3mLuREHPmR63iPrByWqUhpTHaJ8czTtxKOl X-Received: by 2002:a17:906:f28f:: with SMTP id gu15mr3635241ejb.291.1601903288508; Mon, 05 Oct 2020 06:08:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1601903288; cv=none; d=google.com; s=arc-20160816; b=seW4bWtcA4loowag00/xAFfbaPjgmDabuWI8AiExyRZDkzbmjIlC7uePaLVYPaKIsD d2Gr94oMSoAHPyPCXVWlH6tLDI3xcW2KEFiTDB+N87S48FSpN2kWShqDl9l9u5MdvBMq 2qJAR9phLNIwnSr4UpuG3m+SEpnjR2QHUkBMGnoOOEFYYZ45LJYfaxV4/h70dSJbcmpT whP1HP+qDpjtxDzR8GD1uCuNl0/FAPL6SYZwy5QLjXizdwlwftsfyp+QKXCcM3tWx/nV oP52fKToxMBDk6DX0NzQ0j0i8Ayo/KNTWuF6/8NYs8IG+YSAbuKdQtjW9K2Uo0AfkOS4 YDBg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:user-agent:references:in-reply-to :subject:cc:to:from:date:content-transfer-encoding:mime-version :dkim-signature; bh=q9wv59ORm6aP0IMjqBRQ6HTJOI8ic15yZLLWnGbs9wc=; b=YkwEU7eGriPAaFJqDu+htWjX8wlX2I9NTq8Y+gkUI2D6npKizpALIDqysWVJGIQHmQ ZVfVI+GOfRVGpxmqOppoF1LGh5/CPdNvwr/ZylkplCSaKU5pXANpJW7VARg87fEfGPeZ hHEcToKYgmNhwCK3CdWnCRq3GLrzAfllwESGGbJcAro4Ndkb2Sgruk6+OL23u6uyqOpJ b1OcHCkacr4nxmX26bw0AWg75sj99QeN4jVlTJDViPZbQAlPuseW/nKpIk/r4thNzZX3 uJfYEmi4AOT7d0zewIFP69EJf4pTZrkG85tT3XZgCiUmxDbxX8CA3cZv0tRpH39NJSBm /CKg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=F0c9qkT7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t12si4973099eje.412.2020.10.05.06.07.45; Mon, 05 Oct 2020 06:08:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=F0c9qkT7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726214AbgJENGl (ORCPT + 99 others); Mon, 5 Oct 2020 09:06:41 -0400 Received: from mail.kernel.org ([198.145.29.99]:58278 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725982AbgJENGk (ORCPT ); Mon, 5 Oct 2020 09:06:40 -0400 Received: from disco-boy.misterjones.org (disco-boy.misterjones.org [51.254.78.96]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id CCEF42074F; Mon, 5 Oct 2020 13:06:39 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1601903199; bh=RPyoRkGcSIwCw/JC5a1gyC2ZdvAZhruEz6S7VcUnUd0=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=F0c9qkT7rPBmxbCG4hUsilQQSlj/yIKOVGDR2x20MYkTe35IMwaqM5dLyIxiKdFc5 CX9uQZv6NMT81YkyvpGR7Njf0H06umzwtVeDtTaThtwzbu9RiwHFBlZ+eVLUzdYvBK gldqT8zg9FjqLAQoQpM74zRpJcKP0YUjHKGXZouA= Received: from disco-boy.misterjones.org ([51.254.78.96] helo=www.loen.fr) by disco-boy.misterjones.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1kPQCL-00HNFd-R0; Mon, 05 Oct 2020 14:06:37 +0100 MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Mon, 05 Oct 2020 14:06:37 +0100 From: Marc Zyngier To: Thierry Reding Cc: linux-tegra@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, Jonathan Hunter , Dmitry Osipenko , Sowjanya Komatineni , Venkat Reddy Talla , Thomas Gleixner , kernel-team@android.com Subject: Re: [PATCH 0/3] soc/tegra: Prevent the PMC driver from corrupting interrupt routing In-Reply-To: <20201005112217.GR425362@ulmo> References: <20201005111443.1390096-1-maz@kernel.org> <20201005112217.GR425362@ulmo> User-Agent: Roundcube Webmail/1.4.8 Message-ID: X-Sender: maz@kernel.org X-SA-Exim-Connect-IP: 51.254.78.96 X-SA-Exim-Rcpt-To: thierry.reding@gmail.com, linux-tegra@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, jonathanh@nvidia.com, digetx@gmail.com, skomatineni@nvidia.com, vreddytalla@nvidia.com, tglx@linutronix.de, kernel-team@android.com X-SA-Exim-Mail-From: maz@kernel.org X-SA-Exim-Scanned: No (on disco-boy.misterjones.org); SAEximRunCond expanded to false Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020-10-05 12:22, Thierry Reding wrote: > On Mon, Oct 05, 2020 at 12:14:40PM +0100, Marc Zyngier wrote: >> Jon recently reported that one of the Tegra systems (Jetson TX2, aka >> tegra186) stopped booting with the introduction of the "IPI as IRQs" >> series. After a few weeks of head scratching and complete puzzlement, >> I obtained a board and started looking at what was happening. >> >> The interrupt hierarchy looks like this: >> >> [DEVICE] -A-> [PMC] -B-> [GIC] >> >> which seems simple enough. However, not all the devices attached to >> the PMC follow this hierarchy, and in some cases, the 'B' link isn't >> present in the HW. In other cases, neither 'A' nor 'B' are present. >> And yet the PMC driver creates such linkages using random hwirq values >> for the non-existent links, potentially overriding existing mappings >> in the process. "What could possibly go wrong?" > > Yes, that would've been my fault. It seemed like the right thing to do > at the time, but the way you describe it makes it obvious that it was > not. I can't say I understand why this would've worked prior to the > rework that made this surface, though. Because until these IPI patches, the range 0-7 never ever appeared as actual hwirqs in the GIC domain. SGIs were handled in the GIC code, behind the core kernel's back. As soon as we start using an actual domain mapping for hwirq 0, the PMC driver starts messing with it. >> It turns out that for the 'B' link, the PMC driver uses hwirq 0, which >> is SGI0 for the GIC, and used as the rescheduling IPI. Obviously, this >> doesn't go very well, nor very far, as the IPI gets routed to random >> drivers. Also, as the handling flow has been overridden, this >> interrupt never gets deactivated and can't fire anymore. Yes, this is >> bad. >> >> The 'A' link is less problematic, but the hwirq value is still out of >> the irqdomain range, and gets remapped every time a new 'A'-less >> driver comes up. >> >> Instead, let's trim the unused hierarchy levels as needed. This >> requires some checks in the upper levels of the hierarchy as we now >> have optional levels, but this looks a lot saner than what we >> currently have. With this, tegra186 is back booting on -next. >> >> I haven't tested any wake-up stuff, nor any other nvidia system (this >> is the only one I have). If people agree to these changes, I can take >> them via the irqchip tree so that they make it into the next merge >> window. > > Yeah, it sounds like this needs to go in ideally before the rework that > caused this to surface in order to preserve bisectibility. But if it > goes in afterwards that's probably fine as well. It's easy to take it as part of the same pull request as the IPI stuff. Not fully bisectable for this platform, but close enough. I may even be able to merge this in *before* the IPI patches (I'd need to rebuild irq/irqchip-next, but that won't alter the commit ids of the individual patches as they are on separate branches). > Let Jon and myself do a bit of testing with this to verify that the > wake > up paths are still working. Sure. Let me know what you find. Thanks, M. -- Jazz is not dead. It just smells funny...