Received: by 2002:a05:7412:d1aa:b0:fc:a2b0:25d7 with SMTP id ba42csp1601744rdb; Wed, 31 Jan 2024 03:53:23 -0800 (PST) X-Google-Smtp-Source: AGHT+IF8Lx/s3OOob8RkGiD8k+zaFIST3igsjSIPTrcYyA3GfNRv83EZcYUGEAp8P+kP/YITS6ND X-Received: by 2002:a17:906:3796:b0:a36:8802:ffc5 with SMTP id n22-20020a170906379600b00a368802ffc5mr646471ejc.7.1706702003040; Wed, 31 Jan 2024 03:53:23 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1706702003; cv=pass; d=google.com; s=arc-20160816; b=kMAN9kBChsNCjlG1njiChVf4oco/556B03BoStlYd0CdSDcUswcW/7nItMeFvzCLXY QvXBHePHJSb8lNsdoAaW6a9PviKuRAjh8WRhXxpNeCsRNvhGRew7D5rNVjeTNm0n8YeA MSoNP1UVufp0miPKKoGB72A+etmBoFIvMCJ5tmViqqMj6MrIVxw2sTs0pIv8FXcQy/Ns UtbSTWBPW+mCWJr459FdRYnMM8q0ZMe+/8SVBmk8TFHc+hACJhAR4n8FVmq1Z/3vk9r4 my56XN7l5W09OMsDyOaRpLU+dANJeZ1yuwPAkPSlJeRXoEv1TSXY4prnSJfSg/qASL7s 4f0w== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date; bh=iPlwXxaVvpfnvbKdAJRcGLr3pzJy0oS3C5ABqO1focQ=; fh=Z/eqaGoQ5+SN+7v1V+1TLzTCvsGFP3ZyLx26gTNC2PE=; b=zx77H9sPhWiNKQrHoR/ZHT1s2fHGL2Ai6Io4B3ZkBfbAXPOP8A+ZWGcFiKWJySkmcO ZaN5KsKj8OhZcbiqbCohQDjycQvVICQQb5R2AN+xtpwTr0kIUp1FTwiOxNApOFvgP82a Cqciaqnmjm+ShHq63XAmDd8sVJlILQWxENuBlB6v9LeQpgn9hH1woIkEoArGIBALIb3Q oGsm+UsIOVsOYgObCRjAOcEhypYuGHmda4TqMzXCwWPZVBU9ft2QXv6f5E/TzDLWJAvo wsnqiOU7rBtyQqXMC1IJrCfqJtyVXpYGq/195rQ10RPV5zJm32yEkrtNer90IAgJmbX0 x3Fw==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-46385-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-46385-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com X-Forwarded-Encrypted: i=1; AJvYcCUOcHpuKSXEf6UabsazdiTxGQZJUjZTsj2pruQDohzlzcc4M9rTrVqvi0U5jjzeV6TNxBCMlEVAuDq9q6O21Hldd7GuxKed1FDA1pJbQA== Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id gx15-20020a170906f1cf00b00a368893a4ffsi270437ejb.521.2024.01.31.03.53.22 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 31 Jan 2024 03:53:23 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-46385-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-46385-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-46385-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 576CC1F303A8 for ; Wed, 31 Jan 2024 11:53:22 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 51AE978689; Wed, 31 Jan 2024 11:53:13 +0000 (UTC) Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by smtp.subspace.kernel.org (Postfix) with ESMTP id A0B8E69E08; Wed, 31 Jan 2024 11:53:09 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=217.140.110.172 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706701991; cv=none; b=l7hh7d9NKbJYiRrQgiShXF2LgMyzJ8aZmaHwFRFyuhI45obrWmbNwYrasCnh2s8UisXJGcmsDkdeBnyI25tbMbZdx2AIC2Q+BSeqtGAOSS+zMgchn3n8f5zvS02NNuXrTxJ+t+HVFgBLXKU+f/dhh3zDmkGFh98PQYQM+XLUJSQ= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706701991; c=relaxed/simple; bh=9VM+eyWUkqOvsQE3IzsAiMOfeUXSINHE8tEah9vrrxs=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=CqSV2qarcHYvl5FeexPE7r9YrCSIFTnWY0PPZU7vRRXQYDs5OuImr9h9IzuAylI656l2efDz+MzkhQvs7QK64rRrmy+LZHsjrMYcBQC6s8cKYdYbuCOejuGDeyB3+StZd2iD6xAXmTsPHsEBLgc2htYp4kae8l43JKYEWaT4rDw= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com; spf=pass smtp.mailfrom=arm.com; arc=none smtp.client-ip=217.140.110.172 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=arm.com Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 91335DA7; Wed, 31 Jan 2024 03:53:52 -0800 (PST) Received: from bogus (unknown [10.57.78.35]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id B7BDB3F738; Wed, 31 Jan 2024 03:53:07 -0800 (PST) Date: Wed, 31 Jan 2024 11:53:04 +0000 From: Sudeep Holla To: Ulf Hansson Cc: Cristian Marussi , Sudeep Holla , linux-arm-kernel@lists.infradead.org, linux-pm@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] pmdomain: arm: Fix NULL dereference on scmi_perf_domain removal Message-ID: <20240131115304.35hmjgq2xbmzw2v4@bogus> References: <20240125191756.868860-1-cristian.marussi@arm.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Wed, Jan 31, 2024 at 12:35:56PM +0100, Ulf Hansson wrote: > On Tue, 30 Jan 2024 at 21:07, Cristian Marussi wrote: > > > > On Tue, Jan 30, 2024 at 02:09:20PM +0100, Ulf Hansson wrote: > > > On Thu, 25 Jan 2024 at 20:18, Cristian Marussi wrote: > > > > > > > > On unloading of the scmi_perf_domain module got the below splat, when in > > > > the DT provided to the system under test the '#power-domain-cells' property > > > > was missing. > > > > Indeed, this particular setup causes the probe to bail out early without > > > > giving any error, so that, then, the removal code is run on unload, but > > > > without all the expected initialized structures in place. > > > > > > > > Add a check and bail out early on remove too. > > > > > > Thanks for spotting this! > > > > > > > > > > > Unable to handle kernel NULL pointer dereference at virtual address 0000000000000008 > > > > Mem abort info: > > > > ESR = 0x0000000096000004 > > > > EC = 0x25: DABT (current EL), IL = 32 bits > > > > SET = 0, FnV = 0 > > > > EA = 0, S1PTW = 0 > > > > FSC = 0x04: level 0 translation fault > > > > Data abort info: > > > > ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000 > > > > CM = 0, WnR = 0, TnD = 0, TagAccess = 0 > > > > GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0 > > > > user pgtable: 4k pages, 48-bit VAs, pgdp=00000001076e5000 > > > > [0000000000000008] pgd=0000000000000000, p4d=0000000000000000 > > > > Internal error: Oops: 0000000096000004 [#1] PREEMPT SMP > > > > Modules linked in: scmi_perf_domain(-) scmi_module scmi_core > > > > CPU: 0 PID: 231 Comm: rmmod Not tainted 6.7.0-00084-gb4b1f27d3b83-dirty #15 > > > > Hardware name: linux,dummy-virt (DT) > > > > pstate: 61400005 (nZCv daif +PAN -UAO -TCO +DIT -SSBS BTYPE=--) > > > > pc : scmi_perf_domain_remove+0x28/0x70 [scmi_perf_domain] > > > > lr : scmi_perf_domain_remove+0x28/0x70 [scmi_perf_domain] > > > > sp : ffff80008393bc10 > > > > x29: ffff80008393bc10 x28: ffff0000875a8000 x27: 0000000000000000 > > > > x26: 0000000000000000 x25: 0000000000000000 x24: 0000000000000000 > > > > x23: ffff00008030c090 x22: ffff00008032d490 x21: ffff80007b287050 > > > > x20: 0000000000000000 x19: ffff00008032d410 x18: 0000000000000000 > > > > x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000 > > > > x14: 8ba0696d05013a2f x13: 0000000000000000 x12: 0000000000000002 > > > > x11: 0101010101010101 x10: ffff00008510cff8 x9 : ffff800080a6797c > > > > x8 : 0101010101010101 x7 : 7f7f7f7f7f7f7f7f x6 : fefefeff6364626d > > > > x5 : 8080808000000000 x4 : 0000000000000020 x3 : 00000000553a3dc1 > > > > x2 : ffff0000875a8000 x1 : ffff0000875a8000 x0 : ffff800082ffa048 > > > > Call trace: > > > > scmi_perf_domain_remove+0x28/0x70 [scmi_perf_domain] > > > > scmi_dev_remove+0x28/0x40 [scmi_core] > > > > device_remove+0x54/0x90 > > > > device_release_driver_internal+0x1dc/0x240 > > > > driver_detach+0x58/0xa8 > > > > bus_remove_driver+0x78/0x108 > > > > driver_unregister+0x38/0x70 > > > > scmi_driver_unregister+0x28/0x180 [scmi_core] > > > > scmi_perf_domain_driver_exit+0x18/0xb78 [scmi_perf_domain] > > > > __arm64_sys_delete_module+0x1a8/0x2c0 > > > > invoke_syscall+0x50/0x128 > > > > el0_svc_common.constprop.0+0x48/0xf0 > > > > do_el0_svc+0x24/0x38 > > > > el0_svc+0x34/0xb8 > > > > el0t_64_sync_handler+0x100/0x130 > > > > el0t_64_sync+0x190/0x198 > > > > Code: a90153f3 f9403c14 f9414800 955f8a05 (b9400a80) > > > > ---[ end trace 0000000000000000 ]--- > > > > > > > > Cc: Sudeep Holla > > > > Cc: Ulf Hansson > > > > Fixes: 2af23ceb8624 ("pmdomain: arm: Add the SCMI performance domain") > > > > Signed-off-by: Cristian Marussi > > > > --- > > > > I suppose the probe does NOT bail out with an error because this DT config has > > > > to be supported, right ? > > > > > > Actually, no. It's a mistake by me, the probe should bail out with an > > > error code. > > > > > > > Ok. I suppose any old platform like JUNO that missed this will have to > > update their DT to use the new scmi_perf_domain...well it should have > > anyway really, it is just that now it is silently failing. > > I don't think it's failing. The old binding for SCMI perf (using > clock-cells) is still supported the way they were before, which is > only for cpufreq. > > But, yes you are right, both the DT and the consumer driver would need > to be updated to support SCMI perf. > Not sure if you want to flag an error on platforms that doesn't use this. IMO probe succeeding doing nothing seems right. Won't returning the error from probe gets flagged as error during boot or module loading though it is harmless on the platform since it doesn't use it. > In fact, there is also one additional similar problem in probe, when > the number of perf-domains are zero. In that case, we should also > return an error code, rather than returning 0. > > > > > > In fact, there is also one additional similar problem in probe, when > > > the number of perf-domains are zero. In that case, we should also > > > return an error code, rather than returning 0. > > > > > > Would you mind updating the patch to cover both problems - or if you > > > are too busy, just let me know and I can help out. > > > > No problem, I can do it next week, but regarding the zero domain case, > > I remember I used to do the same on regulator/voltage driver and bail out > > when no domains were found, but we were asked by some customer to support > > instead the very useless and funny case of zero domains for some of their > > testing setup scenarios .. i.e. allowing the driver to load with zero domains > > (and do nothing) and then unload cleanly avoiding harms while unloading ...) > > > > Thoughts about this ? Can fix as you prefer . > > In my opinion, there is no point having a module/driver loaded to do > nothing. I would prefer to just return an error code. > IIRC we had this in one of the driver but there was a request to keep it this way as it is useful in SCMI f/w bringup/testing. Not all info/features need to be ready. That said I am fine if pmdomain prefers to flag 0 domains as error. -- Regards, Sudeep