Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id E50E4C54EED for ; Mon, 30 Jan 2023 15:10:23 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237959AbjA3PKW (ORCPT ); Mon, 30 Jan 2023 10:10:22 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49102 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237906AbjA3PKU (ORCPT ); Mon, 30 Jan 2023 10:10:20 -0500 Received: from esa.microchip.iphmx.com (esa.microchip.iphmx.com [68.232.154.123]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3BF2A2C64E for ; Mon, 30 Jan 2023 07:10:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=microchip.com; i=@microchip.com; q=dns/txt; s=mchp; t=1675091417; x=1706627417; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=JQuw/Va9NkBBHclCJT2RtjcVIN8WT+CObEhnifAP11E=; b=nmz2nDv2yeyw3g5S+R+E5rHGQCep4qE0rruMZLkffWs2lKnVCjWN0VWM IWh59TT1SoCebTM10CtKBUpvat13DbnV4Uwbpn0ti4u684FR6LNYlt2ph w0N69ePT2oAdh/vHPR7HDvGJLev7mryZru2ymIYUXGW7H720GYofxvmfd 0HPHL8OI/buUYxd4/i9i2NQnVErOUot0KUbDRg47emxo25bU9RMJlSq5D PbV8blP0cT/SYcpNU6WykXKV6bQx82kVf/Kt/rQTISvonXmHB9zDvGBpE SY4SbiddkSZTqdaGuddSr/f4pRcrThNCyl4vq1P4/r0mH+7GyiaMDaOHt A==; X-IronPort-AV: E=Sophos;i="5.97,258,1669100400"; d="asc'?scan'208";a="134635086" Received: from unknown (HELO email.microchip.com) ([170.129.1.10]) by esa6.microchip.iphmx.com with ESMTP/TLS/AES256-SHA256; 30 Jan 2023 08:10:15 -0700 Received: from chn-vm-ex03.mchp-main.com (10.10.85.151) by chn-vm-ex03.mchp-main.com (10.10.85.151) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.16; Mon, 30 Jan 2023 08:10:07 -0700 Received: from wendy (10.10.115.15) by chn-vm-ex03.mchp-main.com (10.10.85.151) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.16 via Frontend Transport; Mon, 30 Jan 2023 08:10:05 -0700 Date: Mon, 30 Jan 2023 15:09:41 +0000 From: Conor Dooley To: Changbin Du CC: Paul Walmsley , Palmer Dabbelt , Albert Ou , Hui Wang , , , Changbin Du , Guo Ren , Zong Li Subject: Re: [PATCH v3] riscv: patch: Fixup lockdep warning in stop_machine Message-ID: References: <20230130232659.3374212-1-changbin.du@huawei.com> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="ZHJ8mDdLEgO7bBXq" Content-Disposition: inline In-Reply-To: <20230130232659.3374212-1-changbin.du@huawei.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --ZHJ8mDdLEgO7bBXq Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hey Changbin, On Tue, Jan 31, 2023 at 07:26:59AM +0800, Changbin Du wrote: > From: Changbin Du >=20 > The task of ftrace_arch_code_modify(_post)_prepare() caller is > stop_machine, whose caller and work thread are of different tasks. The > lockdep checker needs the same task context, or it's wrong. That means > it's a bug here to use lockdep_assert_held because we don't guarantee > the same task context. >=20 > kernel/locking/lockdep.c: > int __lock_is_held(const struct lockdep_map *lock, int read) > { > struct task_struct *curr =3D current; > int i; >=20 > for (i =3D 0; i < curr->lockdep_depth; i++) { > ^^^^^^^^^^^^^^^^^^^ > struct held_lock *hlock =3D curr->held_locks + i; > ^^^^^^^^^^^^^^^^ > if (match_held_lock(hlock, lock)) { > if (read =3D=3D -1 || !!hlock->read =3D=3D read) > return LOCK_STATE_HELD; >=20 > The __lock_is_held depends on current held_locks records; if > stop_machine makes the checker runing on another task, that's wrong. >=20 > Here is the log: > [ 15.761523] ------------[ cut here ]------------ > [ 15.762125] WARNING: CPU: 0 PID: 15 at arch/riscv/kernel/patch.c:63 pa= tch_insn_write+0x72/0x364 > [ 15.763258] Modules linked in: > [ 15.764154] CPU: 0 PID: 15 Comm: migration/0 Not tainted 6.1.0-rc1-000= 14-g66924be85884-dirty #377 > [ 15.765339] Hardware name: riscv-virtio,qemu (DT) > [ 15.765985] Stopper: multi_cpu_stop+0x0/0x192 <- stop_cpus.constprop.0= +0x90/0xe2 > [ 15.766711] epc : patch_insn_write+0x72/0x364 > [ 15.767011] ra : patch_insn_write+0x70/0x364 > [ 15.767276] epc : ffffffff8000721e ra : ffffffff8000721c sp : ff200000= 0067bca0 > [ 15.767622] gp : ffffffff81603f90 tp : ff60000002432a00 t0 : 73000000= 00000000 > [ 15.767919] t1 : 0000000000000000 t2 : 73695f6b636f6c5f s0 : ff200000= 0067bcf0 > [ 15.768238] s1 : 0000000000000008 a0 : 0000000000000000 a1 : 00000000= 00000000 > [ 15.768537] a2 : 0000000000000000 a3 : 0000000000000000 a4 : 00000000= 00000000 > [ 15.768837] a5 : 0000000000000000 a6 : 0000000000000000 a7 : 00000000= 00000000 > [ 15.769139] s2 : ffffffff80009faa s3 : ff2000000067bd10 s4 : ffffffff= ffffffff > [ 15.769447] s5 : 0000000000000001 s6 : 0000000000000001 s7 : 00000000= 00000003 > [ 15.769740] s8 : 0000000000000002 s9 : 0000000000000004 s10: 00000000= 00000003 > [ 15.770027] s11: 0000000000000002 t3 : 0000000000000000 t4 : ffffffff= 819af097 > [ 15.770323] t5 : ffffffff819af098 t6 : ff2000000067ba28 > [ 15.770574] status: 0000000200000100 badaddr: 0000000000000000 cause: = 0000000000000003 > [ 15.771102] [] patch_text_nosync+0x10/0x3a > [ 15.771421] [] ftrace_update_ftrace_func+0x74/0x10a > [ 15.771704] [] ftrace_modify_all_code+0xb0/0x16c > [ 15.771958] [] __ftrace_modify_code+0x12/0x1c > [ 15.772196] [] multi_cpu_stop+0x14a/0x192 > [ 15.772454] [] cpu_stopper_thread+0x96/0x14c > [ 15.772699] [] smpboot_thread_fn+0xf8/0x1cc > [ 15.772945] [] kthread+0xe2/0xf8 > [ 15.773160] [] ret_from_exception+0x0/0x14 > [ 15.773471] ---[ end trace 0000000000000000 ]--- FWIW, you can always crop the [15.192321] stuff out of commit messages, as it just adds noise. > By the way, this also fixes the same issue for patch_text(). >=20 > Fixes: 0ff7c3b33127 ("riscv: Use text_mutex instead of patch_lock") > Co-developed-by: Guo Ren > Signed-off-by: Guo Ren > Cc: Zong Li > Cc: Palmer Dabbelt > Signed-off-by: Changbin Du > --- > Changes in v3: > - denote this also fixes function patch_text(). >=20 > Changes in v2: > - Rewrite commit log with lockdep explanation [Guo Ren] > - Rebase on v6.1 [Guo Ren] >=20 > v1: > https://lore.kernel.org/linux-riscv/20210417023532.354714-1-changbin.du@g= mail.com/ > --- > arch/riscv/kernel/patch.c | 7 ------- > 1 file changed, 7 deletions(-) >=20 > diff --git a/arch/riscv/kernel/patch.c b/arch/riscv/kernel/patch.c > index 765004b60513..8619706f8dfd 100644 > --- a/arch/riscv/kernel/patch.c > +++ b/arch/riscv/kernel/patch.c > @@ -55,13 +55,6 @@ static int patch_insn_write(void *addr, const void *in= sn, size_t len) > bool across_pages =3D (((uintptr_t) addr & ~PAGE_MASK) + len) > PAGE_SI= ZE; > int ret; > =20 > - /* > - * Before reaching here, it was expected to lock the text_mutex > - * already, so we don't need to give another lock here and could > - * ensure that it was safe between each cores. > - */ > - lockdep_assert_held(&text_mutex); I must admit, patches like this do concern me a little, as a someone unfamiliar with the world of probing and tracing. Seeing an explicit check that the lock was held, leads me to believe that the original author (Zong Li I think) thought that the text_mutex lock was insufficient. Do you think that their fear is unfounded? Explaining why it is safe to remove this assertion in the commit message would go a long way towards easing my anxiety! Also, why delete the comment altogether? The comment provides some information that doesn't appear to become invalid, even with the assertion removed? Thanks, Conor. > - > if (across_pages) > patch_map(addr + len, FIX_TEXT_POKE1); > =20 > --=20 > 2.25.1 >=20 >=20 --ZHJ8mDdLEgO7bBXq Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iHUEABYIAB0WIQRh246EGq/8RLhDjO14tDGHoIJi0gUCY9fdtQAKCRB4tDGHoIJi 0sslAP49Nvkw+3fXSSaHe03RLitfwfj3l8d8kcrLGD0hdLYxywEAgXyRn/9IxdRM CufK52dxbWa8JP2SttW8REey/iqxxgU= =GXtU -----END PGP SIGNATURE----- --ZHJ8mDdLEgO7bBXq--