Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp15181186rwb; Mon, 28 Nov 2022 08:46:26 -0800 (PST) X-Google-Smtp-Source: AA0mqf4sqiZ+qjhv+qJnfUh7BvIicWdyrUNRn4+Hklj/ARsiG1a7FVHO2t8P3t92QBaUE+0xX2Qx X-Received: by 2002:a05:6402:528d:b0:468:dc9:ec08 with SMTP id en13-20020a056402528d00b004680dc9ec08mr33286899edb.17.1669653986651; Mon, 28 Nov 2022 08:46:26 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669653986; cv=none; d=google.com; s=arc-20160816; b=YfEEGrPFY+Lr/9DqsexDMor2Exv8wOsGvbdEyMXJv9cFZF88vwRd792XnZGwvr4TPA eeLs54kED4LMiYuS0B5EuFLWYZ+3MrLbIIr4M8l/6J06ujnxu1vfdpdIOrYfjACU5DpB 8LGeUvbNd7dwG+kY2+9+dRr9C0bBNs3HneE+nxe2DEQ5vPMIwOdhSYU+WvuVM5WU/o19 PJFnXGZY8euvIA2wUHnA7vTstkPjqAXdBKNnGGYfr1T2FZBOBycsI9ptj4PQr4e9Tlqd SWrxWy37KsdSKmYucJTuo3ewSEkladMCbRoIpHhM8ylKP2Yuzsgqv9WNRTZespSy41qg iDNQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature; bh=TEFyTvkkLzJNxzZCsVqvDnbISJ8MmcrWuphUWeVd8Hs=; b=kVoLLOxOgu83k9at6et/1xB8xkNNJQbiSOsKN7wvcq70sdZObB9S2FIo0m2KTRdmxn w5oCnJu9625+bKPDe1wW56gwQiraO+ZyKa5kbmB2mx6xE/vFV/cLqCQV/OkgzJRY6/e6 yF32mFPp9rO7XL7jlfA4tlwdxQLT6reWhlHiNMKCy4Cg+ONHPOk+fxm33+UihKShzhe+ k1W24++yRcDSmCMkBfXq+nS/3p1FnzvPW0SV8nAT1sKIGz+PT/Seg2DVjf+q1hvwsh9Z c8MwiyDym2fkBELdL0I8Ojw7gTwnr8IhttyVQ6ULm+gew2Cdk6PnbvBpXvGWns5DwdlJ Mldg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=gwI8U7rZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id br13-20020a170906d14d00b00783c42d2da1si8310774ejb.425.2022.11.28.08.46.04; Mon, 28 Nov 2022 08:46:26 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=gwI8U7rZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232698AbiK1Q3z (ORCPT + 85 others); Mon, 28 Nov 2022 11:29:55 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57362 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232071AbiK1Q3y (ORCPT ); Mon, 28 Nov 2022 11:29:54 -0500 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D9F871F9F9 for ; Mon, 28 Nov 2022 08:29:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1669652944; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=TEFyTvkkLzJNxzZCsVqvDnbISJ8MmcrWuphUWeVd8Hs=; b=gwI8U7rZdj5rRPz05Zi/DawgM79QzMjKdHjJ8L0+ZS5ONDRB0xVe9baonjJbiBGx/M7ZRv NIHqG3EaQOG1ZcbLhnRWE5bkH2EKjFfGApzjflusG1PTxEmlR4gUJG6OvnKx39XyY6qTpG EpdCIj+d8TxviWjbNpS0BSDkyQcucsI= Received: from mimecast-mx02.redhat.com (mx3-rdu2.redhat.com [66.187.233.73]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-428-as453O6IOWKccPlGmCpSRg-1; Mon, 28 Nov 2022 11:29:02 -0500 X-MC-Unique: as453O6IOWKccPlGmCpSRg-1 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.rdu2.redhat.com [10.11.54.6]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 91CD41C0515F; Mon, 28 Nov 2022 16:29:01 +0000 (UTC) Received: from rotkaeppchen (unknown [10.39.195.61]) by smtp.corp.redhat.com (Postfix) with ESMTP id CF9F22166B2E; Mon, 28 Nov 2022 16:28:59 +0000 (UTC) Date: Mon, 28 Nov 2022 17:28:55 +0100 From: Philipp Rudo To: Ricardo Ribalda Cc: Eric Biederman , Jonathan Corbet , Sergey Senozhatsky , linux-kernel@vger.kernel.org, kexec@lists.infradead.org, Ross Zwisler , linux-doc@vger.kernel.org, "Joel Fernandes (Google)" , Steven Rostedt Subject: Re: [PATCH v1 2/2] kexec: Introduce kexec_reboot_disabled Message-ID: <20221128172347.629a75c2@rotkaeppchen> In-Reply-To: References: <20221114-disable-kexec-reset-v1-0-fb51d20cf871@chromium.org> <20221114-disable-kexec-reset-v1-2-fb51d20cf871@chromium.org> <20221117160650.16e06b37@rotkaeppchen> <20221121150948.6f7c1f1f@rotkaeppchen> <20221124124000.5af23cad@rotkaeppchen> <20221124160115.23ae7928@rotkaeppchen> Organization: Red Hat inc. MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 3.1 on 10.11.54.6 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Ricardo, On Thu, 24 Nov 2022 23:32:34 +0100 Ricardo Ribalda wrote: > Hi Philipp > > > On Thu, 24 Nov 2022 at 16:01, Philipp Rudo wrote: > > > > On Thu, 24 Nov 2022 13:52:58 +0100 > > Ricardo Ribalda wrote: > > > > > On Thu, 24 Nov 2022 at 12:40, Philipp Rudo wrote: > > > > > > > > Hi Ricardo, > > > > > > > > On Wed, 23 Nov 2022 09:58:08 +0100 > > > > Ricardo Ribalda wrote: > > > > > > > > > Hi Philipp > > > > > > > > > > Thanks for your review. > > > > > > > > > > My scenario is a trusted system, where even if you are root, your > > > > > access to the system is very limited. > > > > > > > > > > Let's assume LOADPIN and verity are enabled. > > > > > > > > My point is that on such systems I expect that a sysadmin also wants to > > > > control the crash kernel including its initramfs (which also has to be part > > > > of the signed kernel?). But if that's the case a sysadmin can simply arm > > > > kdump early during boot and then toggle kexec_load_disabled. With that > > > > LINUX_REBOOT_CMD_KEXEC also gets disabled as no kexec kernel can be loaded > > > > while kdump works. Thus there is no need to add the new interface. Or am > > > > I missing anything? > > > > > > Let's say that you have a script that does something like this > > > > > > > > > kexec -p dump_kernel > > > echo 1 > /proc/sys/kernel/kexec_load_disabled > > > > > > If an attacker can DDos the system and make that script crash... then > > > kexec is still accessible > > > > > > On the other hand, if you load the kernel with the commandline > > > > > > sysctl.kernel.kexec_load_disabled=1 > > ^^^^ > > reboot? > > yes :) thanks! > > > Otherwise you shouldn't be able to load the crash kernel at all. > > > > > Then even if the script crashes, the only way to abuse kexec is by > > > panicing the running kernel.... > > > > True. But when an attacker can DDos the system the final workload is > > already running. So wouldn't it be enough to make sure that the script > > above has finished before starting you workload. E.g. by setting an > > appropriate Before=/After= in the systemd.unit? > > What if the kexec binary crashes and the unit will never succeed? Then there are options like OnFailure= or FailureAction= the sysadmin can use do what ever he seems appropriate. > Or worse, your distro does not use systemd !!! In that case there are other ways to achieve the same. The two options are only examples. Why can't this be used as a replacement? > > Furthermore, I don't think that restricting kexec reboot alone is > > sufficient when the attacker can still control the crash kernel. At > > least my assumption is that triggering a panic instead of just > > rebooting is just a mild inconvenience for somebody who is able to pull > > off an attack like that. > > The attacker does not control the crash kernel completely. loadpin is > still in place. > Yes, they can downgrade the whole system to a vulnerable kernel image. > But the choices are limited :) > > With physical access to the device panicing a kernel is easily doable > (but not trivial). But remotely, it is more challenging. Well the same holds for kexec. So the only difference is triggering the panic where I'm still not convinced it's a huge obstacle for someone who is able to pull off all the steps before for such an attack. To be honest I don't think we make a progress here at the moment. I would like to hear from others what they think about this. Thanks Philipp > > > > > > Would it make you more comfortable if I model this as a kernel config > > > instead of a runtime option? > > > > No, I think the implementation is fine. I'm currently only struggling > > to understand what problem kexec_reboot_disabled solves that cannot be > > solved by kexec_load_disabled. > > > > > Thanks! > > > > Happy to help. > > > > Thanks > > Philipp > > > > > > > > > > > > > > > > Thanks > > > > Philipp > > > > > > > > > > > > > > On Mon, 21 Nov 2022 at 15:10, Philipp Rudo wrote: > > > > > > > > > > > > Hi Ricardo, > > > > > > > > > > > > On Thu, 17 Nov 2022 16:15:07 +0100 > > > > > > Ricardo Ribalda wrote: > > > > > > > > > > > > > Hi Philipp > > > > > > > > > > > > > > Thanks for your review! > > > > > > > > > > > > happy to help. > > > > > > > > > > > > > > > > > > > > On Thu, 17 Nov 2022 at 16:07, Philipp Rudo wrote: > > > > > > > > > > > > > > > > Hi Ricardo, > > > > > > > > > > > > > > > > all in all I think this patch makes sense. However, there is one point > > > > > > > > I don't like... > > > > > > > > > > > > > > > > On Mon, 14 Nov 2022 14:18:39 +0100 > > > > > > > > Ricardo Ribalda wrote: > > > > > > > > > > > > > > > > > Create a new toogle that disables LINUX_REBOOT_CMD_KEXEC, reducing the > > > > > > > > > attack surface to a system. > > > > > > > > > > > > > > > > > > Without this toogle, an attacker can only reboot into a different kernel > > > > > > > > > if they can create a panic(). > > > > > > > > > > > > > > > > > > Signed-off-by: Ricardo Ribalda > > > > > > > > > > > > > > > > > > diff --git a/Documentation/admin-guide/sysctl/kernel.rst b/Documentation/admin-guide/sysctl/kernel.rst > > > > > > > > > index 97394bd9d065..25d019682d33 100644 > > > > > > > > > --- a/Documentation/admin-guide/sysctl/kernel.rst > > > > > > > > > +++ b/Documentation/admin-guide/sysctl/kernel.rst > > > > > > > > > @@ -462,6 +462,17 @@ altered. > > > > > > > > > Generally used together with the `modules_disabled`_ sysctl. > > > > > > > > > > > > > > > > > > > > > > > > > > > +kexec_reboot_disabled > > > > > > > > > +===================== > > > > > > > > > + > > > > > > > > > +A toggle indicating if ``LINUX_REBOOT_CMD_KEXEC`` has been disabled. > > > > > > > > > +This value defaults to 0 (false: ``LINUX_REBOOT_CMD_KEXEC`` enabled), > > > > > > > > > +but can be set to 1 (true: ``LINUX_REBOOT_CMD_KEXEC`` disabled). > > > > > > > > > +Once true, kexec can no longer be used for reboot and the toggle > > > > > > > > > +cannot be set back to false. > > > > > > > > > +This toggle does not affect the use of kexec during a crash. > > > > > > > > > + > > > > > > > > > + > > > > > > > > > kptr_restrict > > > > > > > > > ============= > > > > > > > > > > > > > > > > > > diff --git a/include/linux/kexec.h b/include/linux/kexec.h > > > > > > > > > index 41a686996aaa..15c3fad8918b 100644 > > > > > > > > > --- a/include/linux/kexec.h > > > > > > > > > +++ b/include/linux/kexec.h > > > > > > > > > @@ -407,6 +407,7 @@ extern int kimage_crash_copy_vmcoreinfo(struct kimage *image); > > > > > > > > > extern struct kimage *kexec_image; > > > > > > > > > extern struct kimage *kexec_crash_image; > > > > > > > > > extern int kexec_load_disabled; > > > > > > > > > +extern int kexec_reboot_disabled; > > > > > > > > > > > > > > > > > > #ifndef kexec_flush_icache_page > > > > > > > > > #define kexec_flush_icache_page(page) > > > > > > > > > diff --git a/kernel/kexec.c b/kernel/kexec.c > > > > > > > > > index cb8e6e6f983c..43063f803d81 100644 > > > > > > > > > --- a/kernel/kexec.c > > > > > > > > > +++ b/kernel/kexec.c > > > > > > > > > @@ -196,6 +196,10 @@ static inline int kexec_load_check(unsigned long nr_segments, > > > > > > > > > if (!capable(CAP_SYS_BOOT) || kexec_load_disabled) > > > > > > > > > return -EPERM; > > > > > > > > > > > > > > > > > > + /* Check if the system admin has disabled kexec reboot. */ > > > > > > > > > + if (!(flags & KEXEC_ON_CRASH) && kexec_reboot_disabled) > > > > > > > > > + return -EPERM; > > > > > > > > > > > > > > > > ... Allowing to load a crashkernel doesn't make sense in my opinion. If > > > > > > > > an attacker is capable of creating a malicious kernel, planting it on > > > > > > > > the victims system and then find a way to boot it via kexec this > > > > > > > > attacker also knows how to load the malicious kernel as crashkernel and > > > > > > > > trigger a panic. So you haven't really gained anything. That's why I > > > > > > > > would simply drop this hunk (and the corresponding one from > > > > > > > > kexec_file_load) and let users who worry about this use a combination of > > > > > > > > kexec_load_disabled and kexec_reboot_disabled. > > > > > > > > > > > > > > If for whatever reason your sysadmin configured kexec_reboot_disabed > > > > > > > it can be nice that when a user try to load it they get a warning. > > > > > > > It is easier to debug than waiting two steps later when they run kexec -e.... > > > > > > > > > > > > I'm having second thoughts about this patch. My main problem is that I > > > > > > don't see a real use case where kexec_reboot_disabled is advantageous > > > > > > over kexec_load_disabled. The point is that disabling > > > > > > LINUX_REBOOT_CMD_KEXEC is almost identical to toggling kexec_load_disabled without > > > > > > a loaded kernel (when you don't have a kernel loaded you cannot reboot > > > > > > into it). With this the main use case of kexec_reboot_disabled is > > > > > > already covered by kexec_load_disabled. > > > > > > > > > > > > > > > > > However, there are two differences > > > > > > > > > > > > 1) with kexec_reboot_disable you can still (re-)load a crash kernel > > > > > > e.g. to update the initramfs after a config change. But as discussed in > > > > > > my first mail this comes on the cost that an attacker could still load a > > > > > > malicious crash kernel and then 'panic into it'. > > > > > > > > > > That crash kernel must be already in the signed malicious kernel. > > > > > which reduces the chances of attack. > > > > > Plus an attacker must be able to panic the current kernel at will, > > > > > instead of just call reset. > > > > > > > > > > > > > > > > > 2) kexec_load_disabled also prevents unloading of a loaded kernel. So > > > > > > once loaded kexec_load_disabled cannot prevent the reboot into this > > > > > > kernel. > > > > > > > > > > > > > > > > > > For 1) I doubt that this is desired at all. My expectation is that on > > > > > > systems where a sysadmin restricts a user to reboot via kexec the > > > > > > sysadmin also wants to prevent the user to load an arbitrary crash > > > > > > kernel. Especially as this still keeps the loophole open you are trying > > > > > > to close. > > > > > > > > > > > > So only 2) is left as real benefit. But that is an extremely specific > > > > > > scenario. How often does this scenario happen in real life? What > > > > > > problem does kexec_reboot_disable solve different implementation > > > > > > (also in userspace) cannot? > > > > > > > > > > > > Sorry about being this pedantic but you want to introduce some new uapi > > > > > > which will be hard if not impossible to change once introduced. That's > > > > > > why I want to be a 100% sure it is really needed. > > > > > > > > > > No worries. Completely understand :). Thanks for taking this seriously.. > > > > > > > > > > > > > > > Best regards! > > > > > > > > > > > > Thanks > > > > > > Philipp > > > > > > > > > > > > > > > > > > > That is why I added it. But i am also ok removing it > > > > > > > > > > > > > > > > > > > > > > > Thanks > > > > > > > > Philipp > > > > > > > > > > > > > > > > > + > > > > > > > > > /* Permit LSMs and IMA to fail the kexec */ > > > > > > > > > result = security_kernel_load_data(LOADING_KEXEC_IMAGE, false); > > > > > > > > > if (result < 0) > > > > > > > > > diff --git a/kernel/kexec_core.c b/kernel/kexec_core.c > > > > > > > > > index ca2743f9c634..fe82e2525705 100644 > > > > > > > > > --- a/kernel/kexec_core.c > > > > > > > > > +++ b/kernel/kexec_core.c > > > > > > > > > @@ -929,6 +929,7 @@ int kimage_load_segment(struct kimage *image, > > > > > > > > > struct kimage *kexec_image; > > > > > > > > > struct kimage *kexec_crash_image; > > > > > > > > > int kexec_load_disabled; > > > > > > > > > +int kexec_reboot_disabled; > > > > > > > > > #ifdef CONFIG_SYSCTL > > > > > > > > > static struct ctl_table kexec_core_sysctls[] = { > > > > > > > > > { > > > > > > > > > @@ -941,6 +942,16 @@ static struct ctl_table kexec_core_sysctls[] = { > > > > > > > > > .extra1 = SYSCTL_ONE, > > > > > > > > > .extra2 = SYSCTL_ONE, > > > > > > > > > }, > > > > > > > > > + { > > > > > > > > > + .procname = "kexec_reboot_disabled", > > > > > > > > > + .data = &kexec_reboot_disabled, > > > > > > > > > + .maxlen = sizeof(int), > > > > > > > > > + .mode = 0644, > > > > > > > > > + /* only handle a transition from default "0" to "1" */ > > > > > > > > > + .proc_handler = proc_dointvec_minmax, > > > > > > > > > + .extra1 = SYSCTL_ONE, > > > > > > > > > + .extra2 = SYSCTL_ONE, > > > > > > > > > + }, > > > > > > > > > { } > > > > > > > > > }; > > > > > > > > > > > > > > > > > > @@ -1138,7 +1149,7 @@ int kernel_kexec(void) > > > > > > > > > > > > > > > > > > if (!kexec_trylock()) > > > > > > > > > return -EBUSY; > > > > > > > > > - if (!kexec_image) { > > > > > > > > > + if (!kexec_image || kexec_reboot_disabled) { > > > > > > > > > error = -EINVAL; > > > > > > > > > goto Unlock; > > > > > > > > > } > > > > > > > > > diff --git a/kernel/kexec_file.c b/kernel/kexec_file.c > > > > > > > > > index 45637511e0de..583fba6de5cb 100644 > > > > > > > > > --- a/kernel/kexec_file.c > > > > > > > > > +++ b/kernel/kexec_file.c > > > > > > > > > @@ -333,6 +333,11 @@ SYSCALL_DEFINE5(kexec_file_load, int, kernel_fd, int, initrd_fd, > > > > > > > > > if (!capable(CAP_SYS_BOOT) || kexec_load_disabled) > > > > > > > > > return -EPERM; > > > > > > > > > > > > > > > > > > + /* Check if the system admin has disabled kexec reboot. */ > > > > > > > > > + if (!(flags & (KEXEC_FILE_ON_CRASH | KEXEC_FILE_UNLOAD)) > > > > > > > > > + && kexec_reboot_disabled) > > > > > > > > > + return -EPERM; > > > > > > > > > + > > > > > > > > > /* Make sure we have a legal set of flags */ > > > > > > > > > if (flags != (flags & KEXEC_FILE_FLAGS)) > > > > > > > > > return -EINVAL; > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >