Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 22569C433FE for ; Mon, 6 Dec 2021 05:42:31 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237360AbhLFFp5 (ORCPT ); Mon, 6 Dec 2021 00:45:57 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37452 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229448AbhLFFp4 (ORCPT ); Mon, 6 Dec 2021 00:45:56 -0500 Received: from mail-io1-xd31.google.com (mail-io1-xd31.google.com [IPv6:2607:f8b0:4864:20::d31]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 068C2C061354 for ; Sun, 5 Dec 2021 21:42:28 -0800 (PST) Received: by mail-io1-xd31.google.com with SMTP id k21so11582916ioh.4 for ; Sun, 05 Dec 2021 21:42:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=iEZ5nj17qGAGUubbHNSG6UOoRRyHT8z0s5qC8Ef+yZU=; b=SDz6TSIyiLDUUQwJZPlKIGGwKJKb2d+eHmzwo5iVAn58dF0KUIWCViVQCAvrB7XtfG utCJNlX/nfcAcuFUZTy+bl0ekteXOO4kDYbBwQKR3Lp4z0oQ/IYVMdvIPiMq08D9wMX3 aRj0qJjivdRFQOmACUODtDicv0LsPPH7N+WKo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=iEZ5nj17qGAGUubbHNSG6UOoRRyHT8z0s5qC8Ef+yZU=; b=2GYp7f9kFRX6QhO8VsEPcKOkWaxzhQ+7LeBM6/8PA3n29MMNLBEZoe4w7UpdrqmUKQ 0fgKC57rgVaqqds92B8qo7smKxI6qmbax5CGARxjWtfRd23ER4F/hOdlvbtaww6jOubh 8vl3xSGoieFZ2JvCGoSbOtYylnqS6aql34T5YQBX8dnSLJGBmVWifIx754yunemo+p1r mSEcnMsJYQvlnYpLhAHs5Bsxsxs0I2jIG1tMOb+OGD8pv2CAywPhOCNIbNAa1nQeIlRJ kgUmBDvrGVDAPM5Aeb79gQnNuC8YMQPA+ijguSI32gJdX3d8IhBYW5t5k3KgeP+EHWY9 FGyw== X-Gm-Message-State: AOAM530/7E6W2HSg1EJtf1RnDhbhtR4t1h9YzsOl0NU2h6c0SBFhj9pa 6fkNMbyRj6dvmRJ/eowvmDJrw5l3kVZSBme66JwYZ5WAQr1XqQ== X-Google-Smtp-Source: ABdhPJxuyhgzjHDgkXVztn7kvt1I/DHsv8mtGgJSRmE/6rKC/T6IgkKmSrI3iupiIVvuy/3xteP1dqvz0qQV4Weiric= X-Received: by 2002:a05:6602:2e81:: with SMTP id m1mr29550143iow.55.1638769347298; Sun, 05 Dec 2021 21:42:27 -0800 (PST) MIME-Version: 1.0 References: <20211012082708.121931-1-iivanov@suse.de> In-Reply-To: From: Hsin-Yi Wang Date: Mon, 6 Dec 2021 13:42:01 +0800 Message-ID: Subject: Re: [PATCH v4] random: fix crash on multiple early calls to add_bootloader_randomness() To: Dominik Brodowski Cc: "Jason A. Donenfeld" , "Theodore Ts'o" , "Ivan T. Ivanov" , Ard Biesheuvel , linux-efi@vger.kernel.org, LKML Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Dec 3, 2021 at 3:59 PM Dominik Brodowski wrote: > > Hi Jason, > > Am Thu, Dec 02, 2021 at 11:55:10AM -0500 schrieb Jason A. Donenfeld: > > Thanks for the patch. One trivial nit and one question: > > Thanks for your review! > > > On Thu, Dec 2, 2021 at 6:35 AM Dominik Brodowski > > wrote: > > > + /* We cannot do much with the input pool until it is set up in > > > + * rand_initalize(); therefore just mix into the crng state. > > > > I think you meant "rand_initialize()" here (missing 'i'). > > Indeed, sorry about that. > > > > If the added entropy suffices to increase crng_init to 1, future calls > > > to add_bootloader_randomness() or add_hwgenerator_randomness() used to > > > progress to credit_entropy_bits(). However, if the input pool is not yet > > > properly set up, the cmpxchg call within that function can lead to an > > > infinite recursion. > > > > I see what this patch does with crng_global_init_time, and that seems > > probably sensible, but I didn't understand this part of the reasoning > > in the commit message; I might just be a bit slow here. Where's the > > recursion exactly? Or even an infinite loop? > > On arm64, it was actually a NULL pointer dereference reported by Ivan T. > Ivanov; see > > https://lore.kernel.org/lkml/20211012082708.121931-1-iivanov@suse.de/ > > Trying to reproduce this rather bluntly on x86/qemu by multiple manual calls > to add_bootloader_randomness(), I mis-interpreted the symptoms to point to an > infinite recursion. The real problem seems to be that crng_reseed() isn't > ready to be called too early in the boot process, in particular before > workqueues are ready (see the call to numa_crng_init()). > > However, there seem be additional issues with add_bootloader_randomness() > not yet addressed (or worsened) by my patch: > > - If CONFIG_RANDOM_TRUST_BOOTLOADER is enabled and crng_init==0, > add_hwgenerator_randomness() calls crng_fast_load() and returns > immediately. If it is disabled and crng_init==0, > add_device_randnomness() calls crng_slow_load() but still > continues to call _mix_pool_bytes(). That means the seed is > used more extensively if CONFIG_RANDOM_TRUST_BOOTLOADER is not > set! If called by the crng_slow_load(), it's mixed into the pool but we're not trusting it. But in crng_fast_load() we're using it to init crng. > > - If CONFIG_RANDOM_TRUST_BOOTLOADER is enabled and crng_init==0, > the entropy is not credited -- same as if > CONFIG_RANDOM_TRUST_BOOTLOADER is not set. Only subsequent calls In crng_fast_load(), the seed would be mixed to primary_crng.state[4], and then crng_init will be 1 if the added seed is enough. rng-seed in dt (called in early_init_dt_scan_chosen()) also needs to use this function to init crng. With the patch, we're seeing [ 0.000000] random: get_random_u64 called from __kmem_cache_create+0x34/0x270 with crng_init=0 While before it should be [ 0.000000] random: get_random_u64 called from __kmem_cache_create+0x34/0x280 with crng_init=1 > to add_bootloader_randomness() would credit entropy, but that > causes the issue NULL pointer dereference or the hang... > > - As crng_fast_load() returns early, that actually means that my > patch causes the additional entropy submitted to > add_hwgenerator_randomness() by subsequent calls to be completely > lost. Only when crng_init==0, if crng is initialized, it would continue with credit_entropy_bits(). > > - For add_bootloader_randomness(), it makes no sense at all to call > wait_event_interruptible(). > > Therefore, it might make more sense to > > - modify add_bootloader_randomness() to always call > add_device_randomness(), and if CONFIG_RANDOM_TRUST_BOOTLOADER is > enabled, to call credit_entropy_bits(). > > - update credit_entropy_bits() to not call credit_entropy_bits() > if crng_global_init_time==0, as workqueues (and possibly other > infrastructure) might not be available at that time. > > What do you think? Draft patch below. @Ivan: Could you re-test on your > system, please? > > Thanks, > Dominik > > --- > > Currently, if CONFIG_RANDOM_TRUST_BOOTLOADER is enabled, mutliple calls > to add_bootloader_randomness() are broken and can cause a NULL pointer > dereference, as noted by Ivan T. Ivanov. This is not only a hypothetical > problem, as qemu on arm64 may provide bootloader entropy via EFI and via > devicetree. > > On the first call to add_hwgenerator_randomness(), crng_fast_load() is > executed, and if the seed is long enough, crng_init will be set to 1. > However, no entropy is currently credited for that, even though the > name and description of CONFIG_RANDOM_TRUST_BOOTLOADER states otherwise. > > On subsequent calls to add_bootloader_randomness() and then to > add_hwgenerator_randomness(), crng_fast_load() will be skipped. Instead, > wait_event_interruptible() (which makes no sense for the init process) > and then credit_entropy_bits() will be called. If the entropy count for > that second seed is large enough, that proceeds to crng_reseed(). > However, crng_reseed() may depend on workqueues being available, which > is not the case early during boot. > > To fix these issues, unconditionally call add_device_randomness() but not > add_hwgenerator_randomness() in add_bootloader_randomness(). This has the > additional advantage that the seed provided by the first call to > add_bootloader_randomness() is not only used by crng_{fast,slow}_load(), > but also mixed into the input pool. If CONFIG_RANDOM_TRUST_BOOTLOADER is > set, explicitly credit the entropy. However, avoid a call to crng_reseed() > too early during boot. It is safe to be called after rand_initialize(), > so use crng_global_init_time (which is set to != 0 in that function) to > determine which branch to take. > > Reported-by: Ivan T. Ivanov > Fixes: 18b915ac6b0a ("efi/random: Treat EFI_RNG_PROTOCOL output as bootloader randomness") > Signed-off-by: Dominik Brodowski > > --- > v3->v4: complete rewrite > v2->v3: only one unlikely (Ard Biesheuvel) > v1->v2: fix commit message; unmerge Reported-and-tested-by-tag (Ard Biesheuvel) > > > diff --git a/drivers/char/random.c b/drivers/char/random.c > index 605969ed0f96..d8614b426dfb 100644 > --- a/drivers/char/random.c > +++ b/drivers/char/random.c > @@ -722,7 +722,8 @@ static void credit_entropy_bits(struct entropy_store *r, int nbits) > if (r == &input_pool) { > int entropy_bits = entropy_count >> ENTROPY_SHIFT; > > - if (crng_init < 2 && entropy_bits >= 128) > + if (crng_init < 2 && entropy_bits >= 128 && > + crng_global_init_time > 0) > crng_reseed(&primary_crng, r); > } > } > @@ -1763,8 +1764,8 @@ static void __init init_std_data(struct entropy_store *r) > } > > /* > - * Note that setup_arch() may call add_device_randomness() > - * long before we get here. This allows seeding of the pools > + * add_device_randomness() or add_bootloader_randomness() may be > + * called long before we get here. This allows seeding of the pools > * with some platform dependent data very early in the boot > * process. But it limits our options here. We must use > * statically allocated structures that already have all > @@ -2291,15 +2292,13 @@ void add_hwgenerator_randomness(const char *buffer, size_t count, > EXPORT_SYMBOL_GPL(add_hwgenerator_randomness); > > /* Handle random seed passed by bootloader. > - * If the seed is trustworthy, it would be regarded as hardware RNGs. Otherwise > - * it would be regarded as device data. > + * If the seed is trustworthy, its entropy will be credited. > * The decision is controlled by CONFIG_RANDOM_TRUST_BOOTLOADER. > */ > void add_bootloader_randomness(const void *buf, unsigned int size) > { > + add_device_randomness(buf, size); > if (IS_ENABLED(CONFIG_RANDOM_TRUST_BOOTLOADER)) > - add_hwgenerator_randomness(buf, size, size * 8); > - else > - add_device_randomness(buf, size); > + credit_entropy_bits(&input_pool, size * 8); > } > EXPORT_SYMBOL_GPL(add_bootloader_randomness);