X-Received: by 2002:a17:90a:67c9:b0:1b9:51d5:6c13 with SMTP id g9-20020a17090a67c900b001b951d56c13mr11825068pjm.216.1645664073088; Wed, 23 Feb 2022 16:54:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645664073; cv=none; d=google.com; s=arc-20160816; b=gL3zXYCPxSA48LOqgW8fa5jqI1Pkhnm2Um+e3fiw/rqziau+tSU57fEKomUcIGyKL6 Gu5xGGcWeRJzGzel3j8qNDRVlAvDf1qkiPAAJEmcF9BYSMb6dXsCMhrowWfhptjtQiDJ N9OxEgjb9bUU1Trk/cZLuKgcMGmmT04fN+JwV4ybCvYRhPSXufILnKda1CvzbsYHsMoX wFEJNJTLBzcli3p/ywrgEO0ci/fOqzYS2r4d+mbpT3UAGVonPmc7yfcUXw3HCtb0b7mI jVF8mRdraZ1GvbD8pS5wUWIUVnAwHckmQ/zXmazP1UxVy2BVAJ5DJun2+R4yVoEnhaj1 La+g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=RrdEVsjsD+nqXmQqBvUEIQg+R+7+UWVmueHZbvxi/fY=; b=mGGdsoRmG+hR8JHeuDm3R+sgjZ3ppJfW97/7zw+vdKFqnA0qJ6XpxnWbO2vjF7Ogjx /MCK2UHxxl/diVJWzzn8Xe7As3IXKMZxRaAQ4m+dr/a1PcEa3lONfCLwPx/zSlG5gYSx pdtoUWgPaPejjFryMkhozThiE7nb5Ici/AXnpTXW3Xie2MJ9GS8j36e/qUfCw309lxOt mvUCeUh3TGucrr4c+KEvtwC3NMwro5xrWKcDPOxKIt3/REKZfMuM3Va9xUVbl0QdJbkA xqBH6SSIAn4WN3FV2g6AIKmNvmoMM14A4oRRCL6Cgdv2LLLjF5yopviKNcpfsnnIWVK3 aObQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=HrqUskcT; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id v8si931857pjl.158.2022.02.23.16.54.32 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 23 Feb 2022 16:54:33 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=HrqUskcT; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id AE5C39A99A; Wed, 23 Feb 2022 16:47:53 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243539AbiBWRvx (ORCPT + 99 others); Wed, 23 Feb 2022 12:51:53 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44870 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237008AbiBWRvt (ORCPT ); Wed, 23 Feb 2022 12:51:49 -0500 Received: from mail-lj1-x22b.google.com (mail-lj1-x22b.google.com [IPv6:2a00:1450:4864:20::22b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F074826CA for ; Wed, 23 Feb 2022 09:51:20 -0800 (PST) Received: by mail-lj1-x22b.google.com with SMTP id o6so25823401ljp.3 for ; Wed, 23 Feb 2022 09:51:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=RrdEVsjsD+nqXmQqBvUEIQg+R+7+UWVmueHZbvxi/fY=; b=HrqUskcTi7myVyE20lUHGxJ5Cbqhrcee6XGl5vkm8NnzpACCfPRFm/+h5jJFvLnJIH J09n5gnVJYBhqvTpCNt326dmBcHXVR9YUWVcgJOUx74XOdS76Di+Wn+9quQYy+pi6ymr AsBJ+RxttBSyKX71ZYQ6bNvnrcXsiJ1bMJZvziOq0mEDMa/C6SnANiHyPjCmlEDftBo/ /dmao04eFUAwrM4TWxdQ2+Jv6uDUNIJ/Y7xhQ5+9z5AGuZ7My4MfYahlXf0wQULDAW0S WUl4lShCV5B1BEGM+itp8+KvXaDTINCwBbb3TuBr8OUEIxfhmCSDYRJNrwhzTrZlOXR8 AoMQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=RrdEVsjsD+nqXmQqBvUEIQg+R+7+UWVmueHZbvxi/fY=; b=n+ItAJm6nux2N6k2MyL4YR6YWbCHvqaajXDk6OxQ67SdC2qenUjuLmnqEc9KpPlQ94 zVKuQt9dDGBM2HP0kNVUyQ68AcF9C8A7eSOA83I+5dVEMlbo4XZA1wXQPbqUe4pVZ/ie r/n6KG5cDOfIdeI5eVc/+oAcCyg7HnMnZsmxutDPkOkBZT6yCtEESBXTMB3cUs6R8P7C xmWCPBCOfj04WHUDJ19FnPmdNVdBN+5nWO8Kg7WplVXxWzCIAzNTCzOLFom/fOjgwMrH YLP8/DSwBGBmBjEtHBQFwp2DB+cTdrpgTSgu6Wed1F9NmpTYmgBMU+L5uJV7JrLslrzP knGA== X-Gm-Message-State: AOAM530seRPk0tK/hN00J4flKrnxvtSZBRvfsxe9pPw9WSOig9IEvNE9 Q3Ll19gZ3ynxSAvajKjQb+GQ1vWF1U27YD9UJLq5hw== X-Received: by 2002:a2e:b78f:0:b0:246:6331:c1bc with SMTP id n15-20020a2eb78f000000b002466331c1bcmr378483ljo.188.1645638679148; Wed, 23 Feb 2022 09:51:19 -0800 (PST) MIME-Version: 1.0 References: <20220218181950.1438236-1-jannh@google.com> <8D85619E-99BD-4DB5-BDDB-A205B057C910@chromium.org> In-Reply-To: <8D85619E-99BD-4DB5-BDDB-A205B057C910@chromium.org> From: Jann Horn Date: Wed, 23 Feb 2022 18:50:52 +0100 Message-ID: Subject: Re: [PATCH v2] pstore: Don't use semaphores in always-atomic-context code To: Kees Cook Cc: Anton Vorontsov , Colin Cross , Tony Luck , Ard Biesheuvel , Sebastian Andrzej Siewior , linux-efi@vger.kernel.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.5 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 23, 2022 at 8:50 AM Kees Cook wrote: > On February 18, 2022 10:19:50 AM PST, Jann Horn wrote: > >pstore_dump() is *always* invoked in atomic context (nowadays in an RCU > >read-side critical section, before that under a spinlock). > >It doesn't make sense to try to use semaphores here. > > Ah, very nice. Thanks for the analysis! > > >[...] > >-static bool pstore_cannot_wait(enum kmsg_dump_reason reason) > >+bool pstore_cannot_block_path(enum kmsg_dump_reason reason) > > Why the rename, That's one of the parts of commit ea84b580b955 that I included in the revert. "wait" in the name is not accurate, since "wait" in the kernel normally refers to scheduling away until some condition is fulfilled. (Though I guess "block" also isn't the best name either... idk.) The place where we might want to have different behavior depending on whether we're handling a kernel crash are spinlocks; during a kernel crash, we shouldn't deadlock on them, but otherwise, AFAIK it's fine to block on them. > extern, and EXPORT? This appears to still only have the same single caller? Also part of the revert. I figured it might make sense to also revert that part because: With this commit applied, the EFI code will always take the "nonblock" path for now, but that's kinda suboptimal; on some platforms the "blocking" path uses a semaphore, so we really can't take that, but on x86 it uses a spinlock, which we could block on if we're not oopsing. We could avoid needlessly losing non-crash dmesg dumps there; I don't know whether we care about that though. So I figured that we might want to start adding new callers to this later on. But if you want, I'll remove that part of the revert and resend? > > [...] > >- pr_err("dump skipped in %s path: may corrupt error record\n", > >- in_nmi() ? "NMI" : why); > >- return; > >- } > >- if (down_interruptible(&psinfo->buf_lock)) { > >- pr_err("could not grab semaphore?!\n"); > >+ if (pstore_cannot_block_path(reason)) { > >+ if (!spin_trylock_irqsave(&psinfo->buf_lock, flags)) { > >+ pr_err("dump skipped in %s path because of concurrent dump\n" > >+ , in_nmi() ? "NMI" : why); > > The pr_err had the comma following the format string moved, Ah, whoops, that was also part of the revert, but I guess I should have left that part out... > and the note about corruption removed. Is that no longer accurate? There should be no more corruption since commit 959217c84c27 ("pstore: Actually give up during locking failure") - if we're bailing out, we can't be causing corruption, I believe?