Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp547013pxb; Thu, 25 Feb 2021 08:53:57 -0800 (PST) X-Google-Smtp-Source: ABdhPJwSRrkHG8c4xD6XaGiaZGV+U7U/aOi8RVhKMEf/Hwi6413e/jsTPAZ0LXbpCaWdP5C992xb X-Received: by 2002:a17:906:b747:: with SMTP id fx7mr3592625ejb.474.1614272037206; Thu, 25 Feb 2021 08:53:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614272037; cv=none; d=google.com; s=arc-20160816; b=GeoVwLBaA5iYXkvT1Z4cTrqpa4Po6dqj0sGmRsmTQBuKs3/frrop0MZKihEkJTdPBS 8P9/FosOn5VUUpbjFwLNMjMMYQafCxEEDoc9dr6Y5XUiSlWjYyZkYp1W8BZXCDVO4FHV mMkVXqsYgphBtFXZTFGWDFM+bylqquA8YQbh/DjjD625e6MNkJwF9x/M/jrmdItsuz6H dKjLOrokbZeLiBWuU5KL4AwqZRGGNrm2jpPVmvMHMKegUMEAME2FuzsOqsnGog6tIHge IQl2XC28pqsjt75wnhLhGxzCadcFrFfMCYdZcW13E2+HVroEfvbWgftHCyMAEGTxNVWn UDOw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature :dkim-filter; bh=wlRHhMDhdnPoOFiC5N9w8GK8FzHuH7hrSm93HlcK0Lk=; b=gLNxbopeZWssOz8vfICMQ4KGKWrPaWcu5MG0u4u95uQltUwO78Opj1ohZLXLs2fXrD rKQdAaW2sw1OqQHzF4MdB/yZKHiGzZmq7RiGh4rL1b25mw8VEFLjwswXTjllLRlYZ0B7 L4DTq/B0T/Ed7aE1SSqi3tkMrK2NIIFrtoVVsDziiKBQwFvRAY5eyIgeSzv6qCKPpp+K N5+PulieJZBf0c48qhACU2rHtmg6rrPknn384vZqDklqkkJaWs4GP+lIVzO5FTzLgST9 FIY0DC/kYbPCZCvUOsTsthh1TJPZ/IQqqVOzCj/2qYipp66i9m5wuWA//+6eUEidn7WI Gp4w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux.microsoft.com header.s=default header.b=RYEpC4Rb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p62si3625006edp.242.2021.02.25.08.53.34; Thu, 25 Feb 2021 08:53:57 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.microsoft.com header.s=default header.b=RYEpC4Rb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234135AbhBYQu0 (ORCPT + 99 others); Thu, 25 Feb 2021 11:50:26 -0500 Received: from linux.microsoft.com ([13.77.154.182]:40634 "EHLO linux.microsoft.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233864AbhBYQqi (ORCPT ); Thu, 25 Feb 2021 11:46:38 -0500 Received: from sequoia (162-237-133-238.lightspeed.rcsntx.sbcglobal.net [162.237.133.238]) by linux.microsoft.com (Postfix) with ESMTPSA id A1D1920B6C40; Thu, 25 Feb 2021 08:45:55 -0800 (PST) DKIM-Filter: OpenDKIM Filter v2.11.0 linux.microsoft.com A1D1920B6C40 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.microsoft.com; s=default; t=1614271556; bh=wlRHhMDhdnPoOFiC5N9w8GK8FzHuH7hrSm93HlcK0Lk=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=RYEpC4RbxF1x4ktJidIkRrotkXepMC9his7/DJxkAzyGquo/8KPx9wknKuEmhPe13 Y/lBsSoDiJrB77FAFbGwMEHgmJjgtYlvAGIcUBLD8hIz0UgcfS7bFpPpTjJDs591eh 69Kd1+1mIdx5NkHV4Pu1+mWqd3h2J2w1MuPCu4nQ= Date: Thu, 25 Feb 2021 10:45:53 -0600 From: Tyler Hicks To: Ondrej Mosnacek Cc: Paul Moore , Stephen Smalley , SElinux list , Linux kernel mailing list Subject: Re: [BUG] Race between policy reload sidtab conversion and live conversion Message-ID: <20210225164553.GG6000@sequoia> References: <20210223214346.GB6000@sequoia> <20210223215054.GC6000@sequoia> <20210223223652.GD6000@sequoia> <20210224143651.GE6000@sequoia> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021-02-25 17:38:25, Ondrej Mosnacek wrote: > On Wed, Feb 24, 2021 at 3:43 PM Tyler Hicks wrote: > > On 2021-02-24 10:33:46, Ondrej Mosnacek wrote: > > > On Tue, Feb 23, 2021 at 11:37 PM Tyler Hicks > > > wrote: > > > > On 2021-02-23 15:50:56, Tyler Hicks wrote: > > > > > On 2021-02-23 15:43:48, Tyler Hicks wrote: > > > > > > I'm seeing a race during policy load while the "regular" sidtab > > > > > > conversion is happening and a live conversion starts to take place in > > > > > > sidtab_context_to_sid(). > > > > > > > > > > > > We have an initial policy that's loaded by systemd ~0.6s into boot and > > > > > > then another policy gets loaded ~2-3s into boot. That second policy load > > > > > > is what hits the race condition situation because the sidtab is only > > > > > > partially populated and there's a decent amount of filesystem operations > > > > > > happening, at the same time, which are triggering live conversions. > > > > > > > > Hmm, perhaps this is the same problem that's fixed by Ondrej's proposed > > > > change here: > > > > > > > > https://lore.kernel.org/selinux/20210212185930.130477-3-omosnace@redhat.com/ > > > > > > > > I'll put these changes through a validation run (the only place that I > > > > can seem to reproduce this crash) and see how it looks. > > > > > > Hm... I think there is actually another race condition introduced by > > > the switch from rwlock to RCU [1]... Judging from the call trace you > > > may be hitting that. > > > > I believe your patches above fixed the race I was seeing. I was able to > > make it through a full validation run without any crashes. Without those > > patches applied, I would see several crashes resulting from this race > > over the course of a validation run. > > Hm... okay so probably you were indeed running into that bug. I tried > to reproduce the other race (I added a BUG_ON to help detect it), but > wasn't able to reproduce it with my (pretty aggressive) stress test. I > only managed to trigger it by adding a conditional delay in the right > place. So I now know the second bug is really there, though it' seems > to be very unlikely to be hit in practice (might be more likely on > systems with many CPU cores, though). The first bug, OTOH, is > triggered almost instantly by my stress test. > > Unless someone objects, I'll start working on a patch to switch back > to read-write lock for now. If all goes well, I'll send it sometime > next week. > > > > > I'll continue to test with your changes and let you know if I end up > > running into the other race you spotted. > > Thanks, but given the results of my testing it's probably not worth trying :) Those changes have now survived through several validation runs. I can confidently say that they fix the race I was seeing. Tyler > > > > > Tyler > > > > > > > > Basically, before the switch the sidtab swapover worked like this: > > > 1. Start live conversion of new entries. > > > 2. Convert existing entries. > > > [Still only the old sidtab is visible to readers here.] > > > 3. Swap sidtab under write lock. > > > 4. Now only the new sidtab is visible to readers, so the old one can > > > be destroyed. > > > > > > After the switch to RCU, we now have: > > > 1. Start live conversion of new entries. > > > 2. Convert existing entries. > > > 3. RCU-assign the new policy pointer to selinux_state. > > > [!!! Now actually both old and new sidtab may be referenced by > > > readers, since there is no synchronization barrier previously provided > > > by the write lock.] > > > 4. Wait for synchronize_rcu() to return. > > > 5. Now only the new sidtab is visible to readers, so the old one can > > > be destroyed. > > > > > > So the race can happen between 3. and 5., if one thread already sees > > > the new sidtab and adds a new entry there, and a second thread still > > > has the reference to the old sidtab and also tires to add a new entry; > > > live-converting to the new sidtab, which it doesn't expect to change > > > by itself. Unfortunately I failed to realize this when reviewing the > > > patch :/ > > > > > > I think the only two options to fix it are A) switching back to > > > read-write lock (the easy and safe way; undoing the performance > > > benefits of [1]), or B) implementing a safe two-way live conversion of > > > new sidtab entries, so that both tables are kept in sync while they > > > are both available (more complicated and with possible tricky > > > implications of different interpretations of contexts by the two > > > policies). > > > > > > [1] 1b8b31a2e612 ("selinux: convert policy read-write lock to RCU") > > > > > > -- > > > Ondrej Mosnacek > > > Software Engineer, Linux Security - SELinux kernel > > > Red Hat, Inc. > > > > > > > > -- > Ondrej Mosnacek > Software Engineer, Linux Security - SELinux kernel > Red Hat, Inc. >