Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp3388834pxb; Mon, 9 Nov 2020 09:51:28 -0800 (PST) X-Google-Smtp-Source: ABdhPJyPA1uZgvvnHiZFc+nWFm8R+ZWHp/ld66VwoMHUPcaI3E9qmGXpxy3QcweJf7Ww8GLdAVyq X-Received: by 2002:a17:907:d23:: with SMTP id gn35mr2769601ejc.238.1604944288502; Mon, 09 Nov 2020 09:51:28 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1604944288; cv=none; d=google.com; s=arc-20160816; b=qlmKzAsVm1+gxDRnWE1eSy6tG8p53lv5Pe19R5mzk1LGy+KotB7fnDisMkyT3tZP46 troAWqjTUhYk4RnvW5MwjYR9GaVf1htaxo6XxHGLvXxU2QuQNLSBGOABwmS5IBksKvKZ /RzZdSQBqOuNsxvPwvM+xoNjNlcg0JcoskjCDAaaLDycqUZuNgSrZzqpI14QaMR+CeUE lAAwo8XeM3Fr44gj5xHrGX5sDmnOavdcUUsIsG4sSThsI2wkWLsDGznKamkcNw4mhBFS RrwCF3Hya3ng5o5RgLWSzfNBvVoUGrbBNo9HzoRWmYeNNBHb8Ql1hpSIDgyoFn2aQq/1 9F2Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=1FkIIrrWoWy37gtQbFWfliGOjWQBYIdFY0N3xjIWLmk=; b=kFLhHmLZjD+mGUJ2aZP2ZQJCfQuCQ1jgh/JxHlfXSz0GiCuYluGCzdDpXc839MFG/6 UuHY0bwElWVKu8gXPn734sSWVw21KKbsquicb32AQufwy7njLIBJscZ8ZduztzdJcTIP tqyAzZBvSUcPyT91WseK2+gmJBw9s8rcKTX5bwUyN6JeiPUYVwH3dPf1d+OKrhHrCbBy ACasJO3UvmZB0WKX84maOnJFqCYLfYmN7CzVnCOTPwi5/oiQsCnhoUKJapnHa3E7GdR8 PynI4GkxVxyjzJ+OCsgpJs9eOy/rFZjj8kQ5jjK4TpZVeBwHML0NwLBoflMkoRRPvvxA 67Og== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=XmRgnrlt; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u14si7970962edq.494.2020.11.09.09.51.05; Mon, 09 Nov 2020 09:51:28 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=XmRgnrlt; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729831AbgKIRtl (ORCPT + 99 others); Mon, 9 Nov 2020 12:49:41 -0500 Received: from mail.kernel.org ([198.145.29.99]:45176 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726952AbgKIRtl (ORCPT ); Mon, 9 Nov 2020 12:49:41 -0500 Received: from kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com (unknown [163.114.132.7]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 9C19A20665; Mon, 9 Nov 2020 17:49:39 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1604944180; bh=2JS+DaVkJRk5CcGsB31ekydAlAwz2JA7uSthpODOzec=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=XmRgnrltaXpz59xQMBWOzeps7fFSmvRjemBlntUxDEH1LwiYlonGdBYE3avceHraq nSyFlK0cCmuNTW1Bum6YvH/dg4My0lmOdWzcN8YwqUwP35dB8OhS0wO7HkYRWRIlhO RBYzc1DM34+d3vUKr9NlcwrI5QlXXnd1WqzveoCg= Date: Mon, 9 Nov 2020 09:49:38 -0800 From: Jakub Kicinski To: Thadeu Lima de Souza Cascardo Cc: Kleber Sacilotto de Souza , Eric Dumazet , netdev@vger.kernel.org, Gerrit Renker , "David S. Miller" , "Gustavo A. R. Silva" , "Alexander A. Klimov" , Kees Cook , Alexey Kodanev , dccp@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH 1/2] dccp: ccid: move timers to struct dccp_sock Message-ID: <20201109094938.45b230c9@kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com> In-Reply-To: <20201109114828.GP595944@mussarela> References: <20201013171849.236025-1-kleber.souza@canonical.com> <20201013171849.236025-2-kleber.souza@canonical.com> <20201016153016.04bffc1e@kicinski-fedora-pc1c0hjn.dhcp.thefacebook.com> <20201109114828.GP595944@mussarela> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 9 Nov 2020 08:48:28 -0300 Thadeu Lima de Souza Cascardo wrote: > On Fri, Oct 16, 2020 at 03:30:16PM -0700, Jakub Kicinski wrote: > > On Tue, 13 Oct 2020 19:18:48 +0200 Kleber Sacilotto de Souza wrote: > > > From: Thadeu Lima de Souza Cascardo > > > > > > When dccps_hc_tx_ccid is freed, ccid timers may still trigger. The reason > > > del_timer_sync can't be used is because this relies on keeping a reference > > > to struct sock. But as we keep a pointer to dccps_hc_tx_ccid and free that > > > during disconnect, the timer should really belong to struct dccp_sock. > > > > > > This addresses CVE-2020-16119. > > > > > > Fixes: 839a6094140a (net: dccp: Convert timers to use timer_setup()) > > > Signed-off-by: Thadeu Lima de Souza Cascardo > > > Signed-off-by: Kleber Sacilotto de Souza > > > > I've been mulling over this fix. > > > > The layering violation really doesn't sit well. > > > > We're reusing the timer object. What if we are really unlucky, the > > fires and gets blocked by a cosmic ray just as it's about to try to > > lock the socket, then user manages to reconnect, and timer starts > > again. Potentially with a different CCID algo altogether? > > > > Is disconnect ever called under the BH lock? Maybe plumb a bool > > argument through to ccid*_hc_tx_exit() and do a sk_stop_timer_sync() > > when called from disconnect()? > > > > Or do refcounting on ccid_priv so that the timer holds both the socket > > and the priv? > > Sorry about too late a response. I was on vacation, then came back and spent a > couple of days testing this further, and had to switch to other tasks. > > So, while testing this, I had to resort to tricks like having a very small > expire and enqueuing on a different CPU. Then, after some minutes, I hit a UAF. > That's with or without the first of the second patch. > > I also tried to refcount ccid instead of the socket, keeping the timer on the > ccid, but that still hit the UAF, and that's when I had to switch tasks. Hm, not instead, as well. I think trying cancel the timer _sync from the disconnect path would be the simplest solution, tho. > Oh, and in the meantime, I found one or two other fixes that we > should apply, will send them shortly. > > But I would argue that we should apply the revert as it addresses the > CVE, without really regressing the other UAF, as I argued. Does that > make sense? We can - it's always a little strange to go from one bug to a different without a fix - but the justification being that while the previous UAF required a race condition the new one is actually worst because it can be triggered reliably?