Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp5049029imm; Tue, 16 Oct 2018 04:31:05 -0700 (PDT) X-Google-Smtp-Source: ACcGV632TSWN6AxcmziK/a5AihO18tRj9LRK7MAhQPLH4rdjbfHFDnLKgsmeIyYh50PWDBQx7UcM X-Received: by 2002:a63:4cb:: with SMTP id 194-v6mr19942916pge.183.1539689465338; Tue, 16 Oct 2018 04:31:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539689465; cv=none; d=google.com; s=arc-20160816; b=miTrIjYmg9PNKHyJ67BXpaJmhaf0wUMli4zsR26K/GxfvvEX1MJLidqFYUU6aZ8f7a wLyrEkEITp4whPCkys6vNlVuRxmlZhgeZjgpfxF806a0z/YO2SXZ6rvJ3Czk2XQcTnDN lQZy5waTE0c0d7rrTRL27NcEPbOyLboMnE/MumA/V6Yee+QEtp9h3PbIncrobxwrBMFu k4KytIXBGcaMpX9Jx+FMbZzvv3a3bI/eiKuB5K94tcrWn3QyH3EBG80Qg5GiqBlq7k24 5Vi73vkpXNWC7RSF1zyFnu7/QU+ad54mzE4CzoLVl8Ub9qya601NGg/cRcsSXsja0AyJ 6jsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=vEfYmk8geDjHqnQvBzi68epmD2K6RG5jkowPpBEAAns=; b=W1DyAWzuXhFuvSpJXFK1v8863g9Wmp7/NKfEmgXlixOuNlZ3O6m20jmA3By4NkWz/E 1G+TQfJczG7QyCE1zwcXWq0Fu0Iy/aLylnr0i6qRjy+DNoz4wzwlv102W6qgNtLpzVk6 2Hg5nHrvcnGKnVCHtG0lWwfd9jpXyh43LXXTuh2J9Mqf5+RJ8+WPnqD1+mBmTs6KOqHM ETHJIHS9q5nbV6nCZTsopAe21mBTJpqgc3DYgXbObScAr191iziLqUFfROlPk9ftnVjY 0iaiwf0TA+XwadsbcvXrCHi0HUmODxCu/NrvKrQQ2eOcrKZ4mK10p2sgXWDe36pTp2j0 hOQQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o9-v6si13917628pfe.283.2018.10.16.04.30.49; Tue, 16 Oct 2018 04:31:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727071AbeJPTTK (ORCPT + 99 others); Tue, 16 Oct 2018 15:19:10 -0400 Received: from charlotte.tuxdriver.com ([70.61.120.58]:59557 "EHLO smtp.tuxdriver.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726861AbeJPTTK (ORCPT ); Tue, 16 Oct 2018 15:19:10 -0400 Received: from cpe-2606-a000-111b-40fe-f1c6-eb46-ccca-fca5.dyn6.twc.com ([2606:a000:111b:40fe:f1c6:eb46:ccca:fca5] helo=localhost) by smtp.tuxdriver.com with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1gC7dT-0003Aq-23; Mon, 15 Oct 2018 14:30:43 -0400 Date: Tue, 16 Oct 2018 07:28:17 -0400 From: Neil Horman To: Marcelo Ricardo Leitner Cc: Dmitry Vyukov , syzbot , David Miller , LKML , linux-sctp@vger.kernel.org, netdev , syzkaller-bugs , Vladislav Yasevich Subject: Re: KASAN: use-after-free Read in sctp_id2assoc Message-ID: <20181015112330.GA31030@hmswarspite.think-freely.org> References: <0000000000007e767d05776336da@google.com> <20181005145855.GB6761@localhost.localdomain> <20181010181325.GD6761@localhost.localdomain> <20181010184011.GE6761@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181010184011.GE6761@localhost.localdomain> User-Agent: Mutt/1.10.1 (2018-07-13) X-Spam-Score: 0.3 (/) X-Spam-Status: No Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Oct 10, 2018 at 03:40:11PM -0300, Marcelo Ricardo Leitner wrote: > On Wed, Oct 10, 2018 at 08:28:22PM +0200, Dmitry Vyukov wrote: > > On Wed, Oct 10, 2018 at 8:13 PM, Marcelo Ricardo Leitner > > wrote: > > > On Wed, Oct 10, 2018 at 05:28:12PM +0200, Dmitry Vyukov wrote: > > >> On Fri, Oct 5, 2018 at 4:58 PM, Marcelo Ricardo Leitner > > >> wrote: > > >> > On Thu, Oct 04, 2018 at 01:48:03AM -0700, syzbot wrote: > > >> >> Hello, > > >> >> > > >> >> syzbot found the following crash on: > > >> >> > > >> >> HEAD commit: 4e6d47206c32 tls: Add support for inplace records encryption > > >> >> git tree: net-next > > >> >> console output: https://syzkaller.appspot.com/x/log.txt?x=13834b81400000 > > >> >> kernel config: https://syzkaller.appspot.com/x/.config?x=e569aa5632ebd436 > > >> >> dashboard link: https://syzkaller.appspot.com/bug?extid=c7dd55d7aec49d48e49a > > >> >> compiler: gcc (GCC) 8.0.1 20180413 (experimental) > > >> >> > > >> >> Unfortunately, I don't have any reproducer for this crash yet. > > >> >> > > >> >> IMPORTANT: if you fix the bug, please add the following tag to the commit: > > >> >> Reported-by: syzbot+c7dd55d7aec49d48e49a@syzkaller.appspotmail.com > > >> >> > > >> >> netlink: 'syz-executor1': attribute type 1 has an invalid length. > > >> >> ================================================================== > > >> >> BUG: KASAN: use-after-free in sctp_id2assoc+0x3a7/0x3e0 > > >> >> net/sctp/socket.c:276 > > >> >> Read of size 8 at addr ffff880195b3eb20 by task syz-executor2/15454 > > >> >> > > >> >> CPU: 1 PID: 15454 Comm: syz-executor2 Not tainted 4.19.0-rc5+ #242 > > >> >> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS > > >> >> Google 01/01/2011 > > >> >> Call Trace: > > >> >> __dump_stack lib/dump_stack.c:77 [inline] > > >> >> dump_stack+0x1c4/0x2b4 lib/dump_stack.c:113 > > >> >> print_address_description.cold.8+0x9/0x1ff mm/kasan/report.c:256 > > >> >> kasan_report_error mm/kasan/report.c:354 [inline] > > >> >> kasan_report.cold.9+0x242/0x309 mm/kasan/report.c:412 > > >> >> __asan_report_load8_noabort+0x14/0x20 mm/kasan/report.c:433 > > >> >> sctp_id2assoc+0x3a7/0x3e0 net/sctp/socket.c:276 > > >> > > > >> > I'm not seeing yet how this could happen. > > >> > All sockopts here are serialized by sock_lock. > > >> > do_peeloff here would create another socket, but the issue was > > >> > triggered before that. > > >> > The same function that freed this memory, also removes the entry from > > >> > idr mapping, so this entry shouldn't be there anymore. > > >> > > > >> > I have only two theories so far: > > >> > - an issue with IDR/RCU. > > >> > - something else happened that just the call stacks are not revealing. > > >> > > >> The "asoc->base.sk != sk" check after idr_find suggests that we don't > > >> actually know what sock it belongs to. And if we don't know then > > > > > > Right. The check is more because the IDR is global and not per socket > > > (and we don't want sockets accessing asocs from other sockets), and not > > > that the asoc may move to another socket in between, but it also > > > protects from such cases, yes. > > > > > >> locking this sock can't help keeping another sock association alive. > > >> Am I missing something obvious here? Should we take assoc ref while we > > > > > > Not sure. Maybe I am. Thanks for looking into this, btw. > > > > > >> are still holding sctp_assocs_id_lock? > > > > > > Shouldn't be needed. > > > > > > Solely by the call stacks: > > > - we tried to establish a new asoc from a sctp_connect() call, > > > blocking one. > > > - it slept waiting for the connect > > > - (something closed the asoc in between the sleeps, because it freed > > > the asoc right when waking up on sctp_wait_for_connect()) > > > - it freed the asoc after sleeping on it on sctp_wait_for_connect [A] > > > - another thread tried to peeloff that asoc [B] > > > > > > For [B] to access the asoc in question, it had to take the same sock > > > lock [A] had taken, and then the idr should not return an asoc in > > > sctp_i2asoc(). Note that we can't peeloff an asoc twice, thus why > > > the certainty here. > > > > > > If [B] actually kicked in before the sleep resumed, that should have > > > been fine because it took the same sock lock [A] would have to > > > re-take. In this case an asoc would have been returned by > > > sctp_id2asoc(), the asoc would have been moved to a new socket, but > > > all while holding the original socket sock lock. > > > > But why A and B use the same lock? > > > > sctp_assocs_id is global, so it contains asocs from all sockets, right? > > assoc id comes straight from userspaces. > > So isn't it possible that B uses completely different sock but passes > > assoc id from the A sock? Then B should find assoc in sctp_assocs_id, > > and at the point of "asoc->base.sk != sk" check the assoc can be > > already freed. > > That explains it. Somehow I was thinking the issue was for reading > ->dead instead. Now it's pretty clear. Then this should be the patch > we want. Can you please give it a spin? (only compile tested) > > While holding the spinlock, an entry cannot be removed from the idr > and thus it cannot be freed. So even if the app uses an id from > another socket, it will still be there. > > ---8<--- > > diff --git a/net/sctp/socket.c b/net/sctp/socket.c > index f73e9d38d5ba..a7722f43aa69 100644 > --- a/net/sctp/socket.c > +++ b/net/sctp/socket.c > @@ -271,11 +271,10 @@ struct sctp_association *sctp_id2assoc(struct sock *sk, sctp_assoc_t id) > > spin_lock_bh(&sctp_assocs_id_lock); > asoc = (struct sctp_association *)idr_find(&sctp_assocs_id, (int)id); > + if (asoc && (asoc->base.sk != sk || asoc->base.dead)) > + asoc = NULL; > spin_unlock_bh(&sctp_assocs_id_lock); > > - if (!asoc || (asoc->base.sk != sk) || asoc->base.dead) > - return NULL; > - > return asoc; > } > > Marcello, can you post this with a proper changelog commit please? Based on the bug report, and description of the problem, I think we can all agree this is a sane fix Thanks Neil