Received: by 2002:a05:6a10:af89:0:0:0:0 with SMTP id iu9csp1360476pxb; Fri, 21 Jan 2022 16:30:04 -0800 (PST) X-Google-Smtp-Source: ABdhPJy+GcxXn4rNrMk7SyfZ4XEiq+kpRMpggujODr6oKK11BrCQQZmk8XsYl67Hl9yBquKmZLfN X-Received: by 2002:a63:3808:: with SMTP id f8mr4498650pga.186.1642811404546; Fri, 21 Jan 2022 16:30:04 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1642811404; cv=none; d=google.com; s=arc-20160816; b=OlU3jGvEp2tVcZXRTo8ijh0OKEiiutpmNbIu8KlD++R9ETfo4DZKcv7lAGjesfKEo4 82Vi8AABeNtJ9fIwFq4Xp6uV3C+u8CmidR3kI2tWuDPfyiAXEfGllIvQ+b0rkFyKw/WE 3RvoCi9W7xr2+0mUryDkK2PTmdG7Khbjbvv/9cmgPgt2uHBDvaKzZMbk7ctTMVV2JHiZ GzUjADvsSXuoln7XL0fCusVMntQDUDxHUeCCiUtHbMTZRkmbuG4jIb/7q2T2CLMeyort Qbw9mCXQyx7R99ZAGuNZuJ9Ias2BFOGwq8FvuDY4y5fVmkObyCSthbCbPG4QoH2fd1qu cMaA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=9jCAGYTiVW0K9poO3OseNMRHMxhJjCtQQmvS8GzBapY=; b=DJQXRGdneMILZDGpeg/c8YFVErHx5xijpZBYjw7eA1kmNAzlF6XTW7jR/0RDYXICU7 VpKl91L+e9gcOIh8mQW9jusjQeMGl3l2lmnrqt4/Wh+DstxU7MXjZVcnrSEjA4fjHEwv mVAoH9krVeFUL3iMc73tVV0Rn+tcAHZ6N6k22Y5nu2Bm0LvrQ8K1Re8ahmYIqJ3wKzWH CvexKZqwuopUHvzcnlOWaKD1yonNbw9/vcwkb0vnuY+xPV1LfokiUHo8zSt2z4Rkhxi7 M+ptyfh4axOlUvRcahBgIppkmBe1JzqghfV+srJCpWvLk3zNZyEUAWCIcchQjf/WC1bq lSWg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id s1si7978241plg.23.2022.01.21.16.29.52; Fri, 21 Jan 2022 16:30:04 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1378140AbiAUBum (ORCPT + 99 others); Thu, 20 Jan 2022 20:50:42 -0500 Received: from szxga01-in.huawei.com ([45.249.212.187]:16727 "EHLO szxga01-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S245241AbiAUBul (ORCPT ); Thu, 20 Jan 2022 20:50:41 -0500 Received: from canpemm500006.china.huawei.com (unknown [172.30.72.57]) by szxga01-in.huawei.com (SkyGuard) with ESMTP id 4Jg2L32dzPzZfJv; Fri, 21 Jan 2022 09:46:51 +0800 (CST) Received: from [10.174.179.200] (10.174.179.200) by canpemm500006.china.huawei.com (7.192.105.130) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2308.21; Fri, 21 Jan 2022 09:50:38 +0800 Subject: Re: [PATCH net] can: isotp: isotp_rcv_cf(): fix so->rx race problem To: Oliver Hartkopp , CC: , , , , References: <20220117120102.2395157-1-william.xuanziyang@huawei.com> <53279d6d-298c-5a85-4c16-887c95447825@hartkopp.net> <280e10c1-d1f4-f39e-fa90-debd56f1746d@huawei.com> <890d8209-f400-a3b0-df9c-3e198e3834d6@huawei.com> <1fb4407a-1269-ec50-0ad5-074e49f91144@hartkopp.net> <2aba02d4-0597-1d55-8b3e-2c67386f68cf@huawei.com> <64695483-ff75-4872-db81-ca55763f95cf@hartkopp.net> From: "Ziyang Xuan (William)" Message-ID: Date: Fri, 21 Jan 2022 09:50:38 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 In-Reply-To: <64695483-ff75-4872-db81-ca55763f95cf@hartkopp.net> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.174.179.200] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To canpemm500006.china.huawei.com (7.192.105.130) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > > On 20.01.22 12:28, Ziyang Xuan (William) wrote: >>> >>> On 20.01.22 07:24, Ziyang Xuan (William) wrote: >>> >>>> I have reproduced the syz problem with Marc's commit, the commit can not fix the panic problem. >>>> So I tried to find the root cause for panic and gave my solution. >>>> >>>> Marc's commit just fix the condition that packet size bigger than INT_MAX which trigger >>>> tpcon::{idx,len} integer overflow, but the packet size is 4096 in the syz problem. >>>> >>>> so->rx.len is 0 after the following logic in isotp_rcv_ff(): >>>> >>>> /* get the FF_DL */ >>>> so->rx.len = (cf->data[ae] & 0x0F) << 8; >>>> so->rx.len += cf->data[ae + 1]; >>>> >>>> so->rx.len is 4096 after the following logic in isotp_rcv_ff(): >>>> >>>> /* FF_DL = 0 => get real length from next 4 bytes */ >>>> so->rx.len = cf->data[ae + 2] << 24; >>>> so->rx.len += cf->data[ae + 3] << 16; >>>> so->rx.len += cf->data[ae + 4] << 8; >>>> so->rx.len += cf->data[ae + 5]; >>>> >>> >>> In these cases the values 0 could be the minimum value in so->rx.len - but e.g. the value 0 can not show up in isotp_rcv_cf() as this function requires so->rx.state to be ISOTP_WAIT_DATA. >> >> Consider the scenario that isotp_rcv_cf() and isotp_rcv_cf() are concurrent for the same isotp_sock as following sequence: > > o_O > > Sorry but the receive path is not designed to handle concurrent receptions that would run isotp_rcv_cf() and isotp_rcv_ff() simultaneously. > >> isotp_rcv_cf() >> if (so->rx.state != ISOTP_WAIT_DATA) [false] >>                         isotp_rcv_ff() >>                         so->rx.state = ISOTP_IDLE >>                         /* get the FF_DL */ [so->rx.len == 0] >> alloc_skb() [so->rx.len == 0] >>                         /* FF_DL = 0 => get real length from next 4 bytes */ [so->rx.len == 4096] >> skb_put(nskb, so->rx.len) [so->rx.len == 4096] >> skb_over_panic() >> > > Even though this case is not possible with a real CAN bus due to the CAN frame transmission times we could introduce some locking (or dropping of concurrent CAN frames) in isotp_rcv() - but this code runs in net softirq context ... > I thought the kernel code logic should make sure the kernel availability no matter what happens in user space code. And tx path has considered so->tx race condition actually but rx path for so->rx. > Regards, > Oliver > > >>> >>> And when so->rx.len is 0 in isotp_rcv_ff() this check >>> >>> if (so->rx.len + ae + off + ff_pci_sz < so->rx.ll_dl) >>>          return 1; >>> >>> will return from isotp_rcv_ff() before ISOTP_WAIT_DATA is set at the end. So after that above check we are still in ISOTP_IDLE state. >>> >>> Or did I miss something here? >>> >>>> so->rx.len is 0 before alloc_skb() and is 4096 after alloc_skb() in isotp_rcv_cf(). The following >>>> skb_put() will trigger panic. >>>> >>>> The following log is my reproducing log with Marc's commit and my debug modification in isotp_rcv_cf(). >>>> >>>> [  150.605776][    C6] isotp_rcv_cf: before alloc_skb so->rc.len: 0, after alloc_skb so->rx.len: 4096 >>> >>> >>> But so->rx_len is not a value that is modified by alloc_skb(): >>> >>>                  nskb = alloc_skb(so->rx.len, gfp_any()); >>>                  if (!nskb) >>>                          return 1; >>> >>>                  memcpy(skb_put(nskb, so->rx.len), so->rx.buf, >>>                         so->rx.len); >>> >>> >>> Can you send your debug modification changes please? >> >> My reproducing debug as attachment and following: >> >> diff --git a/net/can/isotp.c b/net/can/isotp.c >> index df6968b28bf4..8b12d63b4d59 100644 >> --- a/net/can/isotp.c >> +++ b/net/can/isotp.c >> @@ -119,8 +119,8 @@ enum { >>   }; >> >>   struct tpcon { >> -       int idx; >> -       int len; >> +       unsigned int idx; >> +       unsigned int len; >>          u32 state; >>          u8 bs; >>          u8 sn; >> @@ -505,6 +505,7 @@ static int isotp_rcv_cf(struct sock *sk, struct canfd_frame *cf, int ae, >>          struct isotp_sock *so = isotp_sk(sk); >>          struct sk_buff *nskb; >>          int i; >> +       bool unexpection = false; >> >>          if (so->rx.state != ISOTP_WAIT_DATA) >>                  return 0; >> @@ -562,11 +563,13 @@ static int isotp_rcv_cf(struct sock *sk, struct canfd_frame *cf, int ae, >>                                  sk_error_report(sk); >>                          return 1; >>                  } >> - >> +               if (so->rx.len == 0) >> +                       unexpection = true; >>                  nskb = alloc_skb(so->rx.len, gfp_any()); >>                  if (!nskb) >>                          return 1; >> - >> +               if (unexpection) >> +                       printk("%s: before alloc_skb so->rc.len: 0, after alloc_skb so->rx.len: %u\n", __func__, so->rx.len); >>                  memcpy(skb_put(nskb, so->rx.len), so->rx.buf, >>                         so->rx.len); >> >> >>> >>> Best regards, >>> Oliver >>> >>>> [  150.611477][    C6] skbuff: skb_over_panic: text:ffffffff881ff7be len:4096 put:4096 head:ffff88807f93a800 data:ffff88807f93a800 tail:0x1000 end:0xc0 dev: >>>> [  150.615837][    C6] ------------[ cut here ]------------ >>>> [  150.617238][    C6] kernel BUG at net/core/skbuff.c:113! >>>> >>> >>> . > .