Received: by 2002:a05:6358:701b:b0:131:369:b2a3 with SMTP id 27csp86066rwo; Fri, 21 Jul 2023 08:54:27 -0700 (PDT) X-Google-Smtp-Source: APBJJlH3hOfzOBvpJEthCDu1VUoKr5cJ/L+hGw6snTYqfrZ7OZLEmRpozNvDVtMx0+b6u7JT7lDr X-Received: by 2002:a05:6a00:1944:b0:668:6445:8931 with SMTP id s4-20020a056a00194400b0066864458931mr554779pfk.29.1689954867170; Fri, 21 Jul 2023 08:54:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689954867; cv=none; d=google.com; s=arc-20160816; b=TsIdUqkRoKfPUkgOnBqdW2ULVGmARjVqBruAUJNBA4jOkPFPtzsAIz/Aa6Gbr1kG1u 3Hh8Q3scPxH+Ya0dYl7lI/N3oH1ErZpZCQwAkOgPRc9WmPT/FKLO7GfXAtls33zRZupr HSXHncXoN3W8A8pjLOFcj3fdPYJXaFWxxxPVNW9XXQbG0HJNNUeVo4KYifoc8xCqlrvV Z9/tmY7Ffo8frvM5QFS7eMbciYI1mrz6y/LpZjUUPWcPNYs92G8ScOhpYbxp7XibGTBp AiLymIod+apfpp/l7ySblHaeR4NgzIuk7784gX/kjLzJzi95altztutlH445elhbydxk Beog== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature:dkim-filter; bh=O0b5d9evA9lwBqNidYtPEYBZ7CVLeRcJ2bV1Mfo3SIU=; fh=quOT+YZRnveYVN5nKy2StSR4LT/pceg7p2+5/KLhEeg=; b=acqY5+QoK04IiEtbbjI27iCIdmyhvmNHsFgug5n/20W9liwIv4/PySuV3yS/tzF0jq quVO2SH2q9SfOKg2uiRKRjRfWEDVvGW70IRPbX9Xd/M5CfIhtKkD1s7l+kvcXLeLqP9h QHuwrDMav9ruxwut5esfRbqIK7WHagdUhphGzzZO+MUNQVrkmH9YYLyeIlJOCDcdBosu FnTGyV6u7S2adrFGQLhLqtr4yXNTLwYyP+tOvGQatRfCUBQdvntCz2uExlD0S4xkdVwG TgXoGVG3LmsZVC75jIO5HK5NcsJbduENJ+DmhsuhtLTjOVvhnmxlj+eFUv/eZgrpUldQ LEsw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ispras.ru header.s=default header.b=S0i1WqiR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id o10-20020a635a0a000000b0055b12581c75si3028673pgb.675.2023.07.21.08.54.10; Fri, 21 Jul 2023 08:54:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@ispras.ru header.s=default header.b=S0i1WqiR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231834AbjGUPN7 (ORCPT + 99 others); Fri, 21 Jul 2023 11:13:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49424 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231679AbjGUPNw (ORCPT ); Fri, 21 Jul 2023 11:13:52 -0400 X-Greylist: delayed 463 seconds by postgrey-1.37 at lindbergh.monkeyblade.net; Fri, 21 Jul 2023 08:13:43 PDT Received: from mail.ispras.ru (mail.ispras.ru [83.149.199.84]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id CE9DA30DB; Fri, 21 Jul 2023 08:13:43 -0700 (PDT) Received: from tp-owlcat.intra.ispras.ru (unknown [10.10.165.6]) by mail.ispras.ru (Postfix) with ESMTPSA id EB21640B27AF; Fri, 21 Jul 2023 15:05:57 +0000 (UTC) DKIM-Filter: OpenDKIM Filter v2.11.0 mail.ispras.ru EB21640B27AF DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ispras.ru; s=default; t=1689951958; bh=O0b5d9evA9lwBqNidYtPEYBZ7CVLeRcJ2bV1Mfo3SIU=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=S0i1WqiRtXiHhhkHUV6IlKrtQZAUvVRZe2S/M7HAqcGQgRINg/yP+yIxGj8NDuExT HcMpCxo3hW8I+RUP3qIk0iiZlFAaKzD7oZxLMLwymxhgd6UzhMKP3qolInJfOGDCU5 7J0zU+X38yJMauQdaYJe2v5R7i2uN6fQrQ93EuIE= From: Anton Gusev To: stable@vger.kernel.org, Greg Kroah-Hartman Cc: Anton Gusev , Doug Ledford , Jason Gunthorpe , Leon Romanovsky , Sasha Levin , Mark Zhang , =?UTF-8?q?H=C3=A5kon=20Bugge?= , Michael Guralnik , Roland Dreier , Sean Hefty , linux-rdma@vger.kernel.org, linux-kernel@vger.kernel.org, lvc-project@linuxtesting.org, syzbot+dc3dfba010d7671e05f5@syzkaller.appspotmail.com, Jason Gunthorpe Subject: [PATCH 5.10 1/1] RDMA/cma: Ensure rdma_addr_cancel() happens before issuing more requests Date: Fri, 21 Jul 2023 18:05:33 +0300 Message-ID: <20230721150535.191318-2-aagusev@ispras.ru> X-Mailer: git-send-email 2.41.0 In-Reply-To: <20230721150535.191318-1-aagusev@ispras.ru> References: <20230721150535.191318-1-aagusev@ispras.ru> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_BLOCKED, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org commit 305d568b72f17f674155a2a8275f865f207b3808 upstream. The FSM can run in a circle allowing rdma_resolve_ip() to be called twice on the same id_priv. While this cannot happen without going through the work, it violates the invariant that the same address resolution background request cannot be active twice. CPU 1 CPU 2 rdma_resolve_addr(): RDMA_CM_IDLE -> RDMA_CM_ADDR_QUERY rdma_resolve_ip(addr_handler) #1 process_one_req(): for #1 addr_handler(): RDMA_CM_ADDR_QUERY -> RDMA_CM_ADDR_BOUND mutex_unlock(&id_priv->handler_mutex); [.. handler still running ..] rdma_resolve_addr(): RDMA_CM_ADDR_BOUND -> RDMA_CM_ADDR_QUERY rdma_resolve_ip(addr_handler) !! two requests are now on the req_list rdma_destroy_id(): destroy_id_handler_unlock(): _destroy_id(): cma_cancel_operation(): rdma_addr_cancel() // process_one_req() self removes it spin_lock_bh(&lock); cancel_delayed_work(&req->work); if (!list_empty(&req->list)) == true ! rdma_addr_cancel() returns after process_on_req #1 is done kfree(id_priv) process_one_req(): for #2 addr_handler(): mutex_lock(&id_priv->handler_mutex); !! Use after free on id_priv rdma_addr_cancel() expects there to be one req on the list and only cancels the first one. The self-removal behavior of the work only happens after the handler has returned. This yields a situations where the req_list can have two reqs for the same "handle" but rdma_addr_cancel() only cancels the first one. The second req remains active beyond rdma_destroy_id() and will use-after-free id_priv once it inevitably triggers. Fix this by remembering if the id_priv has called rdma_resolve_ip() and always cancel before calling it again. This ensures the req_list never gets more than one item in it and doesn't cost anything in the normal flow that never uses this strange error path. Link: https://lore.kernel.org/r/0-v1-3bc675b8006d+22-syz_cancel_uaf_jgg@nvidia.com Cc: stable@vger.kernel.org Fixes: e51060f08a61 ("IB: IP address based RDMA connection manager") Reported-by: syzbot+dc3dfba010d7671e05f5@syzkaller.appspotmail.com Signed-off-by: Jason Gunthorpe Signed-off-by: Anton Gusev --- drivers/infiniband/core/cma.c | 23 +++++++++++++++++++++++ drivers/infiniband/core/cma_priv.h | 1 + 2 files changed, 24 insertions(+) diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c index db24f7dfa00f..805678f6fe57 100644 --- a/drivers/infiniband/core/cma.c +++ b/drivers/infiniband/core/cma.c @@ -1792,6 +1792,14 @@ static void cma_cancel_operation(struct rdma_id_private *id_priv, { switch (state) { case RDMA_CM_ADDR_QUERY: + /* + * We can avoid doing the rdma_addr_cancel() based on state, + * only RDMA_CM_ADDR_QUERY has a work that could still execute. + * Notice that the addr_handler work could still be exiting + * outside this state, however due to the interaction with the + * handler_mutex the work is guaranteed not to touch id_priv + * during exit. + */ rdma_addr_cancel(&id_priv->id.route.addr.dev_addr); break; case RDMA_CM_ROUTE_QUERY: @@ -3401,6 +3409,21 @@ int rdma_resolve_addr(struct rdma_cm_id *id, struct sockaddr *src_addr, if (dst_addr->sa_family == AF_IB) { ret = cma_resolve_ib_addr(id_priv); } else { + /* + * The FSM can return back to RDMA_CM_ADDR_BOUND after + * rdma_resolve_ip() is called, eg through the error + * path in addr_handler(). If this happens the existing + * request must be canceled before issuing a new one. + * Since canceling a request is a bit slow and this + * oddball path is rare, keep track once a request has + * been issued. The track turns out to be a permanent + * state since this is the only cancel as it is + * immediately before rdma_resolve_ip(). + */ + if (id_priv->used_resolve_ip) + rdma_addr_cancel(&id->route.addr.dev_addr); + else + id_priv->used_resolve_ip = 1; ret = rdma_resolve_ip(cma_src_addr(id_priv), dst_addr, &id->route.addr.dev_addr, timeout_ms, addr_handler, diff --git a/drivers/infiniband/core/cma_priv.h b/drivers/infiniband/core/cma_priv.h index caece96ebcf5..b53f4fa5e3fb 100644 --- a/drivers/infiniband/core/cma_priv.h +++ b/drivers/infiniband/core/cma_priv.h @@ -89,6 +89,7 @@ struct rdma_id_private { u8 reuseaddr; u8 afonly; u8 timeout; + u8 used_resolve_ip; enum ib_gid_type gid_type; /* -- 2.41.0