Received: by 2002:a05:7412:cfc7:b0:fc:a2b0:25d7 with SMTP id by7csp1469993rdb; Mon, 19 Feb 2024 17:55:44 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCUF2Tf0hD1+3T4rviX3PSLGP4eaiog/0qjH+Tk84OMpvyMNLr5l6mevAvmw8tQKd8g2WghiRdhGUsg3xL9cdIXZABK1wbD64SyMy+eMOw== X-Google-Smtp-Source: AGHT+IHSGpcD5BHpgVnjd7Uj72xfmqxW5OFF2dMmeRuo60M0XFXgtGwxX8t1T3koLchslLq/cQ/t X-Received: by 2002:a50:fa84:0:b0:564:c519:554e with SMTP id w4-20020a50fa84000000b00564c519554emr219614edr.16.1708394144240; Mon, 19 Feb 2024 17:55:44 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1708394144; cv=pass; d=google.com; s=arc-20160816; b=P014DPCbc8iBDH4aguOXWb9G38CxdHgwoztcEbXBTv7so2dEVl/KE/qDyR8VJ0GArH srx0cfb+JfIqaXvBiNgdwO14bvcreB5ogggFd7w35rWP1lmdPAQkY6/KOlhHWbPFTIqD 9eu5XW/cVH50Rr8iNep7Yt2XIflD7xWn1mG18W5QnbH6u4oojzYJrAqDZCWvbAGATZ6J Xu5K3S8O9Br79Bdh0XhUuXeBgc8GB0Lmsci7F+UIf7FiFT35TZZIbftrGCKAc0O+Xmoi mTMGe94uJOfQdjRZ06IhEvl1idibVLpwVNLAgGHTXiQt2hMFajg64z+xP1hSwmIW9Y5x EWPA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:list-unsubscribe:list-subscribe:list-id :precedence:date:message-id:dkim-signature; bh=PZKp5Almpu55EEXLxa5YMez+rRWeNGVSQ+CadfPuu7s=; fh=R2LJav7NUBRX2CoWPDckhn3zRz8dtV6E4Qed6PygHq8=; b=dBRNu1mjqu+cJBkw4VZWOLFMrEyr4wBsRvVSer3lbWHwf3GOzGb3D8q1Pkv80FDeaU WBjnIo8y3X+PFHz/7To4wMfgRscbaEmXUc9fos7gMHdwbLsAWJ5FBaDElmFOm48u2CxS fTMTD6RjtmU5yF0qe3TlAIGNaeBOOGRXot3h8riKJp7kxLYsM6WU+n2R7wPkdv/d9yZ7 Wj324TXwiXbFdP99+25WvQC52p5FTxcS4xjdoZyQ3KbnoTC7s2z9/4pioXQ4oOZSVnjA OsokFtEhOsd7Gt8mpaXV2RUxZDoFAfSKSU9IS3QM7NhQcp63X2/j1FBEa8hN2UjhA6nY QxoQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b="w/GlcZhW"; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-72193-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-72193-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [147.75.80.249]) by mx.google.com with ESMTPS id q8-20020a056402518800b0056445f4ade6si2111229edd.234.2024.02.19.17.55.44 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 19 Feb 2024 17:55:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-72193-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) client-ip=147.75.80.249; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b="w/GlcZhW"; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-72193-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-72193-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 95A001F21DBB for ; Tue, 20 Feb 2024 01:55:43 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 86F2B3B2A8; Tue, 20 Feb 2024 01:55:32 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b="w/GlcZhW" Received: from out30-111.freemail.mail.aliyun.com (out30-111.freemail.mail.aliyun.com [115.124.30.111]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7E85C11CA1; Tue, 20 Feb 2024 01:55:27 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=115.124.30.111 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1708394131; cv=none; b=cJcaJOADxNIqbEcUTECI6kofWawG0cXiedeKxz8TosNzQY6TtKN19nu06fjrYR9aQwtaF5bhp7j+FmvQpbRb6q7BOii3JefgHZudtkPSQyODMlqcnqC3dRuMGO349nGSBAcKgBGc55esmjLZHhytqRYb2lv7eum+i+Rbz7NvtQ8= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1708394131; c=relaxed/simple; bh=bRUVcLp0mVBd3IHB74HQVByUPb0l0kuvm2r+vEVwZYc=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=X4vLTmZ7O4R3NDtn2aUhAFP4faKLVl6sDsdwkysS5Ii8aanqBQlXAYOB5Z0fHBg3N0k0QA4pO/RoUXvLycO7GWx7cO46U+WgSZtwK6GCdNGMBanJAu5xRSCT7i2OtQaLTGn4wFn7AZZjNMqaa3I3eBKFLlkvHAjNWK21S6LlTDs= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com; spf=pass smtp.mailfrom=linux.alibaba.com; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b=w/GlcZhW; arc=none smtp.client-ip=115.124.30.111 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linux.alibaba.com DKIM-Signature:v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1708394119; h=Message-ID:Date:MIME-Version:Subject:To:From:Content-Type; bh=PZKp5Almpu55EEXLxa5YMez+rRWeNGVSQ+CadfPuu7s=; b=w/GlcZhWXlH6BF38RelJrJUyUtUFANfJZQ9BEiSrb1T+Tdkx/oUgUvdibil10EGsKapukdXFhpzs2B4C+120enIEJeeDykW8Pra7dHkNDtZBiQV7S3KzHh97Y545rEJtdVVT0k/Ve98ZOXHyt22LZ5tFI6c/VstjwnC0ekggh78= X-Alimail-AntiSpam:AC=PASS;BC=-1|-1;BR=01201311R591e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018045192;MF=guwen@linux.alibaba.com;NM=1;PH=DS;RN=18;SR=0;TI=SMTPD_---0W0uqlG1_1708394117; Received: from 30.221.128.233(mailfrom:guwen@linux.alibaba.com fp:SMTPD_---0W0uqlG1_1708394117) by smtp.aliyun-inc.com; Tue, 20 Feb 2024 09:55:18 +0800 Message-ID: Date: Tue, 20 Feb 2024 09:55:16 +0800 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH net-next 06/15] net/smc: implement DMB-related operations of loopback-ism To: Wenjia Zhang , wintera@linux.ibm.com, hca@linux.ibm.com, gor@linux.ibm.com, agordeev@linux.ibm.com, davem@davemloft.net, edumazet@google.com, kuba@kernel.org, pabeni@redhat.com, jaka@linux.ibm.com, Gerd Bayer Cc: borntraeger@linux.ibm.com, svens@linux.ibm.com, alibuda@linux.alibaba.com, tonylu@linux.alibaba.com, linux-s390@vger.kernel.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org References: <20240111120036.109903-1-guwen@linux.alibaba.com> <20240111120036.109903-7-guwen@linux.alibaba.com> From: Wen Gu In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit On 2024/2/16 22:13, Wenjia Zhang wrote: > > > On 11.01.24 13:00, Wen Gu wrote: >> This implements DMB (un)registration and data move operations of >> loopback-ism device. >> >> Signed-off-by: Wen Gu >> --- >>   net/smc/smc_cdc.c      |   6 ++ >>   net/smc/smc_cdc.h      |   1 + >>   net/smc/smc_loopback.c | 133 ++++++++++++++++++++++++++++++++++++++++- >>   net/smc/smc_loopback.h |  13 ++++ >>   4 files changed, 150 insertions(+), 3 deletions(-) >> >> diff --git a/net/smc/smc_cdc.c b/net/smc/smc_cdc.c >> index 3c06625ceb20..c820ef197610 100644 >> --- a/net/smc/smc_cdc.c >> +++ b/net/smc/smc_cdc.c >> @@ -410,6 +410,12 @@ static void smc_cdc_msg_recv(struct smc_sock *smc, struct smc_cdc_msg *cdc) >>   static void smcd_cdc_rx_tsklet(struct tasklet_struct *t) >>   { >>       struct smc_connection *conn = from_tasklet(conn, t, rx_tsklet); >> + >> +    smcd_cdc_rx_handler(conn); >> +} >> + >> +void smcd_cdc_rx_handler(struct smc_connection *conn) >> +{ >>       struct smcd_cdc_msg *data_cdc; >>       struct smcd_cdc_msg cdc; >>       struct smc_sock *smc; >> diff --git a/net/smc/smc_cdc.h b/net/smc/smc_cdc.h >> index 696cc11f2303..11559d4ebf2b 100644 >> --- a/net/smc/smc_cdc.h >> +++ b/net/smc/smc_cdc.h >> @@ -301,5 +301,6 @@ int smcr_cdc_msg_send_validation(struct smc_connection *conn, >>                    struct smc_wr_buf *wr_buf); >>   int smc_cdc_init(void) __init; >>   void smcd_cdc_rx_init(struct smc_connection *conn); >> +void smcd_cdc_rx_handler(struct smc_connection *conn); >>   #endif /* SMC_CDC_H */ >> diff --git a/net/smc/smc_loopback.c b/net/smc/smc_loopback.c >> index 353d4a2d69a1..f72e7b24fc1a 100644 >> --- a/net/smc/smc_loopback.c >> +++ b/net/smc/smc_loopback.c >> @@ -15,11 +15,13 @@ >>   #include >>   #include >> +#include "smc_cdc.h" >>   #include "smc_ism.h" >>   #include "smc_loopback.h" >>   #if IS_ENABLED(CONFIG_SMC_LO) >>   #define SMC_LO_V2_CAPABLE    0x1 /* loopback-ism acts as ISMv2 */ >> +#define SMC_DMA_ADDR_INVALID    (~(dma_addr_t)0) >>   static const char smc_lo_dev_name[] = "loopback-ism"; >>   static struct smc_lo_dev *lo_dev; >> @@ -50,6 +52,97 @@ static int smc_lo_query_rgid(struct smcd_dev *smcd, struct smcd_gid *rgid, >>       return 0; >>   } >> +static int smc_lo_register_dmb(struct smcd_dev *smcd, struct smcd_dmb *dmb, >> +                   void *client_priv) >> +{ >> +    struct smc_lo_dmb_node *dmb_node, *tmp_node; >> +    struct smc_lo_dev *ldev = smcd->priv; >> +    int sba_idx, order, rc; >> +    struct page *pages; >> + >> +    /* check space for new dmb */ >> +    for_each_clear_bit(sba_idx, ldev->sba_idx_mask, SMC_LO_MAX_DMBS) { >> +        if (!test_and_set_bit(sba_idx, ldev->sba_idx_mask)) >> +            break; >> +    } >> +    if (sba_idx == SMC_LO_MAX_DMBS) >> +        return -ENOSPC; >> + >> +    dmb_node = kzalloc(sizeof(*dmb_node), GFP_KERNEL); >> +    if (!dmb_node) { >> +        rc = -ENOMEM; >> +        goto err_bit; >> +    } >> + >> +    dmb_node->sba_idx = sba_idx; >> +    order = get_order(dmb->dmb_len); >> +    pages = alloc_pages(GFP_KERNEL | __GFP_NOWARN | >> +                __GFP_NOMEMALLOC | __GFP_COMP | >> +                __GFP_NORETRY | __GFP_ZERO, >> +                order); >> +    if (!pages) { >> +        rc = -ENOMEM; >> +        goto err_node; >> +    } >> +    dmb_node->cpu_addr = (void *)page_address(pages); >> +    dmb_node->len = dmb->dmb_len; >> +    dmb_node->dma_addr = SMC_DMA_ADDR_INVALID; >> + >> +again: >> +    /* add new dmb into hash table */ >> +    get_random_bytes(&dmb_node->token, sizeof(dmb_node->token)); >> +    write_lock(&ldev->dmb_ht_lock); >> +    hash_for_each_possible(ldev->dmb_ht, tmp_node, list, dmb_node->token) { >> +        if (tmp_node->token == dmb_node->token) { >> +            write_unlock(&ldev->dmb_ht_lock); >> +            goto again; >> +        } >> +    } >> +    hash_add(ldev->dmb_ht, &dmb_node->list, dmb_node->token); >> +    write_unlock(&ldev->dmb_ht_lock); >> + > The write_lock_irqsave()/write_unlock_irqrestore() and read_lock_irqsave()/read_unlock_irqrestore()should be used > instead of write_lock()/write_unlock() and read_lock()/read_unlock() in order to keep the lock irq-safe. > dmb_ht_lock won't be hold in an interrupt or sockirq context. The dmb_{register|unregister}, dmb_{attach|detach} and data_move are all on the process context. So I think write_(un)lock and read_(un)lock is safe here. >> +    dmb->sba_idx = dmb_node->sba_idx; >> +    dmb->dmb_tok = dmb_node->token; >> +    dmb->cpu_addr = dmb_node->cpu_addr; >> +    dmb->dma_addr = dmb_node->dma_addr; >> +    dmb->dmb_len = dmb_node->len; >> + >> +    return 0; >> + >> +err_node: >> +    kfree(dmb_node); >> +err_bit: >> +    clear_bit(sba_idx, ldev->sba_idx_mask); >> +    return rc; >> +} >> + >> +static int smc_lo_unregister_dmb(struct smcd_dev *smcd, struct smcd_dmb *dmb) >> +{ >> +    struct smc_lo_dmb_node *dmb_node = NULL, *tmp_node; >> +    struct smc_lo_dev *ldev = smcd->priv; >> + >> +    /* remove dmb from hash table */ >> +    write_lock(&ldev->dmb_ht_lock); >> +    hash_for_each_possible(ldev->dmb_ht, tmp_node, list, dmb->dmb_tok) { >> +        if (tmp_node->token == dmb->dmb_tok) { >> +            dmb_node = tmp_node; >> +            break; >> +        } >> +    } >> +    if (!dmb_node) { >> +        write_unlock(&ldev->dmb_ht_lock); >> +        return -EINVAL; >> +    } >> +    hash_del(&dmb_node->list); >> +    write_unlock(&ldev->dmb_ht_lock); >> + >> +    clear_bit(dmb_node->sba_idx, ldev->sba_idx_mask); >> +    kfree(dmb_node->cpu_addr); >> +    kfree(dmb_node); >> + >> +    return 0; >> +} >> + >>   static int smc_lo_add_vlan_id(struct smcd_dev *smcd, u64 vlan_id) >>   { >>       return -EOPNOTSUPP; >> @@ -76,6 +169,38 @@ static int smc_lo_signal_event(struct smcd_dev *dev, struct smcd_gid *rgid, >>       return 0; >>   } >> +static int smc_lo_move_data(struct smcd_dev *smcd, u64 dmb_tok, >> +                unsigned int idx, bool sf, unsigned int offset, >> +                void *data, unsigned int size) >> +{ >> +    struct smc_lo_dmb_node *rmb_node = NULL, *tmp_node; >> +    struct smc_lo_dev *ldev = smcd->priv; >> + >> +    read_lock(&ldev->dmb_ht_lock); >> +    hash_for_each_possible(ldev->dmb_ht, tmp_node, list, dmb_tok) { >> +        if (tmp_node->token == dmb_tok) { >> +            rmb_node = tmp_node; >> +            break; >> +        } >> +    } >> +    if (!rmb_node) { >> +        read_unlock(&ldev->dmb_ht_lock); >> +        return -EINVAL; >> +    } >> +    read_unlock(&ldev->dmb_ht_lock); >> + >> +    memcpy((char *)rmb_node->cpu_addr + offset, data, size); >> + > > Should this read_unlock be placed behind memcpy()? > dmb_ht_lock is used to ensure safe access to the DMB hash table of loopback-ism. The DMB hash table could be accessed by all the connections on loopback-ism, so it should be protected. But a certain DMB is only used by one connection, and the move_data process is protected by conn->send_lock (see smcd_tx_sndbuf_nonempty()), so the memcpy(rmb_node) here is safe and no race with other. Thanks! > <...>