Received: by 2002:a05:7412:d8a:b0:e2:908c:2ebd with SMTP id b10csp2910525rdg; Mon, 16 Oct 2023 20:49:22 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEq9hgSCFv6QuUKOF8n8ds/PT8H3ql1kX1QYxsiZYXg7XZh0zS52eQhWthZTt/Eib8j5M62 X-Received: by 2002:a05:6358:9f94:b0:166:a6e3:dbe6 with SMTP id fy20-20020a0563589f9400b00166a6e3dbe6mr1528111rwb.3.1697514561968; Mon, 16 Oct 2023 20:49:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1697514561; cv=none; d=google.com; s=arc-20160816; b=ltb+xtMypyFl11GadEHDUShlWI6RRC2N9NrDk6yEST7xITusq3GWyW0TWLK4uNex+C Vl1DzHMVwxlmackWW9U23QTXJuCOE+RO8FtNE+qtTR/bZjPo9y8wvwMcYx8vSeNOnh1d pATe4dxCkkO0QfE623yGwcaFJe1bfoK+N9fXMxZg94m+TgWy6ns7FEFhb4hpwjcWax9A 01uDlbAWLxcE/RD/M4he4axtgbBKRm/IioFx+9iPu1AF9jHWI0NrWK4+iQZeKD8nRMWj YThZBanaZUaYrhZirG4yZ4rCAOnz/xP6LDOvH1IndjHKOh0po8pkWr7GviIzmT9rWXdA ZIiw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :cc:to:from:subject:user-agent:mime-version:date:message-id; bh=ieJxDscOt6tfSE1Mg1tJsr0vas9lZRLthlZ6nos92pU=; fh=CDnVxfjYxEe4hIAMsX4+JEorjyrJNmrnosuD/iewpDQ=; b=h1h1tATBb7/jjMtZ8o3flHqgxHcVMQOCrCI8Gnks7JDW7+aN+l3EbUmTM5nnko1XUh ywQQKpd0ms7CtPnbbS1A/8mvLd+/2Oia62dv/Z75MnjzW+fPFghaoz5tmi5haA66uIgt WEZKELxwq5JeuYfTAMLeO18AxkGjYWOSxTuG/AgydWHVyZhlHrdYBtip6zQaqThx/XRI 9oXZeDLKJIriCzuMGnfUG6rarMscau0d1iChwYb5BN0bkrjuMKKQ6UCLgqUbfNL1ouGp RERM2SNzJiXbA6oaKW7KnHajpH68bUhPAJlgvfFPQfEEZSs4q7P1W5GTJhUg+tfdNRsn pyUg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from agentk.vger.email (agentk.vger.email. [2620:137:e000::3:2]) by mx.google.com with ESMTPS id n15-20020a6546cf000000b005859c255ce8si864672pgr.819.2023.10.16.20.49.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 16 Oct 2023 20:49:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) client-ip=2620:137:e000::3:2; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id 6EF8E803FF92; Mon, 16 Oct 2023 20:49:19 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234070AbjJQDtN (ORCPT + 99 others); Mon, 16 Oct 2023 23:49:13 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57200 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231856AbjJQDtM (ORCPT ); Mon, 16 Oct 2023 23:49:12 -0400 Received: from out30-132.freemail.mail.aliyun.com (out30-132.freemail.mail.aliyun.com [115.124.30.132]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A67BE83; Mon, 16 Oct 2023 20:49:09 -0700 (PDT) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R701e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046050;MF=guwen@linux.alibaba.com;NM=1;PH=DS;RN=17;SR=0;TI=SMTPD_---0VuLP0Gd_1697514545; Received: from 30.221.128.209(mailfrom:guwen@linux.alibaba.com fp:SMTPD_---0VuLP0Gd_1697514545) by smtp.aliyun-inc.com; Tue, 17 Oct 2023 11:49:06 +0800 Message-ID: <49847786-9914-b615-56d6-f39fbc6e03c2@linux.alibaba.com> Date: Tue, 17 Oct 2023 11:49:04 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.15.1 Subject: Re: [PATCH net-next v4 00/18] net/smc: implement virtual ISM extension and loopback-ism From: Wen Gu To: Niklas Schnelle , kgraul@linux.ibm.com, wenjia@linux.ibm.com, jaka@linux.ibm.com, davem@davemloft.net, edumazet@google.com, kuba@kernel.org, pabeni@redhat.com Cc: wintera@linux.ibm.com, gbayer@linux.ibm.com, pasic@linux.ibm.com, alibuda@linux.alibaba.com, tonylu@linux.alibaba.com, dust.li@linux.alibaba.com, linux-s390@vger.kernel.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org References: <1695568613-125057-1-git-send-email-guwen@linux.alibaba.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-4.1 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, UNPARSEABLE_RELAY autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Mon, 16 Oct 2023 20:49:19 -0700 (PDT) On 2023/10/8 15:19, Wen Gu wrote: > > > On 2023/10/5 16:21, Niklas Schnelle wrote: > >> >> Hi Wen Gu, >> >> I've been trying out your series with iperf3, qperf, and uperf on >> s390x. I'm using network namespaces with a ConnectX VF from the same >> card in each namespace for the initial TCP/IP connection i.e. initially >> it goes out to a real NIC even if that can switch internally. All of >> these look great for streaming workloads both in terms of performance >> and stability. With a Connect-Request-Response workload and uperf >> however I've run into issues. The test configuration I use is as >> follows: >> >> Client Command: >> >> # host=$ip_server ip netns exec client smc_run uperf -m tcp_crr.xml >> >> Server Command: >> >> # ip netns exec server smc_run uperf -s &> /dev/null >> >> Uperf tcp_crr.xml: >> >> >> >>          >>                  >>                          >>                          >>                          >>                          >>                  >>          >> >> >> The workload first runs fine but then after about 4 GB of data >> transferred fails with "Connection refused" and "Connection reset by >> peer" errors. The failure is not permanent however and re-running >> the streaming workloads run fine again (with both uperf server and >> client restarted). So I suspect something gets stuck in either the >> client or server sockets. The same workload runs fine with TCP/IP of >> course. >> >> Thanks, >> Niklas >> >> > > Hi Niklas, > > Thank you very much for the test. With the test example you provided, I've > reproduced the issue in my VM. And moreover, sometimes the test complains > with 'Error saying goodbye with ' > > I'll figure out what's going on here. > > Thanks! > Wen Gu I think that there is a common issue for SMC-R and SMC-D. I also reproduce 'connection reset by peer' and 'Error saying goodbye with ' when using SMC-R under the same test condition. They occur at the end of the test. When the uperf test time ends, some signals are sent. At this point there are usually some SMC connections doing CLC handshake. I catch some -EINTR(-4) in client and -ECONNRESET(-104) in server returned from smc_clc_wait_msg, (correspondingly handshake error counts also increase) and TCP RST packets sent to terminate the CLC TCP connection(clcsock). I am not sure if this should be considered as a bydesign or a bug of SMC. From an application perspective, the conn reset behavior only happens when using SMC. @Wenjia, could you please take a look at this? Thanks, Wen Gu