Received: by 2002:a05:6358:701b:b0:131:369:b2a3 with SMTP id 27csp626286rwo; Fri, 21 Jul 2023 18:50:34 -0700 (PDT) X-Google-Smtp-Source: APBJJlGm0cq+7pG0J+g4pSA94iwlAm+pjXNU1YchPM5oQN3hH9SfJd/Sk7aVy1ZBu5dIfvGdrDZ0 X-Received: by 2002:a05:6808:30d:b0:3a3:6f89:5de1 with SMTP id i13-20020a056808030d00b003a36f895de1mr4267250oie.49.1689990634583; Fri, 21 Jul 2023 18:50:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689990634; cv=none; d=google.com; s=arc-20160816; b=K4dJagMw90jfy3F4F0syovH4tY2IWHlkGuaO49d0Bpq1AxQBbizpQdVvLMG2rr76RX gDH5nkuROKbeiWLLKxFU44I9r2MsE0aLbAA1f6QlR7nb0G40R2NQXAcimsq1Hnyi/roH efVvV0UM3iMpOpOo1IyzDSihHwTFLf+TqgmUJfdgiXeKDdWLcZwaVn4pKc1yrqR8A9Yf MDYkJ0KK3rOSvz6iUip7dc3TYdOs0agKynH0xSrhGEj6LyVGsHXguDQiKyI5yaC918UH HlaaazA7inhSnzGqLhybrvWJYmFfT4Cz6nFhDEWBkCL0+lrHtXa6DKGj5K7QJoOxPKI3 d7mQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :cc:to:from:content-language:subject:user-agent:mime-version:date :message-id:dkim-signature:dkim-signature; bh=0uv6PROEPdk4v+oC0SzHPh8GcCQnYhhGtPEobTIVpdI=; fh=Q/SD1Kd4radrKudgdjEiR0YRLtzulTIY/7CGMJvVRxc=; b=FE9LUtZjCBedxcgStix234XaATFaJTAfPbb8uC5Yq/q81j1Zuj2RqXjH+eFdxyTQx0 CyI9qk0mgSJLVMO6dxdlducqEqdGjEd5kHpZLvjvYjuVaSyrrnX4MC+1Z8PU86bpeiPg cOi3kOj+oLhZVDq4awYbnt1EOOMdXQKkjxr8YMb50AWG/9FGB4JM/pBvpG1DxZSlPnGL jvrrC3MDe+BsY8V4xpLi7PqaGXx1v0DmmsfTosVCNmmSpuYuTW0UqCSrODnUDt6KZrlX ytqHINb0Yoo+yBfLatHY608SFt05bUD8mcyuFgMmwDcyI6443xah4SCXFd3NEMy8QsPl iKkQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=UQIMmCib; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=aq9pvsqr; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alu.unizg.hr Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a62-20020a639041000000b0055b28f930a3si4455694pge.546.2023.07.21.18.50.22; Fri, 21 Jul 2023 18:50:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=UQIMmCib; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=aq9pvsqr; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alu.unizg.hr Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230230AbjGVAzH (ORCPT + 99 others); Fri, 21 Jul 2023 20:55:07 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59160 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229534AbjGVAzF (ORCPT ); Fri, 21 Jul 2023 20:55:05 -0400 Received: from domac.alu.hr (domac.alu.unizg.hr [IPv6:2001:b68:2:2800::3]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2F35C2704; Fri, 21 Jul 2023 17:55:03 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by domac.alu.hr (Postfix) with ESMTP id E09BC6016E; Sat, 22 Jul 2023 02:55:01 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=alu.unizg.hr; s=mail; t=1689987301; bh=YaLq25haSxSDHd97nw2meIEN+HYydB/cvPOZ+ZMccX0=; h=Date:Subject:From:To:Cc:References:In-Reply-To:From; b=UQIMmCibIbnUKQ52ndtiAVOH6EsWykUCJHh4GLc/MHpIFIdB1FX6GMYEgXhDL/5cV h8AgAuLoq5tfIm+V9waXZv7h00timRZmqXwsDtxjRLVaFtXBausXWbotD3ha54XYxf uAL1YVmBSGoE/2phBpXGW7WuZ5X7k7nVjZ9t61YmYs/RPCGIQd9xJd0W9HMJVIEbUw j5ypyT+YknGwTrDy8rLONTrYfsAWcBsonmc6b9h8El+5smCFoiFBWkUdFmmj69RzNS 1MzFsjiKr5jnI8l0+fndei7XdU2/kH/eo+Iy212gnvGUTSBcxHERtSOCObr2aYqmLG ilFi/NiyPxJCQ== X-Virus-Scanned: Debian amavisd-new at domac.alu.hr Received: from domac.alu.hr ([127.0.0.1]) by localhost (domac.alu.hr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id kbXq_SrhBLUc; Sat, 22 Jul 2023 02:54:58 +0200 (CEST) Received: from [192.168.1.6] (unknown [94.250.191.183]) by domac.alu.hr (Postfix) with ESMTPSA id 8540D6015F; Sat, 22 Jul 2023 02:54:44 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=alu.unizg.hr; s=mail; t=1689987298; bh=YaLq25haSxSDHd97nw2meIEN+HYydB/cvPOZ+ZMccX0=; h=Date:Subject:From:To:Cc:References:In-Reply-To:From; b=aq9pvsqrLRTenwgmmxgqccRprH4uSwEMwYsb/b67zkkfkAK3GsuoOtbtKDD66iWTK +b0HKsflGUhiUcHV6RAaWFQYSsGq8Wr5/pkSzMdg4vsKZcYpCga9lGYnVHc7J6Ibgd b5elweFA6bP2nyaIMGWDM95L9oDV+8z0ee2+alnWupuhk0YpilosbaCnKgN9h+0xM5 /vfoFUVkOLaWcZa4v+o/DezIwgg0SN5L7WwhzzPO34X6NK/hFohPz80663LJbyio3w YpEOKnJh2PtE4VDBSz9D1xfz7mPstVeO870eupEX1tqGCbrvWRitzX69THohv3oxsu AjtzunnGrqwfQ== Message-ID: Date: Sat, 22 Jul 2023 02:54:44 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.13.0 Subject: Re: [PROBLEM] [ADDITIONAL DIAG] seltests: net/forwarding/sch_ets.sh [HANG] Content-Language: en-US From: Mirsad Todorovac To: Petr Machata Cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org, "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , Shuah Khan , linux-kselftest@vger.kernel.org, Ido Schimmel References: <759fe934-2e43-e9ff-8946-4fd579c09b05@alu.unizg.hr> <87cz0m9a3n.fsf@nvidia.com> <580b9f28-7a68-e618-b2d5-b8663386aa12@alu.unizg.hr> <6c871d89-390d-41ae-ef48-6cd12b99fd74@alu.unizg.hr> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,LOTS_OF_MONEY,NICE_REPLY_A, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 7/22/23 02:31, Mirsad Todorovac wrote: > On 7/20/23 18:25, Mirsad Todorovac wrote: >> On 7/20/23 18:07, Mirsad Todorovac wrote: >>> On 7/20/23 11:43, Petr Machata wrote: >>>> >>>> Mirsad Todorovac writes: >>>> >>>>> Using the same config for 6.5-rc2 on Ubuntu 22.04 LTS and 22.10, the execution >>>>> stop at the exact same line on both boxes (os I reckon it is more than an >>>>> accident): >>>>> >>>>> # selftests: net/forwarding: sch_ets.sh >>>>> # TEST: ping vlan 10                                                  [ OK ] >>>>> # TEST: ping vlan 11                                                  [ OK ] >>>>> # TEST: ping vlan 12                                                  [ OK ] >>>>> # Running in priomap mode >>>>> # Testing ets bands 3 strict 3, streams 0 1 >>>>> # TEST: band 0                                                        [ OK ] >>>>> # INFO: Expected ratio >95% Measured ratio 100.00 >>>>> # TEST: band 1                                                        [ OK ] >>>>> # INFO: Expected ratio <5% Measured ratio 0 >>>>> # Testing ets bands 3 strict 3, streams 1 2 >>>>> # TEST: band 1                                                        [ OK ] >>>>> # INFO: Expected ratio >95% Measured ratio 100.00 >>>>> # TEST: band 2                                                        [ OK ] >>>>> # INFO: Expected ratio <5% Measured ratio 0 >>>>> # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 >>>>> # TEST: band 0                                                        [ OK ] >>>>> # INFO: Expected ratio >95% Measured ratio 100.00 >>>>> # TEST: band 1                                                        [ OK ] >>>>> # INFO: Expected ratio <5% Measured ratio 0 >>>>> # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 1 2 >>>>> # TEST: bands 1:2                                                     [ OK ] >>>>> # INFO: Expected ratio 2.00 Measured ratio 1.99 >>>>> # Testing ets bands 3 quanta 3300 3300 3300, streams 0 1 2 >>>>> # TEST: bands 0:1                                                     [ OK ] >>>>> # INFO: Expected ratio 1.00 Measured ratio .99 >>>>> # TEST: bands 0:2                                                     [ OK ] >>>>> # INFO: Expected ratio 1.00 Measured ratio 1.00 >>>>> # Testing ets bands 3 quanta 5000 3500 1500, streams 0 1 2 >>>>> # TEST: bands 0:1                                                     [ OK ] >>>>> # INFO: Expected ratio 1.42 Measured ratio 1.42 >>>>> # TEST: bands 0:2                                                     [ OK ] >>>>> # INFO: Expected ratio 3.33 Measured ratio 3.33 >>>>> # Testing ets bands 3 quanta 5000 8000 1500, streams 0 1 2 >>>>> # TEST: bands 0:1                                                     [ OK ] >>>>> # INFO: Expected ratio 1.60 Measured ratio 1.59 >>>>> # TEST: bands 0:2                                                     [ OK ] >>>>> # INFO: Expected ratio 3.33 Measured ratio 3.33 >>>>> # Testing ets bands 2 quanta 5000 2500, streams 0 1 >>>>> # TEST: bands 0:1                                                     [ OK ] >>>>> # INFO: Expected ratio 2.00 Measured ratio 1.99 >>>>> # Running in classifier mode >>>>> # Testing ets bands 3 strict 3, streams 0 1 >>>>> # TEST: band 0                                                        [ OK ] >>>>> # INFO: Expected ratio >95% Measured ratio 100.00 >>>>> # TEST: band 1                                                        [ OK ] >>>>> # INFO: Expected ratio <5% Measured ratio 0 >>>>> # Testing ets bands 3 strict 3, streams 1 2 >>>>> # TEST: band 1                                                        [ OK ] >>>>> # INFO: Expected ratio >95% Measured ratio 100.00 >>>>> # TEST: band 2                                                        [ OK ] >>>>> # INFO: Expected ratio <5% Measured ratio 0 >>>>> # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 >>>>> >>>>> I tried to run 'set -x' enabled version standalone, but that one finished >>>>> correctly (?). >>>>> >>>>> It could be something previous scripts left, but right now I don't have a clue. >>>>> I can attempt to rerun all tests with sch_ets.sh bash 'set -x' enabled later today. >>>> >>>> If you run it standalone without set -x, does it finish as well? >>> >>> Added that. Yes, standlone run finishes correctly, with or without 'set -x': >>> >>> root@defiant:/home/marvin/linux/kernel/linux_torvalds/tools/testing/selftests/net/forwarding# ./sch_ets.sh >>> TEST: ping vlan 10                                                  [ OK ] >>> TEST: ping vlan 11                                                  [ OK ] >>> TEST: ping vlan 12                                                  [ OK ] >>> Running in priomap mode >>> Testing ets bands 3 strict 3, streams 0 1 >>> TEST: band 0                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 strict 3, streams 1 2 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 2                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 >>> TEST: band 0                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 1 2 >>> TEST: bands 1:2                                                     [ OK ] >>> INFO: Expected ratio 2.00 Measured ratio 1.99 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 3300 3300 3300, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.00 Measured ratio 1.00 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 1.00 Measured ratio .99 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 5000 3500 1500, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.42 Measured ratio 1.42 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 3.33 Measured ratio 3.33 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 5000 8000 1500, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.60 Measured ratio 1.59 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 3.33 Measured ratio 3.33 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 2 quanta 5000 2500, streams 0 1 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 2.00 Measured ratio 1.99 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Running in classifier mode >>> Testing ets bands 3 strict 3, streams 0 1 >>> TEST: band 0                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 strict 3, streams 1 2 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 2                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 >>> TEST: band 0                                                        [ OK ] >>> INFO: Expected ratio >95% Measured ratio 100.00 >>> TEST: band 1                                                        [ OK ] >>> INFO: Expected ratio <5% Measured ratio 0 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 1 2 >>> TEST: bands 1:2                                                     [ OK ] >>> INFO: Expected ratio 2.00 Measured ratio 1.99 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 3300 3300 3300, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.00 Measured ratio .99 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 1.00 Measured ratio .99 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 5000 3500 1500, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.42 Measured ratio 1.42 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 3.33 Measured ratio 3.33 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 3 quanta 5000 8000 1500, streams 0 1 2 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 1.60 Measured ratio 1.60 >>> TEST: bands 0:2                                                     [ OK ] >>> INFO: Expected ratio 3.33 Measured ratio 3.33 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> Testing ets bands 2 quanta 5000 2500, streams 0 1 >>> TEST: bands 0:1                                                     [ OK ] >>> INFO: Expected ratio 2.00 Measured ratio 2.00 >>> killing MZ >>> killed MZ >>> killing MZ >>> killed MZ >>> root@defiant:/home/marvin/linux/kernel/linux_torvalds/tools/testing/selftests/net/forwarding# >>> >>>> That would imply that the reproducer needs to include the previous tests as >>>> well. >>> >>> This is entirely possible, as timeouts and CTRL+C events do not seem to be caught >>> and the cleanup is not done ... >>> >>> sch_ets_core.sh:    trap cleanup EXIT >>> >>> Some tests time out even after settings:timeout=240, so IMHO this should be taken into account. >>> >>> Best regards, >>> Mirsad Todorovac >>> >>>> It looks like the test is stuck in ets_test_mixed in classifier_mode. >>>> A way to run just this test would be: >>>> >>>>      TESTS="classifier_mode ets_test_mixed" ./sch_ets.sh >>>> >>>> Looking at the code, the only place that I can see that waits on >>>> anything is the "{ kill %% && wait %%; } 2>/dev/null" line in >>>> stop_traffic() (in lib.sh). Maybe something like this would let >>>> us see if that's the case: >>>> >>>> modified   tools/testing/selftests/net/forwarding/lib.sh >>>> @@ -1468,8 +1470,10 @@ start_tcp_traffic() >>>>   stop_traffic() >>>>   { >>>> +    echo killing MZ >>>>       # Suppress noise from killing mausezahn. >>>>       { kill %% && wait %%; } 2>/dev/null >>>> +    echo killed MZ >>>>   } >> >> FYI, this is the [incomplete] list of the tests that time out even after >> being assigned long timeout of 240s instead of default. >> >> marvin@defiant:~/linux/kernel/linux_torvalds$ grep TIMEOUT ../kselftest-6.5-rc2-net-forwarding-8.log >> not ok 13 selftests: net/forwarding: custom_multipath_hash.sh # TIMEOUT 240 seconds >> not ok 18 selftests: net/forwarding: gre_custom_multipath_hash.sh # TIMEOUT 240 seconds >> not ok 19 selftests: net/forwarding: gre_inner_v4_multipath.sh # TIMEOUT 240 seconds >> not ok 21 selftests: net/forwarding: gre_multipath_nh_res.sh # TIMEOUT 240 seconds >> not ok 22 selftests: net/forwarding: gre_multipath_nh.sh # TIMEOUT 240 seconds >> not ok 27 selftests: net/forwarding: ip6gre_custom_multipath_hash.sh # TIMEOUT 240 seconds >> not ok 34 selftests: net/forwarding: ip6gre_inner_v4_multipath.sh # TIMEOUT 240 seconds >> not ok 58 selftests: net/forwarding: no_forwarding.sh # TIMEOUT 240 seconds >> not ok 67 selftests: net/forwarding: router_mpath_nh_res.sh # TIMEOUT 240 seconds >> not ok 68 selftests: net/forwarding: router_mpath_nh.sh # TIMEOUT 240 seconds >> not ok 70 selftests: net/forwarding: router_multipath.sh # TIMEOUT 240 seconds >> marvin@defiant:~/linux/kernel/linux_torvalds$ > > The test seem to be stuck despite adding SIGTERM cleanup to all tests > that timed out: > > # selftests: net/forwarding: sch_ets.sh > # TEST: ping vlan 10                                                  [ OK ] > # TEST: ping vlan 11                                                  [ OK ] > # TEST: ping vlan 12                                                  [ OK ] > # Running in priomap mode > # Testing ets bands 3 strict 3, streams 0 1 > # TEST: band 0                                                        [ OK ] > # INFO: Expected ratio >95% Measured ratio 100.00 > # TEST: band 1                                                        [ OK ] > # INFO: Expected ratio <5% Measured ratio 0 > # Testing ets bands 3 strict 3, streams 1 2 > # TEST: band 1                                                        [ OK ] > # INFO: Expected ratio >95% Measured ratio 100.00 > # TEST: band 2                                                        [ OK ] > # INFO: Expected ratio <5% Measured ratio 0 > # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 > # TEST: band 0                                                        [ OK ] > # INFO: Expected ratio >95% Measured ratio 100.00 > # TEST: band 1                                                        [ OK ] > # INFO: Expected ratio <5% Measured ratio 0 > # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 1 2 > # TEST: bands 1:2                                                     [ OK ] > # INFO: Expected ratio 2.00 Measured ratio 2.00 > # Testing ets bands 3 quanta 3300 3300 3300, streams 0 1 2 > # TEST: bands 0:1                                                     [ OK ] > # INFO: Expected ratio 1.00 Measured ratio .99 > # TEST: bands 0:2                                                     [ OK ] > # INFO: Expected ratio 1.00 Measured ratio .99 > # Testing ets bands 3 quanta 5000 3500 1500, streams 0 1 2 > # TEST: bands 0:1                                                     [ OK ] > # INFO: Expected ratio 1.42 Measured ratio 1.42 > # TEST: bands 0:2                                                     [ OK ] > # INFO: Expected ratio 3.33 Measured ratio 3.33 > # Testing ets bands 3 quanta 5000 8000 1500, streams 0 1 2 > # TEST: bands 0:1                                                     [ OK ] > # INFO: Expected ratio 1.60 Measured ratio 1.59 > # TEST: bands 0:2                                                     [ OK ] > # INFO: Expected ratio 3.33 Measured ratio 3.33 > # Testing ets bands 2 quanta 5000 2500, streams 0 1 > # TEST: bands 0:1                                                     [ OK ] > # INFO: Expected ratio 2.00 Measured ratio 1.99 > # Running in classifier mode > # Testing ets bands 3 strict 3, streams 0 1 > # TEST: band 0                                                        [ OK ] > # INFO: Expected ratio >95% Measured ratio 100.00 > # TEST: band 1                                                        [ OK ] > # INFO: Expected ratio <5% Measured ratio 0 > # Testing ets bands 3 strict 3, streams 1 2 > # TEST: band 1                                                        [ OK ] > # INFO: Expected ratio >95% Measured ratio 100.00 > # TEST: band 2                                                        [ OK ] > # INFO: Expected ratio <5% Measured ratio 0 > # Testing ets bands 4 strict 1 quanta 5000 2500 1500, streams 0 1 > > What is happening is a stuck mousezahn loop: > > root      558266    2732  0 01:17 pts/2    00:00:23 mausezahn veth0.10 -p 8000 -A 192.0.2.1 -B 192.0.2.2 -c 0 -a own -b ba:33:37:81:dc:5 > root      558273    2732  0 01:17 pts/2    00:00:23 mausezahn veth0.11 -p 8000 -A 192.0.2.17 -B 192.0.2.18 -c 0 -a own -b ba:33:37:81:dc > > $ sudo strace -p 558266 > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > ioctl(5, SIOCGIFINDEX, {ifr_name="veth0.10"}) = -1 ENODEV (No such device) > > I am not sure if this will ever complete, or is it a runaway loop? > > Probably it is stuck, for nothing was written to the test log for 71 minutes. > > I hope we are coming closer to the cause. > > Previously I didn't figure out what hanged. > > FWIW, I will post the patches for the scripts that timed out for your review. Hi, More homework: I can trace the infinite mausezahn mode to here: lib.sh 1420 __start_traffic() 1421 { 1422 local pktsize=$1; shift 1423 local proto=$1; shift 1424 local h_in=$1; shift # Where the traffic egresses the host 1425 local sip=$1; shift 1426 local dip=$1; shift 1427 local dmac=$1; shift 1428 1429 $MZ $h_in -p $pktsize -A $sip -B $dip -c 0 \ 1430 -a own -b $dmac -t "$proto" -q "$@" & 1431 sleep 1 1432 } but I don't know why in this test it runs away for "veth0.10" in an infinite loop and is obviously nowhere killed for hours. I seem to have no new ideas now so I guess I should take a break. Kind regards, Mirsad