Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp1814614ioo; Mon, 23 May 2022 04:04:54 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwNezir9o6opiYLyKzPi5xNjrrTOi+gFEA89gvnqgIT0erMDItghnyG7RsbpP6yTBotJSDV X-Received: by 2002:a17:90a:9282:b0:1dc:4a1b:ea55 with SMTP id n2-20020a17090a928200b001dc4a1bea55mr25434794pjo.24.1653303894377; Mon, 23 May 2022 04:04:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653303894; cv=none; d=google.com; s=arc-20160816; b=fqK47ZY5a3DWcIZK2k2KjLmQ+T9r2tQlnjP403OUcXMMFN8iSWmDrZjRjSjGFuJJno VFWWQ8IU75e8KOh+lZc9ETllMjcCWblb/esXNLKq5hRJHCE7V2x7OWxQsLbjKx1JGhq0 4Uei9Ok6DGBumkgpUoUx+r7I/NE8hSc4dZEKOsyXyYHBa6HWDvmXNIksPFAJ1EFjwl6b tHujpPldY7B9xKfeyyFIocBhYpqgf244uUizZm5ho8R2kiv9JzSHENyoNy7WOokSEpcR ezylW1hhieqMKFuOfuSv0kBInM4EWXnVzZQSXEnlX8Pg43yJKGKeDDc4I4VAotPtUUFY 6sew== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=7Gj0myDG7Upk1Lm/yIITVfKpkM1xaCwFPb15m2KrOlA=; b=euRki+1aMhp64xpqfAW10YZB+RKlD+sbPyb6i6ba49cADc225TVLw1iHsyd4O5utQq jEIbn3VKlOdOsqP6+D044Y31322SJzeAcM3ujjr3vAqshIEaSLsx4v121RHA3sqRmRdy Xg/BvsYaHOd5ixekkhSkBzsc0ZohzKniQHFxcwAFZFn9GNYf8+BC/B7SNZ+2pJEmgZkG EpMIEDHZZa/d/KpYpMv0lF/xu7NV7Byg7QUYLxu+1RVMexlm4pniRuycgzuyhWodJ4fs uwYkH45HFDc1jwYPVO8Qzz54ARR8ZZvt+YfXCEitBv2z90eEb1BuHet91p+5r8zMsGUl Y0xQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@raspberrypi.com header.s=google header.b=SJc2DPtd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=raspberrypi.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id m18-20020a635812000000b003db17cfd6d0si9623080pgb.564.2022.05.23.04.04.54 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 23 May 2022 04:04:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@raspberrypi.com header.s=google header.b=SJc2DPtd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=raspberrypi.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 3392717E33; Mon, 23 May 2022 04:01:35 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234641AbiEWLBY (ORCPT + 99 others); Mon, 23 May 2022 07:01:24 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48844 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234513AbiEWLBK (ORCPT ); Mon, 23 May 2022 07:01:10 -0400 Received: from mail-wr1-x436.google.com (mail-wr1-x436.google.com [IPv6:2a00:1450:4864:20::436]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4B86A1AA for ; Mon, 23 May 2022 04:01:05 -0700 (PDT) Received: by mail-wr1-x436.google.com with SMTP id u27so19971007wru.8 for ; Mon, 23 May 2022 04:01:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raspberrypi.com; s=google; h=message-id:date:mime-version:user-agent:subject:to:cc:references :from:in-reply-to:content-transfer-encoding; bh=7Gj0myDG7Upk1Lm/yIITVfKpkM1xaCwFPb15m2KrOlA=; b=SJc2DPtddEaRZtuiQIVwI9U6g2Kqmyg3NynK+fD7QTldXAL63VcMRo+aMpJExYDpea U61wuSKnmZudN9/aPDkW+8q2i6sI42HdU+daRbOBydwTH/40cNUSgVlse0Y2aAyMf2c5 M84vKj+sPSwuuOPh+WMeLIkLYg5u0mh0YUqnX4r/H0MM7ztqP4pL9mX9GZMSDnCcC/Eb JGCq9NAZ7xkWCQurfiXYTq3ZBPfihEm7fptSmT/Td81FKIlPYQCJRIcieZ7zpDRZQWaW wJJlN3j/UIGTboshXaa5yTkQil2BreC2vL3dzzRVBrCLzj0GOELRnczy6unvHZKSluZg mP2A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:date:mime-version:user-agent:subject :to:cc:references:from:in-reply-to:content-transfer-encoding; bh=7Gj0myDG7Upk1Lm/yIITVfKpkM1xaCwFPb15m2KrOlA=; b=SKJ4zHIGTUK8h9tpQeWcMZeKLmDvulxxWXRweB+n8QV6tt1klgde1n2bnYyZ2LTSy4 RVwZS198saTvIjmsTRxgD2oYlXR05K+heCPJ6WmeHtFiJ4SnPSM5pLdsa8EcfZBC14mf nOn8XuxA4+0BL2FE/p2vSuyyOUDsZAfLed6KnN6957PnYJ+9yMSU9NrpUTYxMmz8+QCZ rhm2LQ3lipyjlWYg5YN3WvG7HZEe5mXSFh73kRHUvlYFuBOephNGk+Zbe5w3OYW9ByRv HAG+XcSo4/k6uH4jO56V94Wg0bviaWLjLZrf+5XoOK4cIM3AcqL/oSVe3lOjEZenq15m RK2A== X-Gm-Message-State: AOAM530GCIw46sg1C2jZ1pHen+yspwhozDYcx5ZTkNo8P/f1SmyrzSO7 gBErK6qFQ+45qdw8meIeV1SYAA== X-Received: by 2002:a5d:6406:0:b0:20e:5bf9:8e31 with SMTP id z6-20020a5d6406000000b0020e5bf98e31mr18094381wru.432.1653303663852; Mon, 23 May 2022 04:01:03 -0700 (PDT) Received: from ?IPV6:2a00:1098:3142:14:3110:d736:2a7:6aff? ([2a00:1098:3142:14:3110:d736:2a7:6aff]) by smtp.gmail.com with ESMTPSA id g10-20020adfa58a000000b0020e60baffd1sm6477332wrc.52.2022.05.23.04.01.03 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 23 May 2022 04:01:03 -0700 (PDT) Message-ID: Date: Mon, 23 May 2022 12:01:03 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.9.1 Subject: Re: vchiq: Performance regression since 5.18-rc1 To: Stefan Wahren , paulmck@kernel.org Cc: Marcelo Tosatti , Andrew Morton , Nicolas Saenz Julienne , Borislav Petkov , Minchan Kim , Mel Gorman , Juri Lelli , Thomas Gleixner , Sebastian Andrzej Siewior , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Linux ARM , regressions@lists.linux.dev, riel@surriel.com, viro@zeniv.linux.org.uk References: <77d6d498-7dd9-03eb-60f2-d7e682bb1b20@i2se.com> <20220521234616.GO1790663@paulmck-ThinkPad-P17-Gen-1> <20220523044818.GS1790663@paulmck-ThinkPad-P17-Gen-1> <58cb7fbb-d317-83e6-0427-d3f3944b24b8@raspberrypi.com> From: Phil Elwell In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-5.3 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Stefan, On 23/05/2022 11:48, Stefan Wahren wrote: > Hi Phil, > > Am 23.05.22 um 11:29 schrieb Phil Elwell: >> Hi Stefan, >> >> On 23/05/2022 07:19, Stefan Wahren wrote: >>> Hi Paul, >>> >>> Am 23.05.22 um 06:48 schrieb Paul E. McKenney: >>>> On Sun, May 22, 2022 at 05:11:36PM +0200, Stefan Wahren wrote: >>>>> Hi Paul, >>>>> >>>>> Am 22.05.22 um 01:46 schrieb Paul E. McKenney: >>>>>> On Sun, May 22, 2022 at 01:22:00AM +0200, Stefan Wahren wrote: >>>>>>> Hi, >>>>>>> >>>>>>> while testing the staging/vc04_services/interface/vchiq_arm driver with my >>>>>>> Raspberry Pi 3 B+ (multi_v7_defconfig) i noticed a huge performance >>>>>>> regression since [ff042f4a9b050895a42cae893cc01fa2ca81b95c] mm: >>>>>>> lru_cache_disable: replace work queue synchronization with synchronize_rcu >>>>>>> >>>>>>> Usually i run "vchiq_test -f 1" to see the driver is still working [1]. >>>>>>> >>>>>>> Before commit: >>>>>>> >>>>>>> real    0m1,500s >>>>>>> user    0m0,068s >>>>>>> sys    0m0,846s >>>>>>> >>>>>>> After commit: >>>>>>> >>>>>>> real    7m11,449s >>>>>>> user    0m2,049s >>>>>>> sys    0m0,023s >>>>>>> >>>>>>> Best regards >>>>>>> >>>>>>> [1] - https://github.com/raspberrypi/userland >>>>>> Please feel free to try the patch shown below.  Or the pair of patches >>>>>> from Rik here: >>>>>> >>>>>> https://lore.kernel.org/lkml/20220218183114.2867528-2-riel@surriel.com/ >>>>>> https://lore.kernel.org/lkml/20220218183114.2867528-3-riel@surriel.com/ >>>>> I tried your patch and Rik's patches but in both cases vchiq_test runs 7 >>>>> minutes instead of ~ 1 second. >>>> That is surprising.  Do you boot with rcupdate.rcu_normal=1? >>> No, not explicit. >>>>    That would >>>> nullify my patch, but I would expect that Rik's patch would still provide >>>> increased performance even in that case. >>> I will retest with a fresh SD card image. >>>> >>>> Could you please characterize where the slowdown is occurring? >>> >>> Unfortunately i don't have a deep insight into driver and vchiq_test tool. >>> Just a user view. >>> >>> Do you think an strace would be a good starting point? >>> >>> @Phil Any advices to analyse this issue? >> >> Sending many small control packets: >> >>    vchiq_test -c 1 10000 >> >> essentially tests interrupt latency. Using a small number of large bulk >> transfers: >> >>    vchiq_test -b 10000 1 >> >> becomes a test of how long it takes to lock down pages. It also tests DMA >> transfer speeds, but since the DMA is run by the firmware (which you aren't >> changing), I think you can rule that. > Thanks i will try. >> >> You may also find it helpful to include "force_turbo=1" in config.txt for more >> predictable results. >> >> By the way, running our 5.18-rc7-based branch on a 3B+ I'm not seeing any >> performance problems: > I assume you are using arm/bcm2709_defconfig and not arm/multi_v7_defconfig as me? That's correct. Simply switching to multi_v7_defconfig breaks vchiq completely, presumably because it doesn't define CONFIG_BCM2835_VCHIQ. Phil >> >> pi@raspberrypi:~$ time vchiq_test -f 1 >> Functional test - iters:1 >> ======== iteration 1 ======== >> Testing bulk transfer for alignment. >> Testing bulk transfer at PAGE_SIZE. >> >> real    0m0.512s >> user    0m0.042s >> sys     0m0.165s >> >> Phil