Received: by 2002:a05:7412:b995:b0:f9:9502:5bb8 with SMTP id it21csp6931889rdb; Tue, 2 Jan 2024 20:57:46 -0800 (PST) X-Google-Smtp-Source: AGHT+IFIT9OA46BUK6IAT82qBjoqhIVmDO+U2Dx5Or6grTbT0a5jVmluo+2KrT+nxvX+AR9W/Pxq X-Received: by 2002:a05:6214:194f:b0:680:c152:77b7 with SMTP id q15-20020a056214194f00b00680c15277b7mr2577799qvk.46.1704257866581; Tue, 02 Jan 2024 20:57:46 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1704257866; cv=none; d=google.com; s=arc-20160816; b=a7rGmc8sYtTh4lOlhjm3LteUtEpMTnmpbEvVUYyhKaue3lw0rmLC8wiEaUFUyNpopi e/VrIDymngNfcl33i2L1lAN1fmflvzw2do4D4HdSlSXGtrL/vjxfbQMnSP7ZKRt/O3ng QCBFkRs/4kCd9sOnRjUc4aH9so0c2QNvBHabJoDlX56gd6avnZmhtR02X33A5+Vlz/bw Cz71TtKF0s5EysScR1DqtbnOrir9I8iquIjjYmvgs1lJfFy4EDta9elda27Ex3OhKpM1 wBGkzSCnX0O7BBZDqJQ0abfKdz+jE7JD0KDHWTaFkXfVBOP/YaEDJ7WlVPzDJ3LYDFSR 4Gaw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:references:cc:to:from:subject :user-agent:mime-version:list-unsubscribe:list-subscribe:list-id :precedence:date:message-id:dkim-signature; bh=TzTbubuBJGoYiT6YmsWHMnhj67g34sIfk7l8kUJzaD4=; fh=eQ+qIczSSRvz7jVYFyfdSAdSJF6uapvWNDKrT1+uBFM=; b=f0sJ24IhcRAA7BXywyY3gwEjd9uoO7f4aNvHl/ektJZfcn97EnmvzQzPqXreE0yr2W 44QJtd3f57VXhg2lglkihkptglwpGYcpRvauEaJ49E3o3cUaKGvvK1aBIKp4yUsQ4Mos SpeaA1kZ9iaojuNQcGXJwKqOYERIg475oeEhS3vkJawHvs/HllAAgZJJ1lPeqnJrxcuV 6KgKmdGyP6Z3Vv10wWOQqxPyKDtUKRuXneE4NnFTN1xpqgETLzMLCUuX9dez50kjhqb5 QqfQb9/nVK8122Jk5G0WUR8c8BSjMrGM9HjJu1fV4CAYeuEUUJf1nFGzDf6x+edc9T+N nxkQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=f3uI0s4s; spf=pass (google.com: domain of linux-kernel+bounces-15157-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-15157-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id w16-20020a0cf710000000b0067f827b4971si26885304qvn.262.2024.01.02.20.57.46 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 02 Jan 2024 20:57:46 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-15157-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=f3uI0s4s; spf=pass (google.com: domain of linux-kernel+bounces-15157-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-15157-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 2EEDE1C22E49 for ; Wed, 3 Jan 2024 04:57:46 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 577E715EBB; Wed, 3 Jan 2024 04:57:39 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="f3uI0s4s" X-Original-To: linux-kernel@vger.kernel.org Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.12]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 4DC99171CA for ; Wed, 3 Jan 2024 04:57:37 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com Authentication-Results: smtp.subspace.kernel.org; spf=none smtp.mailfrom=linux.intel.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1704257857; x=1735793857; h=message-id:date:mime-version:subject:from:to:cc: references:in-reply-to:content-transfer-encoding; bh=4KY36EA/Zapbzcjcb2oCAr/P47qDCCgD+zqKdSA9XQw=; b=f3uI0s4so9CC8yOPy1GSyYMV2DEkp76n0QEbfis2vrrLq5+KnfNSr7qg eW9ULPAAoGXIVROhsO6151El0ocVwgWo9TuUV7s7376IRoaMGFsrE48gN WPOIWc1bDNjeTeMfnjYCLX0bg+4ihHp/Odiu4jmgthg2DZB/Jd6ci5Aan yG5+JmH6ClCr1SGhpKY3jLNFJE5l68ClXEKUHY3pozRgzmjuEMhRBXFcB MmaM3uPJrSd5dqtXlASIFufkRiQ5aQ4l+m8ZJr0X2O51Mxx3w2R2Qx85A P9BGpDlJiVXOf0sERp2wUiG8oBvGtrcc6p8xqQhVCCelbI5oDejte6uIG A==; X-IronPort-AV: E=McAfee;i="6600,9927,10941"; a="4303169" X-IronPort-AV: E=Sophos;i="6.04,326,1695711600"; d="scan'208";a="4303169" Received: from orviesa002.jf.intel.com ([10.64.159.142]) by orvoesa104.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Jan 2024 20:57:37 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.04,326,1695711600"; d="scan'208";a="21682378" Received: from zhaohaif-mobl.ccr.corp.intel.com (HELO [10.93.16.94]) ([10.93.16.94]) by orviesa002-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Jan 2024 20:57:35 -0800 Message-ID: Date: Wed, 3 Jan 2024 12:57:33 +0800 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [Regression] [iommu/iova] iova_rbtree_lock contended seriously causing performance bottleneck (bisect done; commit found) From: Ethan Zhao To: robin.murphy@arm.com Cc: iommu@lists.linux.dev, linux-kernel@vger.kernel.org, regressions@lists.linux.dev References: <20240103045506.1077586-1-haifeng.zhao@linux.intel.com> In-Reply-To: <20240103045506.1077586-1-haifeng.zhao@linux.intel.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit On 1/3/2024 12:55 PM, Ethan Zhao wrote: part of the mail got lost after sent past here Issue: When network throughput is large (>1GB/s), the performance of application nginx hits bottleneck, most of cpu cycles are cost by spin lock, it takes a lot of cycles to aquire iova_rbtree_lock for  __alloc_and_insert_iova_range(). > flame graph > > > queued_spin_lock_slowpath-------------\ > do_raw_spin_lock \ > _raw_spin_lock_irqsave \ > __alloc_and_insert_iova_range \ > alloc_iova_fast \ > iommu_dma_alloc_iova \ > __iommu_dma_map \ > iommu_dma_map_page \ > ice_tx_map.isra.0 \ > ice_xmit_frame_ring \ > dev_hard_start_xmit \ > sch_direct_xmit \ > __dev_xmit_skb \ > __dev_queue_xmit \ > ip_finish_output2 \ > __ip_queue_xmit \ > __tcp_transmit_skb \ > tcp_write_xmit \ > __tcp_push_pending_frames \ > tcp_sendmsg_locked \ > tcp_sendmsg \ > tcp_sendmsg \ > sock_write_iter \ > do_iter_readv_writev \ > do_iter_write \ > vfs_writev \ > do_writev \ > do_syscall_64 \ > entry_SYSCALL_64_after_hwframe \ > __GI___writev \ > [nginx] \ > ngx_linux_sendfile_chain \ > ngx_http_write_filter \ > ngx_output_chain \ > ngx_http_send_response \ > ngx_http_script_return_code \ > [nginx] \ > ngx_http_core_run_phases \ > ngx_http_process_request \ > > Setup: > 1. configure server nginx with following nginx.conf (appended to the tail) > > 2. request server with WRK > ./wrk -t 64 -c 1024 -d 40 --latency http://$server_ip:10802/1KB.json > > Debugging summary: > Bisect identified "commit 371d7955e310 iommu/iova: Improve restart logic" > as cause. > > nginx.conf > user nginx; > worker_processes 16; > error_log /var/log/nginx/error.log crit; > pid /var/run/nginx.pid; > events { > worker_connections 4000; > use epoll; > multi_accept on; > } > http { > include /etc/nginx/mime.types; > default_type application/octet-stream; > log_format main '$remote_addr - $remote_user [$time_local] "$request" ' > '$status $body_bytes_sent "$http_referer" ' > '"$http_user_agent" "$http_x_forwarded_for"'; > access_log off; > sendfile on; > tcp_nopush on; > tcp_nodelay on; > keepalive_timeout 65; > keepalive_requests 20480; > gzip_min_length 10240; > gzip_comp_level 1; > gzip_vary on; > gzip_disable msie6; > gzip_proxied expired no-cache no-store private auth; > gzip_types > # text/html is always compressed by HttpGzipModule > text/css > text/javascript > text/xml > text/plain > text/x-component > application/javascript > application/x-javascript > application/json > application/xml > application/rss+xml > application/atom+xml > font/truetype > font/opentype > application/vnd.ms-fontobject > image/svg+xml; > > reset_timedout_connection on; > client_body_timeout 10; > send_timeout 2; > include /etc/nginx/conf.d/*.conf; > server { > listen 10802; > server_name localhost; > location / { > root /usr/share/nginx/html; > index index.html index.htm; > } > error_page 500 502 503 504 /50x.html; > location = /50x.html { > #root /usr/share/nginx/html; > root /ramdisk; > } > location /1KB.json { > return 202 '{"status":"success","result":"\ > Hello from NGINX, 2KB test\ > Nanchang, which was the capital of Yuzhang Prefecture during the HanDynasty, \ > now falls under the jurisdiction of Hongzhou. It straddles the borderof the \ > influence of the Ye and Zhen constellations , and is adjacent to theHeng \ > and the Lu mountains . The three rivers enfold it like the frontpart \ > of a garment and the five lakes encircle it like a girdle. Itcontrols \ > nature’s jewels. The radiance of its legendary sword shootsdirectly upward \ > between the constellations Niu and Dou. Its talented peopleare outstanding,\ > and the spirit of intelligence pervades the place. This wasthe place where Xu \ > Ru spent the night on his visit to Chen Fan (10). The mightyHongzhou spreads \ > meteors chasing one another.\ > "}'; > } > } > } Thanks, Ethan