Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp895160ybl; Wed, 11 Dec 2019 09:10:16 -0800 (PST) X-Google-Smtp-Source: APXvYqy9yVenwIn5dY9nWk8DAHn5Clt+xlYJw6YXnhet7XL3JL96LNSIsUMk+fEnd2S/pccvS071 X-Received: by 2002:a05:6808:8ec:: with SMTP id d12mr3422055oic.131.1576084216197; Wed, 11 Dec 2019 09:10:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1576084216; cv=none; d=google.com; s=arc-20160816; b=I+BluvV7f2oa1rGZM7Nd/eq/K7zWZLt+ubQXuH9NzpxVImvamsOsz73HbqOZsS+Zlz d5CZJH+X5aU4jNLu3AGWVoymmnsBwkUhnsD4RwJlsBvG43kkew8JQe652LkRHa6W+tJd WjbXU1KDhjUN9ACNYFbvSoHse06kF0ehxk7IgjeO/rec9CezFt8CFgLIStRaj5PjL1E5 yaZIpRpfegnslP6xv4oSeXznOcjbhK7eqqPk4uCGKG9pJsSY1ZsGvjmLR/rENWUlAK+d 3q8mOqk9mXFH4TxwV/bdgcgCGi6hmguqJZP8kmJcKb9gOu7eeF+/U+3I3WcDPvGa6lnp rWFQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=JdZWREz6G2wmNMMaK1f9eUfx5lnnYL4pIixt8ppKprA=; b=aGkypEs6n0Hz5Jb0kbock1lOJIMbELytlJt73Eth0P8guyvtwWhONyZ7bpf4GHpZfa /Z7hw7Y403Xplmp8rq7sO7wctCg5MtsAX7bdGgg7jBJ3ebAVx1da0uE7q1+IwrcM2JCW zzAwK6V4WE9pXI65glLQO/H2r3/Nf7oA27TULZ+o8Rl5oEAvXIhC0IY957Y98djO8zxp 29w436i1unIsBfiJVoQVwikUAKOrgB9JEKXzz4tEL/RmglhHbHDt4tbGw7TpEXnW7sbU FjbAohIJdEsenBbox5eCkL9h8tGJ5ouZwLh2+4K+kEUSGd/avq+6ur4nVxDAh7eC/Dw9 wQ6Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n65si1511233oib.262.2019.12.11.09.10.02; Wed, 11 Dec 2019 09:10:16 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730788AbfLKRJW (ORCPT + 99 others); Wed, 11 Dec 2019 12:09:22 -0500 Received: from lhrrgout.huawei.com ([185.176.76.210]:2181 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729512AbfLKRJW (ORCPT ); Wed, 11 Dec 2019 12:09:22 -0500 Received: from LHREML710-CAH.china.huawei.com (unknown [172.18.7.107]) by Forcepoint Email with ESMTP id 0AD9CE01575F78853AA3; Wed, 11 Dec 2019 17:09:20 +0000 (GMT) Received: from lhreml724-chm.china.huawei.com (10.201.108.75) by LHREML710-CAH.china.huawei.com (10.201.108.33) with Microsoft SMTP Server (TLS) id 14.3.408.0; Wed, 11 Dec 2019 17:09:19 +0000 Received: from [127.0.0.1] (10.202.226.46) by lhreml724-chm.china.huawei.com (10.201.108.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1713.5; Wed, 11 Dec 2019 17:09:19 +0000 Subject: Re: [PATCH RFC 1/1] genirq: Make threaded handler use irq affinity for managed interrupt To: Ming Lei CC: , , , , , , , , , , References: <1575642904-58295-1-git-send-email-john.garry@huawei.com> <1575642904-58295-2-git-send-email-john.garry@huawei.com> <20191207080335.GA6077@ming.t460p> <78a10958-fdc9-0576-0c39-6079b9749d39@huawei.com> <20191210014335.GA25022@ming.t460p> From: John Garry Message-ID: <0ad37515-c22d-6857-65a2-cc28256a8afa@huawei.com> Date: Wed, 11 Dec 2019 17:09:18 +0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.1.2 MIME-Version: 1.0 In-Reply-To: <20191210014335.GA25022@ming.t460p> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.202.226.46] X-ClientProxiedBy: lhreml728-chm.china.huawei.com (10.201.108.79) To lhreml724-chm.china.huawei.com (10.201.108.75) X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/12/2019 01:43, Ming Lei wrote: >>>> For when the interrupt is managed, allow the threaded part to run on all >>>> cpus in the irq affinity mask. >>> I remembered that performance drop is observed by this approach in some >>> test. >> From checking the thread about the NVMe interrupt swamp, just switching to >> threaded handler alone degrades performance. I didn't see any specific >> results for this change from Long Li -https://lkml.org/lkml/2019/8/21/128 Hi Ming, > I am pretty clear the reason for Azure, which is caused by aggressive interrupt > coalescing, and this behavior shouldn't be very common, and it can be > addressed by the following patch: I am running some NVMe perf tests with Marc's patch. I see this almost always eventually (with or without that patch): [ 66.018140] rcu: INFO: rcu_preempt self-detected stall on CPU2% done] [5058MB/0KB/0KB /s] [1295K/0/0 iops] [eta 01m:39s] [ 66.023885] rcu: 12-....: (5250 ticks this GP) idle=182/1/0x4000000000000004 softirq=517/517 fqs=2529 [ 66.033306] (t=5254 jiffies g=733 q=2241) [ 66.037394] Task dump for CPU 12: [ 66.040696] fio R running task 0 798 796 0x00000002 [ 66.047733] Call trace: [ 66.050173] dump_backtrace+0x0/0x1a0 [ 66.053823] show_stack+0x14/0x20 [ 66.057126] sched_show_task+0x164/0x1a0 [ 66.061036] dump_cpu_task+0x40/0x2e8 [ 66.064686] rcu_dump_cpu_stacks+0xa0/0xe0 [ 66.068769] rcu_sched_clock_irq+0x6d8/0xaa8 [ 66.073027] update_process_times+0x2c/0x50 [ 66.077198] tick_sched_handle.isra.14+0x30/0x50 [ 66.081802] tick_sched_timer+0x48/0x98 [ 66.085625] __hrtimer_run_queues+0x120/0x1b8 [ 66.089968] hrtimer_interrupt+0xd4/0x250 [ 66.093966] arch_timer_handler_phys+0x28/0x40 [ 66.098398] handle_percpu_devid_irq+0x80/0x140 [ 66.102915] generic_handle_irq+0x24/0x38 [ 66.106911] __handle_domain_irq+0x5c/0xb0 [ 66.110995] gic_handle_irq+0x5c/0x148 [ 66.114731] el1_irq+0xb8/0x180 [ 66.117858] efi_header_end+0x94/0x234 [ 66.121595] irq_exit+0xd0/0xd8 [ 66.124724] __handle_domain_irq+0x60/0xb0 [ 66.128806] gic_handle_irq+0x5c/0x148 [ 66.132542] el0_irq_naked+0x4c/0x54 [ 97.152870] rcu: INFO: rcu_preempt self-detected stall on CPU8% done] [4736MB/0KB/0KB /s] [1212K/0/0 iops] [eta 01m:08s] [ 97.158616] rcu: 8-....: (1 GPs behind) idle=08e/1/0x4000000000000002 softirq=462/505 fqs=2621 [ 97.167414] (t=5253 jiffies g=737 q=5507) [ 97.171498] Task dump for CPU 8: [pu_task+0x40/0x2e8 [ 97.198705] rcu_dump_cpu_stacks+0xa0/0xe0 [ 97.202788] rcu_sched_clock_irq+0x6d8/0xaa8 [ 97.207046] update_process_times+0x2c/0x50 [ 97.211217] tick_sched_handle.isra.14+0x30/0x50 [ 97.215820] tick_sched_timer+0x48/0x98 [ 97.219644] __hrtimer_run_queues+0x120/0x1b8 [ 97.223989] hrtimer_interrupt+0xd4/0x250 [ 97.227987] arch_timer_handler_phys+0x28/0x40 [ 97.232418] handle_percpu_devid_irq+0x80/0x140 [ 97.236935] generic_handle_irq+0x24/0x38 [ 97.240931] __handle_domain_irq+0x5c/0xb0 [ 97.245015] gic_handle_irq+0x5c/0x148 [ 97.248751] el1_irq+0xb8/0x180 [ 97.251880] find_busiest_group+0x18c/0x9e8 [ 97.256050] load_balance+0x154/0xb98 [ 97.259700] rebalance_domains+0x1cc/0x2f8 [ 97.263783] run_rebalance_domains+0x78/0xe0 [ 97.268040] efi_header_end+0x114/0x234 [ 97.271864] run_ksoftirqd+0x38/0x48 [ 97.275427] smpboot_thread_fn+0x16c/0x270 [ 97.279511] kthread+0x118/0x120 [ 97.282726] ret_from_fork+0x10/0x18 [ 97.286289] Task dump for CPU 12: [ 97.289591] kworker/12:1 R running task 0 570 2 0x0000002a [ 97.296634] Workqueue: 0x0 (mm_percpu_wq) [ 97.300718] Call trace: [ 97.303152] __switch_to+0xbc/0x218 [ 97.306632] page_wait_table+0x1500/0x1800 Would this be the same interrupt "swamp" issue? > > http://lists.infradead.org/pipermail/linux-nvme/2019-November/028008.html > What is the status of these patches? I did not see them in mainline. > Then please share your lockup story, such as, which HBA/drivers, test steps, > if you complete IOs from multiple disks(LUNs) on single CPU, if you have > multiple queues, how many active LUNs involved in the test, ... > > Thanks, John