Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp719953ybe; Fri, 6 Sep 2019 06:22:33 -0700 (PDT) X-Google-Smtp-Source: APXvYqzmdqAjWTFM2Nv4ALZpZA6InLxSAgzI3w0RQw7X06ORlkqm4wvccj+fyAuwdA0aFmFMezS2 X-Received: by 2002:a17:90a:c70c:: with SMTP id o12mr8137231pjt.50.1567776153580; Fri, 06 Sep 2019 06:22:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1567776153; cv=none; d=google.com; s=arc-20160816; b=k1EeioDqDIzukj9P99cAM+TNOW8qjWo7TC7VoXo3ZGxUMq2poN8u4azYgEcI0Kbgde OwM2F8UZ/WyBBMTuzVAe438ns6BoRHNLxQtfzy/hCCdMn20WPFgQ9r7pcsF14thvDdkb zII9XkNFmQ0Mxd8JDDoKG74JZVr+VrYk5H7m/uDamJRXPMORHLvgHpRB3QhNTP0A7oCB DfituyxnUB2QokYUbtOhWBfLYZWKd277c5PbV3Z0Hk/sMbNsHRqZ/P4CNZo0YyEx+cGE ihIBFtmvFmprCzB8e3u8u9O50ChQisjg/XIu6KkQgviceBjT7Czq6UvVwRgbpKCuSOBo P0/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:autocrypt:openpgp:from:references:cc:to:subject :dkim-signature; bh=7CsW3TppKngdBwILFW1AivCZPL5hHqz4o7k82Kvz7yk=; b=tn/szBXKDoCUg65tDpavd330CEHon+frShOGKjHJ5K7kMZkaY32kDqGkuHhTCgbnpn u3Xdl3ONX6jVgF1GkZN8kLJZ3oMrMp1ymh3pZAQzOyHBUk2KszeWFPFezrflG/iTLOQS f8qjpcKNcayE9Tpv5tzAWkCi4PjojMACTqBh2Pg3ojlp81YZPl1NubUe8jC0KZ7qNzfi Ox2YyHCsX+yyG3basAsvnMJwKoogYSNu+sWfF4yVqsZP0GkOztBEDXBUxnnZ9AoQdpC4 t4UfWUW27PjQbe1etF9Mx5YGJ3tBsDy/irfeR5FijA+mriSw164Nv/2FrprfIRAjpLJh xAXQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=vMdENFAR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z62si4225916pgd.472.2019.09.06.06.22.16; Fri, 06 Sep 2019 06:22:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=vMdENFAR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392306AbfIFFOW (ORCPT + 99 others); Fri, 6 Sep 2019 01:14:22 -0400 Received: from mail-wm1-f66.google.com ([209.85.128.66]:36416 "EHLO mail-wm1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2387551AbfIFFOW (ORCPT ); Fri, 6 Sep 2019 01:14:22 -0400 Received: by mail-wm1-f66.google.com with SMTP id p13so5502395wmh.1 for ; Thu, 05 Sep 2019 22:14:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=subject:to:cc:references:from:openpgp:autocrypt:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=7CsW3TppKngdBwILFW1AivCZPL5hHqz4o7k82Kvz7yk=; b=vMdENFAR1u1iwNLpm3uTpo5yhrAL19/DvOmxc2B9Rr/ZKxjFvqIOErerFUDh4MiFR2 AD5P+3aTT9qOqUE0Z2u54Ilty3puZYTz/jNh6/NsHDGyu2HPcMi1jyWI/7HVYVi+mxeH cpV2jufh4c3Qs8WHhzi1tV3siAVTMuxnHkGShnmF8FyS6nNB6HWoFVw58I3TyK081YT4 //8Xif4HZZekZ90p+2lzzbfN10aaFxfNVI3oIqeydhhXgJh8wU0YSp6cncl/xz0gs7Km cTAe0f7nzSsFkkL6f4+r/HWku7eovFJY6NnVMpKTZx76rwu3Jv4gUv+6gJ1gdjVZoiYG JV9A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:openpgp:autocrypt :message-id:date:user-agent:mime-version:in-reply-to :content-language:content-transfer-encoding; bh=7CsW3TppKngdBwILFW1AivCZPL5hHqz4o7k82Kvz7yk=; b=GuLqnCJmz0LMyvO1i/1fQM6Pplpf3/SrD2ISzi35NZcHElxw17EGYFGVYWjZAPEE0H jwlqYSmLhQWCf3dGb4B4GP7KmpJ6nZML59hQXKm4rMubZBdwHlzKZSZuEERy+K8hWq52 ZoNwfI+YGTGmCgvdaZoigsyyKKF8w4nbHw65HcCUAjLy1ToHA15jj5rQu8RWaTHHGKS/ TaAM4c7Mh3LkpGP9WzLPKok5dR/Nbw+kxNGr+T4GFP0xgqDcEl3H+sC9AiIzgOyy8bbW pFTjRfXK1sl4benP87MZpxl48XZuJ8rryxYlCp/A4E61zo6g19jnvKF+hcUxw+GLkJkK UsHw== X-Gm-Message-State: APjAAAWuQdErucixZ64YFZXcG5xqdgTqVdHBNH6Z1Kz/zN7A/eS5RgVN EnzUnDQuyZF3REYFuQUulggqaIn5wjE= X-Received: by 2002:a05:600c:2105:: with SMTP id u5mr6040493wml.150.1567746858892; Thu, 05 Sep 2019 22:14:18 -0700 (PDT) Received: from ?IPv6:2a01:e34:ed2f:f020:a0b5:f0e4:ddb5:dee4? ([2a01:e34:ed2f:f020:a0b5:f0e4:ddb5:dee4]) by smtp.googlemail.com with ESMTPSA id a13sm9501606wrf.73.2019.09.05.22.14.15 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 05 Sep 2019 22:14:18 -0700 (PDT) Subject: Re: [PATCH 1/4] softirq: implement IRQ flood detection mechanism To: Ming Lei Cc: Keith Busch , Hannes Reinecke , Bart Van Assche , linux-scsi@vger.kernel.org, Peter Zijlstra , Long Li , John Garry , LKML , linux-nvme@lists.infradead.org, Jens Axboe , Ingo Molnar , Thomas Gleixner , Christoph Hellwig , Sagi Grimberg References: <20190903033001.GB23861@ming.t460p> <299fb6b5-d414-2e71-1dd2-9d6e34ee1c79@linaro.org> <20190903063125.GA21022@ming.t460p> <6b88719c-782a-4a63-db9f-bf62734a7874@linaro.org> <20190903072848.GA22170@ming.t460p> <6f3b6557-1767-8c80-f786-1ea667179b39@acm.org> <2a8bd278-5384-d82f-c09b-4fce236d2d95@linaro.org> <20190905090617.GB4432@ming.t460p> <6a36ccc7-24cd-1d92-fef1-2c5e0f798c36@linaro.org> <20190906014819.GB27116@ming.t460p> From: Daniel Lezcano Openpgp: preference=signencrypt Autocrypt: addr=daniel.lezcano@linaro.org; prefer-encrypt=mutual; keydata= mQINBFv/yykBEADDdW8RZu7iZILSf3zxq5y8YdaeyZjI/MaqgnvG/c3WjFaunoTMspeusiFE sXvtg3ehTOoyD0oFjKkHaia1Zpa1m/gnNdT/WvTveLfGA1gH+yGes2Sr53Ht8hWYZFYMZc8V 2pbSKh8wepq4g8r5YI1XUy9YbcTdj5mVrTklyGWA49NOeJz2QbfytMT3DJmk40LqwK6CCSU0 9Ed8n0a+vevmQoRZJEd3Y1qXn2XHys0F6OHCC+VLENqNNZXdZE9E+b3FFW0lk49oLTzLRNIq 0wHeR1H54RffhLQAor2+4kSSu8mW5qB0n5Eb/zXJZZ/bRiXmT8kNg85UdYhvf03ZAsp3qxcr xMfMsC7m3+ADOtW90rNNLZnRvjhsYNrGIKH8Ub0UKXFXibHbafSuq7RqyRQzt01Ud8CAtq+w P9EftUysLtovGpLSpGDO5zQ++4ZGVygdYFr318aGDqCljKAKZ9hYgRimPBToDedho1S1uE6F 6YiBFnI3ry9+/KUnEP6L8Sfezwy7fp2JUNkUr41QF76nz43tl7oersrLxHzj2dYfWUAZWXva wW4IKF5sOPFMMgxoOJovSWqwh1b7hqI+nDlD3mmVMd20VyE9W7AgTIsvDxWUnMPvww5iExlY eIC0Wj9K4UqSYBOHcUPrVOKTcsBVPQA6SAMJlt82/v5l4J0pSQARAQABtCpEYW5pZWwgTGV6 Y2FubyA8ZGFuaWVsLmxlemNhbm9AbGluYXJvLm9yZz6JAlcEEwEIAEECGwEFCwkIBwIGFQoJ CAsCBBYCAwECHgECF4ACGQEWIQQk1ibyU76eh+bOW/SP9LjScWdVJwUCXAkeagUJDRnjhwAK CRCP9LjScWdVJ+vYEACStDg7is2JdE7xz1PFu7jnrlOzoITfw05BurgJMqlvoiFYt9tEeUMl zdU2+r0cevsmepqSUVuUvXztN8HA/Ep2vccmWnCXzlE56X1AK7PRRdaQd1SK/eVsJVaKbQTr ii0wjbs6AU1uo0LdLINLjwwItnQ83/ttbf1LheyN8yknlch7jn6H6J2A/ORZECTfJbG4ecVr 7AEm4A/G5nyPO4BG7dMKtjQ+crl/pSSuxV+JTDuoEWUO+YOClg6azjv8Onm0cQ46x9JRtahw YmXdIXD6NsJHmMG9bKmVI0I7o5Q4XL52X6QxkeMi8+VhvqXXIkIZeizZe5XLTYUvFHLdexzX Xze0LwLpmMObFLifjziJQsLP2lWwOfg6ZiH8z8eQJFB8bYTSMqmfTulB61YO0mhd676q17Y7 Z7u3md3CLH7rh61wU1g7FcLm9p5tXXWWaAud9Aa2kne2O3sirO0+JhsKbItz3d9yXuWgv6w3 heOIF0b91JyrY6tjz42hvyjxtHywRr4cdAEQa2S7HeQkw48BQOG6PqQ9d3FYU34pt3WFJ19V A5qqAiEjqc4N0uPkC79W32yLGdyg0EEe8v0Uhs3CxM9euGg37kr5fujMm+akMtR1ENITo+UI fgsxdwjBD5lNb/UGodU4QvPipB/xx4zz7pS5+2jGimfLeoe7mgGJxrkBDQRb/8z6AQgAvSkg 5w7dVCSbpP6nXc+i8OBz59aq8kuL3YpxT9RXE/y45IFUVuSc2kuUj683rEEgyD7XCf4QKzOw +XgnJcKFQiACpYAowhF/XNkMPQFspPNM1ChnIL5KWJdTp0DhW+WBeCnyCQ2pzeCzQlS/qfs3 dMLzzm9qCDrrDh/aEegMMZFO+reIgPZnInAcbHj3xUhz8p2dkExRMTnLry8XXkiMu9WpchHy XXWYxXbMnHkSRuT00lUfZAkYpMP7La2UudC/Uw9WqGuAQzTqhvE1kSQe0e11Uc+PqceLRHA2 bq/wz0cGriUrcCrnkzRmzYLoGXQHqRuZazMZn2/pSIMZdDxLbwARAQABiQI2BBgBCAAgFiEE JNYm8lO+nofmzlv0j/S40nFnVScFAlv/zPoCGwwACgkQj/S40nFnVSf4OhAAhWJPjgUu6VfS mV53AUGIyqpOynPvSaMoGJzhNsDeNUDfV5dEZN8K4qjuz2CTNvGIyt4DE/IJbtasvi5dW4wW Fl85bF6xeLM0qpCaZtXAsU5gzp3uT7ut++nTPYW+CpfYIlIpyOIzVAmw7rZbfgsId2Lj7g1w QCjvGHw19mq85/wiEiZZNHeJQ3GuAr/uMoiaRBnf6wVcdpUTFMXlkE8/tYHPWbW0YKcKFwJ3 uIsNxZUe6coNzYnL0d9GK2fkDoqKfKbFjNhW9TygfeL2Qhk949jMGQudFS3zlwvN9wwVaC0i KC/D303DiTnB0WFPT8CltMAZSbQ1WEWfwqxhY26di3k9pj+X3BfOmDL9GBlnRTSgwjqjqzpG VZsWouuTfXd9ZPPzvYdUBrlTKgojk1C8v4fhSqb+ard+bZcwNp8Tzl/EI9ygw6lYEATGCUYI Wco+fjehCgG1FWvWavMU+jLNs8/8uwj1u+BtRpWFj4ug/VaDDIuiApKPwl1Ge+zoC7TLMtyb c00W5/8EckjmNgLDIINEsOsidMH61ZOlwDKCxo2lbV+Ij078KHBIY76zuHlwonEQaHLCAdqm WiI95pYZNruAJEqZCpvXDdClmBVMZRDRePzSljCvoHxn7ArEt3F14mabn2RRq/hqB8IhC6ny xAEPQIZaxxginIFYEziOjR65AQ0EW//NCAEIALcJqSmQdkt04vIBD12dryF6WcVWYvVwhspt RlZbZ/NZ6nzarzEYPFcXaYOZCOCv+Xtm6hB8fh5XHd7Y8CWuZNDVp3ozuqwTkzQuux/aVdNb Fe4VNeKGN2FK1aNlguAXJNCDNRCpWgRHuU3rWwGUMgentJogARvxfex2/RV/5mzYG/N1DJKt F7g1zEcQD3JtK6WOwZXd+NDyke3tdG7vsNRFjMDkV4046bOOh1BKbWYu8nL3UtWBxhWKx3Pu 1VOBUVwL2MJKW6umk+WqUNgYc2bjelgcTSdz4A6ZhJxstUO4IUfjvYRjoqle+dQcx1u+mmCn 8EdKJlbAoR4NUFZy7WUAEQEAAYkDbAQYAQgAIBYhBCTWJvJTvp6H5s5b9I/0uNJxZ1UnBQJb /80IAhsCAUAJEI/0uNJxZ1UnwHQgBBkBCAAdFiEEGn3N4YVz0WNVyHskqDIjiipP6E8FAlv/ zQgACgkQqDIjiipP6E+FuggAl6lkO7BhTkrRbFhrcjCm0bEoYWnCkQtX9YFvElQeA7MhxznO BY/r1q2Uf6Ifr3YGEkLnME/tQQzUwznydM94CtRJ8KDSa1CxOseEsKq6B38xJtjgYSxNdgQb EIfCzUHIGfk94AFKPdV6pqqSU5VpPUagF+JxiAkoEPOdFiQCULFNRLMsOtG7yp8uSyJRp6Tz cQ+0+1QyX1krcHBUlNlvfdmL9DM+umPtbS9F6oRph15mvKVYiPObI1z8ymHoc68ReWjhUuHc IDQs4w9rJVAyLypQ0p+ySDcTc+AmPP6PGUayIHYX63Q0KhJFgpr1wH0pHKpC78DPtX1a7HGM 7MqzQ4NbD/4oLKKwByrIp12wLpSe3gDQPxLpfGgsJs6BBuAGVdkrdfIx2e6ENnwDoF0Veeji BGrVmjVgLUWV9nUP92zpyByzd8HkRSPNZNlisU4gnz1tKhQl+j6G/l2lDYsqKeRG55TXbu9M LqJYccPJ85B0PXcy63fL9U5DTysmxKQ5RgaxcxIZCM528ULFQs3dfEx5euWTWnnh7pN30RLg a+0AjSGd886Bh0kT1Dznrite0dzYlTHlacbITZG84yRk/gS7DkYQdjL8zgFr/pxH5CbYJDk0 tYUhisTESeesbvWSPO5uNqqy1dAFw+dqRcF5gXIh3NKX0gqiAA87NM7nL5ym/CNpJ7z7nRC8 qePOXubgouxumi5RQs1+crBmCDa/AyJHKdG2mqCt9fx5EPbDpw6Zzx7hgURh4ikHoS7/tLjK iqWjuat8/HWc01yEd8rtkGuUcMqbCi1XhcAmkaOnX8FYscMRoyyMrWClRZEQRokqZIj79+PR adkDXtr4MeL8BaB7Ij2oyRVjXUwhFQNKi5Z5Rve0a3zvGkkqw8Mz20BOksjSWjAF6g9byukl CUVjC03PdMSufNLK06x5hPc/c4tFR4J9cLrV+XxdCX7r0zGos9SzTPGNuIk1LK++S3EJhLFj 4eoWtNhMWc1uiTf9ENza0ntqH9XBWEQ6IA1gubCniGG+Xg== Message-ID: Date: Fri, 6 Sep 2019 07:14:15 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20190906014819.GB27116@ming.t460p> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On 06/09/2019 03:48, Ming Lei wrote: [ ... ] >> You did not share yet the analysis of the problem (the kernel warnings >> give the symptoms) and gave the reasoning for the solution. It is hard >> to understand what you are looking for exactly and how to connect the dots. > > Let me explain it one more time:> > When one IRQ flood happens on one CPU: > > 1) softirq handling on this CPU can't make progress > > 2) kernel thread bound to this CPU can't make progress > > For example, network may require softirq to xmit packets, or another irq > thread for handling keyboards/mice or whatever, or rcu_sched may depend > on that CPU for making progress, then the irq flood stalls the whole > system. > >> >> AFAIU, there are fast medium where the responses to requests are faster >> than the time to process them, right? > > Usually medium may not be faster than CPU, now we are talking about > interrupts, which can be originated from lots of devices concurrently, > for example, in Long Li'test, there are 8 NVMe drives involved. > >> >> I don't see how detecting IRQ flooding and use a threaded irq is the >> solution, can you explain? > > When IRQ flood is detected, we reserve a bit little time for providing > chance to make softirq/threads scheduled by scheduler, then the above > problem can be avoided. > >> >> If the responses are coming at a very high rate, whatever the solution >> (interrupts, threaded interrupts, polling), we are still in the same >> situation. > > When we moving the interrupt handling into irq thread, other softirq/ > threaded interrupt/thread gets chance to be scheduled, so we can avoid > to stall the whole system. Ok, so the real problem is per-cpu bounded tasks. I share Thomas opinion about a NAPI like approach. I do believe you should also rely on the IRQ_TIME_ACCOUNTING (may be get it optimized) to contribute to the CPU load and enforce task migration at load balance. >> My suggestion was initially to see if the interrupt load will be taken >> into accounts in the cpu load and favorize task migration with the >> scheduler load balance to a less loaded CPU, thus the CPU processing >> interrupts will end up doing only that while other CPUs will handle the >> "threaded" side. >> >> Beside that, I'm wondering if the block scheduler should be somehow >> involved in that [1] > > For NVMe or any multi-queue storage, the default scheduler is 'none', > which basically does nothing except for submitting IO asap. > > > Thanks, > Ming > -- Linaro.org │ Open source software for ARM SoCs Follow Linaro: Facebook | Twitter | Blog