Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp5231389ybi; Tue, 4 Jun 2019 03:30:38 -0700 (PDT) X-Google-Smtp-Source: APXvYqx1XQUjZrmnGjBwDqouww/Dkngd6p7u9NP0/iNv3fU4mXmfcGhBy7PF2Ijb9VGDpnDCqG6m X-Received: by 2002:a17:90a:b00b:: with SMTP id x11mr7155932pjq.120.1559644238403; Tue, 04 Jun 2019 03:30:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559644238; cv=none; d=google.com; s=arc-20160816; b=zfvZuXxqt9eDBY3pqmePH340AlGj7Kjp3qmQN/SlJ8gsUm7CVAZk2l3aYCdinXciYP BC4eBIXagLZzXTMne3sZgsKvoDs88t+HXyMIkT3AI3+GbLRiP501oXHg2x4q7YMAbHuA tkrT6zAf6/WQ5xsOQfS5or8KrKK6rGIwOKlPdxJ83ZHYcDopu0/zh9KdxYVvxgoMTftR NSLsS3f4rXkMXPkQ3Ddxn97wXAjfhosOHCWROf0DxPvCq97bNnjAoWsdXR0FpmPpM0kV 1QzUEJ3Fo7x8smilvNhCZxPlT+yswwIknL80dnB4rR0tPG2xS5ZFJ/9etNFkT2gMmlwK oKlQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:autocrypt:openpgp:from:references:cc:to :subject; bh=eChA7OTvelImXD39SP3GaJJW7hxF+Qr+Bkmpg99Gq58=; b=x4YnyeBHQd2+1RpMH81/fBKUddQbCoNkTCB8LehKArVNC23+sV0AJJFp4NlkxmrBkG UbAI9w3bWNYpcCe77xSj3eEYG1UudHgY8wagxe5/EWIAtMjMomljRuUBL6wkCrS9P14o 46wABxeQnBX89uWq+NbCDNKtwzdGnJ8hx2y1LXC4z6hKBO0/DCgo7UU791VDUeDixl+m X/iyfOwAiFP2hGrSOmoeXsDFwNEHqU00tbZDNpiDn7fx+1i1J8A18lq1mECzlMyIVbwg bCdMgZGKepnNzo43oEHKhVC7qOZgmYtNqW5BS1pIuy0ClKvVwzB/p4dvZOeEQaVSZrAc /tyQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g13si21534463pgs.161.2019.06.04.03.30.18; Tue, 04 Jun 2019 03:30:38 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727207AbfFDK2m (ORCPT + 99 others); Tue, 4 Jun 2019 06:28:42 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:39888 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726877AbfFDK2l (ORCPT ); Tue, 4 Jun 2019 06:28:41 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id D6D1F80D; Tue, 4 Jun 2019 03:28:40 -0700 (PDT) Received: from [10.1.197.61] (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id E94E83F246; Tue, 4 Jun 2019 03:28:39 -0700 (PDT) Subject: Re: [RFC v2] irqchip/gic-its: fix command queue pointer comparison bug To: Heyi Guo , linux-kernel@vger.kernel.org Cc: wanghaibin.wang@huawei.com, Thomas Gleixner , Jason Cooper References: <1557747726-28283-1-git-send-email-guoheyi@huawei.com> From: Marc Zyngier Openpgp: preference=signencrypt Autocrypt: addr=marc.zyngier@arm.com; prefer-encrypt=mutual; keydata= mQINBE6Jf0UBEADLCxpix34Ch3kQKA9SNlVQroj9aHAEzzl0+V8jrvT9a9GkK+FjBOIQz4KE g+3p+lqgJH4NfwPm9H5I5e3wa+Scz9wAqWLTT772Rqb6hf6kx0kKd0P2jGv79qXSmwru28vJ t9NNsmIhEYwS5eTfCbsZZDCnR31J6qxozsDHpCGLHlYym/VbC199Uq/pN5gH+5JHZyhyZiNW ozUCjMqC4eNW42nYVKZQfbj/k4W9xFfudFaFEhAf/Vb1r6F05eBP1uopuzNkAN7vqS8XcgQH qXI357YC4ToCbmqLue4HK9+2mtf7MTdHZYGZ939OfTlOGuxFW+bhtPQzsHiW7eNe0ew0+LaL 3wdNzT5abPBscqXWVGsZWCAzBmrZato+Pd2bSCDPLInZV0j+rjt7MWiSxEAEowue3IcZA++7 ifTDIscQdpeKT8hcL+9eHLgoSDH62SlubO/y8bB1hV8JjLW/jQpLnae0oz25h39ij4ijcp8N t5slf5DNRi1NLz5+iaaLg4gaM3ywVK2VEKdBTg+JTg3dfrb3DH7ctTQquyKun9IVY8AsxMc6 lxl4HxrpLX7HgF10685GG5fFla7R1RUnW5svgQhz6YVU33yJjk5lIIrrxKI/wLlhn066mtu1 DoD9TEAjwOmpa6ofV6rHeBPehUwMZEsLqlKfLsl0PpsJwov8TQARAQABtCNNYXJjIFp5bmdp ZXIgPG1hcmMuenluZ2llckBhcm0uY29tPokCOwQTAQIAJQIbAwYLCQgHAwIGFQgCCQoLBBYC AwECHgECF4AFAk6NvYYCGQEACgkQI9DQutE9ekObww/+NcUATWXOcnoPflpYG43GZ0XjQLng LQFjBZL+CJV5+1XMDfz4ATH37cR+8gMO1UwmWPv5tOMKLHhw6uLxGG4upPAm0qxjRA/SE3LC 22kBjWiSMrkQgv5FDcwdhAcj8A+gKgcXBeyXsGBXLjo5UQOGvPTQXcqNXB9A3ZZN9vS6QUYN TXFjnUnzCJd+PVI/4jORz9EUVw1q/+kZgmA8/GhfPH3xNetTGLyJCJcQ86acom2liLZZX4+1 6Hda2x3hxpoQo7pTu+XA2YC4XyUstNDYIsE4F4NVHGi88a3N8yWE+Z7cBI2HjGvpfNxZnmKX 6bws6RQ4LHDPhy0yzWFowJXGTqM/e79c1UeqOVxKGFF3VhJJu1nMlh+5hnW4glXOoy/WmDEM UMbl9KbJUfo+GgIQGMp8mwgW0vK4HrSmevlDeMcrLdfbbFbcZLNeFFBn6KqxFZaTd+LpylIH bOPN6fy1Dxf7UZscogYw5Pt0JscgpciuO3DAZo3eXz6ffj2NrWchnbj+SpPBiH4srfFmHY+Y LBemIIOmSqIsjoSRjNEZeEObkshDVG5NncJzbAQY+V3Q3yo9og/8ZiaulVWDbcpKyUpzt7pv cdnY3baDE8ate/cymFP5jGJK++QCeA6u6JzBp7HnKbngqWa6g8qDSjPXBPCLmmRWbc5j0lvA 6ilrF8m5Ag0ETol/RQEQAM/2pdLYCWmf3rtIiP8Wj5NwyjSL6/UrChXtoX9wlY8a4h3EX6E3 64snIJVMLbyr4bwdmPKULlny7T/R8dx/mCOWu/DztrVNQiXWOTKJnd/2iQblBT+W5W8ep/nS w3qUIckKwKdplQtzSKeE+PJ+GMS+DoNDDkcrVjUnsoCEr0aK3cO6g5hLGu8IBbC1CJYSpple VVb/sADnWF3SfUvJ/l4K8Uk4B4+X90KpA7U9MhvDTCy5mJGaTsFqDLpnqp/yqaT2P7kyMG2E w+eqtVIqwwweZA0S+tuqput5xdNAcsj2PugVx9tlw/LJo39nh8NrMxAhv5aQ+JJ2I8UTiHLX QvoC0Yc/jZX/JRB5r4x4IhK34Mv5TiH/gFfZbwxd287Y1jOaD9lhnke1SX5MXF7eCT3cgyB+ hgSu42w+2xYl3+rzIhQqxXhaP232t/b3ilJO00ZZ19d4KICGcakeiL6ZBtD8TrtkRiewI3v0 o8rUBWtjcDRgg3tWx/PcJvZnw1twbmRdaNvsvnlapD2Y9Js3woRLIjSAGOijwzFXSJyC2HU1 AAuR9uo4/QkeIrQVHIxP7TJZdJ9sGEWdeGPzzPlKLHwIX2HzfbdtPejPSXm5LJ026qdtJHgz BAb3NygZG6BH6EC1NPDQ6O53EXorXS1tsSAgp5ZDSFEBklpRVT3E0NrDABEBAAGJAh8EGAEC AAkFAk6Jf0UCGwwACgkQI9DQutE9ekMLBQ//U+Mt9DtFpzMCIHFPE9nNlsCm75j22lNiw6mX mx3cUA3pl+uRGQr/zQC5inQNtjFUmwGkHqrAw+SmG5gsgnM4pSdYvraWaCWOZCQCx1lpaCOl MotrNcwMJTJLQGc4BjJyOeSH59HQDitKfKMu/yjRhzT8CXhys6R0kYMrEN0tbe1cFOJkxSbV 0GgRTDF4PKyLT+RncoKxQe8lGxuk5614aRpBQa0LPafkirwqkUtxsPnarkPUEfkBlnIhAR8L kmneYLu0AvbWjfJCUH7qfpyS/FRrQCoBq9QIEcf2v1f0AIpA27f9KCEv5MZSHXGCdNcbjKw1 39YxYZhmXaHFKDSZIC29YhQJeXWlfDEDq6nIhvurZy3mSh2OMQgaIoFexPCsBBOclH8QUtMk a3jW/qYyrV+qUq9Wf3SKPrXf7B3xB332jFCETbyZQXqmowV+2b3rJFRWn5hK5B+xwvuxKyGq qDOGjof2dKl2zBIxbFgOclV7wqCVkhxSJi/QaOj2zBqSNPXga5DWtX3ekRnJLa1+ijXxmdjz hApihi08gwvP5G9fNGKQyRETePEtEAWt0b7dOqMzYBYGRVr7uS4uT6WP7fzOwAJC4lU7ZYWZ yVshCa0IvTtp1085RtT3qhh9mobkcZ+7cQOY+Tx2RGXS9WeOh2jZjdoWUv6CevXNQyOUXMM= Organization: ARM Ltd Message-ID: <0723e23c-fd0e-366b-73ec-12cc59767a4e@arm.com> Date: Tue, 4 Jun 2019 11:28:38 +0100 User-Agent: Mozilla/5.0 (X11; Linux aarch64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 In-Reply-To: <1557747726-28283-1-git-send-email-guoheyi@huawei.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Heyi, On 13/05/2019 12:42, Heyi Guo wrote: > When we run several VMs with PCI passthrough and GICv4 enabled, not > pinning vCPUs, we will occasionally see below warnings in dmesg: > > ITS queue timeout (65440 65504 480) > ITS cmd its_build_vmovp_cmd failed > > The reason for the above issue is that in BUILD_SINGLE_CMD_FUNC: > 1. Post the write command. > 2. Release the lock. > 3. Start to read GITS_CREADR to get the reader pointer. > 4. Compare the reader pointer to the target pointer. > 5. If reader pointer does not reach the target, sleep 1us and continue > to try. > > If we have several processors running the above concurrently, other > CPUs will post write commands while the 1st CPU is waiting the > completion. So we may have below issue: > > phase 1: > ---rd_idx-----from_idx-----to_idx--0--------- > > wait 1us: > > phase 2: > --------------from_idx-----to_idx--0-rd_idx-- > > That is the rd_idx may fly ahead of to_idx, and if in case to_idx is > near the wrap point, rd_idx will wrap around. So the below condition > will not be met even after 1s: > > if (from_idx < to_idx && rd_idx >= to_idx) > > There is another theoretical issue. For a slow and busy ITS, the > initial rd_idx may fall behind from_idx a lot, just as below: > > ---rd_idx---0--from_idx-----to_idx----------- > > This will cause the wait function exit too early. > > Actually, it does not make much sense to use from_idx to judge if > to_idx is wrapped, but we need a initial rd_idx when lock is still > acquired, and it can be used to judge whether to_idx is wrapped and > the current rd_idx is wrapped. That's an interesting observation. Indeed, from_idx is pretty irrelevant here, and all we want to observe is the read pointer reaching the end of the command set. > > We switch to a method of calculating the delta of two adjacent reads > and accumulating it to get the sum, so that we can get the real rd_idx > from the wrapped value even when the queue is almost full. > > Cc: Thomas Gleixner > Cc: Jason Cooper > Cc: Marc Zyngier > > Signed-off-by: Heyi Guo > --- > drivers/irqchip/irq-gic-v3-its.c | 30 ++++++++++++++++++++---------- > 1 file changed, 20 insertions(+), 10 deletions(-) > > diff --git a/drivers/irqchip/irq-gic-v3-its.c b/drivers/irqchip/irq-gic-v3-its.c > index 7577755..f05acd4 100644 > --- a/drivers/irqchip/irq-gic-v3-its.c > +++ b/drivers/irqchip/irq-gic-v3-its.c > @@ -745,32 +745,40 @@ static void its_flush_cmd(struct its_node *its, struct its_cmd_block *cmd) > } > > static int its_wait_for_range_completion(struct its_node *its, > - struct its_cmd_block *from, > + u64 origin_rd_idx, > struct its_cmd_block *to) > { > - u64 rd_idx, from_idx, to_idx; > + u64 rd_idx, prev_idx, to_idx, sum; > + s64 delta; > u32 count = 1000000; /* 1s! */ > > - from_idx = its_cmd_ptr_to_offset(its, from); > to_idx = its_cmd_ptr_to_offset(its, to); > + if (to_idx < origin_rd_idx) > + to_idx += ITS_CMD_QUEUE_SZ; > + > + prev_idx = origin_rd_idx; I guess you could just rename origin_rd_idx to prev_idx and drop the extra declaration (the pr_err doesn't matter much). > + sum = origin_rd_idx; > > while (1) { > rd_idx = readl_relaxed(its->base + GITS_CREADR); > > - /* Direct case */ > - if (from_idx < to_idx && rd_idx >= to_idx) > - break; > + /* Wrap around for CREADR */ > + if (rd_idx >= prev_idx) > + delta = rd_idx - prev_idx; > + else > + delta = rd_idx + ITS_CMD_QUEUE_SZ - prev_idx; > > - /* Wrapped case */ > - if (from_idx >= to_idx && rd_idx >= to_idx && rd_idx < from_idx) > + sum += delta; So "sum" isn't quite saying what it represent. My understanding is that it is the linearized version of the read pointer, right? Just like you've linearized to_idx at the beginning of the function. > + if (sum >= to_idx) > break; > > count--; > if (!count) { > pr_err_ratelimited("ITS queue timeout (%llu %llu %llu)\n", > - from_idx, to_idx, rd_idx); > + origin_rd_idx, to_idx, sum); > return -1; > } > + prev_idx = rd_idx; > cpu_relax(); > udelay(1); > } > @@ -787,6 +795,7 @@ void name(struct its_node *its, \ > struct its_cmd_block *cmd, *sync_cmd, *next_cmd; \ > synctype *sync_obj; \ > unsigned long flags; \ > + u64 rd_idx; \ > \ > raw_spin_lock_irqsave(&its->lock, flags); \ > \ > @@ -808,10 +817,11 @@ void name(struct its_node *its, \ > } \ > \ > post: \ > + rd_idx = readl_relaxed(its->base + GITS_CREADR); \ > next_cmd = its_post_commands(its); \ > raw_spin_unlock_irqrestore(&its->lock, flags); \ > \ > - if (its_wait_for_range_completion(its, cmd, next_cmd)) \ > + if (its_wait_for_range_completion(its, rd_idx, next_cmd)) \ > pr_err_ratelimited("ITS cmd %ps failed\n", builder); \ > } > > If you agree with my comments above, I'm happy to take this patch and tidy it up myself. Now, I think there is still an annoying bug that can creep up if the queue wraps *twice* while you're sleeping (which could perfectly happen in a guest running on a busy host). Which means we need to account for wrapping generations... Thanks, M. -- Jazz is not dead. It just smells funny...