Received: by 10.192.165.148 with SMTP id m20csp3435937imm; Mon, 23 Apr 2018 06:36:50 -0700 (PDT) X-Google-Smtp-Source: AIpwx48ZCXQGYRp7cuNub+fkkbEl1jJN2G/1176Y8qfC2w9vKDQBS3bWrPcYSeA5WDcn2RsKBYKc X-Received: by 10.98.66.143 with SMTP id h15mr8515029pfd.156.1524490610138; Mon, 23 Apr 2018 06:36:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1524490610; cv=none; d=google.com; s=arc-20160816; b=CP0yXXRBqKufMKavE5EUaOuxteV/dRNXn19xJOOrOKqkUAlO0XdXKDakCLrTvM4fQZ YdDyo3GQ2b84UZwYRmkaKHO0tHSL74ZLXsUL1QzdrKreZFHVX4XWtNoA0AmxfM3YlLMN Z8lVZsDhJsTv6WcP9Aos+HXDtDaI4NwFfSekORvqRoW21snOhAg/aJmICixzfhKwdfvB WMh3rGwEiqYlTYzuj2zWhkG7RtBmFmHikTbJhIUAeEHJUiicnyyJC0XEkq4bTuFmXJAn f+H6qwYywRdQPqcAHGeb3Jb4eFXVnTJ/k9ifgweGoNu+wHGSTRbfaIeC3Wr2ran2CRXu s0Tg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date :arc-authentication-results; bh=NM76s6VcolnlbBENeRq6MZrPZv0rmBTW82tnk5Ylb5Q=; b=w0TsTz0vzgp2y9ByBYuP8E33gfTb9HxJXoTH3SbrdNQNDSW50g3Pil3w6bYMsLCOuP F557vCbcCwXsAXPD7fVZl7gaFmzdEbgTTTmVuUkS5t+SxIvhjJYxnHqWS3mUwELcH4oF RCuexCDb22S5UbVbj9Hi+O3coltGmpe3YN5lVxHMQb3Rl2wj+Sx49ziCklrqc5nqm7AA F1DexNnNB/2kZ+gT/YaYwxvtC3/VKJDhRM/AARR5o2OOEn+xwApKce0IsnXbVeRnCXHX 2jXUnP+fLr3DuOYuJmrgT9QLZ30T31zJ2nr4WCwhtXuNorFAOECX5QGVI3kG3n4n5nVI CDyA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d127si93159pgc.236.2018.04.23.06.36.36; Mon, 23 Apr 2018 06:36:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755249AbeDWNfd (ORCPT + 99 others); Mon, 23 Apr 2018 09:35:33 -0400 Received: from Galois.linutronix.de ([146.0.238.70]:55715 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754881AbeDWNf3 (ORCPT ); Mon, 23 Apr 2018 09:35:29 -0400 Received: from hsi-kbw-5-158-153-52.hsi19.kabel-badenwuerttemberg.de ([5.158.153.52] helo=nanos.tec.linutronix.de) by Galois.linutronix.de with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1fAbct-0000pa-4S; Mon, 23 Apr 2018 15:35:27 +0200 Date: Mon, 23 Apr 2018 15:35:26 +0200 (CEST) From: Thomas Gleixner To: "Wan, Kaike" cc: "Marciniszyn, Mike" , "Dalessandro, Dennis" , "Weiny, Ira" , "Fleck, John" , "linux-kernel@vger.kernel.org" , "linux-rdma@vger.kernel.org" Subject: RE: hrtimer (rdmavt RNR timer) was lost In-Reply-To: <3F128C9216C9B84BB6ED23EF16290AFB634C9420@CRSMSX101.amr.corp.intel.com> Message-ID: References: <3F128C9216C9B84BB6ED23EF16290AFB634C8C87@CRSMSX101.amr.corp.intel.com> <3F128C9216C9B84BB6ED23EF16290AFB634C9420@CRSMSX101.amr.corp.intel.com> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 23 Apr 2018, Wan, Kaike wrote: > > Can you apply the following debug patch and enable the hrtimer_start trace > > point and send me the full trace or upload it somewhere? > > The original trace was about 29GB and I filtered it with > "0000000066dda1ea" (the offending base) to generate a 1.4GB file that I > could open and investigate. I am not sure how I can send them to you. Do > you have somewhere I can upload to? > > I can try your debug patch and again I am anticipating a big trace file. Well, you can find the spot where the fail happens and then extract the full thing from 2s before that point to 1s after. That should be reasonably small and good enough. Let me know when you have it and how big it is (compressed) and we'll figure something out how to transport it. Thanks, tglx