Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754112AbaJMOqW (ORCPT ); Mon, 13 Oct 2014 10:46:22 -0400 Received: from v094114.home.net.pl ([79.96.170.134]:52585 "HELO v094114.home.net.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1753272AbaJMOqV (ORCPT ); Mon, 13 Oct 2014 10:46:21 -0400 From: "Rafael J. Wysocki" To: Pavel Machek , Wilmer van der Gaast Cc: bhelgaas@google.com, rafael.j.wysocki@intel.com, linux-kernel@vger.kernel.org Subject: Re: Machine crashes right *after* ~successful resume Date: Mon, 13 Oct 2014 17:06:35 +0200 Message-ID: <9751497.0fL0kvsyX7@vostro.rjw.lan> User-Agent: KMail/4.11.5 (Linux/3.16.0-rc5+; KDE/4.11.5; x86_64; ; ) In-Reply-To: <20141012204032.GA11171@amd> References: <54347520.3050109@gaast.net> <543AA2FE.20105@gaast.net> <20141012204032.GA11171@amd> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="utf-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sunday, October 12, 2014 10:40:32 PM Pavel Machek wrote: > Bjorn, any ideas? > > Would it be feasible to revert 2e8b... to see if it fixes it on 3.17? That's a merge, isn't it? I'd rather check what the pci/misc branch was based on and then bisect that branch. If you do $ git show fed2451 you'll see (among other things) that this indeed is the PCI branch merged by that commit and that it is based on 3b2f64d00c46 Linux 3.11-rc2 So, you can do $ git bisect 3b2f64d00c46..fed2451 and see which of the commits in there introduced the problem you're seeing. Note: Test fed2451 itself *first* and if that is bad already, then the merge itself was problematic, in which case please let me know. > On Sun 2014-10-12 16:49:18, Wilmer van der Gaast wrote: > > Hello, > > > > Many thanks for your response! > > > > On 12-10-14 15:30, Pavel Machek wrote: > > > > > >Has it ever worked ok? ...aha, in 3.10, ok. > > > > > Correct. And I've tried a few more kernels now, compiled on my own. 3.17 > > still has this issue, 3.10 is completely fine all the way up to 3.10.57 > > (I've tested just under 50 cycles last night). 3.11 I tried but it seems to > > have other suspend-resume stability issues not present anymore in later > > kernels, I've mostly not used those results. > > > > git bisect: I've finally succeeded! I've tried automating it completely, but > > sadly Gigabyte couldn't be bothered wiring up the motherboard to make the > > watchdog work. :-( > > > > The culprit appears to be this one: 2e8b5f621dbe29425906852c6079afb6b28720cb > > > > Merge: 07f2daa fed2451 > > Author: Bjorn Helgaas > > Date: Wed Aug 28 20:55:41 2013 -0600 > > > > Merge branch 'pci/misc' into next > > > > * pci/misc: > > PCI: Remove pcie_cap_has_devctl() > > PCI: Support PCIe Capability Slot registers only for ports with slots > > PCI: Remove PCIe Capability version checks > > PCI: Allow PCIe Capability link-related register access for switches > > PCI: Add offsets of PCIe capability registers > > PCI: Tidy bitmasks and spacing of PCIe capability definitions > > PCI: Remove obsolete comment reference to pci_pcie_cap2() > > PCI: Clarify PCI_EXP_TYPE_PCI_BRIDGE comment > > PCI: Rename PCIe capability definitions to follow convention > > PCI: Disable decoding for BAR sizing only when it was actually enabled > > PCI: Add comment about needing pci_msi_off() even when > > CONFIG_PCI_MSI=n > > PCI: Add pcibios_pm_ops for optional arch-specific hibernate > > functionality > > > > I've then tried to narrow down which of the merged changes is my issue but > > with no luck, possibly because there's a problem with a combination of one > > of these changes, and a change that was not in the pci/misc branch at the > > time. I could do a manual test instead. > > > > >>I've already tried to skip the NVidia + VMware modules at boot time (as you > > >>can see from the logs they're not loaded at any point), but it didn't help. > > >>I could try omitting more modules. > > >Yes, try with minimal modules (and no s2ram) would be nice. > > > > > I've tried unloading a bunch of modules (sound and NIC IIRC), same results. > > I can try this again with an even more minimal set. If this improves the > > situation, I'll post again. > > > > > > Wilmer van der Gaast. > > > > -- I speak only for myself. Rafael J. Wysocki, Intel Open Source Technology Center. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/