Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754274AbdDKJAT (ORCPT ); Tue, 11 Apr 2017 05:00:19 -0400 Received: from ozlabs.org ([103.22.144.67]:41857 "EHLO ozlabs.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754203AbdDKJAP (ORCPT ); Tue, 11 Apr 2017 05:00:15 -0400 From: Michael Ellerman To: Tyrel Datwyler , Sachin Sant , linuxppc-dev@ozlabs.org Cc: Nathan Fontenot , LKML Subject: Re: WARN @lib/refcount.c:128 during hot unplug of I/O adapter. In-Reply-To: <89aec36c-e352-e055-5e80-1235449762ce@linux.vnet.ibm.com> References: <8760ig983f.fsf@concordia.ellerman.id.au> <89aec36c-e352-e055-5e80-1235449762ce@linux.vnet.ibm.com> User-Agent: Notmuch/0.21 (https://notmuchmail.org) Date: Tue, 11 Apr 2017 19:00:08 +1000 Message-ID: <871sszwc87.fsf@concordia.ellerman.id.au> MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1314 Lines: 38 Tyrel Datwyler writes: > On 04/06/2017 09:04 PM, Michael Ellerman wrote: >> Tyrel Datwyler writes: >> >>> On 04/06/2017 03:27 AM, Sachin Sant wrote: >>>> On a POWER8 LPAR running 4.11.0-rc5, a hot unplug operation on >>>> any I/O adapter results in the following warning >>>> >>>> This problem has been in the code for some time now. I had first seen this in >>>> -next tree. >>>> > > > >>>> Have attached the dmesg log from the system. Let me know if any additional >>>> information is required to help debug this problem. >>> >>> I remember you mentioning this when the issue was brought up for CPUs. I >>> assume the case is the same here where the issue is only seen with >>> adapters that were hot-added after boot (ie. hot-remove of adapter >>> present at boot doesn't trip the warning)? >> >> So who's fixing this? > > I started looking at it when Bharata submitted a patch trying to fix the > issue for CPUs, but got side tracked by other things. I suspect that > this underflow has actually been an issue for quite some time, and we > are just now becoming aware of it thanks to the recount_t patchset being > merged. Yes I agree. Which means it might be broken in existing distros. > I'll look into it again this week. Thanks. cheers