Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp420644pxu; Tue, 5 Jan 2021 15:09:54 -0800 (PST) X-Google-Smtp-Source: ABdhPJxLF78/hOLDEpjdpEjdSM7ql4GNCqKFkQPdolOvTEolFV2KfbymRHiok4mHgYuYgBeBTM2w X-Received: by 2002:a50:eb96:: with SMTP id y22mr2054379edr.91.1609888194822; Tue, 05 Jan 2021 15:09:54 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1609888194; cv=none; d=google.com; s=arc-20160816; b=TsyLFpAz717IZmSPMxIolwHY6H2ZygG28YCDOB7e9GKSoEB+Utoz0LJ7+tiWp6HyYN X9+sK2NwpFAcaC3mzpLy+FlfLaMoJud58lVdBNumFgFAggeyhiTro6/sxSuANnLWv2tQ Z+meY0G8ybFy393R6+pa+1ALN4YzRL4WBnL3JFaYdmN2/LWWlLZEZjOso1GWSrQNFpBE 4NyVRuPPvy9sRnFdUTI4NTK+VPf4EFHHKSjFarvoK6x/elKHmJHaDvpEOnaS1TZ1UZjC 9vma+omzZVtzhehFrFO/yRLo51jvKvxsf9JVoBEbqI1ggTDO66Gz+P8vukeY2rH2H4vG AgWw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version:date:cc :to:from:subject:message-id:dkim-signature; bh=EZZnqZ7bOfUDFseawd2jueyjyFb8lfB/zNJXFRgDiL8=; b=HZdHdQ2uDM6cvzcbqlwo3iwvdJ0wy58NNYBbkjBle/0eyDUA+4q/M9W0fEs0Fc3J7e hVrU+5YqjslS/rmEWVF3g/0Y+9G51BGf6XaWuFzMTCqqZq4j5hsQ4KdUzPEaILEAV7Yw dV5RwQcTe2TPlvMGRFJRsJcQ/ylDUmjt6gmXy2gNJ1ieBceZSLqbGGMvz6LqeNYMZ3HK nB1Lr//M0qwteuv04p+Col6IPkgrcaThEy8ZHBQNSg0AWztIzLwhN4P1EWkO2qhBkjLX YChwTr8JNYHkpIVhdeC+8QFks20MdeWDzeTl2z54UA2ouzx5yC4D9bI7tK9gadmvdZjX Io4g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="cj/m0ghM"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m6si187440edr.532.2021.01.05.15.09.31; Tue, 05 Jan 2021 15:09:54 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="cj/m0ghM"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730796AbhAEWhq (ORCPT + 99 others); Tue, 5 Jan 2021 17:37:46 -0500 Received: from us-smtp-delivery-124.mimecast.com ([216.205.24.124]:52458 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728229AbhAEWhp (ORCPT ); Tue, 5 Jan 2021 17:37:45 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1609886178; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=EZZnqZ7bOfUDFseawd2jueyjyFb8lfB/zNJXFRgDiL8=; b=cj/m0ghMR2+YDQmi5apCOYhZ/9QGIf5qzUr5Kpc1T6UC2mCFkkzfnZ5hET8uRoS7qOW/LW ZBliiC7+uvadfwSdV8iKPbD6fswO5U3LJLDAMMt31V96DYWpChAFy9Ik4wjdRt93pKCg3m J9gsoCXQU1OfaPuJZvje2KsPCLJq+/Q= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-367-Bi46ccUIOT6eSv4xnZQ-Pg-1; Tue, 05 Jan 2021 17:36:14 -0500 X-MC-Unique: Bi46ccUIOT6eSv4xnZQ-Pg-1 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 06EF515720; Tue, 5 Jan 2021 22:36:13 +0000 (UTC) Received: from ovpn-115-104.rdu2.redhat.com (ovpn-115-104.rdu2.redhat.com [10.10.115.104]) by smtp.corp.redhat.com (Postfix) with ESMTP id 7C44760BE5; Tue, 5 Jan 2021 22:36:12 +0000 (UTC) Message-ID: <09f87692f844cdaac5b13a7b4ba25e658559f517.camel@redhat.com> Subject: Power9 NV linux-next random process hang From: Qian Cai To: linuxppc-dev@lists.ozlabs.org Cc: linux-kernel@vger.kernel.org, Michael Ellerman Date: Tue, 05 Jan 2021 17:36:11 -0500 Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org .config: https://cailca.coding.net/public/linux/mm/git/files/master/powerpc.config Today's linux-next starts to generate random process hang quite easily. Yesterday's build seems work fine. Sometimes, the process stack seems corrupt while the process is running 100% CPU with gdb shows it just entered a subroutine that really can't see why it hangs. [ 6732.309621][T11627] task:ranbug state:R running task stack:24176 pid: 2893 ppid: 2867 flags:0x00040000 [ 6732.309779][T11627] Call Trace: [ 6732.309826][T11627] [c00000006166fa30] [c00000006166fb60] 0xc00000006166fb60 (unreliable) Also, running LTP syscalls ended up hanging with lots of zombie process. Any idea? root 2023 0.0 0.0 0 0 ? Zs 14:10 0:00 [login] root 52052 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [recv01] root 52054 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [recvfrom01] root 52056 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [recvmsg01] root 52155 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [rt_sigtimedwait] root 52305 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [semctl01] root 52362 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [send01] root 52386 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04] root 52387 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04] root 52388 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04] root 52389 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04] root 52390 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04] root 52392 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04_64] root 52393 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04_64] root 52394 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04_64] root 52395 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04_64] root 52396 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile04_64] root 52398 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile05] root 52400 0.0 0.0 0 0 pts/0 Z 15:03 0:00 [sendfile05_64] root 52415 0.0 0.0 0 0 pts/0 Z 15:04 0:00 [sendmsg01] root 53470 0.0 0.0 0 0 pts/0 Z 15:04 0:00 [sendto01] root 53763 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53764 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53765 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53766 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53767 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53768 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53769 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53770 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53771 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53772 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53773 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53774 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53775 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53776 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53777 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53778 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53779 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53780 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] root 53782 0.0 0.0 0 0 pts/0 Z 15:06 0:00 [setrlimit01] nobody 54290 0.0 0.0 0 0 pts/0 Z 15:07 0:00 [sysctl03] root 56813 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56814 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56815 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56816 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56817 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56818 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56819 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56820 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56821 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56822 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56823 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56825 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56826 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56827 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56828 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56829 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56830 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56831 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56832 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56833 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56834 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56835 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56836 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid03] root 56838 0.0 0.0 0 0 pts/0 Z 16:09 0:00 [waitpid04] sshd 58675 0.0 0.0 0 0 ? Z 17:21 0:00 [sshd]