Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp5963497imu; Wed, 30 Jan 2019 06:34:58 -0800 (PST) X-Google-Smtp-Source: ALg8bN7/iofnMkNeZtLGRgOfRgahknt0P3mp28pQa/Xl4xVRuvx0LrVUDNPcTGfwcvsTJfnTVZrS X-Received: by 2002:a63:2263:: with SMTP id t35mr27793801pgm.69.1548858898925; Wed, 30 Jan 2019 06:34:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548858898; cv=none; d=google.com; s=arc-20160816; b=DD2HIUVHMLwxaYz5Q7N1qiny3BC7mhAfsv9Nchib4tx14HrNWnf+XNALybVd6HjGgz 7brtkM+/3hJfpAbLgU4jKERz+A6A+GV9A2eWoQsVLmHwJUxFvNSFcIi1kEGmOOpriHSH 3udxxkNgNgClJfjYSy7cA7jg2EivwlKfaUwPuRODPLpNRPgnEifzvCELXmKzdU+nrWCK 1sVhNxnd3Q6iJIdhBPGhh4Ya2I5+o3H6jfNDIjc3JpJ7lpTxClbvoLyHy7GuIqJMSIZe 9zkcUpr0FXNlaqA7JzUkIVCzwmLbVTVkFTTaEdcn//M60lIai/P0UHLH1Hf4zI8Hdha2 rmzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date; bh=7OexsK22ug3d/gSBi4xfH4YjAgMQLRvbL/HQ6mUowG4=; b=BJo3ZTvqbqw3vRwO7D3D9wEoXI24w5BuMP2SPl6FtpRnVnQ2uc1b6lLwHUEL1bbPAy 4mT0dgKzf13YvvO9ur0jU4QWyg7IGYpfa9teyN51N7AQjnXSMBLTyavxNIHa9hqKap/q EDb8ISVNUbabmvB4kG8BQWO7ROX25S6o2Hrd3yy/9a3SqJApGtER/rLTBT6VR/2et4pX XuaI4iOAhrFNvEwthycixFCOGG3hKjycyBH9yzCn+H3Aol26lYkJMMESwtu/W+fY2pAZ 4nBIl3Bib1kwylUq6DINKA2txigvakIcB7XTIaWvTi9QDQfyCyibtomV/zRAArdbhMs4 WBvw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x23si1527068pgj.247.2019.01.30.06.34.41; Wed, 30 Jan 2019 06:34:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731067AbfA3OeF (ORCPT + 99 others); Wed, 30 Jan 2019 09:34:05 -0500 Received: from Galois.linutronix.de ([146.0.238.70]:47344 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727789AbfA3OeF (ORCPT ); Wed, 30 Jan 2019 09:34:05 -0500 Received: from [2a01:598:b890:92b7:20d0:59ff:fe6e:bab9] (helo=nanos) by Galois.linutronix.de with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1goqwB-0002AI-Af; Wed, 30 Jan 2019 15:33:59 +0100 Date: Wed, 30 Jan 2019 15:33:47 +0100 (CET) From: Thomas Gleixner To: Sebastian Sewior cc: Heiko Carstens , Peter Zijlstra , Ingo Molnar , Martin Schwidefsky , LKML , linux-s390@vger.kernel.org, Stefan Liebler Subject: Re: WARN_ON_ONCE(!new_owner) within wake_futex_pi() triggered In-Reply-To: Message-ID: References: <20190129102409.GB26906@osiris> <20190129103557.GF28485@hirez.programming.kicks-ass.net> <20190129132303.GE26906@osiris> <20190129151058.GG26906@osiris> <20190129171653.ycl64psq2liy5o5c@linutronix.de> <20190130094913.GC5299@osiris> <20190130125955.GD5299@osiris> <20190130132420.spwrq2d4oxeydk5s@linutronix.de> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="8323329-1894196896-1548858839=:1950" X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --8323329-1894196896-1548858839=:1950 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT On Wed, 30 Jan 2019, Thomas Gleixner wrote: > On Wed, 30 Jan 2019, Sebastian Sewior wrote: > > > On 2019-01-30 13:59:55 [+0100], Heiko Carstens wrote: > > > Last lines of trace below (full log attached): > > > > <...>-56956 [005] .... 658.931364: handle_futex_death: uaddr: 3ff9e880c58 pi: 1 > > … > > <...>-56956 [005] .... 658.931369: handle_futex_death: uaddr: 3ff9e8808c0 success > > <...>-56956 [005] .... 658.931370: sched_process_exit: comm=ld64.so.1 pid=56956 prio=120 > > > > not including 3ff9e880140 > > > > <...>-56956 [005] .... 658.931370: sched_process_exit: comm=ld64.so.1 pid=56956 prio=120 > > > > <...>-56496 [001] .... 658.932404: sys_futex(uaddr: 3ff9e880140, op: 6, val: 1, utime: 0, uaddr2: 5, val3: 0) > > <...>-56496 [001] .... 658.932405: attach_to_pi_owner: Missing pid 56956 > > <...>-56496 [001] .... 658.932406: handle_exit_race: uval2 vs uval 8000de7c vs 8000de7c (-1) > > <...>-56496 [001] .... 658.932501: sys_futex -> 0xfffffffffffffffd > > … > > <...>-56496 [001] .... 659.020750: handle_futex_death: uaddr: 3ff9e880140 pi: 1 > > <...>-56496 [001] .... 659.020750: handle_futex_death: uaddr: 3ff9e880140 uval: 8000de7c > > <...>-56496 [001] .... 659.020750: handle_futex_death: uaddr: 3ff9e880140 success > > > > and here we have it. Well no. That futex is in the robust list of the dying task because it tried to lock it and then died probably before removing itself from the robust list. handle_futex_death() of that task does not touch it because the userspace TID value is de7c which is 56956. The last entries with that uaddr are: <...>-56956 [005] .... 658.923608: sys_futex(uaddr: 3ff9e880140, op: 7, val: 3ff00000007, utime: 3ff9b078910, uaddr2: 3ff9b078910, val3: 3ffea67e3f7) UNLOCK <...>-56945 [006] .... 658.923612: sys_futex(uaddr: 3ff9e880140, op: 6, val: 1, utime: 1003ff0, uaddr2: 3ff9e87f910, val3: 3ff0000de71) LOCK <...>-56956 [005] .... 658.923612: sys_futex(uaddr: 3ff9e880140, op: 7, val: 3ff00000007, utime: 3ff9b078910, uaddr2: 3ff9b078910, val3: 3ffea67e3f7) UNLOCK <...>-56945 [006] .... 658.923830: sys_futex(uaddr: 3ff9e880140, op: 7, val: 3ff00000007, utime: 3ff9e87f910, uaddr2: 3ff9e87f910, val3: 3ffea67e3f7) UNLOCK <...>-56496 [001] .... 658.932404: sys_futex(uaddr: 3ff9e880140, op: 6, val: 1, utime: 0, uaddr2: 5, val3: 0) LOCK which fails. This does not make any sense. The last kernel visible operation of 56956 on that uaddr is the UNLOCK above. I need to think some more about what might happen. Thanks, tglx --8323329-1894196896-1548858839=:1950--