Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp3076968imm; Sun, 1 Jul 2018 11:43:31 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKamUASXQYFKKhl3s88sgx4ZZiCvu80oUrA7U/DZppE92naj05Ppv1Xpo8yVHYdiHT6Uysr X-Received: by 2002:a17:902:6b86:: with SMTP id p6-v6mr23087706plk.75.1530470611618; Sun, 01 Jul 2018 11:43:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1530470611; cv=none; d=google.com; s=arc-20160816; b=0adML9RYZAPk9dJG9beJwemH6GoDRDI6F3kZVoh0e/lhpCmSUV8NjVB1Qx6UAx+Lem PA3dSntX2o6J5fqi06b9o/uBlLEy7ou7fmtKoUfR8tBpmRpL7KWF+NrHrOV7/GAZkoiW U9CcsnMvPoMRSgSXZO5m4kPmEubTPtUQVSyaiiGXGCPmkDTcLDdE7PL9WGYBR0OTrx5U kvzaUThPqqcLLHMJryvEVKRY9AZLZh5K+UZ8lalww6r28/5ZS362qzaFI6p+11jdyywV YtSB601M1ToycS8Xv4C0F91lhNR/rI43iyAo/D+TLe7JxV4fT/lI/Og0tDL9khLBZjKP p+Fg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=+UxhoFKYRbmGfhBdYTd/xUWqKtyhYSbMgV3DY1R3dI4=; b=fONF2nsk5LJXTDWMmiJ41pvk+tgLt3jBbXe06GMZ5LVmR6JxT9yZoBf4bjktvJfXWZ oLfUC2b0ElDa8aRUN9N7lyBzzMtN7b7wrMgf6BYe5Gmwpj26xYK8J0GxZjo1zkQDCcz4 o2x6JZcIBAIwwsP/JbH2BfnVF5gt7SUXK87lme8rz9FqKVN9i8h9uC3BvaiTRtFCmjS4 NPXNhfBfSnD2OZq8OiHH5XG2XUuIA2EzNrtpYjCPpnCUxmzqG0co++nRnU93KmOvPNH3 k0ngwANgmFo2rOe0NueJQaB/Il90qgYD7QP4oIZwUUHshD7szqGbTioNBPnt/YR67S2n mxLQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j5-v6si5140081pgt.370.2018.07.01.11.43.17; Sun, 01 Jul 2018 11:43:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932823AbeGASml (ORCPT + 99 others); Sun, 1 Jul 2018 14:42:41 -0400 Received: from mail.linuxfoundation.org ([140.211.169.12]:60148 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932441AbeGAQNa (ORCPT ); Sun, 1 Jul 2018 12:13:30 -0400 Received: from localhost (LFbn-1-12247-202.w90-92.abo.wanadoo.fr [90.92.61.202]) by mail.linuxfoundation.org (Postfix) with ESMTPSA id 43EDC86D; Sun, 1 Jul 2018 16:13:29 +0000 (UTC) From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Gaurav Kohli , "Peter Zijlstra (Intel)" , Oleg Nesterov , Linus Torvalds , Thomas Gleixner , Ingo Molnar , Sasha Levin Subject: [PATCH 3.18 23/85] kthread, sched/wait: Fix kthread_parkme() wait-loop Date: Sun, 1 Jul 2018 18:01:41 +0200 Message-Id: <20180701153123.280903344@linuxfoundation.org> X-Mailer: git-send-email 2.18.0 In-Reply-To: <20180701153122.365061142@linuxfoundation.org> References: <20180701153122.365061142@linuxfoundation.org> User-Agent: quilt/0.65 X-stable: review MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 3.18-stable review patch. If anyone has any objections, please let me know. ------------------ From: Peter Zijlstra [ Upstream commit 741a76b350897604c48fb12beff1c9b77724dc96 ] Gaurav reported a problem with __kthread_parkme() where a concurrent try_to_wake_up() could result in competing stores to ->state which, when the TASK_PARKED store got lost bad things would happen. The comment near set_current_state() actually mentions this competing store, but only mentions the case against TASK_RUNNING. This same store, with different timing, can happen against a subsequent !RUNNING store. This normally is not a problem, because as per that same comment, the !RUNNING state store is inside a condition based wait-loop: for (;;) { set_current_state(TASK_UNINTERRUPTIBLE); if (!need_sleep) break; schedule(); } __set_current_state(TASK_RUNNING); If we loose the (first) TASK_UNINTERRUPTIBLE store to a previous (concurrent) wakeup, the schedule() will NO-OP and we'll go around the loop once more. The problem here is that the TASK_PARKED store is not inside the KTHREAD_SHOULD_PARK condition wait-loop. There is a genuine issue with sleeps that do not have a condition; this is addressed in a subsequent patch. Reported-by: Gaurav Kohli Signed-off-by: Peter Zijlstra (Intel) Reviewed-by: Oleg Nesterov Cc: Linus Torvalds Cc: Peter Zijlstra Cc: Thomas Gleixner Signed-off-by: Ingo Molnar Signed-off-by: Sasha Levin Signed-off-by: Greg Kroah-Hartman --- kernel/kthread.c | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) --- a/kernel/kthread.c +++ b/kernel/kthread.c @@ -156,12 +156,13 @@ void *probe_kthread_data(struct task_str static void __kthread_parkme(struct kthread *self) { - __set_current_state(TASK_PARKED); - while (test_bit(KTHREAD_SHOULD_PARK, &self->flags)) { + for (;;) { + set_current_state(TASK_PARKED); + if (!test_bit(KTHREAD_SHOULD_PARK, &self->flags)) + break; if (!test_and_set_bit(KTHREAD_IS_PARKED, &self->flags)) complete(&self->parked); schedule(); - __set_current_state(TASK_PARKED); } clear_bit(KTHREAD_IS_PARKED, &self->flags); __set_current_state(TASK_RUNNING);