Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp2497251imu; Thu, 29 Nov 2018 06:03:39 -0800 (PST) X-Google-Smtp-Source: AFSGD/VuePDHAq3i30Za5KJtnfE9oMZPy7Tav+NCD2geMhPQTYwpIPisY3cuChGgOou4MOgMhbwz X-Received: by 2002:a63:295:: with SMTP id 143mr1307869pgc.362.1543500219206; Thu, 29 Nov 2018 06:03:39 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543500219; cv=none; d=google.com; s=arc-20160816; b=G4hlil9vUc9u90fmVHAVag6uwAQRIOX6R8Hflhv/VzCiJ0G7gKG8jb5lku+jsXiM2e TpKHFqZJEdQkvIKL0W8Y9dI+RmsRbwpg5WmsgqVxJLB6BLYVxDAqflaXMdaMf/rs2gF7 /BjffVfCsbxwJK82UtGHDLSUsvVInOYXEz2GC25vzN05knAoSm4cwZIS9EwDLBCRHJy5 61HdyHo3FnhYfDTrsVh/pdgEmoEluinv9vBG06Bl3Q+hlc+OjOKpHNREK7y7I0wYStbh r3JMqngysTYCCipHtzYrbdxv5TIXKyifxJpaIiV6dOsUUpt/c1c/yZpA7bKlLcxwuTrj APuQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=/jjV9KGWquWLOwtCollysS/p7Dp1rsTMNfcsFZuikHU=; b=p2go4ytQzvGO1fpIzbaA01nl7CBL1NagMB5cHSQlPZInlOF/O3W3DnliEqJh4spqzq 5FsJmxDFXd9Sz3zwr/uPCGXePuzZ93AaGRDDIjFD33D7Fl6PCPTEdRTXhxAwqRTzRmV1 RrSqvbcWp4k1YtVQmdZ+Va1fQd2pw7DXjLAxqUI1ucNyPAsj0Vd5knWfsnn57QBAgCo5 B8PizU+J+5n+CTgxxDXc5KlE1OrPNcnP1ZNb1QIJ72pZP5wtO88i6VwFjrr+2QhvJ1hb W4arih41RQH221e9g8EwBAW/lQdz2fjVzJxvP1gJ1VA8WH4V+kK+atFUlqSD3BVJZyfk Zshw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=GjfBNbjG; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c11si2146136pgj.255.2018.11.29.06.03.09; Thu, 29 Nov 2018 06:03:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=GjfBNbjG; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728290AbeK3BHp (ORCPT + 99 others); Thu, 29 Nov 2018 20:07:45 -0500 Received: from mail-yw1-f65.google.com ([209.85.161.65]:39323 "EHLO mail-yw1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727040AbeK3BHp (ORCPT ); Thu, 29 Nov 2018 20:07:45 -0500 Received: by mail-yw1-f65.google.com with SMTP id j6so784436ywj.6 for ; Thu, 29 Nov 2018 06:02:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=/jjV9KGWquWLOwtCollysS/p7Dp1rsTMNfcsFZuikHU=; b=GjfBNbjGV3galjxtE3pCfF1n4l2nktyw7vwRiOmahMqDnn3Yxhs2R95TDp7uQIBPlp ag/JF6XSSJyUjU3bJqDmfPrfudQ0IPIFDYMlw1OkSQAccGWym/k7nG+FRzbMz041gPbJ X6hboXpzdZZB+HtK62KagpQxB6J9SFtI2hjPjOg35RE0s8mUuXrZ0wCFSaL5IcK05xPH JYrKsg2x8h7qwkbRnce16NLGk826XR/i7tezEI7Bs6ijBU8pYMj7J3i8pRfffzNJTVkn ymq57Qcypz6qKGB2HNtuE3j/6ZKQ6ouK+PZrvyo2jk2FL/WnSi2O8NePsyN5V98+LaU2 hotg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=/jjV9KGWquWLOwtCollysS/p7Dp1rsTMNfcsFZuikHU=; b=gJ+ePwQlg0Vg8M3jPh+QChrtC3OMnZZ12aI45yE/zKAY7x85XL/GGFLrJbNy4UVipi 5Vo4kYyufdx01igeW/6OTx36cDD+eLNHnz8rDt5YyK1ceGN8ulQrADhGTpHvTOQsDOFQ ia1mmWgRcV1h1RAJfKoNhVARKq6FyTebZ3Mj3ApNwEQ3J5iyRRSlvNVkqDaiUObj6+3e ipPTTkgqqYf1mUCiiFH7+muFjrduN2e7/NaTw9VbsVzR48igkigsn+bMqV7QKxGSaqmg 3mQGtbAfNPP7xikg9SFEhk/0iyopSrYBiUnAVnKf05HgYnoFQKwSDAfXoPIMnV6oHDT5 Qs2w== X-Gm-Message-State: AA+aEWZrTewNPnYWxCkc0vSkjDbWRTXF45f2vrYj8knDg52pRsP/fuD4 o7fmBfTmgjtrjV+MzhH+kR77DlgaDlBmBbX80W0= X-Received: by 2002:a81:a70a:: with SMTP id e10mr1402739ywh.233.1543500137133; Thu, 29 Nov 2018 06:02:17 -0800 (PST) MIME-Version: 1.0 References: <1543495830-2644-1-git-send-email-xieyongji@baidu.com> <20181129131232.GN2131@hirez.programming.kicks-ass.net> <20181129134449.GH2149@hirez.programming.kicks-ass.net> In-Reply-To: <20181129134449.GH2149@hirez.programming.kicks-ass.net> From: Yongji Xie Date: Thu, 29 Nov 2018 22:02:03 +0800 Message-ID: Subject: Re: [RFC] locking/rwsem: Avoid issuing wakeup before setting the reader waiter to nil To: peterz@infradead.org Cc: mingo@redhat.com, will.deacon@arm.com, linux-kernel@vger.kernel.org, xieyongji@baidu.com, zhangyu31@baidu.com, liuqi16@baidu.com, yuanlinsi01@baidu.com, nixun@baidu.com, lilin24@baidu.com, dave@stgolabs.net, longman@redhat.com, tglx@linutronix.de Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 29 Nov 2018 at 21:45, Peter Zijlstra wrote: > > On Thu, Nov 29, 2018 at 02:12:32PM +0100, Peter Zijlstra wrote: > > > > +Cc davidlohr and waiman > > > Urgh; so the case where the cmpxchg() fails because it already has a > > wakeup in progress, which then 'violates' our expectation of when the > > wakeup happens. > > > > Yes, I think this is real, and worse, I think we need to go audit all > > wake_q_add() users and document this behaviour. > > > > In the ideal case we'd delay the actual wakeup to the last wake_up_q(), > > but I don't think we can easily fix that. > > See commit: 1d0dcb3ad9d3 ("futex: Implement lockless wakeups"), I think > that introduces the exact same bug. > Hmm...Yes, even the thread may be in futex's wake_q and lead to rwsem's wakeup missing. Seems like fix this problem casy by case and document the behaviour is easier than delay the actual wakeup to the last wake_up_q()... Thanks, Yongji > Something like the below perhaps, altough this pattern seems to want a > wake_a_add() variant that already assumes get_task_struct(). > > diff --git a/kernel/futex.c b/kernel/futex.c > index f423f9b6577e..d14971f6ed3d 100644 > --- a/kernel/futex.c > +++ b/kernel/futex.c > @@ -1387,11 +1387,7 @@ static void mark_wake_futex(struct wake_q_head *wake_q, struct futex_q *q) > if (WARN(q->pi_state || q->rt_waiter, "refusing to wake PI futex\n")) > return; > > - /* > - * Queue the task for later wakeup for after we've released > - * the hb->lock. wake_q_add() grabs reference to p. > - */ > - wake_q_add(wake_q, p); > + get_task_struct(p); > __unqueue_futex(q); > /* > * The waiting task can free the futex_q as soon as q->lock_ptr = NULL > @@ -1401,6 +1397,13 @@ static void mark_wake_futex(struct wake_q_head *wake_q, struct futex_q *q) > * plist_del in __unqueue_futex(). > */ > smp_store_release(&q->lock_ptr, NULL); > + > + /* > + * Queue the task for later wakeup for after we've released > + * the hb->lock. wake_q_add() grabs reference to p. > + */ > + wake_q_add(wake_q, p); > + put_task_struct(p); > } > > /*