by Boqun Feng

[permalink] [raw]

Subject: Re: [PATCH tip v4 4/5] rcu: Do not call rcu_nocb_gp_cleanup() while holding rnp->lock

Hi Daniel,

On Tue, Nov 24, 2015 at 02:03:06PM +0100, Daniel Wagner wrote:
> rcu_nocb_gp_cleanup() is called while holding rnp->lock. Currently,
> this is okay because the wake_up_all() in rcu_nocb_gp_cleanup() will
> not enable the IRQs. lockdep is happy.
>
> By switching over using swait this is not true anymore. swake_up_all()
> enables the IRQs while processing the waiters. __do_softirq() can now
> run and will eventually call rcu_process_callbacks() which wants to
> grap nrp->lock.
>
> Let's move the rcu_nocb_gp_cleanup() call outside the lock before we
> switch over to swait.
>

But you did introduce swait in this patch ;-)

[snip]

>
> Signed-off-by: Daniel Wagner <[email protected]>
> Cc: "Paul E. McKenney" <[email protected]>
> Cc: Peter Zijlstra <[email protected]>
> Cc: Thomas Gleixner <[email protected]>
> Cc: [email protected]
> ---
> kernel/rcu/tree.c | 4 +++-
> kernel/rcu/tree.h | 3 ++-
> kernel/rcu/tree_plugin.h | 16 +++++++++++++---
> 3 files changed, 18 insertions(+), 5 deletions(-)
>

So I tried to build this patch with a config having RCU_EXPERT=y and
RCU_NOCB_CPU=y, but I got:

In file included from include/linux/completion.h:11:0,
from include/linux/rcupdate.h:43,
from include/linux/sysctl.h:25,
from include/linux/timer.h:242,
from include/linux/workqueue.h:8,
from include/linux/pm.h:25,
from ./arch/x86/include/asm/apic.h:5,
from ./arch/x86/include/asm/smp.h:12,
from include/linux/smp.h:59,
from kernel/rcu/tree.c:34:
kernel/rcu/tree_plugin.h: In function ‘rcu_nocb_gp_cleanup’:
kernel/rcu/tree_plugin.h:1782:14: warning: passing argument 1 of ‘__wake_up’ from incompatible pointer type [-Wincompatible-pointer-types]
wake_up_all(sq);
^
include/linux/wait.h:168:36: note: in definition of macro ‘wake_up_all’
#define wake_up_all(x) __wake_up(x, TASK_NORMAL, 0, NULL)

(I also attach the configure file in case you need it)

I think the reason is that you introduced swait in this patch, but
you didn't use the proper function, e.g. swake_up_all(), and also you
didn't #include <linux/swait.h> ;-)

Regards,
Boqun

> diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c
> index 775d36c..952536d 100644
> --- a/kernel/rcu/tree.c
> +++ b/kernel/rcu/tree.c
> @@ -1568,7 +1568,6 @@ static int rcu_future_gp_cleanup(struct rcu_state *rsp, struct rcu_node *rnp)
> int needmore;
> struct rcu_data *rdp = this_cpu_ptr(rsp->rda);
>
> - rcu_nocb_gp_cleanup(rsp, rnp);
> rnp->need_future_gp[c & 0x1] = 0;
> needmore = rnp->need_future_gp[(c + 1) & 0x1];
> trace_rcu_future_gp(rnp, rdp, c,
> @@ -1972,6 +1971,7 @@ static void rcu_gp_cleanup(struct rcu_state *rsp)
> int nocb = 0;
> struct rcu_data *rdp;
> struct rcu_node *rnp = rcu_get_root(rsp);
> + struct swait_queue_head *sq;
>
> WRITE_ONCE(rsp->gp_activity, jiffies);
> raw_spin_lock_irq(&rnp->lock);
> @@ -2010,7 +2010,9 @@ static void rcu_gp_cleanup(struct rcu_state *rsp)
> needgp = __note_gp_changes(rsp, rnp, rdp) || needgp;
> /* smp_mb() provided by prior unlock-lock pair. */
> nocb += rcu_future_gp_cleanup(rsp, rnp);
> + sq = rcu_nocb_gp_get(rnp);
> raw_spin_unlock_irq(&rnp->lock);
> + rcu_nocb_gp_cleanup(sq);
> cond_resched_rcu_qs();
> WRITE_ONCE(rsp->gp_activity, jiffies);
> rcu_gp_slow(rsp, gp_cleanup_delay);
> diff --git a/kernel/rcu/tree.h b/kernel/rcu/tree.h
> index 2e991f8..3dcf6368 100644
> --- a/kernel/rcu/tree.h
> +++ b/kernel/rcu/tree.h
> @@ -608,7 +608,8 @@ static void zero_cpu_stall_ticks(struct rcu_data *rdp);
> static void increment_cpu_stall_ticks(void);
> static bool rcu_nocb_cpu_needs_barrier(struct rcu_state *rsp, int cpu);
> static void rcu_nocb_gp_set(struct rcu_node *rnp, int nrq);
> -static void rcu_nocb_gp_cleanup(struct rcu_state *rsp, struct rcu_node *rnp);
> +static struct swait_queue_head *rcu_nocb_gp_get(struct rcu_node *rnp);
> +static void rcu_nocb_gp_cleanup(struct swait_queue_head *sq);
> static void rcu_init_one_nocb(struct rcu_node *rnp);
> static bool __call_rcu_nocb(struct rcu_data *rdp, struct rcu_head *rhp,
> bool lazy, unsigned long flags);
> diff --git a/kernel/rcu/tree_plugin.h b/kernel/rcu/tree_plugin.h
> index b2bf396..db4f357 100644
> --- a/kernel/rcu/tree_plugin.h
> +++ b/kernel/rcu/tree_plugin.h
> @@ -1777,9 +1777,9 @@ early_param("rcu_nocb_poll", parse_rcu_nocb_poll);
> * Wake up any no-CBs CPUs' kthreads that were waiting on the just-ended
> * grace period.
> */
> -static void rcu_nocb_gp_cleanup(struct rcu_state *rsp, struct rcu_node *rnp)
> +static void rcu_nocb_gp_cleanup(struct swait_queue_head *sq)
> {
> - wake_up_all(&rnp->nocb_gp_wq[rnp->completed & 0x1]);
> + wake_up_all(sq);
> }
>
> /*
> @@ -1795,6 +1795,11 @@ static void rcu_nocb_gp_set(struct rcu_node *rnp, int nrq)
> rnp->need_future_gp[(rnp->completed + 1) & 0x1] += nrq;
> }
>
> +static struct swait_queue_head *rcu_nocb_gp_get(struct rcu_node *rnp)
> +{
> + return &rnp->nocb_gp_wq[rnp->completed & 0x1];
> +}
> +
> static void rcu_init_one_nocb(struct rcu_node *rnp)
> {
> init_waitqueue_head(&rnp->nocb_gp_wq[0]);
> @@ -2469,7 +2474,7 @@ static bool rcu_nocb_cpu_needs_barrier(struct rcu_state *rsp, int cpu)
> return false;
> }
>
> -static void rcu_nocb_gp_cleanup(struct rcu_state *rsp, struct rcu_node *rnp)
> +static void rcu_nocb_gp_cleanup(struct swait_queue_head *sq)
> {
> }
>
> @@ -2477,6 +2482,11 @@ static void rcu_nocb_gp_set(struct rcu_node *rnp, int nrq)
> {
> }
>
> +static struct swait_queue_head *rcu_nocb_gp_get(struct rcu_node *rnp)
> +{
> + return NULL;
> +}
> +
> static void rcu_init_one_nocb(struct rcu_node *rnp)
> {
> }
> --
> 2.4.3
>

Attachments:

(No filename) (0.00 B)
signature.asc (473.00 B)
Download all attachments

2015-11-25 01:02:13

On Thu, 26 Nov 2015, Daniel Wagner wrote:
> On 11/24/2015 02:03 PM, Daniel Wagner wrote:
> > The API provided by wait.h and swait.h is very similiar. Most of the
> > time your are only one character away from either of it:
> >
> > wake_up() vs swake_up()
> >
> > This is on purpose so that we do not have two nearly identical bits of
> > infrastructre code with dissimilar names.
> >
> > A compile time type check assertion ensures that obvious wrong usage
> > is caught at early stage.
>
> Obviously, this didn't really work as one can see with patch #4. That
> one just compiled. So I wrapped almost all functions to get a better
> check coverage. woken_wake_function(), autoremove_wake_function() and
> wake_bit_function() can't be wrapped easily because DEFINE_WAIT and
> friends. I just left them out.
>
> The result looks pretty bad in my opinion. Probably it would be
> better do add -Werror=incompatible-pointer-types to the CFLAGS.

That's really bad.

If we can pull off the -Werror=incompatible-pointer-types trick, that
would solve it nicely.

Thanks,

tglx