LinuxLists.cc - [PATCH] time: Add locking to xtime access in get

2011-05-04 03:12:00

Subject: [PATCH] time: Add locking to xtime access in get_seconds()

From: John Stultz <[email protected]>

So get_seconds() has always been lock free, with the assumption
that accessing a long will be atomic.

However, recently I came across an odd bug where time() access could
occasionally be inconsistent, but only on power7 hardware. The
same code paths on power6 or x86 could not reproduce the issue.

After adding careful debugging checks to any xtime manipulation, and
not seeing any inconsistencies on the kernel side, I realized that
with no locking in the get_seconds path, its could be that two
sequential calls to time() could be executed out of order on newer
hardware, causing the inconsistency to appear in userland.

After adding the following locking, the issue cannot be reproduced.

Wanted to run this by the power guys to make sure the theory above
sounds sane.

CC: Paul Mackerras <[email protected]>
CC: Paul E. McKenney <[email protected]>
CC: Anton Blanchard <[email protected]>
CC: Thomas Gleixner <[email protected]>
Signed-off-by: John Stultz <[email protected]>
---
kernel/time/timekeeping.c | 10 +++++++++-
1 files changed, 9 insertions(+), 1 deletions(-)

diff --git a/kernel/time/timekeeping.c b/kernel/time/timekeeping.c
index 8ad5d57..89c7582 100644
--- a/kernel/time/timekeeping.c
+++ b/kernel/time/timekeeping.c
@@ -975,7 +975,15 @@ EXPORT_SYMBOL_GPL(monotonic_to_bootbased);

unsigned long get_seconds(void)
{
- return xtime.tv_sec;
+ unsigned long seq, now;
+
+ do {
+ seq = read_seqbegin(&xtime_lock);
+
+ now = xtime.tv_sec;
+ } while (read_seqretry(&xtime_lock, seq));
+
+ return now;
}
EXPORT_SYMBOL(get_seconds);

--
1.7.3.2.146.gca209

2011-05-04 03:52:58

by Andi Kleen

[permalink] [raw]

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

John Stultz <[email protected]> writes:

> From: John Stultz <[email protected]>
>
> So get_seconds() has always been lock free, with the assumption
> that accessing a long will be atomic.
>
> However, recently I came across an odd bug where time() access could
> occasionally be inconsistent, but only on power7 hardware. The

Shouldn't a single rmb() be enough to avoid that?

If not then I suspect there's a lot more code buggy on that CPU than
just the time.

-Andi

--
[email protected] -- Speaking for myself only

2011-05-04 16:50:45

by Max Asbock

[permalink] [raw]

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

On Tue, 2011-05-03 at 20:11 -0700, John Stultz wrote:
> From: John Stultz <[email protected]>
>
> So get_seconds() has always been lock free, with the assumption
> that accessing a long will be atomic.
>

get_seconds() is used in the x86 machine check handler and there is a
comment saying:
/* We hope get_seconds stays lockless */

This needs to be carefully looked at if locking is introduced to
get_seconds().

- Max

2011-05-04 21:05:47

by Andi Kleen

[permalink] [raw]

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Max Asbock <[email protected]> writes:

> On Tue, 2011-05-03 at 20:11 -0700, John Stultz wrote:
>> From: John Stultz <[email protected]>
>>
>> So get_seconds() has always been lock free, with the assumption
>> that accessing a long will be atomic.
>>
>
> get_seconds() is used in the x86 machine check handler and there is a
> comment saying:
> /* We hope get_seconds stays lockless */
>
> This needs to be carefully looked at if locking is introduced to
> get_seconds().

Yes the seqlock being interrupted by an MCE would deadlock.

-Andi
--
[email protected] -- Speaking for myself only

2011-05-04 23:06:05

by john stultz

[permalink] [raw]

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

On Wed, 2011-05-04 at 09:51 -0700, Max Asbock wrote:
> On Tue, 2011-05-03 at 20:11 -0700, John Stultz wrote:
> > From: John Stultz <[email protected]>
> >
> > So get_seconds() has always been lock free, with the assumption
> > that accessing a long will be atomic.
> >
>
> get_seconds() is used in the x86 machine check handler and there is a
> comment saying:
> /* We hope get_seconds stays lockless */
>
> This needs to be carefully looked at if locking is introduced to
> get_seconds().

Ah. Thanks for pointing this out Max.

I'll go ahead and use Andi's suggestion of the rmb();

Patch soon to follow.

thanks
-john

2011-05-05 02:54:59

by john stultz

[permalink] [raw]

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

On Tue, 2011-05-03 at 20:52 -0700, Andi Kleen wrote:
> John Stultz <[email protected]> writes:
>
> > From: John Stultz <[email protected]>
> >
> > So get_seconds() has always been lock free, with the assumption
> > that accessing a long will be atomic.
> >
> > However, recently I came across an odd bug where time() access could
> > occasionally be inconsistent, but only on power7 hardware. The
>
> Shouldn't a single rmb() be enough to avoid that?
>
> If not then I suspect there's a lot more code buggy on that CPU than
> just the time.

So interestingly, I've found that the issue was not as complex as I
first assumed. While the rmb() is probably a good idea for
get_seconds(), but it alone does not solve the issue I was seeing,
making it clear my theory wasn't correct.

The problem was reported against the 2.6.32-stable kernel, and had not
been seen in later kernels. I had assumed the change to logarithmic time
accumulation basically reduced the window for for the issue to be seen,
but it would likely still show up eventually.

When the rmb() alone did not solve this issue, I looked to see why the
locking did resolve it, and then it was clear: The old
update_xtime_cache() function doesn't set the xtime_cache values
atomically.

Now, the xtime_cache writing is done under the xtime_lock, so the
get_seconds() locking resolves the issue, but isn't appropriate since
get_seconds() is called from machine check handlers.

So the fix here for the 2.6.32-stable tree is to just update xtime_cache
in one go as done with the following patch.

I also added the rmb() for good measure, and the rmb() should probably
also go upstream since theoretically there maybe a platform that could
do out of order syscalls.

I suspect the reason this hasn't been triggered on x86 or power6 is due
to compiler or processor optimizations reordering the assignment to in
effect make it atomic. Or maybe the timing window to see the issue is
harder to observe?

Signed-off-by: John Stultz <[email protected]>

Index: linux-2.6.32.y/kernel/time/timekeeping.c
===================================================================
--- linux-2.6.32.y.orig/kernel/time/timekeeping.c 2011-05-04 19:34:21.604314152 -0700
+++ linux-2.6.32.y/kernel/time/timekeeping.c 2011-05-04 19:39:09.972203989 -0700
@@ -168,8 +168,10 @@ int __read_mostly timekeeping_suspended;
static struct timespec xtime_cache __attribute__ ((aligned (16)));
void update_xtime_cache(u64 nsec)
{
- xtime_cache = xtime;
- timespec_add_ns(&xtime_cache, nsec);
+ /* use temporary timespec so xtime_cache is updated atomically */
+ struct timespec ts = xtime;
+ timespec_add_ns(&ts, nsec);
+ xtime_cache = ts;
}

/* must hold xtime_lock */
@@ -859,6 +861,7 @@ EXPORT_SYMBOL_GPL(monotonic_to_bootbased

unsigned long get_seconds(void)
{
+ rmb();
return xtime_cache.tv_sec;
}
EXPORT_SYMBOL(get_seconds);

2011-05-05 05:44:12

Subject: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [PATCH] time: Add locking to xtime access in get_seconds()

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: Re: [RFC] time: xtime_lock is held too long

Subject: [PATCH] seqlock: don't smp_rmb in seqlock reader spin loop

Subject: Re: [PATCH] seqlock: don't smp_rmb in seqlock reader spin loop

Subject: Re: [PATCH] seqlock: don't smp_rmb in seqlock reader spin loop