by Rafael J. Wysocki

[permalink] [raw]

2008-02-24 21:42:41

by Pavel Machek

[permalink] [raw]

Subject: Re: [Bug 10030] Suspend doesn't work when SD card is inserted

On Sun 2008-02-24 15:33:01, Alan Stern wrote:
> On Sun, 24 Feb 2008, Pavel Machek wrote:
>
> > > > What locking protects this variable? What happens when suspending_task
> > > > exits? (Hmm, that would probably be bug, anyway?)
> > >
> > > It's protected by whatever existing locking scheme allows only one
> > > task to start a system sleep at a time. For example, the suspending
> > > task has to get a write lock on pm_sleep_rwsem.
> >
> > And readers of suspending_task are protected by?
>
> I added a comment about that too.
>
> > At the very least, you'd need rmb() before reading it and wmb() after
> > writing to it, but I'm not sure if that's enough on every obscure
> > architecture out there.
>
> No, neither one is needed because of the way suspending_task is used.
>
> It's not necessary for a reader R to see the variable's actual value;
> all R needs to know is whether or not suspending_task is equal to R.
> Since the only process which can set suspending_task to R is R itself,
> and since R will set suspending_task back to NULL before releasing the
> write lock on pm_sleep_rwsem, there's never any ambiguity.

Subtle.

Very subtly wrong ;-).

imagine suspending_task == 0xabcdef01. Now task "R" with current ==
0xabcd0000 reads suspending_task while the other cpu is writing to it,
and sees 0xabcd0000 (0xef01 was not yet written) -- and mistakenly
believes that "R" == suspending_task.

I agree it is very unlikely, and it will not happen on i386. But what
about just using atomic_t suspending_task, and store current->pid into
it?
Pavel
--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2008-02-24 22:20:25

by Rafael J. Wysocki

[permalink] [raw]

On Thursday, 6 of March 2008, Zdenek Kabelac wrote:
> 2008/3/6, Alan Stern <[email protected]>:
> > On Thu, 6 Mar 2008, Pavel Machek wrote:
> >
> > > On Tue 2008-03-04 16:00:51, Alan Stern wrote:
> > > > On Tue, 4 Mar 2008, David Brownell wrote:
> > > >
> > > > > > What's wrong with a superfluous probe at resume time, besides the waste
> > > > > > of a few milliseconds?
> > > > >
> > > > > I'm more concerned with the undesirable removal of devices at suspend
> > > > > time ... ones with mounted filesystems etc.
> > > >
> > > > On that we can agree. The removal is done if the host doesn't define a
> > > > resume method. There doesn't seem to be any point to that, given that
> > > > the probing during resume will determine whether a card has in fact
> > > > been removed.
> > >
> > > Hmm, if the driver is sleeping too deeply, user might have removed the
> > > card and put in different one, without driver noticing. That would be
> > > _bad_.
> >
> >
> > Ironically, the very same problem now exists with the USB mass-storage
> > driver. I don't see any way for the driver itself to solve it,
> > especially during a hibernation (which can be a _very_ deep sleep).
> >
> > One thing that could be done is for filesystems to verify, after a
> > system sleep, that their superblocks haven't changed. There could
> > still be issues with non-mounted partitions, if they have live entries
> > in the block cache, but it would be an improvement.
> >
> > Do you know the right people to mention this to? Anybody in filesystem
> > development interested in suspend/hibernation issues?

I don't really think so (but I may be wrong ...)

> IMHO the way would be to try to unmount fs if it's possible - if not -
> user should be notified on suspend/hibernation that he must preserve
> media in its place after resume and it should be checked and user
> should be notified if different devices/fs were find...

This is a very long standing issue which IMO can only be solved by making
filesystems suspend-aware.

Thanks,
Rafael