2013-09-03 14:00:31

by Manfred Spraul

[permalink] [raw]
Subject: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

The check if the queue is full and adding current to the wait queue of pending
msgsnd() operations (ss_add()) must be atomic.

Otherwise:
- the thread that performs msgsnd() finds a full queue and decides to sleep.
- the thread that performs msgrcv() calls first reads all messages from the
queue and then sleep, because the queue is empty.
- the msgrcv() calls do not perform any wakeups, because the msgsnd() task
has not yet called ss_add().
- then the msgsnd()-thread first calls ss_add() and then sleeps.
Net result: msgsnd() and msgrcv() both sleep forever.

Observed with msgctl08 from ltp with a preemptible kernel.

Fix: Call ipc_lock_object() before performing the check.

The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
- msgctl(IPC_SET) explicitely mentions that it tries to expunge any pending
operations that are not allowed anymore with the new permissions.
If security_msg_queue_msgsnd() is called without locks, then there might be
races.
- it makes the patch much simpler.

Reported-by: Vineet Gupta <[email protected]>
Signed-off-by: Manfred Spraul <[email protected]>
---
ipc/msg.c | 12 +++++-------
1 file changed, 5 insertions(+), 7 deletions(-)

diff --git a/ipc/msg.c b/ipc/msg.c
index 9f29d9e..b65fdf1 100644
--- a/ipc/msg.c
+++ b/ipc/msg.c
@@ -680,16 +680,18 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
goto out_unlock1;
}

+ ipc_lock_object(&msq->q_perm);
+
for (;;) {
struct msg_sender s;

err = -EACCES;
if (ipcperms(ns, &msq->q_perm, S_IWUGO))
- goto out_unlock1;
+ goto out_unlock0;

err = security_msg_queue_msgsnd(msq, msg, msgflg);
if (err)
- goto out_unlock1;
+ goto out_unlock0;

if (msgsz + msq->q_cbytes <= msq->q_qbytes &&
1 + msq->q_qnum <= msq->q_qbytes) {
@@ -699,10 +701,9 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
/* queue full, wait: */
if (msgflg & IPC_NOWAIT) {
err = -EAGAIN;
- goto out_unlock1;
+ goto out_unlock0;
}

- ipc_lock_object(&msq->q_perm);
ss_add(msq, &s);

if (!ipc_rcu_getref(msq)) {
@@ -730,10 +731,7 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
goto out_unlock0;
}

- ipc_unlock_object(&msq->q_perm);
}
-
- ipc_lock_object(&msq->q_perm);
msq->q_lspid = task_tgid_vnr(current);
msq->q_stime = get_seconds();

--
1.8.3.1


2013-09-03 15:09:07

by Rik van Riel

[permalink] [raw]
Subject: Re: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

On 09/03/2013 10:00 AM, Manfred Spraul wrote:
> The check if the queue is full and adding current to the wait queue of pending
> msgsnd() operations (ss_add()) must be atomic.
>
> Otherwise:
> - the thread that performs msgsnd() finds a full queue and decides to sleep.
> - the thread that performs msgrcv() calls first reads all messages from the
> queue and then sleep, because the queue is empty.
> - the msgrcv() calls do not perform any wakeups, because the msgsnd() task
> has not yet called ss_add().
> - then the msgsnd()-thread first calls ss_add() and then sleeps.
> Net result: msgsnd() and msgrcv() both sleep forever.
>
> Observed with msgctl08 from ltp with a preemptible kernel.
>
> Fix: Call ipc_lock_object() before performing the check.
>
> The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
> - msgctl(IPC_SET) explicitely mentions that it tries to expunge any pending
> operations that are not allowed anymore with the new permissions.
> If security_msg_queue_msgsnd() is called without locks, then there might be
> races.
> - it makes the patch much simpler.
>
> Reported-by: Vineet Gupta <[email protected]>
> Signed-off-by: Manfred Spraul <[email protected]>

Acked-by: Rik van Riel <[email protected]>


--
All rights reversed

2013-09-03 16:13:48

by Sedat Dilek

[permalink] [raw]
Subject: Re: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

On Tue, Sep 3, 2013 at 4:00 PM, Manfred Spraul <[email protected]> wrote:
> The check if the queue is full and adding current to the wait queue of pending
> msgsnd() operations (ss_add()) must be atomic.
>
> Otherwise:
> - the thread that performs msgsnd() finds a full queue and decides to sleep.
> - the thread that performs msgrcv() calls first reads all messages from the
> queue and then sleep, because the queue is empty.

reads -> sleeps

> - the msgrcv() calls do not perform any wakeups, because the msgsnd() task
> has not yet called ss_add().
> - then the msgsnd()-thread first calls ss_add() and then sleeps.
> Net result: msgsnd() and msgrcv() both sleep forever.
>

I don't know what and why "net result" - net in sense of networking?

> Observed with msgctl08 from ltp with a preemptible kernel.
>

...on ARC arch (that sounds funny somehow).

> Fix: Call ipc_lock_object() before performing the check.
>
> The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
> - msgctl(IPC_SET) explicitely mentions that it tries to expunge any pending
> operations that are not allowed anymore with the new permissions.
> If security_msg_queue_msgsnd() is called without locks, then there might be
> races.
> - it makes the patch much simpler.
>
> Reported-by: Vineet Gupta <[email protected]>
> Signed-off-by: Manfred Spraul <[email protected]>

I guess this is missing a "CC: stable" as Vineet reported against
Linux v3.11-rc7 (and should enter v3.11.1)?

- Sedat -

> ---
> ipc/msg.c | 12 +++++-------
> 1 file changed, 5 insertions(+), 7 deletions(-)
>
> diff --git a/ipc/msg.c b/ipc/msg.c
> index 9f29d9e..b65fdf1 100644
> --- a/ipc/msg.c
> +++ b/ipc/msg.c
> @@ -680,16 +680,18 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> goto out_unlock1;
> }
>
> + ipc_lock_object(&msq->q_perm);
> +
> for (;;) {
> struct msg_sender s;
>
> err = -EACCES;
> if (ipcperms(ns, &msq->q_perm, S_IWUGO))
> - goto out_unlock1;
> + goto out_unlock0;
>
> err = security_msg_queue_msgsnd(msq, msg, msgflg);
> if (err)
> - goto out_unlock1;
> + goto out_unlock0;
>
> if (msgsz + msq->q_cbytes <= msq->q_qbytes &&
> 1 + msq->q_qnum <= msq->q_qbytes) {
> @@ -699,10 +701,9 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> /* queue full, wait: */
> if (msgflg & IPC_NOWAIT) {
> err = -EAGAIN;
> - goto out_unlock1;
> + goto out_unlock0;
> }
>
> - ipc_lock_object(&msq->q_perm);
> ss_add(msq, &s);
>
> if (!ipc_rcu_getref(msq)) {
> @@ -730,10 +731,7 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> goto out_unlock0;
> }
>
> - ipc_unlock_object(&msq->q_perm);
> }
> -
> - ipc_lock_object(&msq->q_perm);
> msq->q_lspid = task_tgid_vnr(current);
> msq->q_stime = get_seconds();
>
> --
> 1.8.3.1
>

2013-09-03 16:33:18

by Manfred Spraul

[permalink] [raw]
Subject: Re: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

Hi Sedat,

On 09/03/2013 06:13 PM, Sedat Dilek wrote:
> On Tue, Sep 3, 2013 at 4:00 PM, Manfred Spraul <[email protected]> wrote:
>> The check if the queue is full and adding current to the wait queue of pending
>> msgsnd() operations (ss_add()) must be atomic.
>>
>> Otherwise:
>> - the thread that performs msgsnd() finds a full queue and decides to sleep.
>> - the thread that performs msgrcv() calls first reads all messages from the
>> queue and then sleep, because the queue is empty.
> reads -> sleeps
Correct.
>> - the msgrcv() calls do not perform any wakeups, because the msgsnd() task
>> has not yet called ss_add().
>> - then the msgsnd()-thread first calls ss_add() and then sleeps.
>> Net result: msgsnd() and msgrcv() both sleep forever.
>>
> I don't know what and why "net result" - net in sense of networking?
http://en.wiktionary.org/wiki/net#Adjective
I.e.: Ignore/remove the "Net".

>> Observed with msgctl08 from ltp with a preemptible kernel.
>>
> ...on ARC arch (that sounds funny somehow).
>
>> Fix: Call ipc_lock_object() before performing the check.
>>
>> The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
>> - msgctl(IPC_SET) explicitely mentions that it tries to expunge any pending
>> operations that are not allowed anymore with the new permissions.
>> If security_msg_queue_msgsnd() is called without locks, then there might be
>> races.
>> - it makes the patch much simpler.
>>
>> Reported-by: Vineet Gupta <[email protected]>
>> Signed-off-by: Manfred Spraul <[email protected]>
> I guess this is missing a "CC: stable" as Vineet reported against
> Linux v3.11-rc7 (and should enter v3.11.1)?
Yes. I didn't notice that Linus already released 3.11.

--
Manfred
> - Sedat -
>
>> ---
>> ipc/msg.c | 12 +++++-------
>> 1 file changed, 5 insertions(+), 7 deletions(-)
>>
>> diff --git a/ipc/msg.c b/ipc/msg.c
>> index 9f29d9e..b65fdf1 100644
>> --- a/ipc/msg.c
>> +++ b/ipc/msg.c
>> @@ -680,16 +680,18 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
>> goto out_unlock1;
>> }
>>
>> + ipc_lock_object(&msq->q_perm);
>> +
>> for (;;) {
>> struct msg_sender s;
>>
>> err = -EACCES;
>> if (ipcperms(ns, &msq->q_perm, S_IWUGO))
>> - goto out_unlock1;
>> + goto out_unlock0;
>>
>> err = security_msg_queue_msgsnd(msq, msg, msgflg);
>> if (err)
>> - goto out_unlock1;
>> + goto out_unlock0;
>>
>> if (msgsz + msq->q_cbytes <= msq->q_qbytes &&
>> 1 + msq->q_qnum <= msq->q_qbytes) {
>> @@ -699,10 +701,9 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
>> /* queue full, wait: */
>> if (msgflg & IPC_NOWAIT) {
>> err = -EAGAIN;
>> - goto out_unlock1;
>> + goto out_unlock0;
>> }
>>
>> - ipc_lock_object(&msq->q_perm);
>> ss_add(msq, &s);
>>
>> if (!ipc_rcu_getref(msq)) {
>> @@ -730,10 +731,7 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
>> goto out_unlock0;
>> }
>>
>> - ipc_unlock_object(&msq->q_perm);
>> }
>> -
>> - ipc_lock_object(&msq->q_perm);
>> msq->q_lspid = task_tgid_vnr(current);
>> msq->q_stime = get_seconds();
>>
>> --
>> 1.8.3.1
>>

2013-09-03 18:07:13

by Davidlohr Bueso

[permalink] [raw]
Subject: Re: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

On Tue, 2013-09-03 at 16:00 +0200, Manfred Spraul wrote:
> The check if the queue is full and adding current to the wait queue of pending
> msgsnd() operations (ss_add()) must be atomic.
>
> Otherwise:
> - the thread that performs msgsnd() finds a full queue and decides to sleep.
> - the thread that performs msgrcv() calls first reads all messages from the
> queue and then sleep, because the queue is empty.
> - the msgrcv() calls do not perform any wakeups, because the msgsnd() task
> has not yet called ss_add().
> - then the msgsnd()-thread first calls ss_add() and then sleeps.
> Net result: msgsnd() and msgrcv() both sleep forever.
>
> Observed with msgctl08 from ltp with a preemptible kernel.

Good catch, thanks for looking into this Manfred.

FWIW similar changes that aim at reducing the kern_ipc_perm.lock
contention in shm have already been in linux-next for a good while and
should be going into 3.12. While both Sedat and I have tested them
through LTP, I will keep an eye open for regressions so that we don't
run into issues like this, late in the release cycle.

>
> Fix: Call ipc_lock_object() before performing the check.
>
> The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
> - msgctl(IPC_SET) explicitely mentions that it tries to expunge any pending
> operations that are not allowed anymore with the new permissions.
> If security_msg_queue_msgsnd() is called without locks, then there might be
> races.

Right.

> - it makes the patch much simpler.
>
> Reported-by: Vineet Gupta <[email protected]>
> Signed-off-by: Manfred Spraul <[email protected]>

Signed-off-by: Davidlohr Bueso <[email protected]>

> ---
> ipc/msg.c | 12 +++++-------
> 1 file changed, 5 insertions(+), 7 deletions(-)
>
> diff --git a/ipc/msg.c b/ipc/msg.c
> index 9f29d9e..b65fdf1 100644
> --- a/ipc/msg.c
> +++ b/ipc/msg.c
> @@ -680,16 +680,18 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> goto out_unlock1;
> }
>
> + ipc_lock_object(&msq->q_perm);
> +
> for (;;) {
> struct msg_sender s;
>
> err = -EACCES;
> if (ipcperms(ns, &msq->q_perm, S_IWUGO))
> - goto out_unlock1;
> + goto out_unlock0;
>
> err = security_msg_queue_msgsnd(msq, msg, msgflg);
> if (err)
> - goto out_unlock1;
> + goto out_unlock0;
>
> if (msgsz + msq->q_cbytes <= msq->q_qbytes &&
> 1 + msq->q_qnum <= msq->q_qbytes) {
> @@ -699,10 +701,9 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> /* queue full, wait: */
> if (msgflg & IPC_NOWAIT) {
> err = -EAGAIN;
> - goto out_unlock1;
> + goto out_unlock0;
> }
>
> - ipc_lock_object(&msq->q_perm);
> ss_add(msq, &s);
>
> if (!ipc_rcu_getref(msq)) {
> @@ -730,10 +731,7 @@ long do_msgsnd(int msqid, long mtype, void __user *mtext,
> goto out_unlock0;
> }
>
> - ipc_unlock_object(&msq->q_perm);
> }
> -
> - ipc_lock_object(&msq->q_perm);
> msq->q_lspid = task_tgid_vnr(current);
> msq->q_stime = get_seconds();
>

2013-09-03 21:56:26

by Sedat Dilek

[permalink] [raw]
Subject: Re: [PATCH] ipc/msg.c: Fix lost wakeup in msgsnd().

On Tue, Sep 3, 2013 at 6:33 PM, Manfred Spraul <[email protected]> wrote:
> Hi Sedat,
>
>
> On 09/03/2013 06:13 PM, Sedat Dilek wrote:
>>
>> On Tue, Sep 3, 2013 at 4:00 PM, Manfred Spraul <[email protected]>
>> wrote:
>>>
>>> The check if the queue is full and adding current to the wait queue of
>>> pending
>>> msgsnd() operations (ss_add()) must be atomic.
>>>
>>> Otherwise:
>>> - the thread that performs msgsnd() finds a full queue and decides to
>>> sleep.
>>> - the thread that performs msgrcv() calls first reads all messages from
>>> the
>>> queue and then sleep, because the queue is empty.
>>
>> reads -> sleeps
>
> Correct.
>
>>> - the msgrcv() calls do not perform any wakeups, because the msgsnd()
>>> task
>>> has not yet called ss_add().
>>> - then the msgsnd()-thread first calls ss_add() and then sleeps.
>>> Net result: msgsnd() and msgrcv() both sleep forever.
>>>
>> I don't know what and why "net result" - net in sense of networking?
>
> http://en.wiktionary.org/wiki/net#Adjective
> I.e.: Ignore/remove the "Net".
>
>
>>> Observed with msgctl08 from ltp with a preemptible kernel.
>>>
>> ...on ARC arch (that sounds funny somehow).
>>
>>> Fix: Call ipc_lock_object() before performing the check.
>>>
>>> The patch also moves security_msg_queue_msgsnd() under ipc_lock_object:
>>> - msgctl(IPC_SET) explicitely mentions that it tries to expunge any
>>> pending
>>> operations that are not allowed anymore with the new permissions.
>>> If security_msg_queue_msgsnd() is called without locks, then there
>>> might be
>>> races.
>>> - it makes the patch much simpler.
>>>
>>> Reported-by: Vineet Gupta <[email protected]>
>>> Signed-off-by: Manfred Spraul <[email protected]>
>>
>> I guess this is missing a "CC: stable" as Vineet reported against
>> Linux v3.11-rc7 (and should enter v3.11.1)?
>
> Yes. I didn't notice that Linus already released 3.11.
>

Linus pushed your patch upstream... with typos fixed and "CC: stable #3.11".
Thanks to all involved people!

- Sedat -

[1] http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=bebcb928c820d0ee83aca4b192adc195e43e66a2

> --
> Manfred
>
>> - Sedat -
>>
>>> ---
>>> ipc/msg.c | 12 +++++-------
>>> 1 file changed, 5 insertions(+), 7 deletions(-)
>>>
>>> diff --git a/ipc/msg.c b/ipc/msg.c
>>> index 9f29d9e..b65fdf1 100644
>>> --- a/ipc/msg.c
>>> +++ b/ipc/msg.c
>>> @@ -680,16 +680,18 @@ long do_msgsnd(int msqid, long mtype, void __user
>>> *mtext,
>>> goto out_unlock1;
>>> }
>>>
>>> + ipc_lock_object(&msq->q_perm);
>>> +
>>> for (;;) {
>>> struct msg_sender s;
>>>
>>> err = -EACCES;
>>> if (ipcperms(ns, &msq->q_perm, S_IWUGO))
>>> - goto out_unlock1;
>>> + goto out_unlock0;
>>>
>>> err = security_msg_queue_msgsnd(msq, msg, msgflg);
>>> if (err)
>>> - goto out_unlock1;
>>> + goto out_unlock0;
>>>
>>> if (msgsz + msq->q_cbytes <= msq->q_qbytes &&
>>> 1 + msq->q_qnum <= msq->q_qbytes) {
>>> @@ -699,10 +701,9 @@ long do_msgsnd(int msqid, long mtype, void __user
>>> *mtext,
>>> /* queue full, wait: */
>>> if (msgflg & IPC_NOWAIT) {
>>> err = -EAGAIN;
>>> - goto out_unlock1;
>>> + goto out_unlock0;
>>> }
>>>
>>> - ipc_lock_object(&msq->q_perm);
>>> ss_add(msq, &s);
>>>
>>> if (!ipc_rcu_getref(msq)) {
>>> @@ -730,10 +731,7 @@ long do_msgsnd(int msqid, long mtype, void __user
>>> *mtext,
>>> goto out_unlock0;
>>> }
>>>
>>> - ipc_unlock_object(&msq->q_perm);
>>> }
>>> -
>>> - ipc_lock_object(&msq->q_perm);
>>> msq->q_lspid = task_tgid_vnr(current);
>>> msq->q_stime = get_seconds();
>>>
>>> --
>>> 1.8.3.1
>>>
>