2024-04-22 08:05:22

by Yuxuan Hu

[permalink] [raw]
Subject: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

From: Yuxuan Hu <[email protected]>

Our fuzzing tool found a null-ptr-deref in function pn533_recv_frame
(/drivers/nfc/pn533/pn533.c) in kernel 6.8.

(1.1) When execute the NFC_CMD_START_POLL command via netlink, the
pn533_send_cmd_async function (/drivers/nfc/pn533/pn533.c: 1714) is
called, which sends the PN533_CMD_IN_AUTOPOLL command packet.

(2.1) If a pn533 response frame that does not match the command is
received, the following call sequence is executed:
pn533_recv_frame (/drivers/nfc/pn533/pn533.c: 2165)
pn533_rx_frame_is_cmd_response (/drivers/nfc/pn533/pn533.c: 2194)
pn533_wq_cmd_complete (/drivers/nfc/pn533/pn533.c: 2022)
pn533_send_async_complete (/drivers/nfc/pn533/pn533.c: 547)
pn533_autopoll_complete (/drivers/nfc/pn533/pn533.c: 414)

(2.2) After completing (2.1), dev->cmd is freed and set to null
(/drivers/nfc/pn533/pn533.c: 432-433).

(3.1) If another incorrect pn533 response frame is received during
the above process, (2.1) and (2.2) will be executed concurrently, and
the initial process setting dev->cmd to null causes the concurrent
process to trigger a null-ptr-deref in pn533_recv_frame.

Although pn533_recv_frame checks for dev->cmd at the beginning, it is
possible that dev->cmd is set to null after the check.

Through our verification, this concurrent vulnerability has a high
probability of occurrence and needs to be fixed.

Kernel print messages when null-ptr-deref is triggered (including PN533
packets, PN533 module errors, and KASAN reports) are as follows.
We added printk of the data packets, and printk before the relevant steps
in pn533_send_async_complete and pn533_recv_frame.

-> 00 00 FF 08 F8 D4 60 FF 03 00 11 12 04 A3 00
<- 00 00 FF 00 FF 00
<- 00 00 FF 0E F2 D5 86 01 10 09 01 00 20 08 04 9B 2C EE 9F 0A 00
tty tty60: NFC: It it not the response to the last command
arted polling nfc device
<- 00 00 FF 03 FD D5 41 00 EA 00
tty tty60: NFC: pn533_autopoll_complete autopoll complete error -5
tty tty60: NFC: It it not the response to the last command
tty tty60: NFC: Error -5 when running autopoll
tty tty60: NFC: autopoll operation has been stopped
pn533_send_async_complete: set dev->cmd to null!!!
pn533_recv_frame: dev->cmd is null!!!
BUG: kernel NULL pointer dereference, address: 0000000000000014
#PF: supervisor write access in kernel mode
#PF: error_code(0x0002) - not-present page
PGD 0 P4D 0
Oops: 0002 [#1] PREEMPT SMP KASAN PTI
CPU: 0 PID: 5541 Comm: kworker/0:0 Tainted: G O 6.8.0 #1
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1ubuntu1.1 04/01/2014
Workqueue: events nfcvirt_recv_work [nfcvirt]
RIP: 0010:pn533_recv_frame+0x18a/0x1e0 [pn533]
Code: 43 ff ff ff 48 8b bb 80 02 00 00 48 c7 c6 0b 02 46 c0 31 c0 e8 97 64 4f c4 48 83 bb b0 01 00 00 00 74 3f 48 8b 83 b0 01 00 00 <c7> 40 14 fb ff ff ff 48 8b 83 b0 01 00 00 48 85 c0 0f 85 3b ff ff
RSP: 0018:ffff88802665fc68 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff88804a13b800 RCX: ad381de3b3cd5e00
RDX: 1ffff11004ccbf38 RSI: 0000000000000008 RDI: ffff88802665f9e0
RBP: ffff88804fb25000 R08: ffff88802665f9e7 R09: 1ffff11004ccbf3c
R10: dffffc0000000000 R11: ffffed1004ccbf3d R12: 0000000000001950
R13: ffff88804ab80000 R14: ffff888021d22640 R15: ffff88802665fcb0
FS: 0000000000000000(0000) GS:ffff88806d200000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000014 CR3: 0000000004ebc000 CR4: 00000000000006f0
Call Trace:
<TASK>
? __die_body+0x62/0xb0
? page_fault_oops+0x421/0x740
? kernelmode_fixup_or_oops+0x1d0/0x1d0
? asan.module_ctor+0x10/0x10
? vprintk_emit+0x3f0/0x3f0
? kernelmode_fixup_or_oops+0x163/0x1d0
? do_user_addr_fault+0xb6c/0xde0
? irq_work_queue+0x54/0xa0
? do_kern_addr_fault+0x160/0x160
? __call_rcu_common+0x518/0xc30
? _dev_err+0x106/0x150
? exc_page_fault+0x66/0x1a0
? asm_exc_page_fault+0x22/0x30
? pn533_recv_frame+0x18a/0x1e0 [pn533]
? pn533_recv_frame+0x1d7/0x1e0 [pn533]
nfcvirt_recv_work+0x24e/0x320 [nfcvirt]
? wake_bit_function+0x230/0x230
process_one_work+0x4f0/0xab0
worker_thread+0x8af/0xee0
? process_one_work+0xab0/0xab0
kthread+0x275/0x300
? process_one_work+0xab0/0xab0
? kthread_blkcg+0xa0/0xa0
ret_from_fork+0x30/0x60
? kthread_blkcg+0xa0/0xa0
ret_from_fork_asm+0x11/0x20
</TASK>
Modules linked in: nfcvirt(O) pn533(O) nfc(O) ki_coverage(O) [last unloaded: pn533(O)]
CR2: 0000000000000014
---[ end trace 0000000000000000 ]---
RIP: 0010:pn533_recv_frame+0x18a/0x1e0 [pn533]
Code: 43 ff ff ff 48 8b bb 80 02 00 00 48 c7 c6 0b 02 46 c0 31 c0 e8 97 64 4f c4 48 83 bb b0 01 00 00 00 74 3f 48 8b 83 b0 01 00 00 <c7> 40 14 fb ff ff ff 48 8b 83 b0 01 00 00 48 85 c0 0f 85 3b ff ff
RSP: 0018:ffff88802665fc68 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff88804a13b800 RCX: ad381de3b3cd5e00
RDX: 1ffff11004ccbf38 RSI: 0000000000000008 RDI: ffff88802665f9e0
RBP: ffff88804fb25000 R08: ffff88802665f9e7 R09: 1ffff11004ccbf3c
R10: dffffc0000000000 R11: ffffed1004ccbf3d R12: 0000000000001950
R13: ffff88804ab80000 R14: ffff888021d22640 R15: ffff88802665fcb0
FS: 0000000000000000(0000) GS:ffff88806d200000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000014 CR3: 0000000004ebc000 CR4: 00000000000006f0

Signed-off-by: Yuxuan Hu <[email protected]>
---
drivers/nfc/pn533/pn533.c | 4 ++++
1 file changed, 4 insertions(+)

diff --git a/drivers/nfc/pn533/pn533.c b/drivers/nfc/pn533/pn533.c
index b19c39dcfbd9..a80a23332f59 100644
--- a/drivers/nfc/pn533/pn533.c
+++ b/drivers/nfc/pn533/pn533.c
@@ -2190,9 +2190,13 @@ void pn533_recv_frame(struct pn533 *dev, struct sk_buff *skb, int status)

if (!dev->ops->rx_is_frame_valid(skb->data, dev)) {
nfc_err(dev->dev, "Received an invalid frame\n");
+ if (!dev->cmd)
+ goto sched_wq;
dev->cmd->status = -EIO;
} else if (!pn533_rx_frame_is_cmd_response(dev, skb->data)) {
nfc_err(dev->dev, "It it not the response to the last command\n");
+ if (!dev->cmd)
+ goto sched_wq;
dev->cmd->status = -EIO;
}

--
2.25.1



2024-04-24 05:37:19

by Krzysztof Kozlowski

[permalink] [raw]
Subject: Re: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

On 22/04/2024 10:04, Yuxuan Hu wrote:
> From: Yuxuan Hu <[email protected]>
>
> Our fuzzing tool found a null-ptr-deref in function pn533_recv_frame
> (/drivers/nfc/pn533/pn533.c) in kernel 6.8.
>
> (1.1) When execute the NFC_CMD_START_POLL command via netlink, the
> pn533_send_cmd_async function (/drivers/nfc/pn533/pn533.c: 1714) is
> called, which sends the PN533_CMD_IN_AUTOPOLL command packet.
>
> (2.1) If a pn533 response frame that does not match the command is
> received, the following call sequence is executed:
> pn533_recv_frame (/drivers/nfc/pn533/pn533.c: 2165)
> pn533_rx_frame_is_cmd_response (/drivers/nfc/pn533/pn533.c: 2194)
> pn533_wq_cmd_complete (/drivers/nfc/pn533/pn533.c: 2022)
> pn533_send_async_complete (/drivers/nfc/pn533/pn533.c: 547)
> pn533_autopoll_complete (/drivers/nfc/pn533/pn533.c: 414)
>
> (2.2) After completing (2.1), dev->cmd is freed and set to null
> (/drivers/nfc/pn533/pn533.c: 432-433).
>
> (3.1) If another incorrect pn533 response frame is received during
> the above process, (2.1) and (2.2) will be executed concurrently, and
> the initial process setting dev->cmd to null causes the concurrent
> process to trigger a null-ptr-deref in pn533_recv_frame.
>
> Although pn533_recv_frame checks for dev->cmd at the beginning, it is
> possible that dev->cmd is set to null after the check.

That sounds reasonable... but solution does not.

>
> Through our verification, this concurrent vulnerability has a high
> probability of occurrence and needs to be fixed.
>
> Kernel print messages when null-ptr-deref is triggered (including PN533
> packets, PN533 module errors, and KASAN reports) are as follows.
> We added printk of the data packets, and printk before the relevant steps
> in pn533_send_async_complete and pn533_recv_frame.
>
> -> 00 00 FF 08 F8 D4 60 FF 03 00 11 12 04 A3 00
> <- 00 00 FF 00 FF 00
> <- 00 00 FF 0E F2 D5 86 01 10 09 01 00 20 08 04 9B 2C EE 9F 0A 00
> tty tty60: NFC: It it not the response to the last command
> arted polling nfc device
> <- 00 00 FF 03 FD D5 41 00 EA 00
> tty tty60: NFC: pn533_autopoll_complete autopoll complete error -5
> tty tty60: NFC: It it not the response to the last command
> tty tty60: NFC: Error -5 when running autopoll
> tty tty60: NFC: autopoll operation has been stopped
> pn533_send_async_complete: set dev->cmd to null!!!
> pn533_recv_frame: dev->cmd is null!!!
> BUG: kernel NULL pointer dereference, address: 0000000000000014
> #PF: supervisor write access in kernel mode
> #PF: error_code(0x0002) - not-present page
> PGD 0 P4D 0
> Oops: 0002 [#1] PREEMPT SMP KASAN PTI
> CPU: 0 PID: 5541 Comm: kworker/0:0 Tainted: G O 6.8.0 #1
> Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1ubuntu1.1 04/01/2014
> Workqueue: events nfcvirt_recv_work [nfcvirt]
> RIP: 0010:pn533_recv_frame+0x18a/0x1e0 [pn533]
> Code: 43 ff ff ff 48 8b bb 80 02 00 00 48 c7 c6 0b 02 46 c0 31 c0 e8 97 64 4f c4 48 83 bb b0 01 00 00 00 74 3f 48 8b 83 b0 01 00 00 <c7> 40 14 fb ff ff ff 48 8b 83 b0 01 00 00 48 85 c0 0f 85 3b ff ff
> RSP: 0018:ffff88802665fc68 EFLAGS: 00010246
> RAX: 0000000000000000 RBX: ffff88804a13b800 RCX: ad381de3b3cd5e00
> RDX: 1ffff11004ccbf38 RSI: 0000000000000008 RDI: ffff88802665f9e0
> RBP: ffff88804fb25000 R08: ffff88802665f9e7 R09: 1ffff11004ccbf3c
> R10: dffffc0000000000 R11: ffffed1004ccbf3d R12: 0000000000001950
> R13: ffff88804ab80000 R14: ffff888021d22640 R15: ffff88802665fcb0
> FS: 0000000000000000(0000) GS:ffff88806d200000(0000) knlGS:0000000000000000
> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 0000000000000014 CR3: 0000000004ebc000 CR4: 00000000000006f0

Please trim the log from irrelevant register dumps.

> Call Trace:
> <TASK>
> ? __die_body+0x62/0xb0
> ? page_fault_oops+0x421/0x740
> ? kernelmode_fixup_or_oops+0x1d0/0x1d0
> ? asan.module_ctor+0x10/0x10
> ? vprintk_emit+0x3f0/0x3f0
> ? kernelmode_fixup_or_oops+0x163/0x1d0
> ? do_user_addr_fault+0xb6c/0xde0
> ? irq_work_queue+0x54/0xa0
> ? do_kern_addr_fault+0x160/0x160
> ? __call_rcu_common+0x518/0xc30
> ? _dev_err+0x106/0x150
> ? exc_page_fault+0x66/0x1a0
> ? asm_exc_page_fault+0x22/0x30
> ? pn533_recv_frame+0x18a/0x1e0 [pn533]
> ? pn533_recv_frame+0x1d7/0x1e0 [pn533]
> nfcvirt_recv_work+0x24e/0x320 [nfcvirt]
> ? wake_bit_function+0x230/0x230
> process_one_work+0x4f0/0xab0
> worker_thread+0x8af/0xee0
> ? process_one_work+0xab0/0xab0
> kthread+0x275/0x300
> ? process_one_work+0xab0/0xab0
> ? kthread_blkcg+0xa0/0xa0
> ret_from_fork+0x30/0x60
> ? kthread_blkcg+0xa0/0xa0
> ret_from_fork_asm+0x11/0x20
> </TASK>
> Modules linked in: nfcvirt(O) pn533(O) nfc(O) ki_coverage(O) [last unloaded: pn533(O)]
> CR2: 0000000000000014
> ---[ end trace 0000000000000000 ]---
> RIP: 0010:pn533_recv_frame+0x18a/0x1e0 [pn533]
> Code: 43 ff ff ff 48 8b bb 80 02 00 00 48 c7 c6 0b 02 46 c0 31 c0 e8 97 64 4f c4 48 83 bb b0 01 00 00 00 74 3f 48 8b 83 b0 01 00 00 <c7> 40 14 fb ff ff ff 48 8b 83 b0 01 00 00 48 85 c0 0f 85 3b ff ff
> RSP: 0018:ffff88802665fc68 EFLAGS: 00010246
> RAX: 0000000000000000 RBX: ffff88804a13b800 RCX: ad381de3b3cd5e00
> RDX: 1ffff11004ccbf38 RSI: 0000000000000008 RDI: ffff88802665f9e0
> RBP: ffff88804fb25000 R08: ffff88802665f9e7 R09: 1ffff11004ccbf3c
> R10: dffffc0000000000 R11: ffffed1004ccbf3d R12: 0000000000001950
> R13: ffff88804ab80000 R14: ffff888021d22640 R15: ffff88802665fcb0
> FS: 0000000000000000(0000) GS:ffff88806d200000(0000) knlGS:0000000000000000
> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 0000000000000014 CR3: 0000000004ebc000 CR4: 00000000000006f0

Same here.

This just makes the commit log less readable.

>
> Signed-off-by: Yuxuan Hu <[email protected]>
> ---
> drivers/nfc/pn533/pn533.c | 4 ++++
> 1 file changed, 4 insertions(+)
>
> diff --git a/drivers/nfc/pn533/pn533.c b/drivers/nfc/pn533/pn533.c
> index b19c39dcfbd9..a80a23332f59 100644
> --- a/drivers/nfc/pn533/pn533.c
> +++ b/drivers/nfc/pn533/pn533.c
> @@ -2190,9 +2190,13 @@ void pn533_recv_frame(struct pn533 *dev, struct sk_buff *skb, int status)
>
> if (!dev->ops->rx_is_frame_valid(skb->data, dev)) {
> nfc_err(dev->dev, "Received an invalid frame\n");

Imagine here dev->cmd != NULL...

> + if (!dev->cmd)
> + goto sched_wq;

.. but here is being NULL-ified by pn533_send_async_complete(). How
does your solution prevent anything? I assume pn533_recv_frame() will be
executed in parallel to the workqueue.

A bit better solution would be to NULL-ify dev->cmd at the beginning of
pn533_send_async_complete(), because that seems logical. The complete
callback takes ownership of dev->cmd, so why it performs the assignment
at the end?

However even above code will keep the race open for short period.
Probably some locking would solve it or checking for dev->cmd in few
places with barriers.

Best regards,
Krzysztof


2024-04-24 05:39:13

by Krzysztof Kozlowski

[permalink] [raw]
Subject: Re: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

On 22/04/2024 10:04, Yuxuan Hu wrote:
> ? kthread_blkcg+0xa0/0xa0
> ret_from_fork_asm+0x11/0x20
> </TASK>
> Modules linked in: nfcvirt(O) pn533(O) nfc(O) ki_coverage(O) [last unloaded: pn533(O)]
> CR2: 0000000000000014
> ---[ end trace 0000000000000000 ]---
> RIP: 0010:pn533_recv_frame+0x18a/0x1e0 [pn533]
> Code: 43 ff ff ff 48 8b bb 80 02 00 00 48 c7 c6 0b 02 46 c0 31 c0 e8 97 64 4f c4 48 83 bb b0 01 00 00 00 74 3f 48 8b 83 b0 01 00 00 <c7> 40 14 fb ff ff ff 48 8b 83 b0 01 00 00 48 85 c0 0f 85 3b ff ff
> RSP: 0018:ffff88802665fc68 EFLAGS: 00010246
> RAX: 0000000000000000 RBX: ffff88804a13b800 RCX: ad381de3b3cd5e00
> RDX: 1ffff11004ccbf38 RSI: 0000000000000008 RDI: ffff88802665f9e0
> RBP: ffff88804fb25000 R08: ffff88802665f9e7 R09: 1ffff11004ccbf3c
> R10: dffffc0000000000 R11: ffffed1004ccbf3d R12: 0000000000001950
> R13: ffff88804ab80000 R14: ffff888021d22640 R15: ffff88802665fcb0
> FS: 0000000000000000(0000) GS:ffff88806d200000(0000) knlGS:0000000000000000
> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 0000000000000014 CR3: 0000000004ebc000 CR4: 00000000000006f0
>
> Signed-off-by: Yuxuan Hu <[email protected]>

Also, please provide here Fixes, Cc:stable and Closes tags.

Best regards,
Krzysztof


2024-04-24 06:08:13

by Markus Elfring

[permalink] [raw]
Subject: Re: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

> Our fuzzing tool found a null-ptr-deref in function pn533_recv_frame
> (/drivers/nfc/pn533/pn533.c) in kernel 6.8.


Can it be nicer to use the term “null pointer dereference” for the commit message here?

Regards,
Markus

2024-04-24 14:37:13

by Yuxuan-Hu

[permalink] [raw]
Subject: Re: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

Hi Krzysztof,
Thanks for your reply.

On 2024/4/24 13:36, Krzysztof Kozlowski wrote:
> A bit better solution would be to NULL-ify dev->cmd at the beginning of
> pn533_send_async_complete(), because that seems logical. The complete
> callback takes ownership of dev->cmd, so why it performs the assignment
> at the end?
>
> However even above code will keep the race open for short period.
> Probably some locking would solve it or checking for dev->cmd in few
> places with barriers.
>
> Best regards,
> Krzysztof

I think adding a lock seems to be a better solution, however, acquire a
lock on each access to dev->cmd does not seem to be an appropriate
implementation.
I wonder whether you think it is appropriate to acquire a lock at the
beginning of pn533_recv_frame(), and release it when
pn533_wq_cmd_complete() is finished, thus ensuring that another work
will not start when one pn533_wq_cmd_complete() work is not yet finished.

Best regards,
Yuxuan


2024-04-24 18:13:50

by Krzysztof Kozlowski

[permalink] [raw]
Subject: Re: [PATCH] nfc: pn533: Fix null-ptr-deref in pn533_recv_frame()

On 24/04/2024 16:00, Yuxuan-Hu wrote:
> Hi Krzysztof,
> Thanks for your reply.
>
> On 2024/4/24 13:36, Krzysztof Kozlowski wrote:
>> A bit better solution would be to NULL-ify dev->cmd at the beginning of
>> pn533_send_async_complete(), because that seems logical. The complete
>> callback takes ownership of dev->cmd, so why it performs the assignment
>> at the end?
>>
>> However even above code will keep the race open for short period.
>> Probably some locking would solve it or checking for dev->cmd in few
>> places with barriers.
>>
>> Best regards,
>> Krzysztof
>
> I think adding a lock seems to be a better solution, however, acquire a
> lock on each access to dev->cmd does not seem to be an appropriate
> implementation.
> I wonder whether you think it is appropriate to acquire a lock at the
> beginning of pn533_recv_frame(), and release it when
> pn533_wq_cmd_complete() is finished, thus ensuring that another work
> will not start when one pn533_wq_cmd_complete() work is not yet finished.

I don't know this driver well enough, but it currently does not have any
locking around this so it assumes pn533_recv_frame() cannot start
concurrently, before complete callback finishes.

This could be solved, if it makes sense.

However maybe it would be enough to take:
1. Move contents of dev->cmd to another member in dev (e.g.
dev->cmd_in_progress), first checking if that member is empty (then
maybe wait? or return -EBUSY?)
2. dev->cmd = NULL
3. queue_work (see also explanation of barriers there)
4. The cmd_complete_work workqueue takes ownership of
dev->cmd_in_progress and NULL-ifies it at end of work + barrier.

Best regards,
Krzysztof