2023-06-23 21:18:48

by Julia Lawall

[permalink] [raw]
Subject: [PATCH 10/26] bus: mhi: host: use array_size

Use array_size to protect against multiplication overflows.

The changes were done using the following Coccinelle semantic patch:

// <smpl>
@@
expression E1, E2;
constant C1, C2;
identifier alloc = {vmalloc,vzalloc};
@@

(
alloc(C1 * C2,...)
|
alloc(
- (E1) * (E2)
+ array_size(E1, E2)
,...)
)
// </smpl>

Signed-off-by: Julia Lawall <[email protected]>

---
drivers/bus/mhi/host/init.c | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
index f72fcb66f408..34a543a67068 100644
--- a/drivers/bus/mhi/host/init.c
+++ b/drivers/bus/mhi/host/init.c
@@ -759,8 +759,8 @@ static int parse_ch_cfg(struct mhi_controller *mhi_cntrl,
* so to avoid any memory possible allocation failures, vzalloc is
* used here
*/
- mhi_cntrl->mhi_chan = vzalloc(mhi_cntrl->max_chan *
- sizeof(*mhi_cntrl->mhi_chan));
+ mhi_cntrl->mhi_chan = vzalloc(array_size(mhi_cntrl->max_chan,
+ sizeof(*mhi_cntrl->mhi_chan)));
if (!mhi_cntrl->mhi_chan)
return -ENOMEM;




2023-06-23 21:39:57

by Jeffrey Hugo

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On 6/23/2023 3:14 PM, Julia Lawall wrote:
> Use array_size to protect against multiplication overflows.
>
> The changes were done using the following Coccinelle semantic patch:
>
> // <smpl>
> @@
> expression E1, E2;
> constant C1, C2;
> identifier alloc = {vmalloc,vzalloc};
> @@
>
> (
> alloc(C1 * C2,...)
> |
> alloc(
> - (E1) * (E2)
> + array_size(E1, E2)
> ,...)
> )
> // </smpl>
>
> Signed-off-by: Julia Lawall <[email protected]>
>
> ---
> drivers/bus/mhi/host/init.c | 4 ++--
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
> index f72fcb66f408..34a543a67068 100644
> --- a/drivers/bus/mhi/host/init.c
> +++ b/drivers/bus/mhi/host/init.c
> @@ -759,8 +759,8 @@ static int parse_ch_cfg(struct mhi_controller *mhi_cntrl,
> * so to avoid any memory possible allocation failures, vzalloc is
> * used here
> */
> - mhi_cntrl->mhi_chan = vzalloc(mhi_cntrl->max_chan *
> - sizeof(*mhi_cntrl->mhi_chan));
> + mhi_cntrl->mhi_chan = vzalloc(array_size(mhi_cntrl->max_chan,
> + sizeof(*mhi_cntrl->mhi_chan)));
> if (!mhi_cntrl->mhi_chan)
> return -ENOMEM;
>
>
>

This doesn't seem like a good fix.

If we've overflowed the multiplication, I don't think we should
continue, and the function should return an error. array_size() is
going to return SIZE_MAX, and it looks like it is possible that
vzalloc() may be able to allocate that successfully in some scenarios.
However, that is going to be less memory than parse_ch_cfg() expected to
allocate, so later on I expect the function will still corrupt memory -
basically the same result as what the unchecked overflow would do.

I'm not convinced the semantic patch is bringing value as I suspect most
of the code being patched is in the same situation.

-Jeff

2023-06-23 22:03:07

by Julia Lawall

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size



On Fri, 23 Jun 2023, Jeffrey Hugo wrote:

> On 6/23/2023 3:14 PM, Julia Lawall wrote:
> > Use array_size to protect against multiplication overflows.
> >
> > The changes were done using the following Coccinelle semantic patch:
> >
> > // <smpl>
> > @@
> > expression E1, E2;
> > constant C1, C2;
> > identifier alloc = {vmalloc,vzalloc};
> > @@
> > (
> > alloc(C1 * C2,...)
> > |
> > alloc(
> > - (E1) * (E2)
> > + array_size(E1, E2)
> > ,...)
> > )
> > // </smpl>
> >
> > Signed-off-by: Julia Lawall <[email protected]>
> >
> > ---
> > drivers/bus/mhi/host/init.c | 4 ++--
> > 1 file changed, 2 insertions(+), 2 deletions(-)
> >
> > diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
> > index f72fcb66f408..34a543a67068 100644
> > --- a/drivers/bus/mhi/host/init.c
> > +++ b/drivers/bus/mhi/host/init.c
> > @@ -759,8 +759,8 @@ static int parse_ch_cfg(struct mhi_controller
> > *mhi_cntrl,
> > * so to avoid any memory possible allocation failures, vzalloc is
> > * used here
> > */
> > - mhi_cntrl->mhi_chan = vzalloc(mhi_cntrl->max_chan *
> > - sizeof(*mhi_cntrl->mhi_chan));
> > + mhi_cntrl->mhi_chan = vzalloc(array_size(mhi_cntrl->max_chan,
> > + sizeof(*mhi_cntrl->mhi_chan)));
> > if (!mhi_cntrl->mhi_chan)
> > return -ENOMEM;
> >
> >
>
> This doesn't seem like a good fix.
>
> If we've overflowed the multiplication, I don't think we should continue, and
> the function should return an error. array_size() is going to return
> SIZE_MAX, and it looks like it is possible that vzalloc() may be able to
> allocate that successfully in some scenarios. However, that is going to be
> less memory than parse_ch_cfg() expected to allocate, so later on I expect the
> function will still corrupt memory - basically the same result as what the
> unchecked overflow would do.
>
> I'm not convinced the semantic patch is bringing value as I suspect most of
> the code being patched is in the same situation.

OK, this just brings the code in line with all the calls updated by Kees's
original patch, cited in the cover letter, which were all the
calls containing a multiplication that existed at the time.

42bc47b35320 ("treewide: Use array_size() in vmalloc()")
fad953ce0b22 ("treewide: Use array_size() in vzalloc()")

julia

>
> -Jeff
>

2023-06-23 22:24:47

by Jeffrey Hugo

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On 6/23/2023 3:45 PM, Julia Lawall wrote:
>
>
> On Fri, 23 Jun 2023, Jeffrey Hugo wrote:
>
>> On 6/23/2023 3:14 PM, Julia Lawall wrote:
>>> Use array_size to protect against multiplication overflows.
>>>
>>> The changes were done using the following Coccinelle semantic patch:
>>>
>>> // <smpl>
>>> @@
>>> expression E1, E2;
>>> constant C1, C2;
>>> identifier alloc = {vmalloc,vzalloc};
>>> @@
>>> (
>>> alloc(C1 * C2,...)
>>> |
>>> alloc(
>>> - (E1) * (E2)
>>> + array_size(E1, E2)
>>> ,...)
>>> )
>>> // </smpl>
>>>
>>> Signed-off-by: Julia Lawall <[email protected]>
>>>
>>> ---
>>> drivers/bus/mhi/host/init.c | 4 ++--
>>> 1 file changed, 2 insertions(+), 2 deletions(-)
>>>
>>> diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
>>> index f72fcb66f408..34a543a67068 100644
>>> --- a/drivers/bus/mhi/host/init.c
>>> +++ b/drivers/bus/mhi/host/init.c
>>> @@ -759,8 +759,8 @@ static int parse_ch_cfg(struct mhi_controller
>>> *mhi_cntrl,
>>> * so to avoid any memory possible allocation failures, vzalloc is
>>> * used here
>>> */
>>> - mhi_cntrl->mhi_chan = vzalloc(mhi_cntrl->max_chan *
>>> - sizeof(*mhi_cntrl->mhi_chan));
>>> + mhi_cntrl->mhi_chan = vzalloc(array_size(mhi_cntrl->max_chan,
>>> + sizeof(*mhi_cntrl->mhi_chan)));
>>> if (!mhi_cntrl->mhi_chan)
>>> return -ENOMEM;
>>>
>>>
>>
>> This doesn't seem like a good fix.
>>
>> If we've overflowed the multiplication, I don't think we should continue, and
>> the function should return an error. array_size() is going to return
>> SIZE_MAX, and it looks like it is possible that vzalloc() may be able to
>> allocate that successfully in some scenarios. However, that is going to be
>> less memory than parse_ch_cfg() expected to allocate, so later on I expect the
>> function will still corrupt memory - basically the same result as what the
>> unchecked overflow would do.
>>
>> I'm not convinced the semantic patch is bringing value as I suspect most of
>> the code being patched is in the same situation.
>
> OK, this just brings the code in line with all the calls updated by Kees's
> original patch, cited in the cover letter, which were all the
> calls containing a multiplication that existed at the time.
>
> 42bc47b35320 ("treewide: Use array_size() in vmalloc()")
> fad953ce0b22 ("treewide: Use array_size() in vzalloc()")

Eh. I "git show fad953ce0b22" and it doesn't really tell me much. The
commit asserts that uses of vzalloc() and multiplication need
array_size(), but doesn't really explain why.

This looks like a brute force automated update with no thought and I
fear the result of this change is the conclusion that we've solved
multiplication overflow, when it doesn't look like we've really done
much. Sure, the multiplication gets capped, but can the code actually
handle that?

I should probably run the numbers, but with the relevant spec capping
the number of channels at 256, I don't think we can realistically
approach overflow, even on a 32-bit system. However, having correct
code that is inherently safe seems like a good idea and so I feel this
function has an issue. I just don't think this automated conversion
meaningfully does anything to improve the code here.

Kees, would you please chime in and educate me here? I feel like I'm
missing something important here.

-Jeff

2023-06-24 00:03:54

by Kees Cook

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On Fri, Jun 23, 2023 at 04:09:46PM -0600, Jeffrey Hugo wrote:
> Kees, would you please chime in and educate me here? I feel like I'm
> missing something important here.

The array_size() family will saturate at SIZE_MAX (rather than potentially
wrapping around). No allocator can fulfil a 18446744073709551615 byte
(18 exabyte) allocation. :) So the NULL return value will (hopefully)
trigger an error path.

--
Kees Cook

2023-06-24 16:18:40

by Jeffrey Hugo

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On 6/23/2023 5:45 PM, Kees Cook wrote:
> On Fri, Jun 23, 2023 at 04:09:46PM -0600, Jeffrey Hugo wrote:
>> Kees, would you please chime in and educate me here? I feel like I'm
>> missing something important here.
>
> The array_size() family will saturate at SIZE_MAX (rather than potentially
> wrapping around). No allocator can fulfil a 18446744073709551615 byte
> (18 exabyte) allocation. :) So the NULL return value will (hopefully)
> trigger an error path.
>

Fair enough, that handles the 64-bit usecase. I'm guessing the
assumption is that on a 32-bit usecase where size_t is ~4GB, there won't
actually be 4GB to allocate and things will also fail. So far, so good.

What about a 32-bit system with something like ARM's LPAE (Large
Physical Address Extension) where the host is 32-bit, and so size_t
would be ~4GB (as far as I can tell) but phys_addr_t is larger than
that, and so we can have/access more than 4GB of resources? Lets see,
ignoring that its a 13 year old feature and probably not in circulation
anymore, probably still can't satisfy a 4GB allocation since you'd need
to map all of it to address it, and part of the address space is surely
reserved for other things.

Ok, I think I'm convinced. I'm going to sleep on it, but I suspect all
will still be good early next week.

Thank you for the explanation.

-Jeff

2023-06-26 12:20:03

by Dan Carpenter

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On Fri, Jun 23, 2023 at 03:30:36PM -0600, Jeffrey Hugo wrote:
> On 6/23/2023 3:14 PM, Julia Lawall wrote:
> > Use array_size to protect against multiplication overflows.
> >
> > The changes were done using the following Coccinelle semantic patch:
> >
> > // <smpl>
> > @@
> > expression E1, E2;
> > constant C1, C2;
> > identifier alloc = {vmalloc,vzalloc};
> > @@
> > (
> > alloc(C1 * C2,...)
> > |
> > alloc(
> > - (E1) * (E2)
> > + array_size(E1, E2)
> > ,...)
> > )
> > // </smpl>
> >
> > Signed-off-by: Julia Lawall <[email protected]>
> >
> > ---
> > drivers/bus/mhi/host/init.c | 4 ++--
> > 1 file changed, 2 insertions(+), 2 deletions(-)
> >
> > diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
> > index f72fcb66f408..34a543a67068 100644
> > --- a/drivers/bus/mhi/host/init.c
> > +++ b/drivers/bus/mhi/host/init.c
> > @@ -759,8 +759,8 @@ static int parse_ch_cfg(struct mhi_controller *mhi_cntrl,
> > * so to avoid any memory possible allocation failures, vzalloc is
> > * used here
> > */
> > - mhi_cntrl->mhi_chan = vzalloc(mhi_cntrl->max_chan *
> > - sizeof(*mhi_cntrl->mhi_chan));
> > + mhi_cntrl->mhi_chan = vzalloc(array_size(mhi_cntrl->max_chan,
> > + sizeof(*mhi_cntrl->mhi_chan)));
> > if (!mhi_cntrl->mhi_chan)
> > return -ENOMEM;
> >
> >
>
> This doesn't seem like a good fix.
>
> If we've overflowed the multiplication, I don't think we should continue,
> and the function should return an error. array_size() is going to return
> SIZE_MAX, and it looks like it is possible that vzalloc() may be able to
> allocate that successfully in some scenarios.

Nope. You can never allocate more that size_t because that's the
highest number that the kernel allocation functions can accept.

Obviously on 64bit size_t is unbelievably large. If I remember right,
on 32bit you didn't used to be able to allocate more than 2GB without
doing all sorts of tricks. And everyone deleted those tricks when 64bit
machines became super common.

regards,
dan carpenter


2023-06-26 15:31:57

by Jeffrey Hugo

[permalink] [raw]
Subject: Re: [PATCH 10/26] bus: mhi: host: use array_size

On 6/23/2023 3:14 PM, Julia Lawall wrote:
> Use array_size to protect against multiplication overflows.
>
> The changes were done using the following Coccinelle semantic patch:
>
> // <smpl>
> @@
> expression E1, E2;
> constant C1, C2;
> identifier alloc = {vmalloc,vzalloc};
> @@
>
> (
> alloc(C1 * C2,...)
> |
> alloc(
> - (E1) * (E2)
> + array_size(E1, E2)
> ,...)
> )
> // </smpl>
>
> Signed-off-by: Julia Lawall <[email protected]>
>

Reviewed-by: Jeffrey Hugo <[email protected]>
Tested-by: Jeffrey Hugo <[email protected]>