In ice_rebuild(), check whether AUX devices/drivers are supported or not
before calling ice_plug_aux_dev().
Fix the following call trace, if RDMA functionality is not available.
auxiliary ice.roce.0: adding auxiliary device failed!: -17
sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'
Workqueue: ice ice_service_task [ice]
Call Trace:
dump_stack_lvl+0x38/0x49
dump_stack+0x10/0x12
sysfs_warn_dup+0x5b/0x70
sysfs_do_create_link_sd.isra.2+0xc8/0xd0
sysfs_create_link+0x25/0x40
bus_add_device+0x6d/0x110
device_add+0x49d/0x940
? _printk+0x52/0x6e
? _printk+0x52/0x6e
__auxiliary_device_add+0x60/0xc0
ice_plug_aux_dev+0xd3/0xf0 [ice]
ice_rebuild+0x27d/0x510 [ice]
ice_do_reset+0x51/0xe0 [ice]
ice_service_task+0x108/0xe70 [ice]
? __switch_to+0x13b/0x510
process_one_work+0x1de/0x420
? apply_wqattrs_cleanup+0xc0/0xc0
worker_thread+0x34/0x400
? apply_wqattrs_cleanup+0xc0/0xc0
kthread+0x14d/0x180
? set_kthread_struct+0x40/0x40
ret_from_fork+0x1f/0x30
Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")
Signed-off-by: Yongxin Liu <[email protected]>
---
drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
1 file changed, 3 insertions(+), 1 deletion(-)
diff --git a/drivers/net/ethernet/intel/ice/ice_main.c b/drivers/net/ethernet/intel/ice/ice_main.c
index 0d6c143f6653..98cc708e9517 100644
--- a/drivers/net/ethernet/intel/ice/ice_main.c
+++ b/drivers/net/ethernet/intel/ice/ice_main.c
@@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf, enum ice_reset_req reset_type)
/* if we get here, reset flow is successful */
clear_bit(ICE_RESET_FAILED, pf->state);
- ice_plug_aux_dev(pf);
+ if (ice_is_aux_ena(pf))
+ ice_plug_aux_dev(pf);
+
return;
err_vsi_rebuild:
--
2.14.5
On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:
> In ice_rebuild(), check whether AUX devices/drivers are supported or not
> before calling ice_plug_aux_dev().
>
> Fix the following call trace, if RDMA functionality is not available.
>
> auxiliary ice.roce.0: adding auxiliary device failed!: -17
> sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'
> Workqueue: ice ice_service_task [ice]
> Call Trace:
> dump_stack_lvl+0x38/0x49
> dump_stack+0x10/0x12
> sysfs_warn_dup+0x5b/0x70
> sysfs_do_create_link_sd.isra.2+0xc8/0xd0
> sysfs_create_link+0x25/0x40
> bus_add_device+0x6d/0x110
> device_add+0x49d/0x940
> ? _printk+0x52/0x6e
> ? _printk+0x52/0x6e
> __auxiliary_device_add+0x60/0xc0
> ice_plug_aux_dev+0xd3/0xf0 [ice]
> ice_rebuild+0x27d/0x510 [ice]
> ice_do_reset+0x51/0xe0 [ice]
> ice_service_task+0x108/0xe70 [ice]
> ? __switch_to+0x13b/0x510
> process_one_work+0x1de/0x420
> ? apply_wqattrs_cleanup+0xc0/0xc0
> worker_thread+0x34/0x400
> ? apply_wqattrs_cleanup+0xc0/0xc0
> kthread+0x14d/0x180
> ? set_kthread_struct+0x40/0x40
> ret_from_fork+0x1f/0x30
>
> Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")
> Signed-off-by: Yongxin Liu <[email protected]>
> ---
> drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/net/ethernet/intel/ice/ice_main.c b/drivers/net/ethernet/intel/ice/ice_main.c
> index 0d6c143f6653..98cc708e9517 100644
> --- a/drivers/net/ethernet/intel/ice/ice_main.c
> +++ b/drivers/net/ethernet/intel/ice/ice_main.c
> @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf, enum ice_reset_req reset_type)
> /* if we get here, reset flow is successful */
> clear_bit(ICE_RESET_FAILED, pf->state);
>
> - ice_plug_aux_dev(pf);
> + if (ice_is_aux_ena(pf))
> + ice_plug_aux_dev(pf);
> +
The change is ok, but it hints that auxiliary bus is used horribly wrong
in this driver. In proper implementation, which should rely on driver/core,
every subdriver like ice.eth, ice.roce e.t.c is supposed to be retriggered
by the code and shouldn't ave "if (ice_is_aux_ena(pf))" checks.
Thanks
> return;
>
> err_vsi_rebuild:
> --
> 2.14.5
>
> -----Original Message-----
> From: Leon Romanovsky <[email protected]>
> Sent: Sunday, September 5, 2021 12:24 AM
> To: Yongxin Liu <[email protected]>
> Cc: Ertman, David M <[email protected]>; Saleem, Shiraz
> <[email protected]>; Nguyen, Anthony L
> <[email protected]>; [email protected]; linux-
> [email protected]; [email protected]; Brandeburg, Jesse
> <[email protected]>; [email protected];
> [email protected]
> Subject: Re: [PATCH net] ice: check whether AUX devices/drivers are
> supported in ice_rebuild
>
> On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:
> > In ice_rebuild(), check whether AUX devices/drivers are supported or not
> > before calling ice_plug_aux_dev().
> >
> > Fix the following call trace, if RDMA functionality is not available.
> >
> > auxiliary ice.roce.0: adding auxiliary device failed!: -17
> > sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'
> > Workqueue: ice ice_service_task [ice]
> > Call Trace:
> > dump_stack_lvl+0x38/0x49
> > dump_stack+0x10/0x12
> > sysfs_warn_dup+0x5b/0x70
> > sysfs_do_create_link_sd.isra.2+0xc8/0xd0
> > sysfs_create_link+0x25/0x40
> > bus_add_device+0x6d/0x110
> > device_add+0x49d/0x940
> > ? _printk+0x52/0x6e
> > ? _printk+0x52/0x6e
> > __auxiliary_device_add+0x60/0xc0
> > ice_plug_aux_dev+0xd3/0xf0 [ice]
> > ice_rebuild+0x27d/0x510 [ice]
> > ice_do_reset+0x51/0xe0 [ice]
> > ice_service_task+0x108/0xe70 [ice]
> > ? __switch_to+0x13b/0x510
> > process_one_work+0x1de/0x420
> > ? apply_wqattrs_cleanup+0xc0/0xc0
> > worker_thread+0x34/0x400
> > ? apply_wqattrs_cleanup+0xc0/0xc0
> > kthread+0x14d/0x180
> > ? set_kthread_struct+0x40/0x40
> > ret_from_fork+0x1f/0x30
> >
> > Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")
> > Signed-off-by: Yongxin Liu <[email protected]>
> > ---
> > drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
> > 1 file changed, 3 insertions(+), 1 deletion(-)
> >
> > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c
> b/drivers/net/ethernet/intel/ice/ice_main.c
> > index 0d6c143f6653..98cc708e9517 100644
> > --- a/drivers/net/ethernet/intel/ice/ice_main.c
> > +++ b/drivers/net/ethernet/intel/ice/ice_main.c
> > @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf, enum
> ice_reset_req reset_type)
> > /* if we get here, reset flow is successful */
> > clear_bit(ICE_RESET_FAILED, pf->state);
> >
> > - ice_plug_aux_dev(pf);
> > + if (ice_is_aux_ena(pf))
> > + ice_plug_aux_dev(pf);
> > +
>
> The change is ok, but it hints that auxiliary bus is used horribly wrong
> in this driver. In proper implementation, which should rely on driver/core,
> every subdriver like ice.eth, ice.roce e.t.c is supposed to be retriggered
> by the code and shouldn't ave "if (ice_is_aux_ena(pf))" checks.
>
> Thanks
Hi Leon and Liu -
First of all, thanks Liu for tracking this down - it is an issue that needs to be fixed. The ice_is_aux_ena() functions only
purpose is to determine if this PF supports RDMA functionality. In probe(), the aux devices are not even initialized if
this test returns false. If this check fixed the issue for you, the PF you are on does not currently support RDMA.
The bit this test is based on is only set one place in the driver currently - at probe time when we are checking the
capabilities (common_caps) of the device.
That being said, the call to check this should be in ice_plug_aux_dev function itself. That way it is taken into account for all
attempts to create the auxiliary device. There is another consideration about disabling RDMA that needs to also needs to be
taken into account to avoid a similar situation.
If it is acceptable, I will create a new patch today and get it out either this afternoon or tomorrow.
Thanks,
Dave E
>
> > return;
> >
> > err_vsi_rebuild:
> > --
> > 2.14.5
> >
> -----Original Message-----
> From: Ertman, David M <[email protected]>
> Sent: Thursday, September 9, 2021 1:31 AM
> To: Leon Romanovsky <[email protected]>; Liu, Yongxin
> <[email protected]>
> Cc: Saleem, Shiraz <[email protected]>; Nguyen, Anthony L
> <[email protected]>; [email protected]; linux-
> [email protected]; [email protected]; Brandeburg, Jesse
> <[email protected]>; [email protected];
> [email protected]
> Subject: RE: [PATCH net] ice: check whether AUX devices/drivers are supported
> in ice_rebuild
>
> > -----Original Message-----
> > From: Leon Romanovsky <[email protected]>
> > Sent: Sunday, September 5, 2021 12:24 AM
> > To: Yongxin Liu <[email protected]>
> > Cc: Ertman, David M <[email protected]>; Saleem, Shiraz
> > <[email protected]>; Nguyen, Anthony L
> > <[email protected]>; [email protected]; linux-
> > [email protected]; [email protected]; Brandeburg, Jesse
> > <[email protected]>; [email protected];
> > [email protected]
> > Subject: Re: [PATCH net] ice: check whether AUX devices/drivers are
> > supported in ice_rebuild
> >
> > On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:
> > > In ice_rebuild(), check whether AUX devices/drivers are supported or
> > > not before calling ice_plug_aux_dev().
> > >
> > > Fix the following call trace, if RDMA functionality is not available.
> > >
> > > auxiliary ice.roce.0: adding auxiliary device failed!: -17
> > > sysfs: cannot create duplicate filename
> '/bus/auxiliary/devices/ice.roce.0'
> > > Workqueue: ice ice_service_task [ice]
> > > Call Trace:
> > > dump_stack_lvl+0x38/0x49
> > > dump_stack+0x10/0x12
> > > sysfs_warn_dup+0x5b/0x70
> > > sysfs_do_create_link_sd.isra.2+0xc8/0xd0
> > > sysfs_create_link+0x25/0x40
> > > bus_add_device+0x6d/0x110
> > > device_add+0x49d/0x940
> > > ? _printk+0x52/0x6e
> > > ? _printk+0x52/0x6e
> > > __auxiliary_device_add+0x60/0xc0
> > > ice_plug_aux_dev+0xd3/0xf0 [ice]
> > > ice_rebuild+0x27d/0x510 [ice]
> > > ice_do_reset+0x51/0xe0 [ice]
> > > ice_service_task+0x108/0xe70 [ice]
> > > ? __switch_to+0x13b/0x510
> > > process_one_work+0x1de/0x420
> > > ? apply_wqattrs_cleanup+0xc0/0xc0
> > > worker_thread+0x34/0x400
> > > ? apply_wqattrs_cleanup+0xc0/0xc0
> > > kthread+0x14d/0x180
> > > ? set_kthread_struct+0x40/0x40
> > > ret_from_fork+0x1f/0x30
> > >
> > > Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide
> > > RDMA")
> > > Signed-off-by: Yongxin Liu <[email protected]>
> > > ---
> > > drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
> > > 1 file changed, 3 insertions(+), 1 deletion(-)
> > >
> > > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c
> > b/drivers/net/ethernet/intel/ice/ice_main.c
> > > index 0d6c143f6653..98cc708e9517 100644
> > > --- a/drivers/net/ethernet/intel/ice/ice_main.c
> > > +++ b/drivers/net/ethernet/intel/ice/ice_main.c
> > > @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf,
> > > enum
> > ice_reset_req reset_type)
> > > /* if we get here, reset flow is successful */
> > > clear_bit(ICE_RESET_FAILED, pf->state);
> > >
> > > - ice_plug_aux_dev(pf);
> > > + if (ice_is_aux_ena(pf))
> > > + ice_plug_aux_dev(pf);
> > > +
> >
> > The change is ok, but it hints that auxiliary bus is used horribly
> > wrong in this driver. In proper implementation, which should rely on
> > driver/core, every subdriver like ice.eth, ice.roce e.t.c is supposed
> > to be retriggered by the code and shouldn't ave "if (ice_is_aux_ena(pf))"
> checks.
> >
> > Thanks
>
> Hi Leon and Liu -
>
> First of all, thanks Liu for tracking this down - it is an issue that needs
> to be fixed. The ice_is_aux_ena() functions only purpose is to determine if
> this PF supports RDMA functionality. In probe(), the aux devices are not
> even initialized if this test returns false. If this check fixed the issue
> for you, the PF you are on does not currently support RDMA.
> The bit this test is based on is only set one place in the driver currently -
> at probe time when we are checking the capabilities (common_caps) of the
> device.
>
> That being said, the call to check this should be in ice_plug_aux_dev
> function itself. That way it is taken into account for all attempts to
> create the auxiliary device. There is another consideration about disabling
> RDMA that needs to also needs to be taken into account to avoid a similar
> situation.
>
> If it is acceptable, I will create a new patch today and get it out either
> this afternoon or tomorrow.
Thanks Leon and Dave E for your quick response and input.
Please go ahead and I am expecting this issue to be fixed in a more reasonable way.
Thanks,
Yongxin
>
> Thanks,
> Dave E
>
> >
> > > return;
> > >
> > > err_vsi_rebuild:
> > > --
> > > 2.14.5
> > >