Consider network topology as follows:
eth0 eth1
|_____|
|
bond0 --- br0
|
vlan0 --- br1
bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
to br1 through bond0, bridge handling code is seeing the packet on bond0
and handing it off to my "legacy" bridge before vlan_tx_tag_present
and vlan_hwaccel_do_receive even haven't a chance to look at it.
Moving the vlan_tx_tag_present before bridge/macvlan handling code could
cure this.
Signed-off-by: Xiaotian Feng <[email protected]>
Cc: "David S. Miller" <[email protected]>
Cc: Eric Dumazet <[email protected]>
Cc: Tom Herbert <[email protected]>
---
net/core/dev.c | 20 ++++++++++----------
1 files changed, 10 insertions(+), 10 deletions(-)
diff --git a/net/core/dev.c b/net/core/dev.c
index 8ae6631..d2d12c2 100644
--- a/net/core/dev.c
+++ b/net/core/dev.c
@@ -3079,27 +3079,27 @@ static int __netif_receive_skb(struct sk_buff *skb)
ncls:
#endif
- /* Handle special case of bridge or macvlan */
- rx_handler = rcu_dereference(skb->dev->rx_handler);
- if (rx_handler) {
+ if (vlan_tx_tag_present(skb)) {
if (pt_prev) {
ret = deliver_skb(skb, pt_prev, orig_dev);
pt_prev = NULL;
}
- skb = rx_handler(skb);
- if (!skb)
+ if (vlan_hwaccel_do_receive(&skb)) {
+ ret = __netif_receive_skb(skb);
+ goto out;
+ } else if (unlikely(!skb))
goto out;
}
- if (vlan_tx_tag_present(skb)) {
+ /* Handle special case of bridge or macvlan */
+ rx_handler = rcu_dereference(skb->dev->rx_handler);
+ if (rx_handler) {
if (pt_prev) {
ret = deliver_skb(skb, pt_prev, orig_dev);
pt_prev = NULL;
}
- if (vlan_hwaccel_do_receive(&skb)) {
- ret = __netif_receive_skb(skb);
- goto out;
- } else if (unlikely(!skb))
+ skb = rx_handler(skb);
+ if (!skb)
goto out;
}
--
1.7.1
On Thu, 2011-03-03 at 18:55 +0800, Xiaotian Feng wrote:
> Consider network topology as follows:
>
> eth0 eth1
> |_____|
> |
> bond0 --- br0
> |
> vlan0 --- br1
>
> bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
> to br1 through bond0, bridge handling code is seeing the packet on bond0
> and handing it off to my "legacy" bridge before vlan_tx_tag_present
> and vlan_hwaccel_do_receive even haven't a chance to look at it.
[...]
This used to work if the underlying device (bond0 in your example)
implemented VLAN tag extraction, because the VLAN group would be checked
before the bridge. But it never worked for devices without VLAN tag
extraction. Perhaps we should just prevent this configuration.
Ben.
--
Ben Hutchings, Senior Software Engineer, Solarflare Communications
Not speaking for my employer; that's the marketing department's job.
They asked us to note that Solarflare product names are trademarked.
Le 03/03/2011 14:53, Ben Hutchings a écrit :
> On Thu, 2011-03-03 at 18:55 +0800, Xiaotian Feng wrote:
>> Consider network topology as follows:
>>
>> eth0 eth1
>> |_____|
>> |
>> bond0 --- br0
>> |
>> vlan0 --- br1
>>
>> bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
>> to br1 through bond0, bridge handling code is seeing the packet on bond0
>> and handing it off to my "legacy" bridge before vlan_tx_tag_present
>> and vlan_hwaccel_do_receive even haven't a chance to look at it.
> [...]
>
> This used to work if the underlying device (bond0 in your example)
> implemented VLAN tag extraction, because the VLAN group would be checked
> before the bridge. But it never worked for devices without VLAN tag
> extraction. Perhaps we should just prevent this configuration.
If Jiri Pirko eventually move vlan processing to rx_handler, then the setup won't be possible,
because bond0 would require two rx_handlers : one for bridge (-> br0) and one for vlan (-> vlan0).
Or we need several rx_handlers per net_device, if the above setup is a real one.
Nicolas.
Thu, Mar 03, 2011 at 11:55:13AM CET, [email protected] wrote:
>Consider network topology as follows:
>
>eth0 eth1
> |_____|
> |
> bond0 --- br0
> |
> vlan0 --- br1
>
>bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
>to br1 through bond0, bridge handling code is seeing the packet on bond0
>and handing it off to my "legacy" bridge before vlan_tx_tag_present
>and vlan_hwaccel_do_receive even haven't a chance to look at it.
>
>Moving the vlan_tx_tag_present before bridge/macvlan handling code could
>cure this.
Wouldn't this break "eth0 - br0 - br0.5"?
>
>Signed-off-by: Xiaotian Feng <[email protected]>
>Cc: "David S. Miller" <[email protected]>
>Cc: Eric Dumazet <[email protected]>
>Cc: Tom Herbert <[email protected]>
>---
> net/core/dev.c | 20 ++++++++++----------
> 1 files changed, 10 insertions(+), 10 deletions(-)
>
>diff --git a/net/core/dev.c b/net/core/dev.c
>index 8ae6631..d2d12c2 100644
>--- a/net/core/dev.c
>+++ b/net/core/dev.c
>@@ -3079,27 +3079,27 @@ static int __netif_receive_skb(struct sk_buff *skb)
> ncls:
> #endif
>
>- /* Handle special case of bridge or macvlan */
>- rx_handler = rcu_dereference(skb->dev->rx_handler);
>- if (rx_handler) {
>+ if (vlan_tx_tag_present(skb)) {
> if (pt_prev) {
> ret = deliver_skb(skb, pt_prev, orig_dev);
> pt_prev = NULL;
> }
>- skb = rx_handler(skb);
>- if (!skb)
>+ if (vlan_hwaccel_do_receive(&skb)) {
>+ ret = __netif_receive_skb(skb);
>+ goto out;
>+ } else if (unlikely(!skb))
> goto out;
> }
>
>- if (vlan_tx_tag_present(skb)) {
>+ /* Handle special case of bridge or macvlan */
>+ rx_handler = rcu_dereference(skb->dev->rx_handler);
>+ if (rx_handler) {
> if (pt_prev) {
> ret = deliver_skb(skb, pt_prev, orig_dev);
> pt_prev = NULL;
> }
>- if (vlan_hwaccel_do_receive(&skb)) {
>- ret = __netif_receive_skb(skb);
>- goto out;
>- } else if (unlikely(!skb))
>+ skb = rx_handler(skb);
>+ if (!skb)
> goto out;
> }
>
>--
>1.7.1
>
>--
>To unsubscribe from this list: send the line "unsubscribe netdev" in
>the body of a message to [email protected]
>More majordomo info at http://vger.kernel.org/majordomo-info.html
Sat, Mar 05, 2011 at 02:53:47PM CET, [email protected] wrote:
>Le 05/03/2011 11:36, Jiri Pirko a ?crit :
>>Thu, Mar 03, 2011 at 11:55:13AM CET, [email protected] wrote:
>>>Consider network topology as follows:
>>>
>>>eth0 eth1
>>>|_____|
>>> |
>>> bond0 --- br0
>>> |
>>> vlan0 --- br1
>>>
>>>bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
>>>to br1 through bond0, bridge handling code is seeing the packet on bond0
>>>and handing it off to my "legacy" bridge before vlan_tx_tag_present
>>>and vlan_hwaccel_do_receive even haven't a chance to look at it.
>>>
>>>Moving the vlan_tx_tag_present before bridge/macvlan handling code could
>>>cure this.
>>
>>Wouldn't this break "eth0 - br0 - br0.5"?
>
>I think it would. One more reason to build a single interface stacking framework...
I plan to deal with vlans later. This kind of topology would not be
possible after that :)
>
> Nicolas.
Le 05/03/2011 11:36, Jiri Pirko a ?crit :
> Thu, Mar 03, 2011 at 11:55:13AM CET, [email protected] wrote:
>> Consider network topology as follows:
>>
>> eth0 eth1
>> |_____|
>> |
>> bond0 --- br0
>> |
>> vlan0 --- br1
>>
>> bond0 serves for both br0 and vlan0, if a vlan tagged packet was sent
>> to br1 through bond0, bridge handling code is seeing the packet on bond0
>> and handing it off to my "legacy" bridge before vlan_tx_tag_present
>> and vlan_hwaccel_do_receive even haven't a chance to look at it.
>>
>> Moving the vlan_tx_tag_present before bridge/macvlan handling code could
>> cure this.
>
> Wouldn't this break "eth0 - br0 - br0.5"?
I think it would. One more reason to build a single interface stacking framework...
Nicolas.