Hi all,
I am running 11.1-RELEASE-p1 on a server with 4 'oce' interfaces - oce0, oce1, oce2, oce3.
For that setup only oce3 is connected and used - oce2 will come later.
No firewall is enabled.
When bonding interfaces, the configuration on the switch is modified - bottom line, the problem does not seem to be related to lagg but more to bridge.
I started my troubleshooting journey with Scenario D being the target configuration and then try to isolate the problem by validating the other scenarios below.
Scenario A:
oce3 only + IP address on oce3 - connectivity: OK
Scenario B:
[oce3]-lagg0 + IP address on lagg0 - connectivity: OK
Scenario C:
oce3 only in a bridge0 + IP address on bridge0 - connectivity: NOK (ARP of the server is seen on the upstream router however FreeBSD sees the MAC address of the upstream router in the bridge MAC address table but won't have any ARP entry for the upstream router. Ping not successful...)
Scenario D:
[oce3]-lagg0 in a bridge0 + IP address on bridge0 - connectivity: NOK (ARP of the server is seen on the upstream router however FreeBSD sees the MAC address of the upstream router in the bridge MAC address table but won't have any ARP entry for the upstream router. Ping not successful...)
So far I have tried the following to try to identify what is wrong with both the scenarios C & D:
- tcpdump on bridge0 or oce3 shows nothing at all related to a ping issued from either router or server_bridge0_ip
- disabled TSO, LRO etc... on the oce3 interface -> no change
- change all the sysctl for net.link.bridge.* to 0 to prevent any unexpected filtering
- enable IP forwarding although it should not make any difference as my first ping is towards the router in the same connected network so no routing involved.
I have tried using an Ubuntu liveCD on top of which I installed the bridge-utils components and Scenario C worked. I did not push for Scenario D but it looks to me like the if_bridge has a bit of any issue.
What else should I consider to troubleshoot this issue?
I am running 11.1-RELEASE-p1 on a server with 4 'oce' interfaces - oce0, oce1, oce2, oce3.
For that setup only oce3 is connected and used - oce2 will come later.
No firewall is enabled.
When bonding interfaces, the configuration on the switch is modified - bottom line, the problem does not seem to be related to lagg but more to bridge.
I started my troubleshooting journey with Scenario D being the target configuration and then try to isolate the problem by validating the other scenarios below.
Scenario A:
oce3 only + IP address on oce3 - connectivity: OK
Scenario B:
[oce3]-lagg0 + IP address on lagg0 - connectivity: OK
Scenario C:
oce3 only in a bridge0 + IP address on bridge0 - connectivity: NOK (ARP of the server is seen on the upstream router however FreeBSD sees the MAC address of the upstream router in the bridge MAC address table but won't have any ARP entry for the upstream router. Ping not successful...)
Scenario D:
[oce3]-lagg0 in a bridge0 + IP address on bridge0 - connectivity: NOK (ARP of the server is seen on the upstream router however FreeBSD sees the MAC address of the upstream router in the bridge MAC address table but won't have any ARP entry for the upstream router. Ping not successful...)
So far I have tried the following to try to identify what is wrong with both the scenarios C & D:
- tcpdump on bridge0 or oce3 shows nothing at all related to a ping issued from either router or server_bridge0_ip
- disabled TSO, LRO etc... on the oce3 interface -> no change
- change all the sysctl for net.link.bridge.* to 0 to prevent any unexpected filtering
- enable IP forwarding although it should not make any difference as my first ping is towards the router in the same connected network so no routing involved.
I have tried using an Ubuntu liveCD on top of which I installed the bridge-utils components and Scenario C worked. I did not push for Scenario D but it looks to me like the if_bridge has a bit of any issue.
What else should I consider to troubleshoot this issue?