r/Network • u/Primary-Hyena-9124 • 14d ago
[Ruckus/Brocade ICX 7150] Two specific VLANs frozen, receiving "tx_dv_threshold_reached" errors (CPU at 1%, No L2 Loops)
Hello everyone,
I'm running into a recurring, critical issue with a Ruckus/Brocade ICX 7150 in a production environment. Traffic on two specific VLANs suddenly stops working, while the rest of the data VLANs continue to pass traffic perfectly fine.
I don't have an active support contract for this box, so I'm hoping to get some insights here.
Symptoms & Behaviors:
- The console gets flooded with the following log (counters increasing rapidly by thousands):
Error: fdry2bcm_tx_one() failed to tx pkt of len 64 ppcr 0 err -14 tx_dv_threshold_reached 476180 - Crucial Detail (The Weird Part): On the affected VLANs, ARP resolution works perfectly fine. I can see MAC addresses populating. However, ICMP Pings and all other actual traffic in two VLANS completely fail.
- Recurrence: This issue happens roughly once a week.
- Workaround: Rebooting the switch temporarily resolves the issue completely, until it freezes up again a week later.
Troubleshooting done so far:
- No L2 Loop: Verified there are no topology changes (no TC logs), no MAC flapping, and no abnormal broadcast/multicast spikes on edge ports. Drop counters on interfaces are at 0.
- CPU usage is only 1%. (So it's not a control-plane policing issue or broadcast storm).
Given that ARP survives but Ping fails, CPU is at 1%, and a reboot fixes it.
- Has anyone encountered this specific
tx_dv_threshold_reachederror on an ICX 7150 where ARP works but Ping drops? - Is this a known stuck queue/memory leak bug? If so, which FastIron version fixes it?
Thank you for reading
1
Upvotes