Troubleshooting

Packet Loss Detection and Troubleshooting: A Practical Guide

Packet loss degrades network performance silently. Learn how to detect, measure, and eliminate packet loss before it impacts your users.

What Is Packet Loss?

Packet loss occurs when data packets traveling across a network fail to reach their destination. Unlike latency (slow delivery), packet loss means the data never arrives at all. The sending system must detect the loss and retransmit, adding delays and consuming bandwidth.

Impact by Application

VoIP/Video: Choppy audio, frozen video, dropped calls. Even 1% loss is noticeable. Web/Email: Slower page loads, timeout errors. File transfers: Reduced throughput, failed uploads.

Common Causes of Packet Loss

Network Congestion

When traffic exceeds link capacity, routers and switches drop packets. This is the most common cause and typically affects multiple users simultaneously.

Hardware Failures

Failing NICs, bad cables, or overheating switches cause intermittent packet drops. Often affects specific ports or devices.

Software Bugs

Buggy firmware, driver issues, or misconfigured QoS policies can silently drop packets. May appear after updates.

Physical Layer Issues

Damaged cables, loose connections, electromagnetic interference, or fiber optic degradation. Often shows up as CRC errors.

Wireless Interference

Wi-Fi is inherently lossy. Channel congestion, signal weakness, and interference from other devices cause packet loss.

Detecting Packet Loss

Multiple methods help identify packet loss at different layers:

# Basic ping test - watch for packet loss %
ping -c 100 192.168.1.1
--- 192.168.1.1 ping statistics ---
100 packets transmitted, 98 received, 2% packet loss

# Extended ping with timestamps
ping -c 1000 -i 0.2 target.example.com

# MTR - combines ping and traceroute
mtr --report --report-cycles 100 target.example.com
Method What It Shows Limitation
ICMP PingEnd-to-end loss percentageICMP may be deprioritized
MTR/TracerouteLoss at each hopMPLS paths may hide hops
SNMP Interface StatsInput/output discards, errorsDevice-level only
Flow AnalysisApplication-level lossRequires flow collectors

SNMP Counters for Packet Loss

Interface MIB counters reveal where packets are being dropped:

CounterOIDMeaning
ifInDiscards1.3.6.1.2.1.2.2.1.13Inbound packets discarded (buffer full)
ifOutDiscards1.3.6.1.2.1.2.2.1.19Outbound packets discarded (congestion)
ifInErrors1.3.6.1.2.1.2.2.1.14Inbound errors (CRC, frame errors)
ifOutErrors1.3.6.1.2.1.2.2.1.20Outbound errors (collisions, carrier)

Tip: These are cumulative counters. Calculate the delta between polls and compare to total packets to get error rates. A few errors per million packets is normal; thousands per second indicates a problem.

Acceptable Packet Loss Thresholds

How much is too much depends on your applications:

Application Acceptable Degraded Critical
VoIP<1%1-3%>3%
Video streaming<0.5%0.5-2%>2%
Web traffic<2%2-5%>5%
File transfer<5%5-10%>10%

Troubleshooting Workflow

Systematic approach to finding the source of packet loss:

  • 1.Confirm the problem: Run extended ping tests (1000+ packets) to establish baseline loss rate.
  • 2.Isolate the path: Use MTR or traceroute to find which hop shows loss. Remember that some hops rate-limit ICMP responses.
  • 3.Check interface counters: Look for discards and errors on devices along the path. Rising error counts indicate hardware or cabling issues.
  • 4.Examine utilization: Is the link congested? Compare traffic levels to interface capacity.
  • 5.Check for patterns: Does loss correlate with time of day, specific traffic types, or certain source/dest pairs?

Fixing Common Causes

For Congestion

Add bandwidth, implement QoS to prioritize critical traffic, or optimize traffic patterns. Consider traffic shaping to prevent bursts.

For Hardware Issues

Replace cables, reseat connections, swap suspect NICs. For switches, try different ports. Check for overheating.

For Software Issues

Update firmware and drivers. Review recent config changes. Check buffer settings and QoS policies for misconfigurations.

For Wireless

Change channels to avoid interference. Upgrade to 5GHz or Wi-Fi 6. Add access points to improve coverage. Reduce client density.