Redundancy

The duplication of critical system components to increase reliability and eliminate single points of failure.

Redundancy is the practice of duplicating critical components, paths, or systems so that if one fails, another can take its place without service interruption. It is a fundamental principle of high-availability architecture and is applied at every layer: network links, servers, databases, data centers, and even geographic regions.

Types of redundancy include hardware redundancy (multiple servers, RAID storage), network redundancy (multiple ISPs, diverse routing), geographic redundancy (multiple data centers or cloud regions), and data redundancy (replicated databases, distributed storage). The level of redundancy should be proportional to the criticality of the service and the cost of downtime.

Redundancy increases availability but also increases cost and complexity. Each redundant component needs to be monitored, maintained, and tested regularly to ensure it will actually work when needed. A backup system that hasn't been tested may fail when called upon. Monitoring all components — including standby systems — with tools like Hyperping ensures your redundancy actually delivers the reliability you expect.

Hyperping monitoring dashboard

Related Terms

Failover
The automatic switching to a backup system when the primary system fails, ensuring service continuit...
Availability
The proportion of time a system is functional and accessible, often expressed as a percentage.
Five Nines (99.999% Uptime)
A reliability standard allowing no more than 5 minutes and 15 seconds of downtime per year.
Load Balancing
The distribution of incoming network traffic across multiple servers to ensure no single server is o...

Related Resources

Get started

Start monitoring in the next 5 minutes.

Stop letting customers discover your outages first. Set up monitoring, status pages, on-call, and alerts before your next coffee break.

14 days free trial — No card required