neteng.life · live

System Status

All Systems Operational

DNS Resolution

It's always this.

Degraded

BGP Sessions

Someone touched a route map on a Friday.

Partial Outage

OSPF Convergence

Reconverging. Has been since Tuesday.

Degraded

Spanning Tree

There is a loop. There is always a loop.

Major Outage

SNMP Polling

Timeouts. Root cause assumed to be DNS.

Degraded

Change Management

Maintenance window skipped. Again.

Degraded

Network Documentation

Was never operational.

Major Outage

On-Call Engineer

Responding. It is 3am.

Degraded

Vendor TAC

Case open since 2023. Awaiting callback.

Partial Outage

Coffee Supply

Down since the incident started. Situation worsening.

Critical

Incident Timeline

nowIncident ongoing. DNS blamed. Investigation continuing.
14 min agoBGP session reset. Cause: unknown. Blamed: network.
1 hr agoTicket opened: 'is the network okay? our app is slow'.
3 hrs agoChange deployed outside maintenance window. Peer review: skipped.
6 hrs agoOn-call engineer paged. Was asleep.
yesterdaySomeone said 'it's been stable for months'. They jinxed it.
last weekDocumentation update scheduled. Postponed indefinitely.
2019Root cause of current incident silently introduced to production.
2017Network documentation last updated. Author has since left.
2003The protocol responsible was designed. Known limitation accepted.

last updated: never · next update: also never · subscribe: there is no subscribe button