Digraph
Organize the world
Digraph
Search
Everything
This topic
Blog
Recent
Everything
Sign in
Software system incidents
Software system incidents
Software system incident postmortems
Parent topics
Software systems
This topic
Recent activity
You must be
signed in
to add and move topics and links.
2012-08 Knight Capital stock trading disruption
Software system incidents
2024-07 CrowdStrike outage
CrowdStrike
Microsoft Windows
Software system incidents
Computer security postmortems
Privacy, computer security, vulnerabilities and attacks
Software system incidents
5 Whys - Wikipedia
https://en.wikipedia.org/wiki/5_Whys
Software system incidents
Cloudflare outage on June 21, 2022
https://blog.cloudflare.com/cloudflare-outage-on-june-21-2022/
Software system incidents
Cloudflare outage on June 21, 2022 | Hacker News
https://news.ycombinator.com/item?id=31823132
Software system incidents
Details of the Cloudflare outage on July 2, 2019
https://blog.cloudflare.com/details-of-the-cloudflare-outage-on-july-2-2019/
Cloudflare
Kyoto Tycoon
Quicksilver (key-value store)
Software system incidents
GitHub - hjacobs/kubernetes-failure-stories: Compilation of public failure/horror stories related to Kubernetes
https://github.com/hjacobs/kubernetes-failure-stories
Kubernetes
Software system incidents
Google Cloud Status Dashboard
https://status.cloud.google.com/incident/storage/19002
Software system incidents
How to lose $172k per second for 45 minutes (2013) | Hacker News
https://news.ycombinator.com/item?id=19542766
Software system incidents
Kubernetes Failure Stories | Hacker News
https://news.ycombinator.com/item?id=20163500
Kubernetes
Software system incidents
Roblox Return to Service 10/28-10/31 2021 - Roblox Blog
https://blog.roblox.com/2022/01/roblox-return-to-service-10-28-10-31-2021/
Software system incidents
Root cause analysis: significantly elevated error rates on 2019‑07‑10
https://stripe.com/rcas/2019-07-10
Software system incidents
Root cause analysis: significantly elevated error rates on 2019‑07‑10 | Hacker News
https://news.ycombinator.com/item?id=20422337
Software system incidents
Route Leak Impacting Cloudflare | Hacker News
https://news.ycombinator.com/item?id=20262214
Border Gateway Protocol (BGP)
Cloudflare
Software system incidents
Summary of the AWS Service Event in the Northern Virginia (US-EAST-1) Region
https://aws.amazon.com/message/12721/
Software system incidents
System separation in the Continental Europe Synchronous Area on 8 January 2021 – 2nd update
https://www.entsoe.eu/news/2021/01/26/system-separation-in-the-continental-europe-synchronous-area-on-8-january-2021-2nd-update/
Software system incidents
Today's Outage Post Mortem
https://blog.cloudflare.com/todays-outage-post-mortem-82515/
Cloudflare
Software system incidents
Update to Security Incident [May 17, 2019] - Stack Overflow Blog
https://stackoverflow.blog/2019/05/17/update-to-security-incident-may-17-2019/
Software system incidents
StackOverflow
Update to Security Incident | Hacker News
https://news.ycombinator.com/item?id=19941797
Software system incidents
StackOverflow
Verizon and a BGP Optimizer Knocked Large Parts of the Internet Offline | Hacker News
https://news.ycombinator.com/item?id=20267790
Border Gateway Protocol (BGP)
Cloudflare
Software system incidents
python sweetness — How to lose $172,222 per second for 45 minutes
https://sweetness.hmmz.org/2013-10-22-how-to-lose-172222-a-second-for-45-minutes.html
2012-08 Knight Capital stock trading disruption
Software system incidents