Turning it off and then back on again probably fixes the issue. There’s very unl...

koolba · on Nov 22, 2022

> Turning it off and then back on again probably fixes the issue.

Turning a large scale system entirely off and on is never simple. Invariably you’ll run into some kind of circular dependency that must be manually investigated. And even tracking those down becomes tricky.

Classic examples are things like DNS, service locators, or authentication systems. And large tech companies are notorious for NIH-syndrome for all of those.

nemo44x · on Nov 22, 2022

There’s so much redundancy built into modern distributed systems that you can reliably bounce a VM without issue. You can reliably roll bounce a series of VMs.

Twitter doesn’t have unique scale problems by todays standards.