A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.
I then suspected a power failure, but the UPS should have sent an alert.
The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.
To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.
The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.
That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.
The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.
The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.
Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.
Never rely only on internal monitoring. Never.
#IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring
Josef Zettl alias Princezna
in reply to Schmaker • • •Ne, ale když jí rozkliknu, tak jo.
Schmaker likes this.
Friendica Support reshared this.
Schmaker
in reply to Schmaker • •Friendica Support reshared this.
Montag
in reply to Schmaker • • •Schmaker
in reply to Montag • •Someone who expect things to "just work" 😀
I can imagine use cases where you actually want picture this big, but these are corner cases. Yet I'd totally expect both sides to handle it somehow (resizing, linking to original, whatever).
Friendica Support reshared this.
Schmaker
in reply to Schmaker • •What is the proper way to bump here? 😀
I would like to know where to report the bug - it's Friendica or Mastodon side bug?
Friendica Support reshared this.