On January 19th, 2022 at 17:00 UTC, Datto RMM Partners on the Concord platform experienced a service interruption which caused 504 Errors when logging in or navigating, and alert delays.
The root cause for this service interruption was identified to be a slowdown in our backend REST service, our backend alerts cluster and other related servers/instances became stressed.
While REST recovered by itself, a double failover of the alerts cluster was performed, and other related servers/instances were restarted. The service was confirmed to be restored January 19th, 2022 at 17:45 UTC.