On September 03, 2020 starting at 10:15 UTC and lasting until 10:38 UTC, partners with PSA databases in the LON datacenter experienced a service interruption which caused an error page to be displayed instead of the normal login page. Users with established connections would have received errors and would have been disconnected from their session.
The root cause for this service interruption was an error in a framework component on one or more servers caused a race condition which caused Send To Dev Errors (STDEs) and prevented users from logging in.
Engineers troubleshooting the issue restarted the web server application pools to reset all worker processes and the site behaviour returned to normal.
Server logs and STDE logs were inspected to determine the root cause, but engineers found not correlation between log data and the behaviour of the affected server(s). We have put monitors in place to alert us if a similar condition presents itself again in the future.