Dates:
Saturday September 15th, 2024 17:02 - 20:15 UTC
What happened:
Our Real Device (RDC) availability dropped below 80% across our US West and US East regions.
Why it happened:
The issue was caused by a failure in our certificate renewal process set to occur automatically, but failed to run as expected.
How we fixed it:
We fixed the problem by restarting the impacted service, which automatically renewed the necessary certificates. This brought our device availability back to normal.
What we are doing to prevent it from happening again:
We have initiated an investigation to review and fix the renewal process script to ensure it functions correctly in the future.