Root cause:
After analyzing both issues, we found the root cause. One of the API servers appeared to be in an unhealthy state and used a low CPU. Unfortunately, the load balancing system sent the customer requests to the unhealthy server.
Impact:
In both incidents, some of our API customers experienced instability in our system.
Actions:
Current status:
We are continuously monitoring the servers now and they appear to be healthy since the changes