Back to overview
Downtime

Elevated rate of retries due to transient timeouts on LLM providers

Apr 9, 2026 at 8:24am UTC
Affected services
Write service

Resolved
Apr 9, 2026 at 8:48am UTC

We experienced an elevated rate of retries due to transient timeouts from an upstream LLM provider. The service remained available throughout, and all queries were completed successfully after retry. Our monitoring incorrectly flagged this as downtime. We are adjusting our thresholds to better distinguish between degraded upstream latency and actual outages.

Updated
Apr 9, 2026 at 8:26am UTC

Write service recovered.

Created
Apr 9, 2026 at 8:24am UTC

Write service went down.