DataRobot Outage History
Uptime record, past incidents, and downtime history for DataRobot.
Checking current status...
90-Day Trend
Monthly Uptime
| Month | Uptime | Days Tracked | Days with Issues |
|---|---|---|---|
| May 2026 | 92% | 25 | 2 |
| April 2026 | 93.3% | 30 | 2 |
| March 2026 | 71% | 31 | 9 |
| February 2026 | 100% | 5 | 0 |
Uptime is calculated from daily worst-status snapshots. A day with any non-operational status counts as a day with issues.
Daily Status (Last 91 Days)
Feb 24
Today
Operational
Degraded
Partial Outage
Major Outage
Maintenance
No Data
Incident History
May 2026
Widespread intermittent service issues for new workloads in US Production
Started: May 8, 12:15 AM
monitoring
Engineering resolved the underlying issue with workload scheduling and is monitoring the cluster.
May 8, 1:40 PM
identified
We are continuing to experience issues launching new workloads for Custom Models and Custom Applications in US Production.
This is connected to an ongoing AWS outage. Our team is exploring multiple mitigation options.
May 8, 6:13 AM
identified
We are continuing to experience issues launching new workloads for Custom Models and Custom Applications in US Production.
This is connected to an ongoing AWS outage. Our team is exploring multiple mitigation options.
May 8, 5:47 AM
monitoring
We are currently experiencing intermittent service issues in US Production, which are primarily affecting the launch of new workloads for Notebooks, Custom models, and Custom Applications. This issue does not impact existing workloads. This disruption is strongly correlated with an ongoing AWS Availability Zone outage (https://health.aws.amazon.com/health/status), causing resource allocation failures. The team is actively monitoring the situation and tracking updates from AWS.
May 8, 3:53 AM
April 2026
Delay in processing actual messages
Started: Apr 10, 10:24 AM
monitoring
Processing actual messages on JP MTS is delayed due to autoscaling malfunction.
Engineering scaled up the deployment to alleviate the issue. Root cause mitigation in progress
Apr 10, 10:24 AM
Elevated Errors on Managed AI Cloud
Started: Apr 9, 9:56 PM
monitoring
Engineering has applied changes to mitigate the elevated error rates. Services are now operating normally. We are continuing to monitor the system while investigating the cause of the issue.
Apr 10, 11:31 AM
monitoring
A fix has been implemented and we are monitoring the results.
Apr 9, 10:56 PM
investigating
We're experiencing an elevated level of errors and are currently looking into the issue.
Apr 9, 9:56 PM
March 2026
Degraded Performance on DataRobot MTS due to Quay outage
Started: Mar 30, 8:43 PM
identified
We are continuing to work on a fix for this issue.
Mar 30, 8:44 PM
identified
Our engineering team has found the the Quay outage currently happening is causing degraded performance across the DataRobot platform. Engineering is currently monitoring the situation.
Mar 30, 8:43 PM
Performance Degradation on Managed AI Cloud
Started: Mar 13, 5:49 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 13, 6:31 PM
investigating
We are experiencing performance degradation on Managed AI Cloud.
Mar 13, 5:49 PM
Intermittent UI disruptions on Managed AI Cloud
Started: Mar 11, 8:05 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 11, 8:22 PM
investigating
We are currently investigating this issue.
Mar 11, 8:05 PM