New AI Incident Response, Multi-Region Agents, and Custom-Domain Status Pages — May 2026
Services Pricing Dashboard

DataRobot Outage History

Uptime record, past incidents, and downtime history for DataRobot.

Checking current status...
85.7% uptime over 91 days
99.9% ✗ 99.5% ✗ 99% ✗ 95% ✗

90-Day Trend

Feb 25May 25

Monthly Uptime

Month Uptime Days Tracked Days with Issues
May 2026 92% 25 2
April 2026 93.3% 30 2
March 2026 71% 31 9
February 2026 100% 5 0

Uptime is calculated from daily worst-status snapshots. A day with any non-operational status counts as a day with issues.

Daily Status (Last 91 Days)

Feb 24 Today
Operational Degraded Partial Outage Major Outage Maintenance No Data

Incident History

May 2026
Widespread intermittent service issues for new workloads in US Production
minor

Started: May 8, 12:15 AM

monitoring
Engineering resolved the underlying issue with workload scheduling and is monitoring the cluster.
May 8, 1:40 PM
identified
We are continuing to experience issues launching new workloads for Custom Models and Custom Applications in US Production. This is connected to an ongoing AWS outage. Our team is exploring multiple mitigation options.
May 8, 6:13 AM
identified
We are continuing to experience issues launching new workloads for Custom Models and Custom Applications in US Production. This is connected to an ongoing AWS outage. Our team is exploring multiple mitigation options.
May 8, 5:47 AM
monitoring
We are currently experiencing intermittent service issues in US Production, which are primarily affecting the launch of new workloads for Notebooks, Custom models, and Custom Applications. This issue does not impact existing workloads. This disruption is strongly correlated with an ongoing AWS Availability Zone outage (https://health.aws.amazon.com/health/status), causing resource allocation failures. The team is actively monitoring the situation and tracking updates from AWS.
May 8, 3:53 AM
April 2026
Delay in processing actual messages
minor

Started: Apr 10, 10:24 AM

monitoring
Processing actual messages on JP MTS is delayed due to autoscaling malfunction. Engineering scaled up the deployment to alleviate the issue. Root cause mitigation in progress
Apr 10, 10:24 AM
Elevated Errors on Managed AI Cloud
major

Started: Apr 9, 9:56 PM

monitoring
Engineering has applied changes to mitigate the elevated error rates. Services are now operating normally. We are continuing to monitor the system while investigating the cause of the issue.
Apr 10, 11:31 AM
monitoring
A fix has been implemented and we are monitoring the results.
Apr 9, 10:56 PM
investigating
We're experiencing an elevated level of errors and are currently looking into the issue.
Apr 9, 9:56 PM
March 2026
Degraded Performance on DataRobot MTS due to Quay outage
minor

Started: Mar 30, 8:43 PM

identified
We are continuing to work on a fix for this issue.
Mar 30, 8:44 PM
identified
Our engineering team has found the the Quay outage currently happening is causing degraded performance across the DataRobot platform. Engineering is currently monitoring the situation.
Mar 30, 8:43 PM
Performance Degradation on Managed AI Cloud
minor

Started: Mar 13, 5:49 PM

monitoring
A fix has been implemented and we are monitoring the results.
Mar 13, 6:31 PM
investigating
We are experiencing performance degradation on Managed AI Cloud.
Mar 13, 5:49 PM
Intermittent UI disruptions on Managed AI Cloud
minor

Started: Mar 11, 8:05 PM

monitoring
A fix has been implemented and we are monitoring the results.
Mar 11, 8:22 PM
investigating
We are currently investigating this issue.
Mar 11, 8:05 PM