Fly.io Outage History
Uptime record, past incidents, and downtime history for Fly.io.
Checking current status...
90-Day Trend
Monthly Uptime
| Month | Uptime | Days Tracked | Days with Issues |
|---|---|---|---|
| April 2026 | 60% | 10 | 4 |
| March 2026 | 51.6% | 31 | 15 |
| February 2026 | 43.5% | 23 | 13 |
Uptime is calculated from daily worst-status snapshots. A day with any non-operational status counts as a day with issues.
Daily Status (Last 64 Days)
Feb 6
Today
Operational
Degraded
Partial Outage
Major Outage
Maintenance
No Data
Incident History
April 2026
Unavailable hosts in ORD region
Started: Apr 9, 7:29 PM
investigating
Some hosts in our Chicago (ORD) region are currently inaccessible. We are working with our provider to resolve this issue.
To see if you are affected, please visit the personalized status page: https://fly.io/status
A small amount of Managed Postgres clusters may also be inaccessible at this time.
Apr 9, 7:29 PM
Managed Postgres Control Plane Issues in SYD
Started: Apr 9, 3:50 AM
identified
We are seeing an improvement in control plane performance in the SYD region. Some clusters in the region currently are showing degraded standby nodes and we are working to bring those back to full health.
Apr 9, 4:12 AM
investigating
We are investigating elevated control plane issues for Managed Postgres clusters in SYD.
The majority of clusters appear to be running fine, but new creates, backup restores, and upgrades may show errors or take longer than usual to complete. Some clusters will have seen a failover event from primary to standby.
Apr 9, 3:50 AM
Metrics currently experiencing issues
Started: Apr 8, 8:34 AM
monitoring
We are continuing to monitor for any further issues.
Apr 8, 11:02 AM
monitoring
We have implemented a fix. We're monitoring the cluster for further issues.
Apr 8, 11:00 AM
investigating
We are currently investigating an issue with our metrics cluster.
Apr 8, 8:34 AM
GraphQL API / Dashboard Issues
Started: Apr 7, 3:08 PM
monitoring
A fix has been implemented and we are monitoring the results.
Apr 7, 3:39 PM
identified
We have restored GraphQL and dashboard availability, but some actions (e.g. app state updates) may still be delayed.
Apr 7, 3:17 PM
investigating
We are investigating issues with our GraphQL API and web dashboard
Apr 7, 3:08 PM
March 2026
Low Capacity in SIN and AMS regions
Started: Mar 29, 3:00 PM
monitoring
We've freed up additional room in the SIN and AMS regions and are monitoring capacity.
Mar 29, 3:35 PM
monitoring
We've freed up additional room in the SIN and AMS regions and are monitoring capacity.
Mar 29, 3:33 PM
identified
We are currently investigating capacity issues in SIN and AMS regions that are affecting:
- Machine Create and Start events
- Deployments, due to affected, degraded Remote Builders
- Sprite startup from cold state
Mar 29, 3:19 PM
identified
This may also affect:
- Remote builders in AMS and SIN regions, which could currently be experiencing degraded performance or failures.
- Sprites starting from a cold state, which may experience failures in starting
Mar 29, 3:13 PM
identified
We are currently investigating elevated errors when creating and starting machines in the SIN and AMS regions. Choosing other regions to create or deploy may help in the meantime
Mar 29, 3:00 PM
Low capacity in IAD
Started: Mar 27, 6:08 PM
monitoring
With the additional capacity we've brought online, machine start failure rates in IAD have now recovered. We'll continue to monitor IAD capacity.
Mar 27, 9:09 PM
identified
We've brought some additional capacity online in IAD and are seeing improvements, and we're continuing to work on adding more and freeing up additional room.
Mar 27, 7:21 PM
investigating
We're continuing to evaluate our options for increasing short-term capacity in the IAD region.
Mar 27, 6:47 PM
investigating
We're currently investigating capacity issues in IAD that is preventing machine starts (machine creates are currently unaffected). This may result in deploys failing to complete (even for apps outside of the IAD region). As a workaround, using legacy Fly builders explicitly located in another region (i.e., `FLY_REMOTE_BUILDER_REGION=lhr fly deploy --depot=false --recreate-builder`) may help in the meantime.
Mar 27, 6:08 PM
Machine Creates Failing in ORD Region
Started: Mar 26, 3:21 PM
monitoring
We've implemented a fix and have seen error rates for machine creates in ORD drop off. We're continuing to monitor the results.
Mar 26, 5:28 PM
identified
We've identified the cause of this increased failure rate and a fix is in progress. We are seeing most creates in ORD succeed at this time, though failure rate is still above baseline.
Mar 26, 4:50 PM
investigating
We are continuing to investigate this issue. We are seeing 408 errors decreasing in ORD, though still above baseline.
Mar 26, 4:08 PM
investigating
We are currently investigating elevated errors creating machines in the ORD (Chicago, Illinois) region. Users may see `failed to launch VM: request returned non-2xx status: 408` errors when creating, updating, or scaling machines in ORD.
Existing, already running machines in the ORD region continue to run as normal.
Mar 26, 3:21 PM
Network issues in FRA region
Started: Mar 26, 12:37 PM
identified
Some Managed Postgres clusters in FRA region are still unreachable, we are investigating this issue.
Mar 26, 1:16 PM
monitoring
Apps and Managed Postgres clusters in FRA region should be back online at this time. We are monitoring for any further issues.
Mar 26, 1:14 PM
investigating
We are investigating network issues in FRA region. Apps and/or Managed Postgres clusters in the region may be inaccessible at this time.
Mar 26, 12:37 PM
Backend errors when trying to use Grafana to view logs
Started: Mar 23, 3:18 PM
monitoring
We've deployed a fix and are monitoring the results. Logs are now be visible on Grafana.
Mar 23, 3:55 PM
identified
Using the Logs panel in Grafana at https://fly-metrics.net/ will show a 502 error from the backend and won't show any logs. You can use `fly logs` or the live log viewer directly on https://fly.io/dashboard to view streaming logs for the time being.
Mar 23, 3:41 PM
investigating
Using the Logs panel in Grafana at https://fly-metrics.net/ will show a 502 error from the backend and won't show any logs. You can use `fly logs` or the live log viewer directly on https://fly.io/dashboard to view streaming logs for the time being.
Mar 23, 3:18 PM
Machines failing to start in DFW
Started: Mar 20, 7:26 AM
monitoring
Machine start success rates in DFW have improved but we are continuing to monitor and make further adjustments. We will provide updates as the situation progresses.
Mar 21, 8:26 AM
monitoring
In addition to freeing up existing capacity, the team has provisioned new capacity in DFW and we are monitoring the results.
Mar 20, 12:45 PM
monitoring
We freed up some capacity on our workers to allow for successful Machine starts.
Mar 20, 8:08 AM
investigating
The Machines start failure rate is elevated in DFW.
Mar 20, 7:26 AM
Metrics currently experiencing issues
Started: Mar 19, 6:28 AM
monitoring
We have implemented a fix. There has been approximately 1h of lost metrics from 06:07UTC. We're monitoring the cluster for further issues
Mar 19, 7:12 AM
investigating
We are currently investigating an issue with our metrics cluster.
Mar 19, 6:28 AM
IPv6 networking issues in SJC region
Started: Mar 18, 4:12 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 18, 4:31 PM
investigating
We are investigating intermittent network issues in SJC region impacting outbound public IPv6 access from Machines. Connecting to IPv6 internet resources from apps hosted in SJC region may be slow or fail at this time.
IPv4 access, as well as 6PN private networking, are unaffected.
Mar 18, 4:12 PM
Fly ssh console command failing
Started: Mar 18, 2:12 PM
identified
We have identified an issue causing new `fly ssh console` connections to fail with 500 errors. A fix is in progress.
Mar 18, 2:12 PM
Connection Issues in SJC
Started: Mar 18, 2:07 PM
monitoring
Between 13:55 and 14:03 UTC machines and MPG clusters hosted in the SJC region saw elevated connection errors. Users may have seen errors connecting to or from most machines in the region, as well as with deployments or updates to machines in the region.
Networking has returned to normal in the region, and we are continuing to monitor closely to ensure stable recovery.
Mar 18, 2:07 PM
Machines failing to start in DFW
Started: Mar 18, 9:58 AM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 18, 12:40 PM
identified
The team is currently rolling out additional capacity in DFW which should help ease Machine start failures across the region.
Mar 18, 11:44 AM
investigating
We are investigating reports of machines failing to start in the DFW (Dallas) region with "insufficient memory" errors. This may cause deployment failures for applications running in DFW.
Our team is actively working to restore full capacity in the region. If you are affected, deploying to an alternate region may serve as a temporary workaround.
We will provide updates as the situation progresses.
Mar 18, 9:58 AM
Elevated 502 errors when starting Sprites in LAX and ORD
Started: Mar 16, 9:59 PM
investigating
We're currently investigating an elevated number of 502 errors when attempting to start Sprites in LAX and ORD.
Mar 16, 9:59 PM
Sprite Operations: 401 errors for certain organizations
Started: Mar 14, 1:33 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 14, 1:45 PM
investigating
Organizations with numerical prefixes might experience failing sprite operations ( like creating a sprite, listing sprites, etc... ) due to 401 errors
Mar 14, 1:44 PM
monitoring
Root cause has been identified and a fix has been applied
Mar 14, 1:42 PM
investigating
Organizations with numerical prefixes might experience failing sprite operations ( like creating a sprite, listing sprites, etc... ) due to 401 errors
Mar 14, 1:41 PM
monitoring
Organizations with numerical prefixes might experience failing sprite operations ( like creating a sprite, listing sprites, etc... ) due to 401 errors
Mar 14, 1:33 PM
Sprites Operations: 401 errors for certain organizations
Started: Mar 14, 12:30 PM
monitoring
Organizations with names prefixed with numerical digits may experience 401 errors. Affected operations include actions such as Sprite creation, listing, etc...
A fix has been implemented and we are monitoring the results!
Mar 14, 1:55 PM