Grafana Cloud Outage History
Uptime record, past incidents, and downtime history for Grafana Cloud.
Checking current status...
90-Day Trend
Monthly Uptime
| Month | Uptime | Days Tracked | Days with Issues |
|---|---|---|---|
| April 2026 | 0% | 10 | 10 |
| March 2026 | 0% | 31 | 31 |
| February 2026 | 47.8% | 23 | 12 |
Uptime is calculated from daily worst-status snapshots. A day with any non-operational status counts as a day with issues.
Daily Status (Last 64 Days)
Feb 6
Today
Operational
Degraded
Partial Outage
Major Outage
Maintenance
No Data
Incident History
April 2026
K6 Browser Testing/Timeline Not Available
Started: Apr 9, 5:34 PM
investigating
We’re currently investigating an issue affecting browser testing.
Users running browser tests will not be able to see the browser timeline.
Our team is actively working to identify the cause and will share an update within two hours.
Thank you for your patience.
Apr 9, 5:34 PM
Unable to Edit Notification Policies
Started: Apr 7, 3:17 PM
identified
We’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
Apr 7, 6:03 PM
identified
We’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
Apr 7, 4:52 PM
investigating
We’re currently investigating an issue affecting notification policies. Our team is actively working to identify the cause and will share an update within 2 hours. Thank you for your patience.
Apr 7, 3:17 PM
Notification Policies and Contact Points Missing in UI on the Slow Release Channel
Started: Apr 6, 2:48 PM
monitoring
We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again within 2 hours.
Apr 6, 11:58 PM
identified
We’ve identified the cause of the issue impacting the Notification Policy and Contact Point UI. Our team is currently implementing a fix.
We’ll provide another update when the fix is deployed and we monitor the expected improvement.
Apr 6, 9:04 PM
investigating
We’re continuing to investigate the issue with the alerting UI. While we don’t have new information to share yet, our team is working to identify the root cause. Next update in 2 hours.
Apr 6, 4:13 PM
investigating
We’re currently investigating an issue affecting notification policies and contact points for instances on the slow release channel. Alerting API calls for contact points and notification policies return data as expected, so this appears to be limited to the UI.
Our team is actively working to identify the cause and will share an update within 1-2 hours. Thank you for your patience.
Apr 6, 2:48 PM
Partial K6 Test Run Outage
Started: Apr 3, 3:29 PM
investigating
We're experiencing an outage affecting test runs that use k6 extensions. The issue prevents users from executing these types of test runs both locally and in Grafana Cloud.
Test runs that do not use extensions are not affected by this incident.
Apr 3, 3:29 PM
AWS integration Degraded Performance
Started: Apr 1, 8:17 PM
investigating
We are investigating a noticeable drop in active series for the AWS integration that began around 18:15 UTC.
This issue may cause scrapes to hit rate limits, which can result in individual data points not being collected for the serverless integration. The impact is intermittent and may affect any customer using the AWS integration, regardless of region.
We are currently working to identify the cause and will provide an update as soon as we have more information.
Apr 1, 8:17 PM
Query degradation and possible rule evaluation failure on prod-eu-west-0.cortex-prod-01
Started: Apr 1, 9:56 AM
monitoring
A fix has been implemented and we are monitoring the results.
Apr 1, 10:12 AM
investigating
We are continuing to investigate this issue.
Apr 1, 10:11 AM
investigating
We are currently observing delays in ingesting data, possibly causing partial query results and failed rule evaluations for prod-eu-west-0.cortex-prod-01 metrics cell.
Apr 1, 9:56 AM
March 2026
Some of the CloudWatch queries are failing
Started: Mar 31, 9:48 AM
monitoring
We are continuing to monitor for any further issues.
Mar 31, 9:49 AM
monitoring
Some of the CloudWatch queries were failing.
Started at 08:37 UTC
Monitoring from 09:21 UTC
Mar 31, 9:48 AM
Some Grafana Instances Unavailable
Started: Mar 27, 1:36 PM
monitoring
We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again in 1 hour.
Mar 27, 8:16 PM
identified
We’ve identified the cause of the issue impacting the instances. Our team is currently implementing a fix. We’ll provide another update in 1–2 hours, or sooner, if the situation changes.
Mar 27, 6:10 PM
investigating
We’re continuing to investigate the issue with Grafana instances. While we don’t have new information to share yet, our team is working to identify the root cause. Next update in 1-2 hours.
Mar 27, 4:36 PM
investigating
We’re continuing to investigate the issue with Grafana instances. While we don’t have new information to share yet, our team is working to identify the root cause. Next update in 1-2 hours.
Mar 27, 2:51 PM
investigating
We’re currently investigating an issue which is affecting primarily users on the Free tier. Impacted users will be met with a "your Grafana instance is loading" message indefinitely. Our team is actively working to identify the cause and will share an update within 1-2 hours. Thank you for your patience.
Mar 27, 1:36 PM
Prometheus writes in prod-eu-west-3 are degraded
Started: Mar 25, 2:11 PM
monitoring
We are still seeing intermittent issues and continue to seek a resolution
Apr 8, 8:32 PM
monitoring
We are continuing to monitor for any further issues.
Apr 2, 9:38 PM
monitoring
We are continuing to monitor this through the weekend.
Mar 27, 9:05 PM
monitoring
We are continuing to monitor the previously impacted environments.
Mar 26, 5:45 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 26, 12:04 PM
investigating
We are continuing to investigate this issue.
Mar 25, 9:35 PM
investigating
The metric writes issue reported in https://status.grafana.com/incidents/gfshj17lxj5z is still ongoing.
Our Engineering team is actively investigating this and we will provide further updates as our investigation progresses.
Mar 25, 2:11 PM
Prometheus writes, Logs, and Synthetic Monitoring in prod-eu-west-3 are degraded
Started: Mar 24, 9:08 AM
investigating
This is also now impacting Logs and Synthetic Monitoring in prod-eu-west-3.
For Synthetic Monitoring, users might observe errors pushing check execution metrics, and this can eventually lead to missing data.
In addition, users might observe errors evaluating Synthetic Monitoring provisioned alert rule evaluations, and this can lead to missed alerts.
For Logs, there is no immediate impact on alerts, however, remote writes to Mimir is delayed which means users may see gaps in their recordin...
Mar 25, 7:43 AM
investigating
We are moving this back to 'Investigating' as we are now observing a substantial drop in successful ingestion and increase in write path errors, and elevated rule evaluation latency and error. Reads are mostly fine. Our Engineering team is actively investigating this and we will provide further updates as our investigation progresses.
Mar 25, 7:04 AM
monitoring
We have not observed any recent errors, but we will continue to monitor while we work with our CSP.
Mar 24, 9:23 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 24, 9:19 AM
investigating
We are currently experiencing degraded writes for mimir-prod-22 in prod-eu-west-3 since 08:45Z.
Mar 24, 9:08 AM
Grafana Assistant Unavailable in prod-us-east-0
Started: Mar 23, 5:03 PM
identified
The issue has been identified, and we are implementing a fix.
Mar 23, 6:25 PM
investigating
The impact extends beyond the TOS check. Assistant is completely unavailable in the impacted region.
Mar 23, 6:07 PM
investigating
We are continuing to investigate this issue.
Mar 23, 6:01 PM
investigating
We are aware of an issue currently impacting Grafana Assistant. Impacted users are met with a request to accept the TOS, however the plugin is failing upon accepting. Our engineering are currently investigating this issue.
Mar 23, 5:03 PM
Authentication API Database Down in prod-eu-west-2 and prod-eu-west-4
Started: Mar 20, 3:00 PM
investigating
We have observed impact in prod-eu-west-4 as well.
Mar 20, 3:08 PM
investigating
We are currently investigating an issue impacting the main database for Authentication API's in the prod-eu-west-2 region. Writes are currently failing, but reads are operational.
Mar 20, 3:00 PM
Various Datasource Issues
Started: Mar 19, 4:46 PM
monitoring
We are continuing to monitor for any further issues.
Mar 19, 5:56 PM
monitoring
We have observed recovery for the Cloudwatch Datasource.
We are now seeing failures for the following Datasources:
Aurora
Opensearch
X-Ray
Timestream
Redshift
Sitewise
A fix for the above is being rolled out now, and we will monitor progress.
We will also change the name of this incident from "Cloudwatch Datasource Issues" to "Various Datasource Issues" to more accurately reflect impact.
Mar 19, 5:56 PM
monitoring
We have identified the issue, and are rolling out the fix. We are already seeing improvements and will continue to monitor progress.
Mar 19, 5:13 PM
investigating
We are currently investigating an issue impacting the CloudWatch Datasource causing failures.
Mar 19, 4:46 PM
Degraded performance of Grafana Cloud k6 test runs
Started: Mar 19, 11:17 AM
investigating
Some customers are seeing degraded performance and errors from certain v6 API endpoints. We are investigating the issue.
Mar 19, 11:17 AM
Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)
Started: Mar 13, 10:28 AM
investigating
We are continuing to investigate this issue with our CSP, and will provide updates as they become available.
Mar 13, 9:22 PM
investigating
We are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working on restoring the service.
Mar 13, 10:28 AM
Increased number of Aborted-by-Systems with a k6 binary building errors
Started: Mar 13, 7:41 AM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 13, 12:49 PM
identified
The issue has been identified and a fix is being implemented.
Mar 13, 8:45 AM
investigating
We are seeing an increased number of Aborted-by-Systems with a k6 binary building error. We are investigating the issue.
The first occurrence of this happened back on March 9, has now been identified as a blocking issue for some customers.
Mar 13, 7:41 AM
Rule Evaluation Outage in prod-us-west-0
Started: Mar 11, 5:10 PM
monitoring
A fix has been implemented and we are monitoring the results.
Mar 11, 6:02 PM
investigating
We are currently investigating an issue impacting rule evaluation for a subset of customers in the prod-us-west-0 region. We will provide updates as they become available.
Mar 11, 5:10 PM
Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)
Started: Mar 11, 8:31 AM
investigating
We are also reporting impact to Faro performance in the same region. We are continuing to investigate this issue.
Mar 11, 9:13 AM
investigating
We are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working on restoring the service.
Mar 11, 8:31 AM
Complete outage in prod-me-central-1
Started: Mar 2, 6:43 AM
investigating
We have not received any further updates from AWS at this time. However, we are actively monitoring the outage and will provide additional information as it becomes available.
Also, please continue to refer to the AWS status page for more detailed updates.
https://health.aws.amazon.com/health/status
All the guidance previously included about stack migration is still relevant. Please reach out to our Support team if you have any questions.
Mar 19, 12:13 PM
investigating
We are actively monitoring the situation, but at this time there are no new updates to share. The next update will be provided once we have more information to share. Please reach out to our Support team if you have any questions.
Mar 4, 10:22 PM
investigating
We are continuing to investigate this issue.
Mar 4, 10:28 AM
investigating
Please continue to refer to the AWS status page for more detailed updates specific to AWS.
https://health.aws.amazon.com/health/status
AWS are recommending that affected customers move workloads to alternate regions, and we are recommending the same.
Customers who are impacted and who cannot wait for a restoration of service are asked to:
1. Create a Grafana Cloud stack in an alternate region
2. Update clients to send telemetry to the new region, if using Grafana Alloy then you can use Fle...
Mar 2, 10:18 PM
investigating
AWS are recommending that affected customers move workloads to alternate regions https://health.aws.amazon.com/health/status and we are recommending the same.
Customers who are impacted and who cannot wait for a restoration of service are asked to:
1. Create a Grafana Cloud stack in an alternate region
2. Update clients to send telemetry to the new region, if using Grafana Alloy then you can use Fleet Management https://grafana.com/docs/grafana-cloud/send-data/fleet-management/introduction/...
Mar 2, 10:31 AM
investigating
Customers are recommended to configure a new blank stack in an alternative Grafana Cloud region and to reconfigure their clients (such as Grafana Alloy) to send telemetry to that region, Fleet Management can be used for this purpose https://grafana.com/docs/grafana-cloud/send-data/fleet-management/introduction/
Mar 2, 10:04 AM
investigating
We are updating this incident to reflect a complete outage in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.
Mar 2, 8:36 AM
investigating
We are observing write and read outage errors across all databases (metrics, logs, traces) in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.
Mar 2, 8:21 AM
investigating
We are observing write and read outage errors across all databases (metrics, logs, traces) in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.
Mar 2, 8:14 AM
investigating
We are seeing elevated write and read path errors in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.
Mar 2, 6:43 AM
February 2026
Grafana Cloud Metrics - Intermittent Write Latency in prod-us-central, prod-us-central-5, and prod-eu-west-0
Started: Feb 25, 7:54 PM
monitoring
We are rolling out a mitigation across the environments in these regions, and preemptively where possible to ensure it doesn’t spread elsewhere.
Mar 6, 9:44 PM
monitoring
We have seen an increase in latency in our cloud providers services, and are rolling out a change to mitigate the issue. We are monitoring.
Mar 6, 8:53 PM
monitoring
We are continuing to investigate this issue alongside the CSP, and have taken steps to escalate through the appropriate channels. The mitigation in place continues to work as expected, and any notable updates will continue to be shared here for tracking.
Mar 5, 10:22 PM
monitoring
We are continuing to investigate this issue alongside the CSP. Any notable updates will continue to be shared here for tracking.
Feb 27, 10:05 PM
monitoring
We've implemented mitigation in place and are continuing to monitoring and investigating this issue.
Feb 27, 2:55 PM
investigating
We have begun rolling out mitigation steps to reduce write latency in the prod-us-central-0 and prod-us-central-5 regions. While these measures are expected to improve performance, we are continuing to investigate the underlying root cause of the issue. We will provide additional updates as more information becomes available.
Feb 26, 4:23 PM
investigating
Since February 19, we have been investigating an intermittent issue causing increased write latency in the prod-us-central-0 and prod-us-central-5 regions. The issue does not affect all traffic but may result in delayed write operations for some customers. Our engineering team is actively working to identify the root cause and stabilize performance. We will share additional updates as progress is made.
Feb 25, 7:54 PM