2025-Q3

Jul 1, 2025 - Sep 30, 2025

SLA Violation

Total Downtime

1d 5h

Weighted by impact

Total Incidents

In this quarter (19 tracked)

Worst Component

Issues

99.742% uptime

Service Features

Time-based uptime calculation for the 132,480 minutes in this quarter

Calculation Method: (Total minutes - Downtime) / Total minutes × 100
Downtime Definition: Minutes with >5% error rate (approximated from incident data)

Component	Uptime %	Downtime	Incidents	Status	Service Credit
Git Operations	99.8564%	3h 10m	5	Violation	10%
API Requests	99.7551%	5h 25m	5	Violation	10%
Issues	99.7424%	5h 41m	7	Violation	10%
Pull Requests	99.7660%	5h 10m	5	Violation	10%
Webhooks	99.9530%	1h 2m	2	Pass	None
Pages	99.9974%	3m	1	Pass	None

Actions

Execution-based calculation (workflow success rate)

Official Calculation Method: (Total executions - Failed executions) / Total executions × 100

⚠️ Data Accuracy Limitation: The GitHub Status API does not provide execution counts or failure rates. The uptime percentages shown below are approximations based on incident duration and impact, not the actual workflow success rate used in GitHub's official SLA calculations.

Component	Uptime %	Downtime	Incidents
Actions	99.7568%	5h 22m	10

Packages

Hybrid calculation with two separate metrics

Official Calculation Methods:
1. Package Transfers: (Total transfers - Failed transfers) / Total transfers × 100
2. Package Storage: (Total minutes - Minutes with >5% error rate) / Total minutes × 100

⚠️ Data Accuracy Limitation: The GitHub Status API does not provide transfer counts or storage error rates. The uptime percentage shown below is an approximation based on incident duration and impact, not the actual metrics used in GitHub's official SLA calculations.

Component	Uptime %	Downtime	Incidents
Packages	99.8385%	3h 34m	3

Incidents in 2025-Q3

44 incidents occurred during this quarter

Disruption with Gemini 2.5 Pro and Gemini 2.0 Flash in Copilot

minor resolved

Created: Sep 29, 2025, 06:39 PM

Resolved: Sep 29, 2025, 07:12 PM

Duration: 33m

Weighted Downtime: 8.25m

4 updates

resolved Sep 29, 2025, 07:12 PM

On September 29, 2025, between 17:53 and 18:42 UTC, the Copilot service experienced a degradation of the Gemini 2.5 model due to an issue with our upstream provider. Approximately 24% of requests failed, affecting 56% of users during this period. No other models were impacted. GitHub notified the upstream provider of the problem as soon as it was detected. The issue was resolved after the upstream provider rolled back a recent change that caused the disruption. GitHub will continue to enhance our monitoring and alerting systems to reduce the time it takes to detect and mitigate similar issues in the future.

investigating Sep 29, 2025, 07:12 PM

The upstream model provided has resolved the issue and we are seeing full availability for Gemini 2.5 Pro and Gemini 2.0 Flash.

investigating Sep 29, 2025, 06:40 PM

We are experiencing degraded availability for the Gemini 2.5 Pro & Gemini 2.0 Flash models in Copilot Chat, VS Code and other Copilot products. This is due to an issue with an upstream model provider. We are working with them to resolve the issue. Other models are available and working as expected.

investigating Sep 29, 2025, 06:39 PM

We are currently investigating this issue.

Disruption with some GitHub services

major resolved

Created: Sep 29, 2025, 04:45 PM

Resolved: Sep 29, 2025, 05:33 PM

Duration: 49m

Weighted Downtime: 36.75m

3 updates

resolved Sep 29, 2025, 05:33 PM

On September 29, 2025 between 16:26 UTC and 17:33 UTC the Copilot API experienced a partial degradation causing intermittent erroneous 404 responses for an average of 0.2% of GitHub MCP server requests, peaking at times around 2% of requests. The issue stemmed from an upgrade of an internal dependency which exposed a misconfiguration in the service. We resolved the incident by rolling back the upgrade to address the misconfiguration. We fixed the configuration issue and will improve documentation and rollout process to prevent similar issues.

investigating Sep 29, 2025, 05:28 PM

Customers are getting 404 responses when connecting to the GitHub MCP server. We have reverted a change we believe is contributing to the impact, and are seeing resolution in deployed environments.

investigating Sep 29, 2025, 04:45 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Sep 25, 2025, 05:00 PM

Resolved: Sep 25, 2025, 05:36 PM

Duration: 36m

Weighted Downtime: 9m

3 updates

resolved Sep 25, 2025, 05:36 PM

On September 26, 2025 between 16:22 UTC and 18:32 UTC raw file access was degraded for a small set of four repositories. On average, raw file access error rate was 0.01% and peaked at 0.16% of requests. This was due to a caching bug exposed by excessive traffic to a handful of repositories. We mitigated the incident by resetting the state of the cache for raw file access and are working to improve cache usage and testing to prevent issues like this in the future.

investigating Sep 25, 2025, 05:06 PM

We are seeing issues related to our ability to serve raw file access across a small percentage of our requests.

investigating Sep 25, 2025, 05:00 PM

We are currently investigating this issue.

Disruption with some GitHub services

major resolved

Created: Sep 24, 2025, 02:46 PM

Resolved: Sep 24, 2025, 03:36 PM

Duration: 50m

Weighted Downtime: 37.5m

3 updates

resolved Sep 24, 2025, 03:36 PM

On September 23, 2025, between 15:29 UTC and 17:38 UTC and also on September 24, 2025 between 15:02 UTC and 15:12, email deliveries were delayed up to 50 minutes which resulted in significant delays for most types of email notifications. This occurred due to an unusually high volume of traffic which caused resource contention on some of our outbound email servers. We have updated the configuration we use to better allocate capacity when there is a high volume of traffic and are also updating our monitors so we can detect this type of issue before it becomes a customer impacting incident.

investigating Sep 24, 2025, 02:55 PM

We are seeing delays in email delivery, which is impacting notifications and user signup email verification. We are investigating and working on mitigation.

investigating Sep 24, 2025, 02:46 PM

We are currently investigating this issue.

Claude Opus 4 is experiencing degraded performance

minor resolved

Created: Sep 24, 2025, 09:08 AM

Resolved: Sep 24, 2025, 09:18 AM

Duration: 0m

Weighted Downtime: 0m

3 updates

resolved Sep 24, 2025, 09:18 AM

On September 24th, 2025, between 08:02 UTC and 09:11 UTC the Copilot service was degraded for Claude Opus 4 and Claude Opus 4.1 requests. On average, 22% of requests failed for Claude Opus 4 and 80% of requests for Claude Opus 4.1. This was due to an upstream provider returning elevated errors on Claude Opus 4 and Opus 4.1. We mitigated the issue by directing users to select other models and by monitoring recovery. To resolve the issue, we are expanding failover capabilities by integrating with additional infrastructure providers.

investigating Sep 24, 2025, 09:16 AM

Between around 8:16 UTC and 8:51 UTC we saw elevated errors on Claude Opus 4 and Opus 4.1, up to 49% of requests were failing. This has recovered to around 4% of requests failing, we are monitoring recovery.

investigating Sep 24, 2025, 09:08 AM

We are currently investigating this issue.

Incident with Copilot

minor resolved

Created: Sep 23, 2025, 10:22 PM

Resolved: Sep 24, 2025, 12:26 AM

Duration: 2h 4m

Weighted Downtime: 31m

Affected Components: Copilot

4 updates

resolved Sep 24, 2025, 12:26 AM

Between 20:06 UTC September 23 and 04:58 UTC September 24, 2025, the Copilot service experienced degraded availability for Claude Sonnet 4 and 3.7 model requests. During this period, 0.46% of Claude 4 requests and 7.83% of Claude 3.7 requests failed. The reduced availability resulted from Copilot disabling routing to an upstream provider that was experiencing issues and reallocating capacity to other providers to manage requests for Claude Sonnet 3.7 and 4. We are continuing to investigate the source of the issues with this provider and will provide an update as more information becomes available.

investigating Sep 24, 2025, 12:26 AM

The issues with our upstream model provider have been resolved, and Claude Sonnet 3.7 and Claude Sonnet 4 are once again available in Copilot Chat, VS Code and other Copilot products. We will continue monitoring to ensure stability, but mitigation is complete.

investigating Sep 23, 2025, 10:22 PM

We are experiencing degraded availability for the Claude Sonnet 3.7 and Claude Sonnet 4 model in Copilot Chat, VS Code and other Copilot products. This is due to an issue with an upstream model provider. We are working with them to resolve the issue. Other models are available and working as expected.

investigating Sep 23, 2025, 10:22 PM

We are investigating reports of degraded performance for Copilot

Incident with Pages and Actions

minor resolved

Created: Sep 23, 2025, 05:28 PM

Resolved: Sep 23, 2025, 05:41 PM

Duration: 14m

Weighted Downtime: 3.5m

Affected Components: ActionsPages

3 updates

resolved Sep 23, 2025, 05:41 PM

On September 23, between 17:11 and 17:40 UTC, customers experienced failures and delays when running workflows on GitHub Actions and building or deploying GitHub Pages. The issue was caused by a faulty configuration change that disrupted service to service communication in GitHub Actions. During this period, in-progress jobs were delayed and new jobs would not start due to a failure to acquire runners, and about 30% of all jobs failed. GitHub Pages users were unable to build or deploy their Pages during this period. The offending change was rolled back within 15 minutes of its deployment, after which Actions workflows and Pages deployments began to succeed. Actions customers continued to experience delays for about 15 minutes after the rollback was completed while services worked through the backlog of queued jobs. We are planning to implement additional rollout checks to help detect and prevent similar issues in the future.

investigating Sep 23, 2025, 05:33 PM

We are investigating delays in Actions Workflows.

investigating Sep 23, 2025, 05:28 PM

We are investigating reports of degraded performance for Actions and Pages

Disruption with some GitHub services

minor resolved

Created: Sep 23, 2025, 04:46 PM

Resolved: Sep 23, 2025, 05:40 PM

Duration: 0m

Weighted Downtime: 0m

3 updates

resolved Sep 23, 2025, 05:40 PM

On September 23, 2025, between 15:29 UTC and 17:38 UTC and also on September 24, 2025 between 14:02 UTC and 15:12 UTC, email deliveries were delayed up to 50 minutes which resulted in significant delays for most types of email notifications. This occurred due to an unusually high volume of traffic which caused resource contention on some of our outbound email servers. We have updated the configuration we use to better allocate capacity when there is a high volume of traffic and are also updating our monitors so we can detect this type of issue before it becomes a customer impacting incident.

investigating Sep 23, 2025, 04:50 PM

We're seeing delays related to outbound emails and are investigating.

investigating Sep 23, 2025, 04:46 PM

We are currently investigating this issue.

Incident with Codespaces

minor resolved

Created: Sep 17, 2025, 03:04 PM

Resolved: Sep 17, 2025, 05:55 PM

Duration: 2h 51m

Weighted Downtime: 42.75m

Affected Components: Codespaces

6 updates

resolved Sep 17, 2025, 05:55 PM

On September 17, 2025 between 13:23 and 16:51 UTC some users in West Europe experienced issues with Codespaces that had shut down due to network disconnections and subsequently failed to restart. Codespace creations and resumes were failed over to another region at 15:01 UTC. While many of the impacted instances self-recovered after mitigation efforts, approximately 2,000 codespaces remained stuck in a "shutting down" state while the team evaluated possible methods to recover unpushed data from the latest active session of affected codespaces. Unfortunately, recovery of that data was not possible. We unblocked shutdown of those codespaces, with all instances either shut down or available by 8:26 UTC on September 19. The disconnects were triggered by an exhaustion of resources in the network relay infrastructure in that region, but the lack of self-recovery was caused by an unhandled error impacting the local agent, which led to an unclean shutdown. We are improving the resilience of the local agent to disconnect events to ensure shutdown of codespaces is always clean without data loss. We have also addressed the exhausted resources in the network relay and will be investing in improved detection and resilience to reduce the impact of similar events in the future.

investigating Sep 17, 2025, 05:55 PM

We have confirmed the original mitigation to failover has resolved the issue causing Codespaces to become unavailable. We are evaluating if there is a path to recover unpushed data from the approximately 2000 Codespaces that are currently in the shutting down state. We will be resolving this incident and will detail the next steps in our public summary.

investigating Sep 17, 2025, 04:51 PM

For Codespaces that were stuck in the shutting down state and have been resumed, we've identified an issue that is causing the contents Codespace to be irrecoverably lost which has impacted approximately 250 Codespaces. We are actively working on a mitigation to prevent any more Codespaces currently in this state from being forced to shut down to prevent the potential data loss.

investigating Sep 17, 2025, 04:07 PM

We're continuing to see improvement with Codespaces that were stuck in in the shutting down state and we anticipate the remaining should self resolve in about an hour.

investigating Sep 17, 2025, 03:31 PM

Some users with Codespaces in West Europe were unable to connect to Codespaces, we have failed over that region and users should be able to create new Codespaces. If a user has a Codespace in a shutting down state, we are still investigating potential fixes and mitigations.

investigating Sep 17, 2025, 03:04 PM

We are investigating reports of degraded performance for Codespaces

Unauthenticated LFS requests for public repos are returning unexpected 401 errors

minor resolved

Created: Sep 16, 2025, 05:55 PM

Resolved: Sep 16, 2025, 06:30 PM

Duration: 35m

Weighted Downtime: 8.75m

Affected Components: Git Operations

5 updates

resolved Sep 16, 2025, 06:30 PM

Between 16:26 UTC on September 15th and 18:30 UTC on September 16th, anonymous REST API calls to approximately 20 endpoints were incorrectly rejected because they were not authenticated. While this caused unauthenticated requests to be rejected by these endpoints, all authenticated requests were unaffected, and no protected endpoints were exposed. This resulted in 100% of requests to these endpoints failing at peak, representing less than 0.1% of GitHub’s overall request volume. On average, the error rate for these endpoints was less than 50% and peaked at 100% for about 26 hours over September 16th. API requests to the impacted endpoints were rejected with a 401 error code. This was due to a mismatch in authentication policies, for specific endpoints, during a system migration. The failure to detect the errors was the result of the issue occurring for a low percentage of traffic. We mitigated the incident by reverting the policy in question, and correcting the logic associated with the degraded endpoints. We are working to improve our test suite to further validate mismatches, and refining our monitors for proactive detection.

investigating Sep 16, 2025, 06:29 PM

We have mitigated the issue and are monitoring the results

investigating Sep 16, 2025, 06:02 PM

Git Operations is experiencing degraded performance. We are continuing to investigate.

investigating Sep 16, 2025, 05:55 PM

A recent change to our API routing inadvertently added an authentication requirement to the anonymous route for LFS requests. We're in the process of fixing the change, but in the interim retrying should eventually succeed.

investigating Sep 16, 2025, 05:55 PM

We are currently investigating this issue.

Creating GitHub apps using the REST API will fail with a 401 error

minor resolved

Created: Sep 16, 2025, 05:14 PM

Resolved: Sep 16, 2025, 05:45 PM

Duration: 31m

Weighted Downtime: 7.75m

4 updates

resolved Sep 16, 2025, 05:45 PM

investigating Sep 16, 2025, 05:27 PM

We have mitigated the issue and are monitoring the results

investigating Sep 16, 2025, 05:15 PM

A recent change to our API routing inadvertently added an authentication requirement to the anonymous route for creating GitHub apps. We're in the process of fixing the change, but in the interim retrying should eventually succeed.

investigating Sep 16, 2025, 05:14 PM

We are currently investigating this issue.

Disruption with some GitHub services

major resolved

Created: Sep 15, 2025, 06:21 PM

Resolved: Sep 15, 2025, 06:28 PM

Duration: 7m

Weighted Downtime: 5.25m

2 updates

resolved Sep 15, 2025, 06:28 PM

On September 15th between 17:55 and 18:20 UTC, Copilot experienced degraded availability for all features. This was due a partial deployment of a feature flag to a global rate limiter. The flag triggered behavior that unintentionally rate limited all requests, resulting in 100% of them returning 403 errors. The issue was resolved by reverting the feature flag which resulted in immediate recovery. The root cause of the incident was from an undetected edge case in our rate limiting logic. The flag was meant to scale down rate limiting for a subset of users, but unintentionally put our rate limiting configuration into an invalid state. To prevent this from happening again, we have addressed the bug with our rate limiting. We are also adding additional monitors to detect anomalies in our traffic patterns, which will allow us to identify similar issues during future deployments. Furthermore, we are exploring ways to test our rate limit scaling in our internal environment to enhance our pre-production validation process.

investigating Sep 15, 2025, 06:21 PM

We are currently investigating this issue.

Repository search is degraded

minor resolved

Created: Sep 13, 2025, 12:44 PM

Resolved: Sep 15, 2025, 09:01 PM

Duration: 2d 8h

Weighted Downtime: 14h 4.25m

4 updates

resolved Sep 15, 2025, 09:01 PM

At around 18:45 UTC on Friday, September 12, 2025, a change was deployed that unintentionally affected search index management. As a result, approximately 25% of repositories were temporarily missing from search results. By 12:45 UTC on Saturday, September 14, most missing repositories were restored from an earlier search index snapshot, and repositories updated between the snapshot and the restoration were reindexed. This backfill was completed at 21:25 UTC. After these repairs, about 98.5% of repositories were once again searchable. We are performing a full reconciliation of the search index and customers can expect to see records being updated and content becoming searchable for all repos again between now and Sept 25. NOTE: Users who notice missing or outdated repositories in search results can force reindexing by starring or un-starring the repository. Other repository actions such as adding topics, or updating the repository description, will also result in reindexing. In general, changes to searchable artifacts in GitHub will also update their respective search index in near-real time. User impact has been mitigated with the exception of the 1.5% of repos that are missing from the search index. The change responsible for the search issue has been reverted, and full reconciliation of the search index is underway, expected to complete by September 23. We have added additional checks to our indexing model to ensure this failure does not happen again. We are also investigating faster repair alternatives. To avoid resource contention and possible further issues we are currently not repairing repositories or organizations individually at this time. No repository data was lost, and other search types were not affected.

investigating Sep 13, 2025, 10:39 PM

Most searchable repositories should again be visible in search results. Up to 1.5% of repositories may still be missing from search results. Many different actions synchronize the repository state with the search index, so we expect natural recovery for repositories that see more frequent user and API-driven interactions. A complete index reconciliation is underway to restore stagnant repositories that were deleted from the index. We will update again once we have a clear timeline of when we expect full recovery for those missing search results.

investigating Sep 13, 2025, 12:49 PM

Customers are not seeing repositories they expect to see in search results. We have restored a snapshot of this search index from Fri 12 Sep at 21:00 UTC. Changes made since then will be unavailable while we work to backfill the rest of the search index. Any new changes will be available in near-real time as expected.

investigating Sep 13, 2025, 12:44 PM

We are currently investigating this issue.

Incident with Actions

minor resolved

Created: Sep 10, 2025, 01:23 PM

Resolved: Sep 10, 2025, 02:02 PM

Duration: 39m

Weighted Downtime: 9.75m

Affected Components: Actions

3 updates

resolved Sep 10, 2025, 02:02 PM

On September 10, 2025 between 13:00 and 14:15 UTC, Actions users experienced failed jobs and run start delays for Ubuntu 24 and Ubuntu 22 jobs on standard runners in private repositories. Additionally, larger runner customers experienced run start delays for runner groups with private networking configured in the eastus2 region. This was due to an outage in an underlying compute service provider in eastus2. 1.06% of Ubuntu 24 jobs and 0.16% of Ubuntu 22 jobs failed during this period. Jobs for larger runners using private networking in the eastus2 region were unable to start for the duration of the incident.We have identified and are working on improvements in our resilience to single partner region outages for standard runners so impact is reduced in similar scenarios in the future.

update Sep 10, 2025, 01:31 PM

Actions hosted runners are taking longer to come online, leading to high wait times or job failures.

investigating Sep 10, 2025, 01:23 PM

We are investigating reports of degraded performance for Actions

Degraded REST API success rates for some customers

minor resolved

Created: Sep 4, 2025, 06:16 PM

Resolved: Sep 4, 2025, 08:25 PM

Duration: 2h 9m

Weighted Downtime: 32.25m

Affected Components: API Requests

7 updates

resolved Sep 4, 2025, 08:25 PM

On September 4, 2025 between 15:30 UTC and 20:00 UTC the REST API endpoints git/refs, git/refs/*, and git/matching-refs/* were degraded and returned elevated errors for repositories with reference counts over 22k. On average, the request error rate to these specific endpoints was 0.5%. Overall REST API availability remained 99.9999%. This was due to the introduction of a code change that added latency to reference evaluations and overly affected repositories with many branches, tags, or other references.We mitigated the incident by reverting the new code.We are working to improve performance testing and to reduce our time to detection and mitigation of issues like this one in the future.

update Sep 4, 2025, 08:05 PM

The deployment has completed and we expect customers who have been impacted to see recovery. We are continuing to monitor.

update Sep 4, 2025, 07:28 PM

We are in the process of deploying the PR to revert the change that was causing timeouts to this endpoint. We will update again once that deployment is complete.

update Sep 4, 2025, 06:57 PM

We have identified a deployed change that correlates with the increase in 5XX errors to the GitRefs REST API. This is particularly affecting requests for repos with very large numbers of commits. We are working on rolling back this change which we expect will resolve the issue.

update Sep 4, 2025, 06:52 PM

API Requests is experiencing degraded performance. We are continuing to investigate.

update Sep 4, 2025, 06:18 PM

Customers are experiencing 504 responses for some API requests for regarding repo refs/tags. We are investigating.

investigating Sep 4, 2025, 06:16 PM

We are currently investigating this issue.

Loading avatars might fail for a 0.5% of total users and 100% users around the Arabian Peninsula. We are investigating.

minor resolved

Created: Sep 2, 2025, 03:17 PM

Resolved: Sep 2, 2025, 03:44 PM

Duration: 27m

Weighted Downtime: 6.75m

2 updates

resolved Sep 2, 2025, 03:44 PM

Between August 21, 2025 at 15:00 UTC, and September 2, 2025 at 15:22 UTC the avatars.githubusercontent.com image service was degraded and failed to display user avatars for users in the Middle East. During this time, avatar images appeared broken on github.com for affected users. On average, this impacted about 82% of users routed through one of our Middle East-based points-of-presence, which represents about 0.14% of global users.This was due to a configuration change within GitHub's edge infrastructure in the affected region, causing HTTP requests to fail. As a result, image requests returned HTTP 503 errors. The failure to detect the issues was the result of an alerting threshold set too low.We mitigated the incident by removing the affected site from service, which restored avatar serving for impacted users.To prevent this from recurring, we have tuned configuration defaults for graceful degradation. We also added new health checks to automatically shift traffic from impacted sites. We are updating our monitoring to prevent undetected errors like this in the future.

investigating Sep 2, 2025, 03:17 PM

We are currently investigating this issue.

Disruption with some GitHub services

major resolved

Created: Aug 27, 2025, 08:41 PM

Resolved: Aug 27, 2025, 09:27 PM

Duration: 46m

Weighted Downtime: 34.5m

Affected Components: API Requests and Issues

9 updates

update Aug 27, 2025, 09:27 PM

API Requests and Issues are operating normally.

resolved Aug 27, 2025, 09:27 PM

On August 27, 2025 between 20:35 and 21:17 UTC, Copilot, Web and REST API traffic experienced degraded performance. Copilot saw an average of 36% of requests fail with a peak failure rate of 77%. Approximately 2% of all non-Copilot Web and REST API traffic requests failed.This incident occurred after we initiated a production database migration to drop a column from a table backing copilot functionality. While the column was no longer in direct use, our ORM continued to reference the dropped column. This led to a large number of 5xx responses and was similar to the incident on August 5th. At 21:15 UTC, we applied a fix to the production schema and by 21:17 UTC, all services had fully recovered.While repairs were in progress to avoid this situation, they were not completed quickly enough to prevent a second incident. We have now implemented a temporary block for all drop column operations as an immediate solution while we add more safeguards to prevent similar issues from occurring in the future. We are also implementing graceful degradation so that Copilot issues will not impact other features of our product.

update Aug 27, 2025, 09:25 PM

We've discovered the cause of the service disruption and applied a mitigation.

update Aug 27, 2025, 09:13 PM

We are continuing to investigate this issue.

update Aug 27, 2025, 08:58 PM

API Requests is experiencing degraded performance. We are continuing to investigate.

update Aug 27, 2025, 08:55 PM

The team is aware of the root cause of this issue and is working to mitigate the issue quickly.

update Aug 27, 2025, 08:50 PM

Issues is experiencing degraded performance. We are continuing to investigate.

update Aug 27, 2025, 08:48 PM

API Requests is experiencing degraded availability. We are continuing to investigate.

investigating Aug 27, 2025, 08:41 PM

We are currently investigating this issue.

Incident with Actions

minor resolved

Created: Aug 21, 2025, 03:54 PM

Resolved: Aug 21, 2025, 06:13 PM

Duration: 2h 19m

Weighted Downtime: 34.75m

Affected Components: Actions

6 updates

resolved Aug 21, 2025, 06:13 PM

On August 21, 2025, from approximately 15:37 UTC to 18:10 UTC, customers experienced increased delays and failures when starting jobs on GitHub Actions using standard hosted runners. This was caused by connectivity issues in our East US region, which prevented runners from retrieving jobs and sending progress updates. As a result, capacity was significantly reduced, especially for busier configurations, leading to queuing and service interruptions. Approximately 8.05% of jobs on public standard Ubuntu24 runners and 3.4% of jobs on private standard Ubuntu24 runners did not start as expected.By 18:10 UTC, we had mitigated the issue by provisioning additional resources in the affected region and burning down the backlog of queued runner assignments. By the end of that day, we deployed changes to improve runner connectivity resilience and graceful degradation in similar situations. We are also taking further steps to improve system resiliency by enhancing observability of network connection health with runners and improving load distribution and failover handling to help prevent similar issues in the future.

update Aug 21, 2025, 05:58 PM

We've applied a mitigation to fix the issues with queuing and running Actions jobs. We are seeing improvements in telemetry and are monitoring for full recovery.

update Aug 21, 2025, 05:21 PM

The team continues to investigate issues with some Actions jobs on Hosted Runners being queued for a long time and a percentage of jobs failing. We are increasing runner capacity and will continue providing updates on the progress towards mitigation.

update Aug 21, 2025, 04:43 PM

The team continues to investigate issues with some Actions jobs on Hosted Runners being queued for a long time and a percentage of jobs failing. We will continue providing updates on the progress towards mitigation.

update Aug 21, 2025, 04:05 PM

We are investigating reports of slow queue times for Hosted Runners, leading to high wait times.

investigating Aug 21, 2025, 03:54 PM

We are investigating reports of degraded performance for Actions

Incident with Issues and Git Operations

minor resolved

Created: Aug 21, 2025, 06:25 AM

Resolved: Aug 21, 2025, 06:58 AM

Duration: 33m

Weighted Downtime: 8.25m

Affected Components: Git Operations and Issues

7 updates

update Aug 21, 2025, 06:58 AM

The errors in our database infrastructure were related to some maintenance events that had more impact than expected. We will provide more details and follow ups when we post a public summary for this incident in the coming days. All impact to customers is resolved.

update Aug 21, 2025, 06:58 AM

Issues is operating normally.

update Aug 21, 2025, 06:58 AM

Git Operations is operating normally.

resolved Aug 21, 2025, 06:58 AM

On August 21st, 2025, between 6:15am UTC and 6:25am UTC Git and Web operations were degraded and saw intermittent errors. On average, the error rate was 1% for API and Web requests. This was due to database infrastructure automated maintenance reducing capacity below our tolerated threshold.The incident was resolved when the impacted infrastructure self-healed and returned to normal operating capacity.We are adding guardrails to reduce the impact of this type of maintenance in the future.

update Aug 21, 2025, 06:39 AM

We saw a brief spike in failures related to some of our database infrastructure. Everything has recovered but we are continuing to investigate to ensure we don't see any reoccurrence.

update Aug 21, 2025, 06:30 AM

Approximately 1% of API and web requests are seeing intermittent errors. Some customers may see some push errors. We are currently investigating.

investigating Aug 21, 2025, 06:25 AM

We are investigating reports of degraded performance for Git Operations and Issues

Disruption with some GitHub services

minor resolved

Created: Aug 20, 2025, 04:14 PM

Resolved: Aug 20, 2025, 04:37 PM

Duration: 23m

Weighted Downtime: 5.75m

4 updates

update Aug 20, 2025, 04:37 PM

We have verified that we fixed the sign up flow and are working to ensure we don't introduce an issue like this in the future.

resolved Aug 20, 2025, 04:37 PM

Between 15:49 and 16:37 UTC on 20 Aug 2025, creating a new GitHub account via the web signup page consistently returned server errors, and users were unable to complete signup during this 48-minute window. We detected the issue at 16:04 UTC and restored normal signup functionality by 16:37 UTC. A recent change to signup flow logic caused all attempts to error. The change was rolled back to restore service. This exposed a gap in our test coverage that we are fixing.

update Aug 20, 2025, 04:24 PM

Customers may experience issues when signing up for new GitHub accounts. We are actively working on a mitigation and will post an update within 30 minutes.

investigating Aug 20, 2025, 04:14 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Aug 19, 2025, 01:39 PM

Resolved: Aug 19, 2025, 02:46 PM

Duration: 1h 7m

Weighted Downtime: 16.75m

Affected Components: Issues and Actions

9 updates

update Aug 19, 2025, 02:46 PM

Issues is operating normally.

update Aug 19, 2025, 02:46 PM

Actions is operating normally.

resolved Aug 19, 2025, 02:46 PM

On August 19, 2025, between 13:35 UTC and 14:33 UTC, GitHub search was in a degraded state. When searching for pull requests, issues, and workflow runs, users would have seen some slow, empty or incomplete results. In some cases, pull requests failed to load.The incident was triggered by intermittent connectivity issues between our load balancers and search hosts. While retry logic initially masked these problems, retry queues eventually overwhelmed the load balancers, causing failure. The incident was mitigated at 14:33 UTC by throttling our search index pipeline. Our automated alerting and internal retries reduced the impact of this event significantly. As a result of this incident we believe we have identified a faster way to mitigate it in the future. We are also working on multiple solutions to resolve the underlying connectivity issues.

update Aug 19, 2025, 02:45 PM

We were able to mitigate the slowness by throttling some search indexing and will work on the issues created by the increased search indexing so they do not have latency impact.

update Aug 19, 2025, 02:11 PM

We are seeing slightly elevated latency on some Issues endpoints and searches for workflow runs in Actions may not return quickly.

update Aug 19, 2025, 01:45 PM

Actions is experiencing degraded performance. We are continuing to investigate.

update Aug 19, 2025, 01:44 PM

Issues is experiencing degraded performance. We are continuing to investigate.

investigating Aug 19, 2025, 01:39 PM

We are currently investigating this issue.

update Aug 19, 2025, 01:39 PM

Issues with timeouts when searching

Incident with Packages

minor resolved

Created: Aug 14, 2025, 06:06 PM

Resolved: Aug 14, 2025, 06:37 PM

Duration: 31m

Weighted Downtime: 7.75m

Affected Components: Packages

4 updates

update Aug 14, 2025, 06:37 PM

The NPM registry has now returned to normal functioning.

resolved Aug 14, 2025, 06:37 PM

On August 14, 2025, between 17:50 UTC and 18:08 UTC, the Packages NPM Registry service was degraded. During this period, NPM package uploads were unavailable and approximately 50% of download requests failed. We identified the root cause as a sudden spike in Packages publishing activity that exceeded our service capacity limits. We are implementing better guardrails to protect the service against unexpected traffic surges and improving our incident response runbooks to ensure faster mitigation of similar issues.

update Aug 14, 2025, 06:11 PM

The NPM registry service is currently experiencing intermittent availability issues. Other package registries should be unaffected. Investigations are ongoing.

investigating Aug 14, 2025, 06:06 PM

We are investigating reports of degraded performance for Packages

Incident with Actions

minor resolved

Created: Aug 14, 2025, 05:03 AM

Resolved: Aug 14, 2025, 06:23 AM

Duration: 1h 20m

Weighted Downtime: 20m

Affected Components: Actions

3 updates

resolved Aug 14, 2025, 06:23 AM

On August 14, 2025, between 02:30 UTC and 06:14 UTC, GitHub Actions was degraded. On average, 3% of workflow runs were delayed by at least 5 minutes. The incident was caused by an outage in a downstream dependency that led to failures in backend service connectivity in one region. At 03:59 UTC, we evacuated a majority of services in the impacted region, but some users may have seen ongoing impact until all services were fully evacuated at 06:14 UTC. We are working to improve monitoring and processes of failover to reduce our time to detection and mitigation of issues like this one in the future.

update Aug 14, 2025, 05:42 AM

We are investigating reports of issues with service(s): Actions. We will continue to keep users updated on progress towards mitigation.

investigating Aug 14, 2025, 05:03 AM

We are investigating reports of degraded performance for Actions

Incident with search on GitHub we are seeing increased failure rates

major resolved

Created: Aug 12, 2025, 02:12 PM

Resolved: Aug 12, 2025, 05:56 PM

Duration: 3h 44m

Weighted Downtime: 2h 48m

Affected Components: API RequestsIssuesPull RequestsActionsand Packages

8 updates

resolved Aug 12, 2025, 05:56 PM

On August 12, 2025, between 13:30 UTC and 17:14 UTC, GitHub search was in a degraded state. Users experienced inaccurate or incomplete results, failures to load certain pages (like Issues, Pull Requests, Projects, and Deployments), and broken components like Actions workflow and label filters.Most user impact occurred between 14:00 UTC and 15:30 UTC, when up to 75% of search queries failed, and updates to search results were delayed by up to 100 minutes. The incident was triggered by intermittent connectivity issues between our load balancers and search hosts. While retry logic initially masked these problems, retry queues eventually overwhelmed the load balancers, causing failure. The query failures were mitigated at 15:30 UTC after throttling our search indexing pipeline to reduce load and stabilize retries. The connectivity failures were resolved at 17:14 UTC after the automated reboot of a search host, causing the rest of the system to recover. We have improved internal monitors and playbooks, and tuned our search cluster load balancer to further mitigate the recurrence of this failure mode. We continue to invest in resolving the underlying connectivity issues.

update Aug 12, 2025, 05:07 PM

Service availability has been mostly restored, but some users will continue to see increased request latency and stale search results. We are still working towards full recovery.

update Aug 12, 2025, 04:33 PM

Service availability has been mostly restored, but increased load/query latency and stale search results persist. We continue to work towards full mitigation.

update Aug 12, 2025, 03:48 PM

We are seeing partial recovery in service availability, but still see inconsistent experiences and stale search data across services. Investigation and mitigations are underway.

update Aug 12, 2025, 03:20 PM

We are experiencing increased latency in our API layers and inconsistently degraded experiences when loading or querying issues, pull requests, labels, packages, releases, workflow runs, projects, and repositories, among others. Investigation is underway.

update Aug 12, 2025, 02:53 PM

We are investigating reports of degraded performance in services backed by search. The team continues to investigate why requests are failing to reach our search clusters.

update Aug 12, 2025, 02:30 PM

Packages is experiencing degraded performance. We are continuing to investigate.

investigating Aug 12, 2025, 02:12 PM

We are investigating reports of degraded performance for API Requests, Actions, Issues and Pull Requests

Disruption with some GitHub services

minor resolved

Created: Aug 11, 2025, 06:51 PM

Resolved: Aug 11, 2025, 06:57 PM

Duration: 6m

Weighted Downtime: 1.5m

3 updates

resolved Aug 11, 2025, 06:57 PM

On August 11, 2025, from 18:41 to 18:57 UTC, GitHub customers experienced errors and increased latency when loading GitHub’s web interface. During this time, a configuration change to improve our UI deployment system caused a surge in requests to a backend datastore. This change led to an unexpected spike in connection attempts to our datastore and saturated its connection backlog and resulted in intermittent failures to serve required UI content. This resulted in elevated error rates for frontend requests.The incident was mitigated by reverting the configuration, which restored normal service.Following mitigation, we are evaluating improvements to our alerting thresholds and exploring architectural changes to reduce load to this datastore and improve the resilience of our UI delivery pipeline.

update Aug 11, 2025, 06:53 PM

Logged out users may see intermittent errors when loading github.com webpages. Investigation is ongoing.

investigating Aug 11, 2025, 06:51 PM

We are currently investigating this issue.

Incident with Pull Requests

minor resolved

Created: Aug 5, 2025, 05:53 PM

Resolved: Aug 5, 2025, 07:46 PM

Duration: 1h 53m

Weighted Downtime: 28.25m

Affected Components: Pull Requests

7 updates

update Aug 5, 2025, 07:46 PM

Pull Requests is operating normally.

resolved Aug 5, 2025, 07:46 PM

At 15:33 UTC on August 5, 2025, we initiated a production database migration to drop a column from a table backing pull request functionality. While the column was no longer in direct use, our ORM continued to reference the dropped column in a subset of pull request queries. As a result, there were elevated error rates across pushes, webhooks, notifications, and pull requests with impact peaking at approximately 4% of all web and REST API traffic. We mitigated the issue by deploying a change that instructed the ORM to ignore the removed column. Most affected services recovered by 16:13 UTC. However, that fix was applied only to our largest production environment. An update to some of our custom and canary environments did not pick up the fix and this triggered a secondary incident affecting ~0.1% of pull request traffic, which was fully resolved by 19:45 UTC.While migrations have protections such as progressive roll-out first targeting validation environments and acknowledge gates, this incident identified an application monitoring gap that would have prevented continued rollout when impact was observed. We will add additional automation and safeguards to prevent future incidents without requiring human intervention. We are also already working on a way to streamline some types of changes across environments, which would have prevented the second incident from occurring.

update Aug 5, 2025, 07:20 PM

We continue to investigate issues with PRs. Impact remains limited to less than 2% of users.

update Aug 5, 2025, 06:49 PM

We continue to investigate issues with PRs impacting less than 2% of customers.

update Aug 5, 2025, 06:23 PM

We continue to investigate issues with PRs impacting less than 2% of customers.

update Aug 5, 2025, 06:07 PM

We're seeing issues related to PR are investigating. Less than 2% of users are impacted.

investigating Aug 5, 2025, 05:53 PM

We are investigating reports of degraded performance for Pull Requests

Incident with pull requests

major resolved

Created: Aug 5, 2025, 03:42 PM

Resolved: Aug 5, 2025, 04:14 PM

Duration: 32m

Weighted Downtime: 24m

Affected Components: Git OperationsWebhooksIssuesPull Requestsand Actions

16 updates

update Aug 5, 2025, 04:14 PM

Webhooks is operating normally.

update Aug 5, 2025, 04:14 PM

Issues is operating normally.

update Aug 5, 2025, 04:14 PM

Pull Requests is operating normally.

update Aug 5, 2025, 04:14 PM

Actions is operating normally.

resolved Aug 5, 2025, 04:14 PM

update Aug 5, 2025, 04:13 PM

We have fully mitigated this issue and all services are operating normally.

update Aug 5, 2025, 04:13 PM

Git Operations is operating normally.

update Aug 5, 2025, 04:08 PM

Webhooks is experiencing degraded performance. We are continuing to investigate.

update Aug 5, 2025, 04:06 PM

Pull Requests is experiencing degraded performance. We are continuing to investigate.

update Aug 5, 2025, 04:05 PM

We have identified a change that was made in the Pull Request area for GitHub. Users may be unable to use certain pull request and issues features and may see some webhooks impacted. We have identified the issue, taken mitigation and are starting to see recovery but will continue to monitor and post updates as we have them.

update Aug 5, 2025, 03:56 PM

Webhooks is experiencing degraded availability. We are continuing to investigate.

update Aug 5, 2025, 03:55 PM

Git Operations is experiencing degraded performance. We are continuing to investigate.

update Aug 5, 2025, 03:54 PM

Pull Requests is experiencing degraded availability. We are continuing to investigate.

update Aug 5, 2025, 03:51 PM

Pull Requests is experiencing degraded performance. We are continuing to investigate.

update Aug 5, 2025, 03:51 PM

Actions is experiencing degraded performance. We are continuing to investigate.

investigating Aug 5, 2025, 03:42 PM

We are investigating reports of degraded performance for Issues and Webhooks

Disruption with some GitHub services

minor resolved

Created: Aug 1, 2025, 09:20 AM

Resolved: Aug 1, 2025, 10:55 AM

Duration: 1h 35m

Weighted Downtime: 23.75m

4 updates

resolved Aug 1, 2025, 10:55 AM

Between 06:04 UTC to 10:55 UTC on August 1, 2025, 100% of users attempting to sign up with an email and password experienced errors. Social signup was not affected. Once the problem became clear, the offending code was identified and a change was deployed to resolve the issue. We are adding additional monitoring to our sign-up process to improve our time to detection.

update Aug 1, 2025, 10:26 AM

We are working on a mitigation to an issue preventing some users from signing up with email and password. Social signup methods remain available.

update Aug 1, 2025, 09:39 AM

We have identified an issue preventing some new users from signing up. We are working to mitigate.

investigating Aug 1, 2025, 09:20 AM

We are currently investigating this issue.

Increase in 429s for Git Operations

minor resolved

Created: Jul 29, 2025, 10:41 AM

Resolved: Jul 29, 2025, 12:05 PM

Duration: 1h 24m

Weighted Downtime: 21m

Affected Components: Git Operations and Actions

5 updates

update Jul 29, 2025, 12:05 PM

Continued monitoring is showing that impact has been mitigated and normal service operation has been restored.We are going to resolve the incident at this time. Thank you for your patience as we investigated this problem.

resolved Jul 29, 2025, 12:05 PM

Public Summary DraftBetween July 28, 2025 16:31 UTC to July 29, 2025 12:05 UTC users saw degraded Git Operations for raw file downloads. On average, the error rate was .005%, with a peak error rate of 3.89%. This was due to a sustained increase in unauthenticated repository traffic.We mitigated the incident by applying regional rate limiting, rolling back a service that was unable to scale with the additional traffic, and addressed a bug that impacted the caching of raw requests. Additionally, we horizontally scaled several dependencies of the service to appropriately handle the increase in traffic.We are working on improving our time to detection and have implemented controls to prevent similar incidents in future.

update Jul 29, 2025, 11:32 AM

We identified and removed unhealthy hosts from our system. This has led to a reduction of 429s and a return to normal operating conditions.We are continuing to monitor recovery and will resolve the incident once we are confident the impact has been mitigated.

update Jul 29, 2025, 11:00 AM

We are seeing an increase in 429s when retrieving git artifacts from GitHub.com. This is manifesting in many ways, for example, in failed GitHub Actions workflow runs.We have our engineers working on mitigation and we will provide more information as we have it. Thank you for your patience.

investigating Jul 29, 2025, 10:41 AM

We are investigating reports of degraded performance for Actions and Git Operations

Incident with Issues, API Requests and Pull Requests

minor resolved

Created: Jul 28, 2025, 10:40 PM

Resolved: Jul 29, 2025, 02:06 AM

Duration: 3h 26m

Weighted Downtime: 51.5m

Affected Components: API RequestsIssuesand Pull Requests

10 updates

resolved Jul 29, 2025, 02:06 AM

Between July 28, 2025, 22:23:00 UTC and July 29, 2025 02:06:00 UTC, GitHub experienced degraded performance across multiple services including API, Issues, GraphQL and Pull Requests. During this time, approximately 4% of Web and API requests resulted in 500 errors. This incident was caused by DNS resolution failure while decommissioning infrastructure hosts. We resolved the incident by removing references to the stale hosts.We are working to improve our host replacement process by correcting our automatic host ejection behavior and by ensuring configuration is updated before hosts are decommissioned. This will prevent similar issues in the future.

update Jul 29, 2025, 02:05 AM

Pull Requests is operating normally.

update Jul 29, 2025, 02:04 AM

Mitigation has deployed. We are seeing recovery across all impacted services.

update Jul 29, 2025, 02:03 AM

Issues is operating normally.

update Jul 29, 2025, 01:52 AM

Team is deploying a mitigation for this incident. We will update again once we have verified the fix.

update Jul 29, 2025, 12:51 AM

Approximately 4% of requests to impacted services continue to error. The team is continuing its work to mitigate this incident.

update Jul 29, 2025, 12:02 AM

Team is continuing to look into networking issues. We will keep users updated on progress towards mitigation.

update Jul 28, 2025, 11:18 PM

Some GitHub services continue to experience degraded performance. Team is looking into networking issues. We will continue to keep users updated on progress towards mitigation.

update Jul 28, 2025, 10:42 PM

Some GitHub services are experiencing degraded performance. Team is currently investigating to determine a cause and mitigation.

investigating Jul 28, 2025, 10:40 PM

We are investigating reports of degraded performance for API Requests, Issues and Pull Requests

GitHub Enterprise Importer migrations are stalled

major resolved

Created: Jul 28, 2025, 09:41 PM

Resolved: Jul 29, 2025, 03:15 AM

Duration: 5h 34m

Weighted Downtime: 4h 10.5m

9 updates

update Jul 29, 2025, 03:15 AM

This incident is resolved, we will follow up with a detailed root cause analysis as soon as possible.As part of mitigation, some existing IP ranges were replaced. Migrations with customer owned storage that have IP allow lists enabled will require adding new IP ranges to your IP allow lists to prevent migrations from failing.- 20.99.172.64/28- 135.234.59.224/28

resolved Jul 29, 2025, 03:15 AM

Between approximately 21:41 UTC July 28th and 03:15 UTC July 29th, GitHub Enterprise Importer (GEI) operated in a degraded state where migrations could not be processed. Our investigation found that a component of the GEI infrastructure had been improperly taken out of service and could not be restored to its previous configuration. This necessitated the provisioning of new resources to resolve the incident.As a result, customers will need to add our new IP range to the following IP allow lists, if enabled:- The IP allow list on your destination GitHub.com organization or enterprise- If you're running migrations from GitHub.com, the IP allow list on your source GitHub.com organization or enterprise- If you're running migrations from a GitHub Enterprise Server, Bitbucket Server or Bitbucket Data Center instance, the allow list on your configured Azure Blob Storage or -- Amazon S3 storage account- If you're running migrations from Azure DevOps, the allow list on your Azure DevOps organizationThe new GEI IP ranges for inclusion in applicable IP allow lists are:- 20.99.172.64/28- 135.234.59.224/28 The following IP ranges are no longer used by GEI and can be removed from all applicable IP allow lists:- 40.71.233.224/28- 20.125.12.8/29Users who have run migrations using GitHub Enterprise Importer in the past 90 days will receive email alerts about this change.

update Jul 29, 2025, 02:32 AM

We have deployed mitigations and are working to verify.

update Jul 29, 2025, 01:23 AM

The team is continuing its work to mitigate this incident.

update Jul 29, 2025, 12:45 AM

We're continuing to work to mitigate this issue, customers will continue to see stalled migrations in the meantime.

update Jul 28, 2025, 11:24 PM

We continue to work to mitigate this issue.

update Jul 28, 2025, 10:49 PM

We are still working to mitigate the issue.

update Jul 28, 2025, 10:12 PM

We have identified the issue and we're working to mitigate.

investigating Jul 28, 2025, 09:41 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Jul 28, 2025, 04:50 PM

Resolved: Jul 29, 2025, 01:23 AM

Duration: 8h 33m

Weighted Downtime: 2h 8.25m

Affected Components: Git Operations

13 updates

resolved Jul 29, 2025, 01:23 AM

Between July 28, 2025 16:31 UTC to July 29, 2025 12:05 UTC users saw degraded Git Operations for raw file downloads. On average, the error rate was .005%, with a peak error rate of 3.89%. This was due to a sustained increase in unauthenticated repository traffic.We mitigated the incident by applying regional rate limiting, rolling back a service that was unable to scale with the additional traffic, and addressed a bug that impacted the caching of raw requests. Additionally, we horizontally scaled several dependencies of the service to appropriately handle the increase in traffic.We are working on improving our time to detection and have implemented controls to prevent similar incidents in future.

update Jul 29, 2025, 01:02 AM

We continue to work to mitigate this issue.

update Jul 29, 2025, 12:12 AM

We are investigating additional ways to mitigate this issue.

update Jul 28, 2025, 11:16 PM

We continue to work to mitigate this issue.

update Jul 28, 2025, 09:50 PM

Some customers continue to experience errors when accessing raw files and archives. We are working on a mitigation to address the issue.

update Jul 28, 2025, 09:06 PM

We are actively setting up additional rate limiting to address increased requests from scraping and investigating the need to add additional hosts.

update Jul 28, 2025, 08:32 PM

We are seeing more of https://github.blog/changelog/2025-05-08-updated-rate-limits-for-unauthenticated-requests/ and working to mitigate it.

update Jul 28, 2025, 07:29 PM

Provisioning of new hosts is underway. We are still investigating other fixes.

update Jul 28, 2025, 06:55 PM

We are adding additional capacity to our infrastructure to mitigate this issue while still investigating.

update Jul 28, 2025, 06:11 PM

We are still actively investigating this issue.

update Jul 28, 2025, 05:30 PM

Git Operations is experiencing degraded performance. We are continuing to investigate.

update Jul 28, 2025, 05:17 PM

We are investigating errors affecting some archive and raw file downloads. Users may experience rate limit warnings or server errors until this is resolved.

investigating Jul 28, 2025, 04:50 PM

We are currently investigating this issue.

Incident with Actions Hosted Runners

minor resolved

Created: Jul 23, 2025, 03:31 PM

Resolved: Jul 23, 2025, 04:30 PM

Duration: 59m

Weighted Downtime: 14.75m

Affected Components: Actions

4 updates

resolved Jul 23, 2025, 04:30 PM

On July 23rd, 2025, from approximately 14:30 to 16:30 UTC, GitHub Actions experienced delayed job starts for workflows in private repos using Ubuntu-24 standard hosted runners. This was due to resource provisioning failures in one of our datacenter regions. During this period, approximately 2% of Ubuntu-24 hosted runner jobs on private repos were delayed. Other hosted runners, self-hosted runners, and public repo workflows were unaffected.To mitigate the issue, additional worker capacity was added from a different datacenter region at 15:35 UTC and further increased at 16:00 UTC. By 16:30 UTC, job queues were healthy and service was operating normally. Since the incident, we have deployed changes to improve how regional health is accounted for when allocating new runners, and we are investigating further improvements to our automated capacity scaling logic and manual overrides to prevent a recurrence.

update Jul 23, 2025, 04:11 PM

We are applying mitigations to increase Actions Hosted Runners capacity, and are starting to see recovery. We’re monitoring to ensure continued stability.

update Jul 23, 2025, 03:36 PM

We're investigating delays provisioning Actions Hosted Runners. Customers may see delays over 5 minutes for jobs starting.

investigating Jul 23, 2025, 03:31 PM

We are investigating reports of degraded performance for Actions

Incident with Copilot and Claude Sonnet 4

minor resolved

Created: Jul 22, 2025, 06:35 PM

Resolved: Jul 22, 2025, 06:49 PM

Duration: 14m

Weighted Downtime: 3.5m

Affected Components: Copilot

3 updates

resolved Jul 22, 2025, 06:49 PM

On July 22nd, 2025, between 17:58 and 18:35 UTC, the Copilot service experienced degraded availability for Claude Sonnet 4 requests. 4.7% of Claude 4 requests failed during this time. No other models were impacted. The issue was caused by an upstream problem affecting our ability to serve requests.We mitigated by rerouting capacity and monitoring recovery. We are improving detection and mitigation to reduce future impact.

investigating Jul 22, 2025, 06:35 PM

We are investigating reports of degraded performance for Copilot

update Jul 22, 2025, 06:35 PM

We are experiencing degraded availability for the Claude Sonnet 4 model in Copilot Chat, VS Code and other Copilot products. This is due to an issue with an upstream model provider. We are working with them to resolve the issue.Other models are available and working as expected.

Some Copilot Models Experiencing Degraded Performance

minor resolved

Created: Jul 21, 2025, 07:36 AM

Resolved: Jul 21, 2025, 07:50 AM

Duration: 14m

Weighted Downtime: 3.5m

2 updates

resolved Jul 21, 2025, 07:50 AM

On July 21, 2025, between 07:20 UTC and 08:00 UTC, the Copilot service experienced degraded availability for Claude 4 requests. 2% of Claude 4 requests failed during this time. The issue was caused by an upstream problem affecting our ability to serve requests.We mitigated by rerouting capacity and monitoring recovery. We are improving detection and mitigation to reduce future impact.

investigating Jul 21, 2025, 07:36 AM

We are currently investigating this issue.

Incident with Issues

minor resolved

Created: Jul 21, 2025, 07:15 AM

Resolved: Jul 21, 2025, 09:48 AM

Duration: 2h 33m

Weighted Downtime: 38.25m

Affected Components: WebhooksAPI RequestsIssuesPull RequestsPackagesCodespacesand Copilot

17 updates

resolved Jul 21, 2025, 09:48 AM

On July 21st, 2025, between 07:00 UTC and 09:45 UTC the API, Codespaces, Copilot, Issues, Package Registry, Pull Requests and Webhook services were degraded and experienced dropped requests and increased latency. At the peak of this incident (a 2 minute period around 07:00 UTC) error rates peaked at 11% and went down shortly after. Average web request load times rose to 1 second during this same time frame. After this period, traffic gradually recovered but error rate and latency remained slightly elevated until the end of the incident.This incident was triggered by a kernel bug that caused a crash of some of our load balancers during a scheduled process after a kernel upgrade. In order to mitigate the incident, we halted the roll out of our upgrades, and rolled back the impacted instances. We are working to make sure the kernel version is fully removed from our fleet. As a precaution, we temporarily paused the scheduled process to prevent any unintended use in the affected kernel. We also tuned our alerting so we can more quickly detect and mitigate future instances where we experience a sudden drop in load-balancing capacity.

update Jul 21, 2025, 09:47 AM

API Requests and Codespaces are operating normally.

update Jul 21, 2025, 09:45 AM

Copilot is operating normally.

update Jul 21, 2025, 09:44 AM

Webhooks is operating normally.

update Jul 21, 2025, 09:41 AM

Mitigations have been applied and we are seeing recovery. We are continuing to closely monitor the situation to ensure complete recovery has been achieved.

update Jul 21, 2025, 09:34 AM

Issues is operating normally.

update Jul 21, 2025, 09:19 AM

Packages is operating normally.

update Jul 21, 2025, 09:00 AM

We are currently implementing mitigations for this issue.

update Jul 21, 2025, 08:48 AM

Copilot is experiencing degraded performance. We are continuing to investigate.

update Jul 21, 2025, 08:27 AM

We continue to investigate reports of degraded performance and intermittent timeouts across GitHub.com.

update Jul 21, 2025, 08:10 AM

Pull Requests is operating normally.

update Jul 21, 2025, 07:46 AM

We're continuing to investigate reports of degraded performance and intermitiant timeouts across GitHub.com.

update Jul 21, 2025, 07:25 AM

Pull Requests is experiencing degraded performance. We are continuing to investigate.

update Jul 21, 2025, 07:23 AM

Packages is experiencing degraded performance. We are continuing to investigate.

update Jul 21, 2025, 07:22 AM

API Requests is experiencing degraded performance. We are continuing to investigate.

update Jul 21, 2025, 07:19 AM

Codespaces is experiencing degraded performance. We are continuing to investigate.

investigating Jul 21, 2025, 07:15 AM

We are investigating reports of degraded performance for Issues and Webhooks

Disruption with some GitHub services

minor resolved

Created: Jul 16, 2025, 08:16 AM

Resolved: Jul 16, 2025, 08:58 AM

Duration: 42m

Weighted Downtime: 10.5m

4 updates

resolved Jul 16, 2025, 08:58 AM

On July 16, 2025, between 05:20 UTC and 08:30 UTC, the Copilot service experienced degraded availability for Claude 3.7 requests. Around 10% of Claude 3.7 requests failed during this time. The issue was caused by an upstream problem affecting our ability to serve requests.We mitigated by rerouting capacity and monitoring recovery. We are improving detection and mitigation to reduce future impact.

update Jul 16, 2025, 08:45 AM

We have seen recovery on our provider's side but have not yet confirmed if the issue is fully resolved. We will update our status in the next 20 minutes as we know more.

update Jul 16, 2025, 08:21 AM

We are experiencing degraded availability for the Claude 3.7 Sonnet model in Copilot Chat, VS Code and other Copilot products. This is due to an issue with an upstream model provider. We are working with them to resolve the issue.

investigating Jul 16, 2025, 08:16 AM

We are currently investigating this issue.

[Retroactive] Disruption with some GitHub Services

minor resolved

Created: Jul 15, 2025, 08:00 PM

Resolved: Jul 15, 2025, 08:00 PM

Duration: 0m

Weighted Downtime: 0m

1 update

resolved Jul 15, 2025, 08:00 PM

On 15 July, between 19:55 and 19:58 UTC, requests to GitHub had a high failure rate while successful requests suffered up to 10x expected latency.Browser-based requests saw a failure rate of up to 20%, GraphQL had up to a 9% failure rate and 2% of REST API requests failed. Any downstream service dependent on GitHub APIs was also affected during this short window.The failure stemmed from a database query change, and was rolled back by our deployment tooling which automatically detected the issue. We will continue to invest in automated detection and rollback with a goal of minimizing time to recovery.

Incident with Actions

minor resolved

Created: Jul 8, 2025, 04:05 PM

Resolved: Jul 8, 2025, 04:44 PM

Duration: 39m

Weighted Downtime: 9.75m

Affected Components: Actions

5 updates

resolved Jul 8, 2025, 04:44 PM

On July 8, 2025, between 14:20 UTC and 16:30 UTC, GitHub Actions service experienced degraded performance leading to delays in updates to Actions workflow runs including missing logs and delayed run status. During this period, 1.07% of Actions workflow runs experienced delayed updates, while 0.34% of runs completed, but showed as canceled in their status. The incident was caused by imbalanced load in our underlying service infrastructure. The issue was mitigated by scaling up our services and tuning resource thresholds. We are working to improve our resilience to load spikes, capacity planning to prevent similar issues, and are implementing more robust monitoring to reduce detection and mitigation time for similar incidents in the future.

update Jul 8, 2025, 04:43 PM

We are seeing complete recovery for Actions. New jobs will run as normal. Some runs initiated during the incident will be left in a stuck state and will not complete.

update Jul 8, 2025, 04:29 PM

We have scaled out our capacity and customers will begin to see timely updates.

update Jul 8, 2025, 04:10 PM

Some customers are seeing delays in updates to their runs resulting in missing logs and delayed run status updates. We are investigating the cause of the issue.

investigating Jul 8, 2025, 04:05 PM

We are investigating reports of degraded performance for Actions

Disruption with some GitHub services

minor resolved

Created: Jul 7, 2025, 10:29 PM

Resolved: Jul 7, 2025, 10:34 PM

Duration: 5m

Weighted Downtime: 1.25m

3 updates

resolved Jul 7, 2025, 10:34 PM

On July 7, 2025, between 21:14 UTC and 22:34 UTC, Copilot coding Agent was degraded and non-responsive to issue assignment. Impact was limited to internal GitHub staff because the feature flag gating a newly released feature was enabled on internal development setups and not in global GitHub production environments.The incident was mitigated by disabling the feature flag for all users.While our existing safeguards worked as intended—the feature flag allowed for immediate mitigation and the limited scope prevented broader impact—we are enhancing our monitoring to better detect issues that affect smaller user segments and reviewing our internal testing processes to identify similar edge cases before they reach production.

update Jul 7, 2025, 10:31 PM

We are investigating reports of Copilot Coding Agent service degraded performance

investigating Jul 7, 2025, 10:29 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Jul 7, 2025, 10:03 PM

Resolved: Jul 7, 2025, 10:22 PM

Duration: 19m

Weighted Downtime: 4.75m

3 updates

resolved Jul 7, 2025, 10:22 PM

On July 7th, 2025, between 18:20 UTC and 22:10 UTC the Actions service was degraded for GitHub Larger Hosted and scale set runners. During this time window, 9% of GitHub Larger Hosted Runners and scale set jobs saw a delay of at least 5 minutes before being assigned to a runner. Impact was more apparent to customers that didn’t have pre-scaled runner pools or who infrequently queued jobs during the incident window. This was due to a change that unintentionally decreased the rate at which we notified our backend that new scale set runners were coming online, and was mitigated by reverting that change. To reduce the likelihood and impact time of a similar issue in the future, we are improving our detection of this failure mode so we catch it in earlier stages of development and rollout.

update Jul 7, 2025, 10:07 PM

We are investigating reports of degraded performance for larger runners.

investigating Jul 7, 2025, 10:03 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Jul 3, 2025, 05:39 AM

Resolved: Jul 3, 2025, 07:12 AM

Duration: 1h 33m

Weighted Downtime: 23.25m

6 updates

resolved Jul 3, 2025, 07:12 AM

On 7/3/2025, between 3:22 AM and 7:12 AM UTC, customers were prevented from SSO authorizing Personal Access Tokens and SSH keys via the GitHub UI. Approximately 1300 users were impacted.A code change modified the content type of the response returned by the server, causing a lazily-loaded dropdown to fail to render, prohibiting the user from proceeding to authorize. No authorization systems were impacted during the incident, only the UI component. We mitigated the incident by reverting the code change that introduced the problem.We are making improvements to our release process and test coverage to catch this class of error earlier in our deployment pipeline. Further, we are improving monitoring to reduce our time to detection and mitigation of issues like this one in the future.

update Jul 3, 2025, 07:11 AM

The rollback has been deployed successfully on all environments. Customers should now be able to SSO authorize their Classic Personal Access Tokens and SSH keys on their GitHub organizations.

update Jul 3, 2025, 06:46 AM

The root cause for the rendering bug that prevented customers from SSO authorizing Personal Access Tokens and SSH keys has started rolling out. We are continuously monitoring this rollback.

update Jul 3, 2025, 06:07 AM

We have identified the root cause for the rendering bug that prevented customers from SSO authorizing Personal Access Tokens and SSH keys.The changes that caused the issue are being rolled back.

update Jul 3, 2025, 05:45 AM

We are investigating an issue with SSO authorizing Classic Personal Access Tokens and SSH keys.

investigating Jul 3, 2025, 05:39 AM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Jul 2, 2025, 04:09 PM

Resolved: Jul 2, 2025, 04:23 PM

Duration: 14m

Weighted Downtime: 3.5m

4 updates

update Jul 2, 2025, 04:23 PM

We're down to healthy level of queued migrations and the system is processing migrations at normal system concurrency levels.

resolved Jul 2, 2025, 04:23 PM

On July 2, 2025, between 1:35 AM UTC and 16:23 UTC, the GitHub Enterprise Importer (GEI) migration service experienced degraded performance and slower-than-normal migration queue processing times. This incident was triggered due to a migration including an abnormally large number of repositories, overwhelming the queue and slowing processing for all migrations.We mitigated the incident by removing the problematic migrations from the queue. Service was restored to normal operation as the queue volume was reduced.To ensure system stability, we have introduced additional concurrency controls that limit the number of queued repositories per organization migration, helping to prevent similar incidents in the future.

update Jul 2, 2025, 04:14 PM

Repository migrations are experiencing delayed processing times. Mitigation has been implemented and migration times are recovering.

investigating Jul 2, 2025, 04:09 PM

We are currently investigating this issue.

Disruption with some GitHub services

minor resolved

Created: Jul 2, 2025, 09:54 AM

Resolved: Jul 2, 2025, 10:16 AM

Duration: 22m

Weighted Downtime: 5.5m

4 updates

update Jul 2, 2025, 10:16 AM

We are no longer experiencing degradation—Claude Sonnet 4 is once again available in Copilot Chat and across IDE integrations.We will continue monitoring to ensure stability, but mitigation is complete.

resolved Jul 2, 2025, 10:16 AM

On July 2nd, 2025, between approximately 08:40 and 10:16 UTC, the Copilot service experienced degradation due to an infrastructure issue which impacted the Claude Sonnet 4 model, leading to a spike in errors. No other models were impacted.The issue was mitigated by rebalancing load within our infrastructure. GitHub is working to further improve the resiliency of the service to prevent similar incidents in the future.

update Jul 2, 2025, 09:55 AM

We are experiencing degraded availability for the Claude Sonnet 4 model in Copilot Chat, VS Code and other Copilot products. This is due to an issue with an upstream model provider. We are working with them to resolve the issue. Other models are available and working as expected.

investigating Jul 2, 2025, 09:54 AM

We are currently investigating this issue.