惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

A
Arctic Wolf
T
The Blog of Author Tim Ferriss
月光博客
月光博客
Recent Announcements
Recent Announcements
V
V2EX
Microsoft Azure Blog
Microsoft Azure Blog
博客园 - 三生石上(FineUI控件)
P
Proofpoint News Feed
The Register - Security
The Register - Security
博客园 - 叶小钗
博客园 - Franky
The Cloudflare Blog
雷峰网
雷峰网
罗磊的独立博客
M
MIT News - Artificial intelligence
I
InfoQ
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
博客园 - 【当耐特】
Engineering at Meta
Engineering at Meta
N
Netflix TechBlog - Medium
爱范儿
爱范儿
博客园 - 司徒正美
Recorded Future
Recorded Future
酷 壳 – CoolShell
酷 壳 – CoolShell
Google DeepMind News
Google DeepMind News
Martin Fowler
Martin Fowler
Microsoft Security Blog
Microsoft Security Blog
F
Full Disclosure
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
B
Blog
大猫的无限游戏
大猫的无限游戏
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
腾讯CDC
WordPress大学
WordPress大学
小众软件
小众软件
K
Kaspersky official blog
Attack and Defense Labs
Attack and Defense Labs
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Forbes - Security
Forbes - Security
aimingoo的专栏
aimingoo的专栏
IT之家
IT之家
The Last Watchdog
The Last Watchdog
N
News and Events Feed by Topic
B
Blog RSS Feed
S
Security @ Cisco Blogs
美团技术团队
量子位
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Cloudbric
Cloudbric
Hacker News - Newest:
Hacker News - Newest: "LLM"

GitHub Status - Incident History

Disruption with some GitHub services Disruption with some GitHub services Degradation with Webhooks, Pull Requests and Actions We are seeing elevated errors with Next Edit Suggestions and Completions Disruption with Copilot next edit suggestions Incident With Webhooks Incident with Copilot Availability Disruption with some GitHub services Multiple services have elevated errors and endpoint failures when checking feature flags Increased latency with webhooks Incident with Webhooks Authentication issues related to API requests Degraded availability for GitHub.com, GraphQL API, and Webhooks UI/API Disruption with Claude Opus 4.7 Pull Requests and Issues unavailable for signed-out users EU Network Maintenance Disruption with some GitHub services in the EU region Auth issue resulting in API impacts, including some Slack and Teams channel subscriptions Live updates degraded Elevated error rates across multiple services Disruption with OpenAI Models Webhook APIs and UI Degraded Elevated rate of Git push errors Intermittent errors with app installation token authentication Incident with Copilot Actions is experiencing degraded availability [Retroactive] Incident with GitHub.com Incident with CodeQL Incident with CodeQL, Webhooks, Notifications, and Slack Integration Incident with high errors on Git Operations CCR and CCA failing to start for PR comments Incident with Actions, we are investigating reports of degraded availability Increased Latency and Failures for SSH Git Operations Incident with Issues and Webhooks Incomplete pull request results in repositories GitHub search is degraded Delays with Actions Jobs for Larger Runners using VNet Injection in the East US region Delays with Code Scanning and Billing Incident with Pull Requests, Issues, Git Operations and API Requests Disruption with some GitHub services Incident with Actions and Pages Incident with Pull Requests Disruption with some GitHub services Incident with Actions Disruption with some GitHub services Disruption with some GitHub services Copilot Code Review Failing Disruption with some GitHub services
Incident with Actions
2022-08-01 · via GitHub Status - Incident History

Resolved

On May 20, 2026, between 16:00 UTC and 17:45 UTC, GitHub Actions customers experienced run start delays exceeding 5 minutes. Approximately 4.5% of all runs were delayed during the impact window, with scale set jobs disproportionately affected. 30% of scale set jobs were delayed and 4% failed to start entirely.

The incident was caused by a misconfigured health check on an internal service that assigns jobs to runners. A brief latency spike in an upstream dependency triggered health check failures across several pods, removing them from service and concentrating load on the remaining capacity. The added load drove memory pressure that escalated into a cascading failure in one regional cluster, leaving it unable to self-recover.

Responders mitigated the incident by scaling capacity in the healthy regional clusters and draining traffic away from the impaired one, after which run start latency recovered. To prevent recurrence, we are strengthening our health check configuration to avoid cascading failure scenarios and evaluating automated mitigations to rebalance traffic when a region is degraded.

Posted May 20, 2026 - 20:14 UTC

Update

Customer impact has fully subsided. We are maintaining yellow status while we deploy a permanent fix to prevent recurrence.

Posted May 20, 2026 - 19:41 UTC

Update

We've applied a mitigation to fix the issues with queuing and running Actions jobs. We are seeing improvements in telemetry and are monitoring for full recovery.

Posted May 20, 2026 - 18:17 UTC

Monitoring

The degradation affecting Actions has been mitigated. We are monitoring to ensure stability.

Posted May 20, 2026 - 17:52 UTC

Update

A subset of runners are taking longer than expected to connect, which may delay some jobs from beginning execution. We are actively working to mitigate the issue.

Posted May 20, 2026 - 17:46 UTC

Investigating

We are investigating reports of degraded performance for Actions

Posted May 20, 2026 - 16:58 UTC

This incident affected: Actions.