InertiaRSS Track and read blogs, news, and tech you care about
Read Original Open in InertiaRSS

Recommended Feeds

博客园 - 司徒正美
MyScale Blog
MyScale Blog
雷峰网
雷峰网
IT之家
IT之家
L
LangChain Blog
T
Tailwind CSS Blog
博客园 - Franky
宝玉的分享
宝玉的分享
The GitHub Blog
The GitHub Blog
MongoDB | Blog
MongoDB | Blog
量子位
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
Vercel News
Vercel News
酷 壳 – CoolShell
酷 壳 – CoolShell
Google DeepMind News
Google DeepMind News
Stack Overflow Blog
Stack Overflow Blog
M
MIT News - Artificial intelligence
S
SegmentFault 最新的问题

InfoQ - 促进软件开发领域知识与创新的传播

Meta 收购 Manus 这事儿泡汤了 5.5万 Star 开源项目 Ghostty 被迫出走,GitHub 正在终结一代技术人的乌托邦 Slack 长时运行多智能体系统的上下文管理方案 从 T+1 到分钟级:金城银行基于 Apache Doris 构建高可靠、强一致的实时数据平台 谷歌云推出 Agents CLI,简化 AI 智能体开发全流程 Claude官方击穿高薪、高学历的安全防线!Anthropic点名10大高危职业,但有群人暂时稳了 亚马逊云科技终止 WorkMail 服务,并将 App Runner 转入维护模式 OPPO小布记忆:全模态碎片化内容的理解与智能整理实践|AICon上海 模力工场038周AI应用周榜:工具在消失,工作流在出现 Akamai CEO Tom Leighton:Agent 时代来临,云基础设施正从“中心化”转向“分布式边缘” 日均数百亿入库背后:从“人肉调度”到K8s弹性架构,度小满金融基于OceanBase重构入库架构实践 百度文库网盘发布GenFlow 4.0:月活用户超1亿,要把网盘变成全端AI工作台 Altman 投的 Agent 终端 Warp 开源了!斩获3.5万star 哪些客户需要拒, 敢让龙虾决定吗?_AI&大模型_InfoQ 中文站_InfoQ精选视频 从开发到生产:为什么越来越多的机器学习团队纷纷迁移到 Snowflake | BUILD 2025_AI&大模型_王玮_InfoQ精选视频 探索多智能体工作流:LangGraph Snowflake Cortex AI | BUILD 2025_AI&大模型_王玮_InfoQ精选视频 腾讯云分布式缓存数据库:AI Agent - 从提示词工程到 Harness 工程 | 腾讯云数据库 DBTalk_腾讯_凌敏_InfoQ精选视频 基于 Streamlit 为 CSV 数据构建分析智能体 | BUILD 2025_AI&大模型_王玮_InfoQ精选视频 AI 智能体:告别文档缺漏 | BUILD 2025_AI&大模型_王玮_InfoQ精选视频 构建 AI 驱动的数据管道:深度探讨 Snowflake Openflow 与非结构化数据 | BUILD 2025_AI&大模型_王玮_InfoQ精选视频 云端太贵、本地不够聪明,英特尔押注“端云混合AI”:智能体PC会替人完成工作 不到10%的存储投入,可能拖垮90%的GPU投资!IBM把AI Agent塞进存储系统,算清企业最容易忽略的一笔账 Snowpark 上手实战 | BUILD 2025_大数据_王玮_InfoQ精选视频 ClickHouse + Langfuse,构建 Agent 可观测基石 腾讯云分布式缓存数据库:Cluster Proxy 共享连接架构深度解析 | 腾讯云数据库 DBTalk_腾讯_凌敏_InfoQ精选视频 AI 写代码太烧钱了:Copilot、Claude 一起涨价,不如把程序员请回来? 英特尔发布至强600系列工作站处理器与锐炫Pro B70 GPU,全新AI工作站来了 腾讯云分布式缓存数据库:从 Redis 到 Valkey - 开源社区如何快速创新 | 腾讯云数据库 DBTalk_腾讯_凌敏_InfoQ精选视频 印奇这次要“从0重做”智驾模型!首谈阶跃和千里双公司布局:中国AI商业闭环要靠车跑出来 从Cursor返聘归来,90后华裔女高管带Claude开启日更模式:token成本比工程师工资低多了! 从 Coding 到 Agent:QCon 北京 2026 全景复盘,优秀出品人 & 明星讲师名单揭晓 全链路支撑大模型国产化“Day 0适配”,商汤大装置构建全栈能力底座 凌晨,OpenAI 与亚马逊云科技史上最大联合发布来了 HashiCorp Vault 2.0 发布:引入新身份联邦机制,迈入 IBM 生命周期体系 Yelp 实现超 1,000 个 Cassandra 节点零停机升级 写了 17 年开源代码,我为什么认为 Coding Agents 堆功能是在瞎折腾? 基于 Apache Camel 编排智能体与多模态 AI 管道 面向智能体与人类用户的AI记忆系统:架构设计与核心场景实践|AICon上海 Anthropic 推出 Managed Agents,简化 AI 代理部署流程 阿里HappyHorse开启灰测,720P视频生成低至0.44元/秒 讯飞联合清华团队押注量子AI:不看营收、不设KPI,一群“无人区”科学家,抢夺下代AI算力入口 小米万亿模型全面开源:MIT 协议、1M 上下文,但还是打不过 DeepSeek Cortex Code 入门指南:面向数据工程师的实践路径 | 技术实践 openJiuwen社区首发Team Skills,定义Coordination Engineering新范式 用 Snowflake Cortex Agents 释放结构化数据的最大价值 | 技术实践 Grafana 利用 Kafka 对 Loki 进行了架构重构,并发布了一款命令行工具,旨在将可观测性引入编码代理 ClickHouse重构全文索引:对象存储上跑出高性能 Full-Text Search 可观测性和遥测技术如何提升软件工程实践 Dropbox 与 GitHub 合作,将单体库大小从 87GB 缩减至 20GB Agent 的下一站:基于长期记忆系统 EverOS 的自我演进|AICon上海
GitHub uses eBPF to eliminate deployment risks and prevent cascading failures caused by circular dependencies.
作者:Craig Ris · 2026-05-10 · via InfoQ - 促进软件开发领域知识与创新的传播

GitHub recently announced a new deployment security solution that uses eBPF to detect and block hidden circular dependencies, preventing the system from losing recovery capabilities during failures. According to GitHub's latest engineering blog, this technology monitors and restricts the network behavior of deployment processes at the kernel level, ensuring that critical systems can still complete updates and repairs even when some platform services are unavailable.

This innovation primarily addresses a long-standing risk in large systems: circular dependencies. A circular dependency occurs when a deployment tool directly or indirectly depends on the service it is supposed to fix. GitHub cites examples where deployment scripts might attempt to download binaries, call internal services, or trigger background update tasks that depend on GitHub itself. Once the platform enters a failure state, these dependencies can create cascading issues, hindering the repair process and prolonging downtime. By using eBPF to isolate deployment processes and control their outbound network access, GitHub can proactively block such calls and expose them to engineering teams before they escalate into incidents.

The core of this solution lies in eBPF's ability to run custom programs within the Linux kernel and attach them to low-level system events such as network requests. GitHub leverages this capability to place deployment scripts in a controlled environment (cGroups), where network traffic can be inspected, filtered, or blocked according to predefined rules. This allows the platform to implement fine-grained, per-process network policies without affecting the overall system or production traffic.

To address management challenges in dynamic infrastructure environments, GitHub further extended this solution by introducing a filtering mechanism with DNS awareness. By intercepting DNS queries and routing them to a proxy layer, the system can evaluate outbound requests based on domain names rather than static IP addresses, making it more adaptable in large-scale, frequently changing environments. At the same time, the system can map blocked requests back to specific processes and commands, giving teams clear insight into what triggered the issue and how to fix it.

The identification of traditional circular dependencies often relies on manual effort and typically only becomes apparent after an incident occurs. GitHub's approach transforms this process into proactive detection: whenever a deployment introduces a risky dependency—whether direct, hidden, or transient—the system immediately issues a warning. This not only reduces the likelihood of deployment failures during incidents but also improves the Mean Time to Recovery (MTTR) by ensuring that fix paths remain available at all times.

After a six-month gradual rollout, the system is now used to protect deployment processes within GitHub's infrastructure. Additionally, it has brought extra benefits, including auditing outbound requests during deployments and using resource limits to prevent runaway scripts from affecting production workloads.

GitHub's application of eBPF also reflects a broader industry trend: as system complexity continues to increase, more organizations are turning to kernel-level observability and control capabilities. Today, eBPF is no longer used solely for monitoring but also for runtime policy enforcement, security hardening, and real-time system behavior management. This approach enables platform teams to go beyond the limitations of traditional application-layer controls and gain a deeper understanding of how systems operate in real-world environments.

This practice also highlights an important evolution in deployment philosophy: ensuring not only that the system operates normally, but also that it remains capable of recovery after a failure. As the degree of coupling between platforms continues to increase, hidden dependencies may lead to unpredictable failure modes. By embedding protection mechanisms directly into the operating system layer, GitHub demonstrates how modern infrastructure can enhance resilience, ensuring that the tools used to repair the system do not themselves depend on the system being repaired.

Other large platforms also face hidden dependencies and deployment security issues, and have adopted similar but not identical approaches. For example,GoogleFor a long time, in internal systems (such asBazel) emphasizes dependency isolation andSealed construction"(hermetic builds)", ensuring that the build and deployment process does not rely on external states or runtime environments that may fail during failures. This design naturally reduces the risk of circular dependencies because the deployment process itself is reproducible and self-contained. Similarly,Amazon Web Services (AWS) promotes a Cell-based architecture pattern , dividing services into mutually isolated units to limit the propagation scope of failures and their dependencies, thereby ensuring that deployment and recovery paths remain available even when parts of the system degrade.

In the cloud-native ecosystem, Kubernetes and network-layer projects such as   Cilium are also evolving toward runtime policy control and observability at the kernel and network layers, similar to GitHub's direction with eBPF. Meanwhile, GitLab focuses more on pipeline isolation and dependency control, advocating practices such as artifact pinning, offline runners, and restricted network access during CI/CD execution.

Behind these different approaches, a common trend can be observed: leading platforms no longer rely solely on process norms or documentation to avoid circular dependencies, but instead embed protective mechanisms directly into the infrastructure and execution environment, ensuring that the deployment system remains reliable even under fault conditions.

Original link:

https://www.infoq.com/news/2026/04/github-ebpf-deployment/