惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Google Online Security Blog
Google Online Security Blog
博客园_首页
酷 壳 – CoolShell
酷 壳 – CoolShell
Jina AI
Jina AI
博客园 - Franky
大猫的无限游戏
大猫的无限游戏
Hugging Face - Blog
Hugging Face - Blog
博客园 - 司徒正美
V
V2EX
雷峰网
雷峰网
云风的 BLOG
云风的 BLOG
V
Visual Studio Blog
F
Full Disclosure
Y
Y Combinator Blog
V
V2EX - 技术
Attack and Defense Labs
Attack and Defense Labs
S
Security @ Cisco Blogs
Schneier on Security
Schneier on Security
Microsoft Azure Blog
Microsoft Azure Blog
SecWiki News
SecWiki News
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
The GitHub Blog
The GitHub Blog
量子位
PCI Perspectives
PCI Perspectives
S
Secure Thoughts
D
Darknet – Hacking Tools, Hacker News & Cyber Security
AWS News Blog
AWS News Blog
Blog — PlanetScale
Blog — PlanetScale
爱范儿
爱范儿
K
Kaspersky official blog
B
Blog
A
Arctic Wolf
Hacker News: Ask HN
Hacker News: Ask HN
L
LangChain Blog
T
Tor Project blog
P
Privacy & Cybersecurity Law Blog
Recent Announcements
Recent Announcements
宝玉的分享
宝玉的分享
The Register - Security
The Register - Security
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
L
Lohrmann on Cybersecurity
D
Docker
A
About on SuperTechFans
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Google DeepMind News
Google DeepMind News
The Last Watchdog
The Last Watchdog
S
Security Affairs
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
P
Privacy International News Feed
Simon Willison's Weblog
Simon Willison's Weblog

SiliconANGLE

Will agentic AI governance run amok? The lesson of Asimov’s Three Laws - SiliconANGLE AI + quantum, Amazon vs. Starlink and the wide-open US-China internet battle - SiliconANGLE Team Cymru launches Total Insights Feed to replace legacy threat intelligence lists - SiliconANGLE AI Mode in Chrome adds split-screen view to enhance the web search experience - SiliconANGLE Resolve AI raises $40M at $1.5B valuation to optimize production environments - SiliconANGLE How Zscaler and OpenAI turn zero-trust security into an AI accelerator - SiliconANGLE OpenAI ratchets up Codex's agentic capabilities to rival Claude Code - SiliconANGLE Anthropic launches Claude Opus 4.7 with coding, visual reasoning improvements - SiliconANGLE Slash raises $100M at a $1.4B valuation to expand AI-powered banking platform for online businesses - SiliconANGLE Canva unveils Canva AI 2.0, recasting its platform as an agentic system for work - SiliconANGLE Data center, consumer device chips boost TSMC’s revenue - SiliconANGLE Mission-critical security cannot be bolted on, says Oracle - SiliconANGLE Agentic infrastructure reshapes enterprise AI - SiliconANGLE Data quality, and data freedom, foundational for AI success - SiliconANGLE Data trust is a bedrock in successful, scalable AI outcomes - SiliconANGLE Google introduces new agentic AI-ready tools and resources for Android developers  - SiliconANGLE Agentic AI orchestration separates winners from laggards - SiliconANGLE Data-driven tools turning the tide against human trafficking - SiliconANGLE Achieving trusted AI development goes beyond 'vibes' - SiliconANGLE Impinj boosts edge computing power in updated R700 RAIN RFID reader - SiliconANGLE Certinia powers professional services with AI - SiliconANGLE Antioch prepares to accelerate simulated testing for autonomous robots after raising $8.5M - SiliconANGLE Developer tooling startup Expo nabs $45M investment - SiliconANGLE Solidroad lands $25M to bring AI to customer support interactions - SiliconANGLE DuploCloud lands compliance and AI governance certifications as enterprise buyers tighten scrutiny - SiliconANGLE Lua lands $5.8M to help businesses build and manage AI agent workforces - SiliconANGLE Best of frenemies: Oracle's and AWS' clouds unite with dedicated, private connectivity - SiliconANGLE NIST shifts National Vulnerability Database to risk-based triage as CVE submissions hit record levels - SiliconANGLE Cisco goes to the races with new Churchill Downs multiyear partnership - SiliconANGLE Susecon 2026 will tackle the future of open-source platforms - SiliconANGLE Seriously? Footwear brand Allbirds says it has just transformed into an AI business - SiliconANGLE Hilbert nabs $28M to ease analytics projects for consumer-focused companies Qlik debuts new agentic capabilities, aiming to enhance AI trust and transparency - SiliconANGLE Google's Gemini 3.1 Flash TTS model offers unparalleled control over AI voices - SiliconANGLE Parasail raises $32M for its pay-per-token inference cloud - SiliconANGLE Distributed multicloud architectures reshape data - SiliconANGLE Scaling the AI factory through conversational analytics - SiliconANGLE AI-driven decision-making reshapes analytics - SiliconANGLE Artemis reels in $70M to make breach remediation more efficient with AI - SiliconANGLE What to expect during Google Cloud Next: Join theCUBE April 22-24 Trusted data foundation is a gating factor for enterprise AI - SiliconANGLE Redefining database infrastructure with Oracle AI database - SiliconANGLE Oracle makes database key for agentic AI development - SiliconANGLE Oracle bets on AI database convergence for agentic AI - SiliconANGLE Quantum technologies drive EU strategy for hybrid computing - SiliconANGLE How the Leibniz Supercomputing Centre is turning quantum promise into practice Quantum computing meets HPC in hybrid models - SiliconANGLE Quantum-HPC integration enters 'software moment' - SiliconANGLE DeepMind launches Gemini Robotics-ER 1.6 to meet precise physical AI demands  - SiliconANGLE GrowthLoop targets real-time, causal decisioning with AI-infused marketing platform - SiliconANGLE Stendr snags $5.4M in pre-seed funding to develop AI-native drone-tracking tech - SiliconANGLE Salesforce bets on conversation as the new interface for developers - SiliconANGLE Emergent launches Wingman: a personal AI agent for everyone  - SiliconANGLE Axonius targets remediation gap with AI, cyber-physical assets and data trust layer Capsule Security launches with $7M to secure AI agents at runtime - SiliconANGLE Leapwork hands off code validation to AI agents to keep pace with automated software development - SiliconANGLE SolarWinds accelerates observability with SW1, an 'agentic AI teammate' that automates IT firefighting - SiliconANGLE AI satellite constellation startup Orbital gets funded by a16z to verify space-based data center concept - SiliconANGLE Helical raises $10M to bridge the gap between foundation models and drug discovery decisions - SiliconANGLE Sectigo launches Private PQC to enable post-quantum certificate testing in existing workflows - SiliconANGLE German startup Synera lands $40M to automate engineering workflows with AI agents - SiliconANGLE Leadership shifts redefine enterprise AI - SiliconANGLE OpenAI partners with Novo Nordisk to accelerate drug discovery and delivery - SiliconANGLE Amazon debuts high-speed satellite internet antenna for commercial aircraft - SiliconANGLE Japanese tech giants launch joint venture targeting physical AI for robots and machines - SiliconANGLE Electric pickup truck startup Slate Auto raises $650M in funding - SiliconANGLE Zoom Perspectives: Why 'agentic' work is the new enterprise standard - SiliconANGLE China has erased the US lead in AI, Stanford HAI's 2026 AI index reveals - SiliconANGLE Cloudflare expands Agent Cloud with new tools to build and scale AI agents - SiliconANGLE Commvault rolls out AI capabilities to secure agentic workflows and data - SiliconANGLE Digital employees are here: What now? - SiliconANGLE Report: Cisco could acquire AI agent security startup Astrix Security for $250M+ - SiliconANGLE CoreWeave inks multiyear cloud deal with Anthropic - SiliconANGLE Agentic AI will force a rethink at the network edge - SiliconANGLE AI training data startup AfterQuery nabs $30M investment - SiliconANGLE Quantum computing market picks up steam - SiliconANGLE Healthcare IT under siege: CloudWave is fighting back - SiliconANGLE Cloud rebalancing gives service providers a new edge - SiliconANGLE Anthropic tries to keep its new AI model away from cyberattackers as enterprises look to tame AI chaos - SiliconANGLE Nutanix expands agentic AI infrastructure for neoclouds - SiliconANGLE Meta says it will spend an additional $21B on CoreWeave's AI infrastructure - SiliconANGLE Florida AG opens probe into ChatGPT alleging connection to FSU shooting - SiliconANGLE Cisco buys Galileo to strengthen Splunk's agentic monitoring capabilities - SiliconANGLE RISC-V chip design startup SiFive nabs $400M investment - SiliconANGLE Anthropic and OpenAI target big businesses with enterprise-grade controls and lower pricing - SiliconANGLE Intel inks multiyear data center chip partnership with Google - SiliconANGLE Apiiro launches command-line interface to bring AI-native security into software development workflows - SiliconANGLE Yobi teams with Microsoft to deliver predictive consumer intelligence on Azure - SiliconANGLE Amazon CEO Andy Jassy highlights AI growth in annual shareholder letter - SiliconANGLE Is a backlash brewing? Rapid innovation in AI coding and agents may force push for enterprise order and control - SiliconANGLE AI-driven guest experience reshapes hospitality IT strategy - SiliconANGLE Tether launches open-source on-device AI framework for developers - SiliconANGLE Database lifecycle management top priority in enterprise AI - SiliconANGLE AWS previews a cloud-agnostic registry for managing agentic fleets at scale - SiliconANGLE Nutanix bets on agentic AI governance - SiliconANGLE AI infrastructure modernization drives storage rethink - SiliconANGLE Haast raises $12M to help legal teams make haste with compliant AI-generated content - SiliconANGLE Blaize launches AI Services platform to move enterprise AI from pilot to production - SiliconANGLE Wasabi to acquire Seagate's Lyve Cloud business - SiliconANGLE Refiant raises $5M to refine AI models with 'nature-inspired' energy efficiency - SiliconANGLE
TestSprite launches an open-source command-line tool to help AI agents check their own work - SiliconANGLE
by Kyt Dotson · 2026-06-11 · via SiliconANGLE

TestSprite launches an open-source command-line tool to help AI agents check their own work

Autonomous artificial intelligence-powered software testing tool TestSprite Inc. today announced that the company has open-sourced its command-line interface tool that allows AI coding agents to verify their own work.

As the AI coding revolution has rolled in, autonomous coding tools have become smarter and enabled developers to prompt their way to entire applications overnight. The result is faster code, but at the same time, it means that the software can come off the digital assembly line with unseen bugs that may not be caught by unit tests run by agentic tools.

In too many cases, an AI agent might report a feature complete, but some of the tests failed, weren’t written correctly, were incomplete or were simply skipped. Other times a coding agent might write a function that appears to run on the surface but has a hidden bug that only triggers in an edge case that a customer will run into in particular circumstances (even 1 in 1,000 is too often) – or, in the worst-case scenario, it breaks some other part of the codebase altogether.

“That’s exactly what’s driving developers crazy,” said founder and Chief Executive Yunhao Jiao. “You use AI, you ship something new, you fix one thing and then boom, another thing crashes. Even the best agent in our competition broke 12% of the features that already worked. That’s the gap a verifier closes.”

TestSprite said today’s release provides a command-line interface, a space on the terminal, that gives coding agents a real quality assurance loop, not a spot check.

The coding agent describes a behavior once. TestSprite then runs it in the cloud the way a real user might, driving a live browser or hitting a live application programming interface, never using mock protocols. It then returns a single, self-consistent failure mode: the failing step and its neighbors, screenshots, a Document Object Model manifest, the test source, a root cause hypothesis and a recommended fix.

The AI coding agent can then read the data, fix the code and rerun.

This becomes the test loop. Every time the agent runs a phase of work, TestSprite adds dozens of new tests, so coverage grows alongside the codebase. This provides a safety net that controls for potential gaps and can capture eventualities as the application complexity changes shape without getting tangled up.

The TestSprite CLI is open source under the Apache 2.0 license and available today. Installation is simple using “npm install -g @testsprite/cli” for machines with Node.js 2.0 or higher. Documentation and reference are available on GitHub.

CoderCup: Publicly refereed AI agent coding battle

In addition to the CLI open-source announcement, TestSprite launched CoderCup, a public competition and leaderboard in which AI agents built and deployed the same app under one clock.

The company used its newly open-sourced CLI as a neutral referee, mimicking the World Cup soccer, which also had its kickoff today. The test agent acted to score each phase and linked each score to public evidence supporting it.

In the first event, several frontier agents went head-to-head, including Anthropic PBC’s Claude Code, OpenAI Group PBC’s Codex, and Google LLC’s Antigravity with TestSprite publishing the full results and per-phase scores openly at codercup.ai.

“Most benchmarks score AI coding agents on a single number, but that’s not what developers actually feel,” Jiao said. “What matters day to day is stuff no leaderboard captures.”

Those metrics include things such as what agents get right the first time, how often they break on something that used to work, and whether they can recover on their own.

For the most part, many of the frontier players took to the field and dazzled with strengths and weaknesses. Claude Code rallied on consistency, whereas Codex and Antigravity were the quickest overall, ranking in cumulative minutes under 100.

Beijing Moonshot AI Technology Co. Ltd.’s Kimi strolled in the opposite direction: slowest on clock, at around 350 minutes; but that slow roll paid off. While being smaller and cheaper, Kimi posted the highest correctness in the field at 0.89 and the lowest total cost, outclassing agents many times its size.

Agents that ran the fastest were rarely the ones that made the grade. Every agent, even the most stalwart, kept breaking work it had already completed.

“We built CoderCup to make those things visible. The soccer faceoff is the fun part; the metrics underneath are the real point,” Jiao added.

Image: SiliconANGLE/Microsoft Designer

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.