惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

爱范儿
爱范儿
Know Your Adversary
Know Your Adversary
Google DeepMind News
Google DeepMind News
A
Arctic Wolf
P
Privacy & Cybersecurity Law Blog
云风的 BLOG
云风的 BLOG
Stack Overflow Blog
Stack Overflow Blog
V
Visual Studio Blog
Project Zero
Project Zero
L
LangChain Blog
N
News and Events Feed by Topic
博客园 - Franky
Last Week in AI
Last Week in AI
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Scott Helme
Scott Helme
T
The Exploit Database - CXSecurity.com
P
Proofpoint News Feed
Blog — PlanetScale
Blog — PlanetScale
www.infosecurity-magazine.com
www.infosecurity-magazine.com
W
WeLiveSecurity
月光博客
月光博客
博客园_首页
美团技术团队
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
腾讯CDC
Latest news
Latest news
WordPress大学
WordPress大学
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Spread Privacy
Spread Privacy
Attack and Defense Labs
Attack and Defense Labs
量子位
L
LINUX DO - 热门话题
C
CERT Recently Published Vulnerability Notes
Webroot Blog
Webroot Blog
L
Lohrmann on Cybersecurity
aimingoo的专栏
aimingoo的专栏
T
Troy Hunt's Blog
Security Latest
Security Latest
小众软件
小众软件
Cloudbric
Cloudbric
Hacker News: Ask HN
Hacker News: Ask HN
S
Secure Thoughts
雷峰网
雷峰网
T
Threat Research - Cisco Blogs
H
Hacker News: Front Page
IT之家
IT之家
Simon Willison's Weblog
Simon Willison's Weblog

BankInfoSecurity.com RSS Syndication

OnDemand | Why Cloud Intrusions Still Evade Detection Bank information security news, training, education Bank information security news, training, education Bank information security news, training, education Bank information security news, training, education Startup Geordie AI Lands $30M to Secure Enterprise AI Agents AI Exploit Risks Pushing Healthcare Security Shift Miasma Worm Hits Microsoft's AI Coding Ecosystem Senate Committee Leader Seeks Answers on NYC Health Hack Webinar | Securing the Agentic Enterprise: An Integrated Policy Framework for Enterprise AI Security Webinar | Securing the Agentic Enterprise: An Integrated Policy Framework for Enterprise AI Security AI Generated Code Is Expanding the Attack Surface What DORA, AI Oversight, and Cloud Dependency Mean for Business and Risk Leaders Why Hospitals Must Rethink Cyber Resilience Why The Privacy Risks of Embedded, Shadow AI in Healthcare The End of Static Security: Why AI Demands Real-Time Microsegmentation Anthropic Submits Pre-IPO SEC Filing, Leads Market Cap Fight AI Agents Are the New Insiders Demystifying Claude: Signal vs. Speculation Integrity or Innovation? Mixed Signals in Trump's Exec Orders Health Cyberthreat Sharing Is Advancing But Gaps Persist AI Is Reshaping Cybersecurity Training Priorities Claude Mythos 5 Can Build Exploits But Can't Power Campaigns Are Small Models Closing the Gap on Frontier AI Cyber Tools? Securing AI in Financial Services with Zero Trust Beyond the Inbox: Defending Against AI-Enabled Social Engineering Webinar | 6 Layers Standing Between Your Enterprise and AI Risk Webinar | 6 Layers Standing Between Your Enterprise and AI Risk How AI Governance Protects Patient Care and Sensitive Data Election Systems Are Now a Persistent Cyber Target DOJ, FBI Seize 13 Domains in Chinese Recruitment Op A Security Gets $37M to Thwart Weaponized AI With Automation Breach Roundup: CISA Says Agencies Should 'Patch Smarter' Joint Commission Certification Targets Healthcare AI Risks German Court: Google Liable for AI Summaries Google Sues Chinese Phishing Service Over Gemini Abuse Policy as Code: From Documents to Machine Intelligence Ozempic Drug Maker Loses Clinical Trial Data in Hack ISMG Editors: Anthropic Unleashes Claude Mythos 5 ISACA Survey: AI Adoption Is Rising, Visibility Is Not Anthropic Limits on OT Access to Mythos Draw Criticism Webinar | Frontier AI and Identity Security in Financial Services US Pulls the Plug on Anthropic 1Password Buys Apono to Expand AI Access Governance US Anthropic Export Controls Sparks Sharp EU Reaction GovSec Summit USA 2026: Cyber Resilience Amid Fiscal Reality Why AI Defenses Fail Without Data and Identity Fundamentals Geopolitics Is Now a Cybersecurity Problem Mythos Shutdown Contains a Message: Don ShinyHunters Hits Universities Via Oracle Zero-Day Labcorp Agrees to Pay $35M to Settle AMCA Data Breach US FCC Eases Router Ban for Cable ISPs How FDA Chinese Hacking Firm Upgrades With New Windows Backdoor South Korea Fines Coupang $409M Over Massive Data Breach Cyber Resilience Summit Dallas Prioritizes Risk Management Hacker: Restore Fable and Mythos Access, Cybersecurity Leaders Urge Live Webinar | Behind Dell’s AI Infrastructure Performance Rokarolla Android Banking Trojan Enables Device Takeover Ent Raises $100M to Reinvent Endpoint Security for AI Era The AI Accountability Gap CIOs Can Chinese Espionage Actor Abuses Email Rules to Steal Research Data AWS Unveils Continuum to Fight Vulnerability Backlog SpaceX Bets Big on AI Coding With $60B Cursor Deal Quantum-Safe Cryptography Isn Heart Monitoring Firm Tells SEC Hackers Stole Sensitive Data Mastra AI Framework Poisoned in npm Supply-Chain Attack Cyberspace Locked in a Nation-State Contest, Says NCSC CEO Webinar | The Future of SASE: Top 5 Predictions and Trends The Gentlemen Ransomware Gang Standardizes EDR Killing CISA Urges OT Resilience in Dark Remarks About Cyberattacks Attackers Steal Salesforce Data From Klue Battlecards Users Crime Gang Sells Access to 74,000 Fortinet Firewall Devices JPMorgan Pulls Anthropic Claude Access in Hong Kong Webinar | From SBOM to Submission: Operationalizing CRA Vulnerability Handling 6 Ways to Contain Enterprise Risk in Model Context Protocol Breach Roundup: ShinyHunters Leaks 26M MSG Records AI Inherits People Accenture Buys Majority Stake in Dragos in $4.2B Deal Multimillion-Dollar Settlement Reached in MCNA Dental Hack Addressing Quantum Readiness in Healthcare Security Klue Confirms OAuth Token Theft Led to Salesforce Data Heist Cybercrime Initial Access Service SocGholish Disrupted Experts Warn of From Reflection to Shadow: AI, Us and the Space in Between ISMG Editors: Cyber Backlash Over the US Ban on Anthropic AI France and Germany Boost Digital Sovereignty Push North Korean IT Workers Try, Try, Try Again HIPAA Europe Seeks to Advance 6G Security, Privacy No Zero-Day Tied to 80,000 Harvested Fortinet Credentials Is It Time to Put Some Teeth in Post-Quantum Guidelines? New AI Model Aims to Transform Behavioral Health Lawsuits Already Getting Filed in Drug Maker Sakana AI Bets on Agent Orchestration Over Frontier Models OpenAI Lets Cyber Vendors Embed GPT-5.5 in Defenses AryStinger Botnet Converts Legacy Routers to Global Proxies Trump Executive Order Accelerates Post-Quantum Security Push
OpenAI Unveils
Emilia David · 2026-06-25 · via BankInfoSecurity.com RSS Syndication

Artificial Intelligence & Machine Learning , Next-Generation Technologies & Secure Development

Custom Silicon Advances Firm's Push Toward a Full AI Stack June 24, 2026    
OpenAI Unveils 'Jalapeño' Inference Chip
Image: Shutterstock/ISMG

OpenAI took its first step towards becoming a full-stack, end-to-end artificial intelligence company after it announced its first inference chip, Jalapeño.

See Also: Accelerate Vector Search for enterprise-scale AI with Elastic and NVIDIA

The company developed Jalapeño in partnership with Broadcom and Canadian electronics manufacturer Celestica. OpenAI will mostly use Jalapeño for model responses and actions rather than training large language models, for which it uses Nvidia chips and GPUs. Inference chips tend to use less energy than training chips and often run faster.

In a blog post, OpenAI said Jalapeño is built "for modern LLM inference, not a general purpose accelerator adapted from earlier AI workloads." While Jalapeño will be deployed mostly for current and future AI models, much of its design was informed by how OpenAI's models and products work.

"The goal is to combine the power and throughput of today's leading AI accelerators with latency closer to the fastest specialized inference systems, making Jalapeño well-suited for interactive LLM products at scale," OpenAI said.

OpenAI plans to offer the chip to data center partners this year, though the timeline is unclear.

The inference space accelerated in the past few years as established companies like AMD, Intel, Google and AWS began designing silicon specifically for inferencing tasks. Investment in smaller competitors such as SambaNova and Groq also grew. Making any chip is difficult, but inferencing handles a narrower, more predictable computational workload than a chip focused on training.

Jalapeño already runs workloads at "production target frequency and power," including GPT-5.3-Codex-Spark. OpenAI said it's still measuring the chip's performance metrics, but early tests showed it delivers "better than the current state of the art." The company explained that one reason for its strong performance is the chip's architecture, which reduces data movement and balances computing.

The AI company's first chip "was designed from the ground up for LLM inference using detailed insights from our close collaboration with OpenAI researchers," said OpenAI hardware lead Richard Ho in the blog post. "We optimized the architecture around the kernels, memory movement, networking and serving patterns that matter most for frontier AI models. Based on early testing, Jalapeño will efficiently execute our most important workloads close to the hardware's theoretical limits."

The biggest impact Jalapeño has on OpenAI is the ability to move away from third-party inference chips. While its dependence on Nvidia GPUs to train its AI models is likely to continue, OpenAI can begin lessening its reliance on Nvidia or Google for inference tasks. OpenAI has used a mix of different inference silicon over the years, inking deals with Nvidia, AMD and Google's tensor processing units to further refine its models, agents and platforms.

OpenAI calls this the full-stack advantage. Should the chips become a product success, it will have greater control over costs, potentially lowering the cost of model development. OpenAI's Stargate project, a network of data centers across Texas, New Mexico and the Midwest, will also give the company more control over this area of the AI stack.

OpenAI seizing more control over its tech stack won't automatically mean lower costs for end users. Vertical integration does equate to less vendor dependence, but it requires significant upfront capital expenditures without a guarantee of a return. OpenAI has already raised a lot of money and filed the necessary forms to go public, just to fund many of its infrastructure projects. Other companies that are more vertically integrated, such as Google - which makes its own chips, develops models, runs its own data centers, hosts its services in the cloud and has a built-in distribution system for its products - still price their LLMs at a premium.